Self-supervised learning of wrist-worn daily living accelerometer data improves the automated detection of gait in older adults

Brand, Yonatan E.; Kluge, Felix; Palmerini, Luca; Paraschiv-Ionescu, Anisoara; Becker, Clemens; Cereatti, Andrea; Maetzler, Walter; Sharrack, Basil; Vereijken, Beatrix; Yarnall, Alison J.; Rochester, Lynn; Del Din, Silvia; Muller, Arne; Buchman, Aron S.; Hausdorff, Jeffrey M.; Perlman, Or

doi:10.1038/s41598-024-71491-3

Download PDF

Article
Open access
Published: 06 September 2024

Self-supervised learning of wrist-worn daily living accelerometer data improves the automated detection of gait in older adults

Yonatan E. Brand^1,2,
Felix Kluge³,
Luca Palmerini^4,5,
Anisoara Paraschiv-Ionescu⁶,
Clemens Becker^7,8,
Andrea Cereatti⁹,
Walter Maetzler¹⁰,
Basil Sharrack¹¹,
Beatrix Vereijken¹²,
Alison J. Yarnall^13,14,15,
Lynn Rochester^13,14,15,
Silvia Del Din^13,15,
Arne Muller³,
Aron S. Buchman¹⁶,
Jeffrey M. Hausdorff^2,17,18,19 &
â¦
Or Perlman^1,18Â

Scientific Reports volumeÂ 14, ArticleÂ number:Â 20854 (2024) Cite this article

2078 Accesses
Metrics details

Subjects

Abstract

Progressive gait impairment is common among aging adults. Remote phenotyping of gait during daily living has the potential to quantify gait alterations and evaluate the effects of interventions that may prevent disability in the aging population. Here, we developed ElderNet, a self-supervised learning model for gait detection from wrist-worn accelerometer data. Validation involved two diverse cohorts, including over 1000 participants without gait labels, as well as 83 participants with labeled data: older adults with Parkinson's disease, proximal femoral fracture, chronic obstructive pulmonary disease, congestive heart failure, and healthy adults. ElderNet presented high accuracy (96.43âÂ±â2.27), specificity (98.87âÂ±â2.15), recall (82.32âÂ±â11.37), precision (86.69âÂ±â17.61), and F1 score (82.92âÂ±â13.39). The suggested method yielded superior performance compared to two state-of-the-art gait detection algorithms, with improved accuracy and F1 score (pâ<â0.05). In an initial evaluation of construct validity, ElderNet identified differences in estimated daily walking durations across cohorts with different clinical characteristics, such as mobility disability (pâ<â0.001) and parkinsonism (pâ<â0.001). The proposed self-supervised method has the potential to serve as a valuable tool for remote phenotyping of gait function during daily living in aging adults, even among those with gait impairments.

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Article Open access 12 April 2024

Real-world gait speed estimation, frailty and handgrip strength: a cohort-based study

Article Open access 23 September 2021

A machine learning contest enhances automated freezing of gait detection and reveals time-of-day effects

Article Open access 06 June 2024

Introduction

Aging is associated with progressive loss of motor function. These deficits are heterogeneous and may manifest as reduced walking speed, poor balance, increased gait variability, increased fear of falling, and shorter stride length^1,2,3. Objective measures of gait obtained during brief supervised gait testing in a lab or clinic predict survival, adverse health outcomes, and loss of independent living^4,5,6. However, these brief assessments provide only a limited snapshot of an individual's gait abilities and may notÂ fully reflect function and variability during the manifold demands of daily living^7,8. Advances in unobtrusive sensor technology afford investigators the opportunity to obtain a more comprehensive assessment of mobility via remote multi-day recordings of daily living. However, the automated analytic tools employed for many commercially available devices focus nearly exclusively on healthy younger adults and do not account for the impairments observed in older adults during device development or validation^9,10. Hence, there is an urgent need for the development and validation of automated tools to quantify daily living gait among the full health spectrum of older adults who reside in community-settings^11,12.

Previous studies investigating real-world gait employed accelerometers worn on the lower back, leveraging the inherent quasi-periodicity of lumbar movement during walking¹³. While these studies have demonstrated the potential of assessing daily living gait, sensor placement on the lower back may present limitations for long-term adherence due to potential discomfort¹⁴. A different approach is to ask participants to wear a wrist-worn accelerometer. Wrist-worn accelerometers have gained widespread use to measure daily living physical activity ^15,16,17,18. In this regard, the ubiquity and popularity of smartwatches make wrist-worn accelerometers a practical choice for ensuring high compliance in daily living studies. Wrist-worn accelerometers enable the extraction of a wide range of daily living behaviors, including sleep patterns¹⁹, circadian metrics²⁰, and levels of physical activity²¹. While estimated physical activity levels can provide many insights^22,23, to date, most studies using a wrist-worn accelerometer lacked detailed and high-resolution information about other crucial facets of gait quality²¹. Therefore, recent efforts have focused on leveraging these accelerometers to assess walking and gait quality.

The first step in deriving gait metrics from an accelerometer is the detection of gait sequences from the raw accelerometer signals^24,25. Gait detection from a wrist-worn accelerometer is more challenging compared to other locations, such as lower limbs or lower back, due to the non-gait related hand movement and the fact that wrist movements often deviate from the expected periodic swinging during the gait cycle. This may occur, for instance, when an individual walks while simultaneously engaging in other activities, such as texting. This challenge is exacerbated for older adults and people with gait disturbances, such as Parkinsonâs disease who manifest reduced arm swing during walking²⁶. People with Parkinsonâs disease also exhibit symptoms of tremor and dyskinesia, which impact wrist movements and contribute to an overall less stable and consistent gait pattern, factors complicating gait detection algorithms²⁴.

Classical gait detection algorithms employ signal processing techniques, such as peak detection and wavelet analysis, to extract features both from the time and frequency domain^25,27. These features are then used to identify gait sequences based on the repeated periodic waveforms manifested during gait. However, the complex wrist movements render the differentiation between gait and non-gait movements very challenging. Alternative approaches are needed to detect gait from wrist-worn accelerometers.

Previous studies addressed this goal by employing supervised machine learning algorithms that were trained to identify patterns in the signal associated with gait^16,28,29. Kluge et al.²⁵ conducted a comprehensive analysis of gait detection algorithms using accelerometer data from lower-back and wrist-worn accelerometers. The algorithms were trained on data from healthy young adults and subsequently tested on diverse subsets of adults from the Mobilise-D technical validation study³⁰, including older adults with and without varied diagnoses. They found, not surprisingly, that algorithms based on lower-back data outperformed wrist-based algorithms. Yet, the reduced performance of wrist-based algorithms may be attributed, in part, to being trained on data from healthy young adults, potentially leading to suboptimal performance among older adults. This highlights the need to optimize wrist-based algorithms for older adults, who more commonly show heterogeneous gait abnormalities that do not occur as frequently in younger adults.

The best performing wrist-based algorithm identified in the study by Kluge et al. was initially developed and validated in Brand et al.²⁴. In that study, we employed a supervised convolutional neural network with U-Net architecture³¹ for gait detection, focusing on older adults and people with Parkinsonâs disease (PD). The results were then compared to those of a control group comprising healthy young adults. Our findings indicated that biological meaningful measures of gait quality (e.g., cadence and gait regularity) and quantity (e.g. daily walking duration) could be derived from a wrist-worn accelerometer. However, it is crucial to note that the model's performance was reduced when applied to older adults and individuals with PD, compared to the healthy young adult control group. An important impediment for training a supervised model that can be applied to older adults and varied clinical conditions derives from the scarcity of ground-truth labels indicating the temporal location of the gait sequences, especially for recordings of unsupervised movement during daily living.

Recently, there has been a growing interest in leveraging self-supervised learning (SSL) methods to overcome the gap imposed by the shortage of labeled data³². SSL generally comprises two main stages. First, learning feature representations of the signals using a substantial amount of unlabeled data, which can be achieved through methods such as multi-task learning (MTL)³³ and contrastive learning^32,34. An example of contrastive learning is the SimCLR method: âA Simple Framework for Contrastive Learning of Visual Representationsâ³². In these approaches, the model's objective is to predict characteristics of the signal that do not require any labels. This stage is commonly referred to as the 'pretext' stage. The second stage involves fine-tuning the SSL model with a smaller set of labeled data in a supervised manner for a downstream task (e.g., gait detection).

The SSL approach has demonstrated significant potential in several human activity recognition tasks^35,36,37. For example, Yuan et al.³⁸ utilized the UK Biobank dataset, which comprised daily living recordings from a wrist-worn accelerometer, to develop an SSL model for activity recognition and exhibited improved performance in several tasks and datasets. Small et al.³⁹ fine-tuned this SSL model for gait detection on a semi-living dataset, termed OxWalk, which included approximately one hour of recording in a home environment. However, the dataset used for fine-tuning included only healthy adults (Nâ=â39, mean ageâ=â38.5 years). Thus, their model may not be optimized for older adults or individuals with gait disturbances.

Here, we developed and evaluated a gait detection deep learning approach, termed ElderNet, that was oriented and optimized for older adults and, in particular, those who might have impaired gait. The first stage involved the training of an SSL model, utilizing the pre-trained UK Biobank model of Yuan et al.³⁸. This SSL model was extensively modified in both architecture and training cohorts to include a large unlabeled dataset of more than 1000 older adults with and without impaired gait who wore a wrist-worn device for up to 10 days (mean age 83 years old) and were participating in the RUSH Memory and Aging Project (MAP)^40,41,42. Next, we fine-tuned the model on a labeled dataset consisting of 83 older adults (mean ageâ=â71.9 years) from the Mobilise-D technical validation study³⁰, each wearing a wrist-worn accelerometer for approximately 2.5 h. The Mobilise-D dataset is one of the largest available labeled datasets that include daily living recordings from a wrist-worn accelerometer in older adults. It contains a ground-truth reference for indicating the presence or absence of gait sequences. Additionally, the dataset contains participants with different health conditions, presenting a range of gait patterns, including individuals with Parkinson's disease, proximal femoral fracture, chronic obstructive pulmonary disease, congestive heart failure, and healthy adults. To explore the added value of the putative enhancements of the ElderNet, we compared it to two state-of-the-art algorithms: the U-Net architecture, which achieved the highest results in the study by Kluge et al.²⁵, and the model developed by Small et al., termed OxWalk, utilizing the strong UK Biobank SSL model.

Finally, we applied ElderNet to a set of new participantsânot previously trained by the modelâto begin to explore its construct validity and generalizability. Construct validity refers to the degree to which a measurement tool, like ElderNet, accurately evaluates its intended purpose, specifically gait detection. In this context, we examined walking duration obtained through ElderNet across cohorts whose clinical status is likely to lead to reduced daily living walking.

Results

Performance of the gait detection algorithm

To develop ElderNet, an SSL model was trained using the MAP database constituting 950 participants. Next, the labeled data from Mobilise-D was used for fine-tuning ElderNet and evaluating its performance (Fig. 1). 83 participants were included in the Mobilise-D dataset. Table 1 summarizes the characteristics of the Mobilise-D dataset.

Table 1 Characteristics of older adults in the Mobilise-D technical validation study.

Full size table

The model predictions made by ElderNet significantly outperformed the two other state-of-the-art algorithms^24,39 both in terms of accuracy and F1 score. The median accuracy for ElderNet was 96.86%, surpassing the U-Net at 93.69%, and OxWalk at 92.83% (pâ<â0.001). In terms of F1 scores, ElderNet achieved a score of 86.52%, outperforming the U-Net and OxWalk models which achieved scores of 67.29% (pâ=â0.046) and 73.51% (pâ<â0.01), respectively (Fig. 2). ElderNet exhibits high results across all the various cohorts, with F1 scores above 80% for all cohorts (Table 2). While the other models tended to poorly identify the gait bouts for the PFF cohort (with average F1 score of 56.29% and 51.53% for the OxWalk and U-Net, respectively), ElderNet achieved higher F1 score of 81.95% for this cohort. Figure 3 shows a representative example of a raw acceleration signal containing gait sequences, along with the predictions of the different models and the corresponding ground-truth labels.

Table 2 Performance of the algorithms across various cohorts in the Mobilise-D test dataset.

Full size table

Model selection: the effect of the SSL approach

We compared the performance of different SSL approaches, specifically MTL and SimCLR, with different model heads on top of the pre-trained UK Biobank model. The results showed that the fully-connected head with non-linearity performed the best under MTL, achieving the highest F1 score of 84.74âÂ±â0.51 (Table 3). This configuration was selected for further evaluations.

Table 3 The effect of the SSL approach.

Full size table

The impact of self-supervised learning

We compared ElderNet with its supervised counterpart (Table 4). ElderNet exhibited superior performance compared to its supervised counterpart, achieving an F1 score of 84.74 for ElderNet, compared to 79.21 for the supervised model.

Table 4 The effect of using SSL compared to supervised counterparts.

Full size table

Exploring construct validity

To examine the construct validity of the output of ElderNet, we first applied it on an unseen portion of the MAP dataset (Nâ=â157) that was not utilized during the training of ElderNet. Table 5 summarizes the characteristics of this test dataset. A preliminary analysis based on the detected gait events, revealed a few statistically significant differences across different subject populations and disease cohorts. The average daily walking duration displayed variations among participants in different demographic and clinical groups, as demonstrated in Fig. 4. A significant difference in daily walking durations was observed between age groups, indicating a decline in walking activity with age, supporting its utility. To account for this, we performed partial correlation analyses, adjusting for age, sex, and BMI in subsequent comparisons, and found that the differences between groups remained statistically significant.

Table 5 Average daily walking durations across demographic and clinical factors in the MAP unseen dataset.

Full size table

Furthermore, Fig. 4 illustrates that participants with a mobility disability score of 0 (no mobility disability) walked significantly more minutes per day than those with scores of 2 (pâ<â0.0001) and 3 (pâ<â0.0001). Additionally, participants with a mobility disability score of 1 also showed a significant difference from those with a score of 3 (pâ=â0.048). Examining participants with different parkinsonism scores in terms of the number of parkinsonian signs, we observed that individuals without any parkinsonian signs walked significantly more than those with 1 sign (pâ<â0.01) or two or more signs (pâ<â0.01).

Discussion

In this work, we developed and validated a gait detection algorithm (ElderNet), specifically designed for older adults with and without gait impairments. ElderNet demonstrated superior performance compared to the two state-of-the-art algorithms. It achieved the highest accuracy, significantly surpassing the OxWalk model³⁹. Moreover, its F1 score was higher than both OxWalk and the U-Net²⁴ models. Additionally, ElderNet achieved at least comparable results in other metrics such as specificity, recall, and precision. The imbalance between gait and non-gait sequences in daily living is often expressed by a significant trade-off between precision and recall²⁴. While the U-net and the OxWalk models indeed exhibited such a trade-off, our model was prominent with stable precision and recall, resulting in a high F1 score. This suggests that ElderNet is well-suited for daily living data, capable of identifying most existing gait sequences (i.e., high recall) with high confidence (i.e., high precision).

ElderNet exhibited stable and high performance across all different cohorts (recall Table 2). Importantly, it performed relatively well on the PFF cohort, patients with poorer mobility as indicated by low scores on the SPPB, where other state-of-the-art algorithms encountered difficulties. In this study, participants in the PFF cohort were, on average, 132.1 days (approximately19 weeks) post-surgery and had a mean SPPB score of 5.9. This aligns with prior studies that reported SPPB scores ranging from 4 to 8 approximately 12 weeks post-hip fracture^44,45. Previous studies have reported lower performance in the PFF cohort, attributing it to several factors that significantly impact the accuracy of gait detection algorithm^25,46. Patients with PFF often demonstrate altered gait patterns due to pain, muscle weakness, and impaired mobility. Additionally, their gait may be asymmetric, making it challenging for algorithms to identify regular gait patterns. Therefore, ElderNet's success in identifying gait sequences in this cohort highlights its potential to detecting gait even in individuals with relatively impaired gait.

Gait detection algorithms often lack labeled data from daily living datasets, particularly for older adults and individuals with gait impairments. This scarcity of labeled data prevents the algorithms from being optimized for these populations, whose gait signals can be diverse and abnormal. Here, an SSL method was utilized to address this gap. First, a pre-trained model trained on the UK Biobank data was leveraged. The UK Biobank dataset consists of 100,000 participants who wore a wrist-worn accelerometer in their daily lives, making it the largest dataset of its kind. Due to its size, we anticipated benefits from incorporating this pre-trained model into our SSL phase. Indeed, utilizing this pre-trained model led to a higher F1 score (82.59) than training the SSL model from scratch (F1 score of 77.15, Supplementary Table S1).

Our objective was to develop a gait detection algorithm tailored for older adults, aiming to bridge the current accuracy gap observed in algorithms designed for this population²⁴. While the UK Biobank dataset included a large number of older adults, its participants were recruited in the age range of 45â69, with a mean age of 62 for the visits that involved wearing the wrist accelerometer. To address this limitation, we leveraged the MAP dataset with a mean age of 83.6 years old (range 62â103) and more than 1000 participants. We found that integrating the MAP data into our combined model enhanced its overall performance (Supplementary Table S2). This improvement may be attributed to the fact that the extensive MAP data used to train ElderNet better represented the characteristics of the target population i.e., older adults that were also reflected in the test set Mobilise-D data.

Two different SSL approaches were explored, namely MTL and SimCLR. Overall, both methods yielded similar performance, with a slight advantage favoring the MTL results, but with no significant difference (Table 3). These findings are consistent with a previously published paper that observed similar results for SimCLR and MTL in human activity recognition tasks using acceleration data from the wrist³⁵. Finally, we compared ElderNet with its supervised counterpart (Table 4). Remarkably, our model exhibited superior performance compared to its supervised counterpart, underscores the potential of leveraging large unlabeled data to learn feature representations of the data.

In this study, the Mobilise-D data was utilized for the fine-tuning phase, leveraging its unique characteristics. Firstly, the dataset incorporates a robust reference system, the INDIP system, whose accuracy has been previously validated against an optical motion capture system. The results showed excellent absolute agreement (ICCâ>â0.95) within a laboratory setting^47,48. Although the validation performance was lower in simulated daily activity tests, it was still relatively high, with an ICCâ>â0.86 for all cohorts except the PFF cohort, which had an ICC of 0.76. This establishes the INDIP system as a reliable method for obtaining reference data in real-world environments. Moreover, the Mobilise-D dataset contains daily living data from older adult populations, particularly those with specific medical conditions that affect mobility²⁵. While we acknowledge that the 2.5-h assessment used in Mobilise-D data may not fully capture the complete variability of real-world walking, this dataset remains one of the largest available with comprehensive gait and non-gait reference information across various disease indications with labels.

Notably, the Mobilise-D cohort includes older adults who utilize walking aids, exhibiting abnormal gait signals from the wrist accelerometer, thereby complicating gait detection²⁵. However, the test set in our study included only few participants who used walking aids during the recordings, limiting our ability to draw precise conclusions about gait detection stratified by walking aid usage. Future work should focus on exploring this aspect more deeply to understand the applicability of ElderNet in detecting gait patterns among older adults who use walking aids.

The establishment of ElderNet sets the stage for subsequent studies aimed at extracting meaningful digital mobility outcomes related to gait quantity and quality from the identified gait sequences^16,49. Gait measures have already been shown to serve as potential biomarkers for age-related health outcomes^5,50. Notably, gait speed has been shown to be associated with survival rates in older adults⁵¹. A recent study has demonstrated that using a simple model based solely on mean acceleration data can facilitate the prodromal diagnosis of Parkinson's disease⁵². We hypothesize that incorporating higher-level gait measures into such models can augment their predictive capabilities, leading to better identification of multiple neurological conditions that manifest with gait impairments.

It is important to highlight that we standardized the sampling rate of all datasets to 30 Hz to align with the frequency used in the pre-trained UK Biobank model. This relatively low sampling rate allowed for the efficient use of long-duration recordings. Exploring the ramifications of using different sampling rates should be addressed in future work. While the MAP data utilized in the SSL phase and the participants from the Mobilise-D data shared similarities in their emphasis on older adults, there were notable differences between them. Particularly, the average age of the MAP is higher (83.6 years) than that of the Mobilise-D data (71.9 years). Additionally, the Mobilise-D dataset predominantly includes participants with specific medical conditions, unlike the MAP data which is not exclusively focused on populations with diseases. We attempted to address this by standardizing both datasets (MAP and Mobilise-D) using a zero-mean unit-variance whitening³⁵. However, we observed that standardizing the MAP data, but not the Mobilise-D data, resulted in improved outcomes (Supplementary Fig. S1).

The data was segmented into non-overlapping 10-s windows, both in the SSL and fine-tuning steps, to align with the UK Biobank pre-trained model, which utilizes the same window size. Consequently, we defined windows containing 5 s or more as gait windows in our labeled dataset, omitting gait sequences shorter than five seconds. However, this approach can lead to an underestimation of the number of gait sequences that occur in daily living. A potential consequence of this approach could be the estimated daily walking duration, as observed in the construct validity step (recall Fig. 4), which was found to be slightly lower than reported in the literature⁵³. To address this issue, we explored the use of dense labeling, involving a shift to per-sample labels and outputs in the fine-tuning model. Despite this modification, the model's performance was found to be lower compared to using window-based labeling, and there was no meaningful change observed in the estimated daily walking time (Supplementary Table S3, Supplementary Fig. S2). This suggests that the alternative dense labeling strategy does not provide a significant improvement in capturing daily walking patterns.

We evaluated ElderNet's performance based on the length of gait sequences, specifically comparing sequences shorter than 30 s to those longer than 30 s (see supplementary Table S4). The results show that ElderNet performs better on longer sequences (>â30 s), particularly in terms of precision. For shorter sequences, the precision was 76.28%, whereas for long sequences, it was 100%, indicating no false positives for sequences longer than 30 s. These findings are consistent with previous studies that demonstrated higher gait detection performance in longer sequences²⁹. This could be due to the higher stability of the acceleration signal within a window (10 s) during longer activities. Since many daily living gait bouts are short, future work should consider ways of improving gait detection performance for short walking bouts.

Conclusions

This study introduced ElderNet, a novel gait detection model developed and validated for older adults with and without known health conditions that can affect gait. The model demonstrated high performance in accurately identifying real-world gait sequences extracted from wrist recordings. When applied to unlabeled daily living data, ElderNet successfully revealed differences between different clinical groups supporting further clinical testing of its efficacy. Given that many older adults experience gait impairments, a reliable system for gait quantification is crucial for obtaining a comprehensive characterization of gait function remotely during daily living. ElderNet addresses that need.

Methods

This study was composed of four stages:

1.
Self-supervised learning: training an SSL model on a large amount of unlabeled activity data to learn the feature representation of daily living acceleration data.
2.
Fine-tuning: utilizing the model from the SSL step for training a supervised gait detection system (ElderNet) using labeled data.
3.
Gait Detection Test Phase: comparing the results of the gait detection model with 2 state-of-the-art algorithms on an independent test set.
4.
Exploring construct validity: applying ElderNet on another unseen dataset to examine the potential of gait-based analysis for identifying differences between cohorts of different clinical characteristics.

Preprocessing

To maintain uniformity in comparison with state-of-the-art algorithms, we standardized the acceleration data across the various cohorts by resampling to a 30 Hz resolution and dividing the signals into 10-s non-overlap windows, following a methodology similar to the UK Biobank study^38,39. We considered the window as a gait window only when half or more of it was labeled as gait. Given that the typical gait frequency is less than 10 Hz, the 30 Hz sampling rate surpasses the Nyquist frequency, preventing any loss of essential signal information.

Stage 1: self-supervised learning

Participants and wearable sensors

Participants were community-dwelling older adults enrolled in an ongoing cohort study of chronic conditions of aging, known as Rush Memory and Aging Project^40,41,42. A total of 1117 participants aged between 61 and 103 years (mean 83.77âÂ±â7.37 SD) (76% female) participated in the study. The dataset was divided into two sets: 85% of the data (Nâ=â950, mean ageâ=â83.6âÂ±â7.3 years, 76% female) was utilized for the SSL model training, while the remaining 15% (Nâ=â167, mean ageâ=â84.2âÂ±â7.6 years, 80% female) was reserved for construct validity step (see the construct validity section). Written informed consent was obtained, and the study was conducted by the latest version of the Declaration of Helsinki and was approved by Rush University Medical Center Institutional Review Board.

Participants wore the GENEActiv device (Activinsights Ltd.; Cambridgeshire, UK), a triaxial accelerometer, on their non-dominant wrist for 24 h/day for up to ten consecutive days. Acceleration data were sampled at either 40 Hz or 60 Hz, depending on the time of recording. Specifically, data recorded in the first half of 2018 were sampled at 60 Hz, while data recorded from the second half of 2018 onwards were sampled at 40 Hz. The device had a range ofâÂ±â8 gravitational acceleration units (g).

Self-supervised approaches

Typically, SSL models consist of a main trunk, usually a convolutional neural network, referred to as a feature extractor, which produces a vector containing feature representations. The feature vector is then adjusted to a different dimension to match the âpretextâ task associated with the chosen SSL approach. In this study, we investigated two SSL approaches, namely MTL and contrastive learning (SimCLR). We selected these approaches based on their demonstrated superior performance in downstream human activity recognition tasks, as identified through an extensive exploration of various SSL approaches using wearable sensors³⁵.

In the MTL approach, each acceleration window undergoes data augmentation, where the objective of the model is to predict the augmentation of the signal (pretext task). Following the methodology of Yuan et al.³⁸, 4 distinct augmentations were employed: (1) Reversing the signal. (2) Permutation of different segments of the window, with each segment comprising 10 samples. (3) Time warping, which alters arbitrary segments of the signal by stretching and compressing them. (4) Scaling each of the acceleration axes with a random factor. Each window has a random probability of undergoing each of the augmentations, and the model predicts whether the window underwent the augmentation, resulting in four binary outputs. The modelâs loss is calculated using the cross-entropy function for all four augmentations and then averaged to produce the final loss.

The SimCLR contrastive learning method also employs data augmentations. In SimCLR, each window undergoes two augmentations, resulting in two distinct views of the same window. Views originating from the same source window are considered âpositiveâ pairs, while views stemming from different sources are considered ânegativeâ pairs. For instance, if we initially have N windows of acceleration signal, the transformation yields 2N views of the windows. Thus, for every positive pair of windows, there are 2N-2 negatives. In this study, we utilized a 3D rotation transformation as the augmentation function. In this augmentation, a random axis in 3D and a random rotation angle are drawn from a uniform distribution, and the corresponding rotation is applied to the window. This can be considered as a way to simulate different sensor placements³⁴, making it especially effective for wrist accelerometers where the axis orientation frequently changes. We specifically chose this augmentation due to its demonstrated superior performance in downstream human activity recognition tasks associated with the SimCLR approach³⁴. The different views of the windows pass through the model encoder (i.e., feature extractor), resulting in an output that reflects the different windows as feature vectors. Next, a contrastive loss function is employed to calculate the relationships between pairs of vectors using cosine similarity. The objective of the loss function is to maintain proximity in the feature space for vector representations of "positive pairs" while ensuring that "negative" pairs remain distant in this space. This loss is also known as the normalized temperature-scaled cross-entropy loss (NT-Xent)³².

Model configurations

To enhance the model's performance, the incorporation of a pre-trained model as the feature extractor of the SSL model was used. Specifically, we employed a model developed by Yuan et al.³⁸, which utilized the diverse UK Biobank dataset to train an SSL model using the MTL approach. The architecture of the pre-trained model was ResNet-V2 with 18 layers. The input acceleration data underwent through the pre-trained model, resulting in an intermediate output- a vector with dimensions (1024, 1). Subsequently, we introduced additional layers on top of the pre-trained model, referred to as a model's head. The intermediate vector then traversed through these additional layers to produce the final output, suitable for the pretext task. While the weights of the pre-trained model were frozen during the training of our model, indicating they were not updated during gradient calculations, the weights of the modelâs head were updated. This modification to the pre-trained model allowed us to tailor our model to older adults using the MAP data, considering that the pre-trained UK Biobank model did not exclusively focus on older adults. We termed our combined model ElderNet. Figure 1 illustrates the pipeline of our model.

We experimented with three different versions for the model's head, each with increasing complexity: (1) Using three fully-connected layers without non-linearity between them. (2) Using the same fully-connected layers, but with ReLU non-linear activation function between the layers. (3) Utilizing the U-Net with an architecture similar to the model employed during the testing phase. Supplementary Table S5 provides more details on the models' hyperparameters and implementation.

Stage 2: fine-tuning

Participants and wearable sensors

For optimizing and evaluating algorithms for gait detection, a dataset from the Mobilise-D technical validation study was used. This multi-center observational dataset, originally aimed at validating real-world digital mobility outcomes included different patient and healthy populations. Participants were recruited at five sites: The Newcastle upon Tyne Hospitals NHS Foundation Trust, UK (Sponsor of the study) and Sheffield Teaching Hospitals NHS Foundation Trust, UK (ethics approval granted by London-Bloomsbury Research Ethics committee, 19/LO/1507); Tel Aviv Sourasky Medical Center, Israel (ethics approval granted by the Helsinki Committee, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel, 0551-19TLV), Robert Bosch Foundation for Medical Research, Germany (ethics approval granted by the ethical committee of the medical faculty of The University of TÃ¼bingen, 647/2019BO2), University of Kiel, Germany (ethics approval granted by the ethical committee of the medical faculty of Kiel University, D438/18). Informed consent was provided by all participants to take part in the study and all research was performed in accordance with the Declaration of Helsinki. A comprehensive description of the study's experimental protocol, incorporating all inclusion and exclusion criteria, can be found in³⁰.

Briefly, 112 participants across five different disease cohorts and one cohort of healthy adults were studied. The patient groups included chronic obstructive pulmonary disease, Parkinson's disease, multiple sclerosis, proximal femoral fracture, and congestive heart failure patients. We excluded the multiple sclerosis group (Nâ=â20, mean ageâ=â48.7 years) as we aimed to customize the model to older adults and the MS cohort comprises also young adults. In addition, nine participants were also excluded due to missing data, resulting in 83 participants overall used for this step. All participants gave written informed consent before participation. The participants were monitored during 2.5 h of real-world living undergoing their normal daily activities without a specific protocol. The participants were equipped with an accelerometer worn at the wrist on the non-dominant hand and a validated multi-sensor system, the INDIP as reference^30,47. The INDIP system provided annotations (i.e., labels) regarding the temporal locations of the gait sequences.

Fine-tuning procedure

The fine-tuning step involved a supervised learning procedure. The model's input comprised the Mobilise-D dataset, which contains labels indicating the temporal location of the gait sequences. We divided the Mobilise-D data into 75â25%, where 75% of the data was used for training and validation of the supervised model, as well as for assessing different model configurations, and the remaining 25% was reserved for testing the model. We selected this ratio to ensure comparable distributions between the training and test sets, ensuring that each cohort has at least three participants in the test set. The divisions were made subject-wise, ensuring that the data points belonging to a particular subject were entirely contained within one subdivision and did not get shared across other subdivisions. We utilized the trained model from the SSL step to train a gait detection model. That is, the weights learned on the extensive unlabeled data served as a robust starting point for training a supervised gait detection model. To adapt the SSL model for gait detection, we modified its last layer to function as a linear layer producing a binary output (i.e., gait/non-gait). During the fine-tuning process, we allowed the model to update all of its weights. This decision was based on prior studies that demonstrated the preference for not freezing weights in the fine-tuning procedure^38,54.

In the fine-tuning process, we again split the training set, corresponding to 75% of the entire data, into an 80â20 ratio. Eighty percent of this subset was used for training and 20% for validation. We applied five-fold cross-validation on the training set, stratified by class label and grouped by participant. An early-stopping mechanism was implemented to halt training when the loss stopped decreasing for five consecutive epochs. The cross-validation process was repeated with three different seeds, representing three different divisions of the folds, to obtain more generalizable results independent of a specific order of the data. The results from the three iterations were averaged to derive final performance metrics. The fine-tuning and performance evaluation processes were implemented for all different SSL configurations (refer to the Models Configuration section), utilizing only the training set. For each unique configuration, its performance after fine-tuning the Mobilise-D data was recorded. The configuration that yielded the best results was then selected as our model for comparison and further analysis. FigureÂ 5 illustrates the flow of this process.

Ablation studies

To further explore the influence of different components of the SSL model on its downstream performance, several ablation studies were conducted. Initially, the impact of utilizing the pre-trained weights from the UK Biobank model was investigated³⁸. For this purpose, the same architecture of the pre-trained SSL model was evaluated (i.e., ResNet) twice- once with the pre-trained weights from the UK Biobank model initialized, and once trained from random initialization on the MAP dataset.

To assess the contribution of the MAP dataset in tailoring the model to older adults, the combined network (the pre-trained model with the newly added layers) was utilized, and its performance was evaluated with and without utilizing the MAP data. This investigation allowed us to discern whether the performance difference stemmed solely from the expansion of the pre-trained model architecture (by adding the new layers) or if the use of a dataset focused on older adults, such as the MAP data, also played a role.

Stage 3: testing

The F1 score from the fine-tuning step was used to select the best model configuration. The choice of using the F1 score for model selection is based on the inherent imbalance of daily living data in terms of gait, where gait sequences are much less frequent than non-gait ones. In imbalanced datasets, the F1 score provides a more realistic and unbiased assessment of the model's performance²⁴. Model Performance was tested at the window level (i.e., comparing the prediction and the label of each window).

Model comparison

We compared the resulting ElderNet model with two state-of-the-art gait detection algorithms. The first comparison algorithm employed a U-Net architecture, developed and validated in our recent publication^24,25. The U-Net model was originally trained on healthy young adults. The second model in our comparative analysis was an SSL algorithm pre-trained on the UK Biobank dataset and subsequently fine-tuned for gait detection in healthy adults, which was referred to as the OxWalk dataset³⁹. We tested these 3 models on 25% of the Mobilise-D data, which was not used in the fine-tuning step. The performance metrics were calculated for each of the 21 participants in the test set, and then averaged to obtain the final performance.

Stage 4: assessing construct validity

As a preliminary exploration of the clinical potential of the gait-detection information introduced by ElderNet, we applied the model to an unseen portion of the MAP dataset, ensuring that participants used in this step were distinct from those involved in the SSL phase. A total of 167 participants were assigned to this stage. To accurately analyze participant activity, we excluded time segments indicating participants who were not wearing the device. These non-wear periods were defined as consistent low movement (low STD) across all acceleration axes for at least 30 minutes^55,56. For each participant, we extracted data from four full (24-h-long) days, as a recent study has shown that this duration provides reliable gait quantity measures⁵⁷. Ten participants were excluded, due to an insufficient amount of activity (less than 96 h of data), resulting in a final number of 157 participants who were included in this stage. ElderNet was applied to the four days of data to identify the gait sequences. Subsequently, for each day, we summed the number of gait sequences and defined the median value as the daily walking time.

The Mobilise-D test set was not included for the construct validity investigation due to the relatively small sample size within each clinical cohort (only 3â5 participants per cohort). This small sample size would make it difficult to reliably explore associations between gait duration and disease severity. Using the training set (~â75% of the Mobilise-D dataset) for construct validity is also not appropriate because the model was directly trained on this data for gait detection. Therefore, we used the larger and unseen MAP dataset for investigating construct validity.

To examine the construct validity of ElderNet, differences in daily walking time among participants belonging to different clinical cohorts were investigated. Specifically, we examined 2 motor-related clinical variables: the mobility disability score, assessed using the Rosow-Breslau scale⁵⁸, and the number of parkinsonism signs⁵⁹. The modified version of the motor portion of the United Parkinson's Disease Rating Scale (UPDRS III) was used to assess the presence of four Parkinsonian signs: bradykinesia, gait, rigidity, and tremor⁶⁰. Participants were categorized into three cohorts (no sign, 1 sign, 2+âsigns). We hypothesized that daily walking time would differ between these cohorts, with individuals without mobility disability spending more time walking than those with mobility disabilities²³. Additionally, we expected individuals without Parkinsonian signs to spend more time walking than those with 1 or more Parkinsonian signs⁶¹.

Statistical analysis

The KruskalâWallis test was performed to identify significant differences between ElderNet and state-of-the-art algorithms across the test performance metrics. Dunn's post-hoc analysis was applied to reveal the sources of difference among the models. In the context of construct validity, the KruskalâWallis test assessed differences in daily walking durations across cohorts with distinct demographic and clinical statuses. The corresponding Dunn's post-hoc analysis was then used to pinpoint the sources of variation in walking durations. To address multiple comparisons in all instances, the Bonferroni correction was applied. The KruskalâWallis test and Dunn's post-hoc analysis were implemented using the 'kruskal' function from the scipy.stats library and the posthoc_dunn function from the scikit_posthocs library, respectively. Partial correlation analyses were performed to adjust for age, sex, and BMI, using IBM SPSS Statistics software (Version 29.0.0.0).

Data availability

Raw data of a representative participant (dataset YAR, participant 0002) can be found on Zenodo: https://doi.org/https://doi.org/10.5281/zenodo.7185429. The full data set will be made available by the Mobilise-D consortium after June 2024. All MAP data included in these analyses are available via the Rush Alzheimerâs Disease Center Research Resource Sharing Hub, which can be found at www.radc.rush.edu (accessed on 17 April 2023). It has descriptions of the studies and available data. Any qualified investigator can create an account and submit requests for deidentified data.

Code availability

The code supporting this study is accessible on GitHub at the following link: https://github.com/yonbrand/ElderNet.

References

Cruz-Jimenez, M. Normal changes in gait and mobility problems in the elderly. Phys. Med. Rehabil. Clin. N. Am. 28, 713â725 (2017).
ArticleÂ PubMedÂ Google ScholarÂ
Freiberger, E., Sieber, C. C. & Kob, R. Mobility in older community-dwelling persons: A narrative review. Front. Physiol. 11, 881 (2020).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Osoba, M. Y., Rao, A. K., Agrawal, S. K. & Lalwani, A. K. Balance and gait in the elderly: A contemporary review. Laryngosc. Investig. Otolaryngol. 4, 143â153 (2019).
ArticleÂ Google ScholarÂ
Mirelman, A. et al. Executive function and falls in older adults: New findings from a five-year prospective study link fall risk to cognition. PLoS One 7, e40297 (2012).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Brodie, M. A. et al. Gait as a biomarker? Accelerometers reveal that reduced movement quality while walking is associated with Parkinsonâs disease, ageing and fall risk. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. 2014, 5968â5971 (2014).
PubMedÂ Google ScholarÂ
Buchman, A. S. et al. Different combinations of mobility metrics derived from a wearable sensor are associated with distinct health outcomes in older adults. J. Gerontol. A Biol. Sci. Med. Sci. 75, 1176â1183 (2020).
ArticleÂ PubMedÂ Google ScholarÂ
Hillel, I. et al. Is every-day walking in older adults more analogous to dual-task walking or to usual walking? Elucidating the gaps between gait performance in the lab and during 24/7 monitoring. Eur. Rev. Aging Phys. Act. 16, 1â12 (2019).
ArticleÂ Google ScholarÂ
Warmerdam, E. et al. Long-term unsupervised mobility assessment in movement disorders. Lancet Neurol. 19, 462â470 (2020).
ArticleÂ PubMedÂ Google ScholarÂ
Feehan, L. M. et al. Accuracy of Fitbit devices: Systematic review and narrative syntheses of quantitative data. JMIR Mhealth Uhealth 6, e10527 (2018).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Brodie, M. A. et al. Big data vs accurate data in health research: Large-scale physical activity monitoring, smartphones, wearable devices and risk of unconscious bias. Med. Hypotheses 119, 32â36 (2018).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
von Coelln, R. et al. Quantitative mobility metrics from a wearable sensor predict incident parkinsonism in older adults. Parkinsonism Relat. Disord. 65, 190â196 (2019).
ArticleÂ Google ScholarÂ
Wohlrab, M. et al. The value of walking: A systematic review on mobility and healthcare costs. Eur. Rev. Aging Phys. Act. 19(1), 31 (2022).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Bonci, T., Keogh, A., Din, S. D., Scott, K. & MazzÃ , C. An objective methodology for the selection of a device for continuous mobility assessment. Sensors 20, 6509 (2020).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Kirk, C. et al. Mobilise-D insights to estimate real-world walking speed in multiple conditions with a wearable device. Sci. Rep. 14, 1754 (2024).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Doherty, A. et al. Large scale population assessment of physical activity using wrist worn accelerometers: The UK Biobank Study. PLoS One 12, e0169649 (2017).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Chan, L. L. Y., Choi, T. C. M., Lord, S. R. & Brodie, M. A. Development and large-scale validation of the watch walk wrist-worn digital gait biomarkers. Sci. Rep. 12, 16211 (2022).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Leroux, A. et al. Organizing and analyzing the activity data in NHANES. Stat. Biosci. 11, 262â287 (2019).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Burq, M. et al. Virtual exam for Parkinsonâs disease enables frequent and reliable remote measurements of motor function. npj Digit. Med. 5, 65 (2022).
Lim, A. S. P., Kowgier, M., Yu, L., Buchman, A. S. & Bennett, D. A. Sleep fragmentation and the risk of incident Alzheimerâs disease and cognitive decline in older persons. Sleep 36, 1027â1032 (2013).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Cai, R. et al. Circadian disturbances and frailty risk in older adults. Nat. Commun. 14:, 7219 (2023).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Lin, W. et al. Can gait characteristics be represented by physical activity measured with wrist-worn accelerometers?. Sensors 23, 8542 (2023).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Zhao, A., Cui, E., Leroux, A., Lindquist, M. A. & Crainiceanu, C. M. Evaluating the prediction performance of objective physical activity measures for incident Parkinsonâs disease in the UK Biobank. J. Neurol. 270, 5913â5923 (2023).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Buchman, A. S. et al. Correlates of person-specific rates of change in sensor-derived physical activity metrics of daily living in the rush memory and aging project. Sensors (Basel) 23, 4152 (2023).
Brand, Y. E. et al. Gait detection from a wrist-worn sensor using machine learning methods: A daily living study in older adults and people with Parkinsonâs disease. Sensors 22, 7094 (2022).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Kluge, F. et al. Real-world gait detection using a wrist-worn inertial sensor: Validation study. JMIR Form Res. 8, e50035 (2024).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Mirelman, A. et al. Effects of aging on arm swing during gait: The role of gait speed and dual tasking. PLoS One 10, e0136043 (2015).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Keren, K. et al. Quantification of daily-living gait quantity and quality using a wrist-worn accelerometer in Huntingtonâs disease. Front. Neurol. 12 (2021).
Willetts, M., Hollowell, S., Aslett, L., Holmes, C. & Doherty, A. Statistical machine learning of sleep and physical activity phenotypes from sensor data in 96,220 UK Biobank participants. Sci. Rep. 8, 7961 (2018).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Soltani, A., Paraschiv-Ionescu, A., Dejnabadi, H., Marques-Vidal, P. & Aminian, K. Real-world gait bout detection using a wrist sensor: An unsupervised real-life validation. IEEE Access 8, 102883â102896 (2020).
ArticleÂ Google ScholarÂ
MazzÃ , C. et al. Technical validation of real-world monitoring of gait: A multicentric observational study. BMJ Open 11 (2021).
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted InterventionâMICCAI 2015. 234â241 (Springer, 2015).
Chen, T., Kornblith, S., Norouzi, M. & Hinton, G. A simple framework for contrastive learning of visual representations. In Proceedings of the International Conference on Machine Learning. 1597â1607 (PMLR, 2020).
Saeed, A., Ozcelebi, T. & Lukkien, J. Multi-task self-supervised learning for human activity detection. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 3, 1â30 (2019).
ArticleÂ Google ScholarÂ
Tang, C. I., Perez-Pozuelo, I., Spathis, D. & Mascolo, C. Exploring Contrastive Learning in Human Activity Recognition for Healthcare. Preprint at https://arxiv.org/abs/2011.11542 (2020).
Haresamudram, H., Essa, I. & PlÃ¶tz, T. Assessing the state of self-supervised human activity recognition using wearables. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 6 (2022).
Haresamudram, H., Essa, I. & PlÃ¶tz, T. Contrastive predictive coding for human activity recognition. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 5 (2021).
Sridhar, N. & Myers, L. Human Activity Recognition on Wrist-Worn Accelerometers Using Self-Supervised Neural Networks. Preprint at https://arxiv.org/abs/2112.12272 (2021).
Yuan, H. et al. Self-supervised learning for human activity recognition using 700,000 person-days of wearable data. npj Digit. Med. 7, 91 (2024).
Small, S. R. et al. Development and validation of a machine learning wrist-worn step detection algorithm with deployment in the UK Biobank. Preprint at https://doi.org/10.1101/2023.02.20.23285750 (2023).
Bennett, A., Schneider, A., Arvanitakis, J. & Wilson, S. Overview and findings from the religious orders study. Curr. Alzheimer Res. 9, 628â645 (2012).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Bennett, D. A. et al. Religious orders study and rush memory and aging project. J. Alzheimerâs Dis. 64, S161âS189 (2018).
ArticleÂ Google ScholarÂ
Bennett, A. et al. Overview and findings from the rush Memory and Aging Project. Curr. Alzheimer Res. 9, 646â663 (2012).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Guralnik, J. M., Ferrucci, L., Simonsick, E. M., Salive, M. E. & Wallace, R. B. Lower-extremity function in persons over the age of 70 years as a predictor of subsequent disability. N. Engl. J. Med. 332, 556â562 (1995).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Latham, N. K. et al. Performance-based or self-report measures of physical function: Which should be used in clinical trials of hip fracture patients?. Arch. Phys. Med. Rehabil. 89, 2146â2155 (2008).
ArticleÂ PubMedÂ Google ScholarÂ
Koudouna, S. et al. Rehabilitation prognostic factors following hip fractures associated with patientâs pre-fracture mobility and functional ability: A prospective observation study. Life 13, 1748 (2023).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
MicÃ³-Amigo, M. E. et al. Assessing real-world gait with digital technology? Validation, insights and recommendations from the Mobilise-D consortium. J. NeuroEng. Rehabil. 20, 78. https://doi.org/10.1186/s12984-023-01108-5 (2023).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Salis, F. et al. A multi-sensor wearable system for the assessment of diseased gait in real-world conditions. Front. Bioeng. Biotechnol. 11 (2023).
Salis, F. et al. A method for gait events detection based on low spatial resolution pressure insoles data. J. Biomech. 127, 110687 (2021).
ArticleÂ CASÂ PubMedÂ Google ScholarÂ
Soltani, A., Dejnabadi, H., Savary, M. & Aminian, K. Real-world gait speed estimation using wrist sensor: A personalized approach. IEEE J. Biomed. Health Inform. 24, 658â668 (2020).
ArticleÂ PubMedÂ Google ScholarÂ
Hausdorff, J. M. et al. Everyday stepping quantity and quality among older adult fallers with and without mild cognitive impairment: Initial evidence for new motor markers of cognitive deficits?. J. Gerontol. A Biol. Sci. Med. Sci. 73, 1078â1082 (2018).
ArticleÂ PubMedÂ Google ScholarÂ
Studenski, S. Gait speed and survival in older adults. JAMA 305, 50â58 (2011).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Schalkamp, A. K., Peall, K. J., Harrison, N. A. & Sandor, C. Wearable movement-tracking data identify Parkinsonâs disease years before clinical diagnosis. Nat. Med. 29, 2048â2056 (2023).
Del Din, S. et al. Falls risk in relation to activity exposure in high-risk older adults. J. Gerontol. Ser. A Biol. Sci. Med. Sci. 75, 1198â1205 (2020).
ArticleÂ Google ScholarÂ
Fortes Rey, V., Nshimyimana, D. & Lukowicz, P. Donât freeze: Finetune encoders for better self-supervised HAR. In Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2023 ACM International Symposium on Wearable Computing. 195â196 (2023).
van Hees, V. T. et al. Estimation of daily energy expenditure in pregnant and non-pregnant women using a wrist-worn tri-axial accelerometer. PLoS One 6, e22922 (2011).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Sabia, S. et al. Association between questionnaire- and accelerometer-assessed physical activity: The role of sociodemographic factors. Am. J. Epidemiol. 179, 781â790 (2014).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Bianchini, E. et al. Four days are enough to provide a reliable daily step count in mild to moderate Parkinsonâs disease through a commercial smartwatch. Sensors 23, 8971 (2023).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Buchman, A. S., Boyle, P. A., Leurgans, S. E., Evans, D. A. & Bennett, D. A. Pulmonary function, muscle strength, and incident mobility disability in elders. Proc. Am. Thorac. Soc. 6, 581â587 (2009).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Buchman, A. S. et al. Incident parkinsonism in older adults without Parkinson disease. Neurology 87, 1036â1044 (2016).
ArticleÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Mhyre, T. R., Boyd, J. T., Hamill, R. W. & Maguire-Zeiss, K. A. Parkinsonâs disease. Subcell Biochem. 65, 389â455 (2012).
ArticleÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Buchman, A. S. et al. Associations between quantitative mobility measures derived from components of conventional mobility testing and Parkinsonian gait in older adults. PLoS One 9, e86262 (2014).
ArticleÂ ADSÂ PubMedÂ PubMed CentralÂ Google ScholarÂ

Download references

Acknowledgements

The authors express their gratitude to the entire Mobilise-D Work Package 2 team for ongoing discussions and valuable insights. Special appreciation is extended to the study participants for their dedicated time and enthusiastic involvement, particularly amidst the challenges of the COVID-19 pandemic. Furthermore, heartfelt thanks go to all contributors of data from the RUSH Memory and Aging Project. The authors also acknowledge the supportive staff at the Rush Alzheimerâs Disease Center.

Funding

This work was supported, in part, by grants from the NIH (R01AG017917; R01AG056352, R01AG79133, R01AG078256) and by the Mobilise-D project. The Mobilise-D project received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement No. 820820. This JU receives support from the European Union's Horizon 2020 research and innovation program and the European Federation of Pharmaceutical Industries and Associations (EFPIA). Content in this publication reflects the authorsâ view and neither IMI nor the European Union, EFPIA, or any Associated Partners are responsible for any use that may be made of the information contained herein.

Author information

Authors and Affiliations

Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
Yonatan E. BrandÂ &Â Or Perlman
Center for the Study of Movement, Cognition and Mobility, Neurological Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
Yonatan E. BrandÂ &Â Jeffrey M. Hausdorff
Biomedical Research, Novartis Pharma AG, Basel, Switzerland
Felix KlugeÂ &Â Arne Muller
Department of Electrical, Electronic and Information Engineering Guglielmo Marconi, University of Bologna, Bologna, Italy
Luca Palmerini
Health Sciences and Technologies-Interdepartmental Center for Industrial Research (CIRI-SDV), University of Bologna, Bologna, Italy
Luca Palmerini
Laboratory of Movement Analysis and Measurement, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland
Anisoara Paraschiv-Ionescu
Robert Bosch Gesellschaft fÃ¼r Medizinische Forschung, Stuttgart, Germany
Clemens Becker
Unit Digitale Geriatrie, UniversitÃ¤tsklinikum Heidelberg, Heidelberg, Germany
Clemens Becker
Department of Electronics and Telecommunications, Politecnico di Torino, Turin, Italy
Andrea Cereatti
Department of Neurology, University Medical Center Schleswig-Holstein, Campus Kiel, Kiel, Germany
Walter Maetzler
Department of Neuroscience and Sheffield NIHR Translational Neuroscience BRC, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, UK
Basil Sharrack
Department of Neuromedicine and Movement Science, Norwegian University of Science and Technology, Trondheim, Norway
Beatrix Vereijken
Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, UK
Alison J. Yarnall,Â Lynn RochesterÂ &Â Silvia Del Din
The Newcastle Upon Tyne Hospitals NHS Foundation Trust, Newcastle Upon Tyne, UK
Alison J. YarnallÂ &Â Lynn Rochester
National Institute for Health and Care Research (NIHR) Newcastle Biomedical Research Centre (BRC), Newcastle University, The Newcastle Upon Tyne Hospitals NHS Foundation Trust, Newcastle Upon Tyne, UK
Alison J. Yarnall,Â Lynn RochesterÂ &Â Silvia Del Din
Department of Neurological Sciences, Rush Alzheimerâs Disease Center, Rush University Medical Center, Chicago, IL, USA
Aron S. Buchman
Department of Physical Therapy, Faculty of Medical and Health Sciences, Tel Aviv University, Tel Aviv, Israel
Jeffrey M. Hausdorff
Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
Jeffrey M. HausdorffÂ &Â Or Perlman
Rush Alzheimerâs Disease Center and Department of Orthopedic Surgery , Rush University, Chicago, IL, USA
Jeffrey M. Hausdorff

Authors

Yonatan E. Brand
View author publications
You can also search for this author in PubMedÂ Google Scholar
Felix Kluge
View author publications
You can also search for this author in PubMedÂ Google Scholar
Luca Palmerini
View author publications
You can also search for this author in PubMedÂ Google Scholar
Anisoara Paraschiv-Ionescu
View author publications
You can also search for this author in PubMedÂ Google Scholar
Clemens Becker
View author publications
You can also search for this author in PubMedÂ Google Scholar
Andrea Cereatti
View author publications
You can also search for this author in PubMedÂ Google Scholar
Walter Maetzler
View author publications
You can also search for this author in PubMedÂ Google Scholar
Basil Sharrack
View author publications
You can also search for this author in PubMedÂ Google Scholar
Beatrix Vereijken
View author publications
You can also search for this author in PubMedÂ Google Scholar
Alison J. Yarnall
View author publications
You can also search for this author in PubMedÂ Google Scholar
Lynn Rochester
View author publications
You can also search for this author in PubMedÂ Google Scholar
Silvia Del Din
View author publications
You can also search for this author in PubMedÂ Google Scholar
Arne Muller
View author publications
You can also search for this author in PubMedÂ Google Scholar
Aron S. Buchman
View author publications
You can also search for this author in PubMedÂ Google Scholar
Jeffrey M. Hausdorff
View author publications
You can also search for this author in PubMedÂ Google Scholar
Or Perlman
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

Participant recruitment and clinical oversight: B.S, W.M, C.B, J.M.H, A.J.Y, L.R. Algorithm development: Y.E.B, O.P, J.M.H. Data analysis, statistical analysis: Y.E.B. Figures and tables preparation: Y.E.B. Data interpretation: Y.E.B, J.M.H, O.P. Drafting of the manuscript: Y.E.B, J.M.H, O.P. Intellectual contribution: Y.E.B, F.K, L.P, K.A, C.B, A.C, W.M, B.S, B.V, A.J.Y, L.R, S.DD, A.M, A.S.B, J.M.H, O.P. All authors have provided critical intellectual input during the revision of the manuscript. All authors have reviewed the manuscript and approved the submitted version.

Corresponding author

Correspondence to Or Perlman.

Ethics declarations

Competing interests

A. Mueller and F. Kluge are employees of and may hold stock in Novartis. L. Palmerini is co-founder and own share of mHealth Technologies (https://mhealthtechnologies.it/). C. Becker are consultants of Philipps Healthcare, Bosch Healthcare, Eli Lilly, Gait-up. S. Del Din and J. Hausdorff report consultancy activity with Hoffmann-La Roche Ltd. outside of this study. The other authors have no competing interests.

Ethics approval and consent to participate

For the Rush Memory and Aging project, all participants provided written informed consent before participation. The study was approved by the Rush University Medical Center Institutional Review Board and conducted in accordance with the Declaration of Helsinki. For the Mobilise-D study, ethical approval was obtained at the individual sites (London-Bloomsbury Research Ethics Committee, 19/LO/1507; Helsinki Committee, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel, 0551-19TLV; ethical committee of the medical faculty of The University of TÃ¼bingen, 647/2019BO2; ethical committee of the medical faculty of Kiel University, D438/18) and all participants gave informed consent before participating.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Brand, Y.E., Kluge, F., Palmerini, L. et al. Self-supervised learning of wrist-worn daily living accelerometer data improves the automated detection of gait in older adults. Sci Rep 14, 20854 (2024). https://doi.org/10.1038/s41598-024-71491-3

Download citation

Received: 14 March 2024
Accepted: 28 August 2024
Published: 06 September 2024
DOI: https://doi.org/10.1038/s41598-024-71491-3

Subjects

Abstract

Similar content being viewed by others

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Real-world gait speed estimation, frailty and handgrip strength: a cohort-based study

A machine learning contest enhances automated freezing of gait detection and reveals time-of-day effects

Introduction

Results

Performance of the gait detection algorithm

Model selection: the effect of the SSL approach

The impact of self-supervised learning

Exploring construct validity

Discussion

Conclusions

Methods

Preprocessing

Stage 1: self-supervised learning

Participants and wearable sensors

Self-supervised approaches

Model configurations

Stage 2: fine-tuning

Participants and wearable sensors

Fine-tuning procedure

Ablation studies

Stage 3: testing

Model comparison

Stage 4: assessing construct validity

Statistical analysis

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethics approval and consent to participate

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links