Identifying patterns in amyotrophic lateral sclerosis progression from sparse longitudinal data

Ramamoorthy, Divya; Severson, Kristen; Ghosh, Soumya; Sachs, Karen; Glass, Jonathan D.; Fournier, Christina N.; Herrington, Todd M.; Berry, James D.; Ng, Kenney; Fraenkel, Ernest

doi:10.1038/s43588-022-00299-w

Download PDF

Article
Open access
Published: 08 September 2022

Identifying patterns in amyotrophic lateral sclerosis progression from sparse longitudinal data

Nature Computational Science volumeÂ 2,Â pages 605â616 (2022)Cite this article

11k Accesses
16 Citations
40 Altmetric
Metrics details

Subjects

Abstract

The clinical presentation of amyotrophic lateral sclerosis (ALS), a fatal neurodegenerative disease, varies widely across patients, making it challenging to determine if potential therapeutics slow progression. We sought to determine whether there were common patterns of disease progression that could aid in the design and analysis of clinical trials. We developed an approach based on a mixture of Gaussian processes to identify clusters of patients sharing similar disease progression patterns, modeling their average trajectories and the variability in each cluster. We show that ALS progression is frequently nonlinear, with periods of stable disease preceded or followed by rapid decline. We also show that our approach can be extended to Alzheimerâs and Parkinsonâs diseases. Our results advance the characterization of disease progression of ALS and provide a flexible modeling approach that can be applied to other progressive diseases.

Temporal stratification of amyotrophic lateral sclerosis patients using disease progression patterns

Article Open access 08 July 2024

Data-driven modelling of neurodegenerative disease progression: thinking outside the black box

Article 08 January 2024

Beyond the usual suspects: multi-factorial computational models in the search for neurodegenerative disease mechanisms

Article Open access 23 September 2024

Main

Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease with a complex pathophysiology resulting in heterogeneous symptoms and progression^1,2. The median length of survival from symptom onset is approximately three years; however, some individuals survive decades with the disease³. Longitudinal functional clinical metrics have gained widespread use as a tool to measure ALS progression. These clinical metrics, such as the Revised ALS Functional Rating Scale (ALSFRS-R), are a proxy for disease progression. However, they are imperfect measures. Individuals are often evaluated at different stages of their disease, making comparisons across patients challenging⁴. While people with ALS invariably decline over time, some of the measures can increase for short durations or reach plateaus⁵. Additionally, the metrics of clinical disease progression are based on subjective assessments of patientsâ daily functioning, such as the ability to climb stairs ânormallyâ or âslowlyâ, which introduces a potential source of error⁶. Furthermore, interventions such as percutaneous endoscopic gastronomy and non-invasive ventilation can affect these clinical metrics. The interconnectedness of function and the variability in the measurement of these clinical metrics present challenges in modeling ALS progression.

The heterogeneity of ALS makes it difficult to determine if a disease-modifying therapy is effectively slowing progression^7,8. Traditional modeling approaches have dealt with the complexities in ALS by first assuming that ALS outcome measures, particularly the ALSFRS-R, progress in a linear fashion^9,10,11. Many ALS clinical trials use changes in the linear slope of ALSFRS-R over time or changes in ALSFRS-R from baseline as primary endpoints^12,13,14. For example, edaravone was approved in the USA on the basis of a 2.5 ALSFRS-R point difference in decline between the treatment and control arms over 6âmonths (ref. ¹⁴), and the estimated effect from the ALS sodium phenylbutyrateâtaurursodiol clinical trial was a change in slope of 0.42 points per month¹². Large global crowdsourcing analyses designed to produce better models for clinical trials have also assumed a linear decline in ALSFRS-R^15,16.

Despite the widespread use of linear models in predicting patient progression, there is evidence that ALS progression can be nonlinear and can vary across disease severity^17,18,19. Several approaches have been proposed to deal with nonlinearity in the context of clinical trials. For example, nonlinear parametric models, which assume a particular shape of the trajectory in advance, have been used to capture these complexities. One notable example is the D50 model, which represents the progression of ALS with a two-parameter sigmoid^20,21. However, by requiring a particular parametric form, these models are restricted to identifying prespecified trajectory shapes^{17,18,22,23,24}, which may not represent the actual heterogeneity in disease progression patterns. Models such as the mixed model for repeated measures can be used in conjunction with unstructured time and covariance structures that reduce reliance on parametric assumptions; however, these models can suffer from being statistically underpowered, especially in clinical cohorts with sparse longitudinal data²⁵.

Less attention has been paid to a more fundamental question: are there common patterns of clinical progression in ALS? If such patterns exist, they could be used to improve patient stratification, which can impact clinical trial planning^8,26. Defining distinct clusters can also enable research aiming to identify disease mechanisms that contribute to modulating disease progression in ALS. Heterogeneity in the clinical progression of ALS may reflect environmental or genetic modifiers of disease, and robustly characterizing heterogeneity in progression patterns can aid in the search for these modifiers. Current clustering efforts for disease progression patterns are limited by requiring parametric assumptions in their cluster assignments or outcome measures^15,19,24. There is a need for computational methods that can flexibly identify patient clusters with minimal assumptions.

To model the full complexity of ALS progression, we turned to computational methods that are more flexible than traditional parametric models. We propose a framework for aggregating patient trajectories into clusters using Gaussian processes^27,28 and determining the number of clusters using a Dirichlet process mixture model^29,30. We also modify this mixture of Gaussian processes model to incorporate prior clinical knowledge. For example, since patients with ALS are expected to decline over time, we incorporate monotonic biases into our model, which encourage declining trajectories to be identified but also allow for the detection of patterns that do not fit prior expectations.

We show that this method can improve the characterization of ALS progression patterns, identifying clusters of participants with similar trajectories from longitudinal clinical scores. The nonlinear progression patterns are robust to sparse data and consistent across study populations, and correspond to survival outcomes. While we focus on clinical ALS outcome measures, we also demonstrate that the method can analyze Alzheimerâs and Parkinsonâs data. Our results provide an advance in modeling progression patterns in ALS and other diseases.

Results

Modeling approach

We developed a mixture of Gaussian processes model with strong inductive bias towards monotonic decline (MoGP) to characterize patterns in disease progression (Extended Data Fig. 1). The model leverages two Bayesian non-parametric methods: Gaussian process regression^27,28 and Dirichlet process clustering^29,30. Gaussian process regression does not require the specification of a particular functional form, but instead learns trajectories from data, enabling the model to capture a wide variety of possibly nonlinear progression patterns. Dirichlet process clustering can be used to identify clusters from data when it is difficult to specify an expected number of clusters a priori. The use of Dirichlet processes is motivated by the uncertainty in the existence and number of ALS progression subtypes and avoids restrictive modeling assumptions.

Our approach includes notable improvements over previous MoGP models^28,31,32, in that we incorporate clinical knowledge relevant to ALS progression. Specifically, we implement a monotonic inductive bias as well as clinically informed parameter priors for Gaussian process regression and Dirichlet process clustering components. Each component of the model is discussed in more detail in Methods.

Elucidating ALS disease progression trajectory patterns

We sought longitudinal ALSFRS-R scores from a wide range of sources. The model was evaluated on five study populations (Supplementary Tables 1 and 2). Three observational studies were used, Answer ALS (AALS) (https://www.answerals.org/), the Emory ALS Clinic database (EMORY)³ and the ALS/MND Natural History Consortium database (NATHIST)³³, and two overlapping clinical trial datasets, the Pooled Resource Open-Access ALS Clinical Trials (PRO-ACT)^34,35 and the Clinical Trial of Ceftriaxone in ALS (CEFT)^13,36.

To characterize patterns in ALS progression, we first applied MoGP to PRO-ACT, which is the largest publicly available dataset of ALSFRS-R scores (Fig. 1 and Extended Data Fig. 2). The analysis identified diverse clusters, including some clusters identifying slow-progression populations (Fig. 1n) and others capturing faster-progression groups (Fig. 1r).

**Fig. 1: Identifying trajectory clusters with varying patterns of decline, using a mixture of Gaussian processes model.**

Notably, in many cases, the patterns of decline were highly nonlinear, with some following sigmoidal (Fig. 1d,k), convex (Fig. 1m,u,v) and concave (Fig. 1o,q) curves. Linear patterns were also detected in some clusters (Fig. 1g,j,t). To estimate how well a linear model fit in the first year generalizes to subsequent timepoints, we computed the slope of the mean function of each cluster in the first year after symptom onset (âfirst-year slopeâ). This is relevant to previous studies that utilize a first-year slope calculation, including previous ALS DREAM Challenges^15,16. While this first-year slope closely reflects actual trajectories for the linear clusters (Fig. 1g,j,t), for others it is either an overestimation (Fig. 1i,k,l,o,x) or an underestimation (Fig. 1s,v), indicating nonlinearity in the trajectory pattern. These errors in the first-year estimations can be large; for instance, it overestimates the disease trajectory in cluster K by 24.20 ALSFRS-R points and underestimates the trajectory in cluster V by 9.48 ALSFRS-R points when both are evaluated 3âyr after symptom onset. This diversity highlights the complexity of progression trajectories in ALS. Analysis of other study populations (Extended Data Figs. 3â6) also revealed many clusters that were highly nonlinear.

Identifying nonlinear patterns across heterogeneous studies

We compared the performance of MoGP against two other approaches, which assume that progression is linear. A standard model in the field is to calculate linear slopes fit to patient data with an onset anchor³⁷ (slope model: SM). The slope model is fit to each patient separately and does not identify clusters. We also benchmarked our model against a mixture of Gaussian processes model with a linear kernel (linear kernel model: LKM). The LKM retains the ability to cluster trajectories using a Dirichlet process but does not allow for nonlinear functions, allowing us to separate the contribution of clustering and the assumption of linearity in our models.

For all study populations, the error was lower in the MoGP model than in the LKM and SM (Fig. 2a). Across the populations, using the MoGP reduced error by more than one ALSFRS-R point as compared with the LKM for at least 27.16% of participants; at least 8.33% of patients have an improvement in accuracy greater than two ALSFRS-R points (Supplementary Table 3). Importantly, the error of the MoGP was lower even though the LKM used a larger number of clusters to model the data (Supplementary Table 6). It is also notable that the MoGP, which identified clusters as large as 88 participants, was able to match or outperform the patient-specific SM (Fig. 2a and Supplementary Table 4), which would have been expected to markedly outperform MoGP if substantial nonlinear structure did not exist in the data. The results are replicated across the five different datasets, suggesting that complex nonlinearity is a common feature of ALS progression and is not a unique feature of a single dataset.

**Fig. 2: Estimating nonlinearity of trajectories.**

The clusters with the most substantial nonlinearity often followed sigmoidal trajectory patterns, with varying inflection points (Fig. 2b). In some of these clusters, patients had slow progression for a period of time, followed by a consistent sharp decline. This pattern of progression appears consistent with a sudden loss of ability to carry out functions that we refer to as a âfunctional cliffâ. In other cases, the pattern is more consistent with a rapid period of decline followed by a slower phase. Since there are many settings in which patient-specific parametric models are very useful, we compared our model with a patient-specific sigmoidal model (SG)²⁰. Somewhat surprisingly, despite the fact that the MoGP models groups of patients, rather than individuals, MoGP outperforms a patient-specific sigmoid model by one or more ALSFRS-R points for 4.20â9.43% of patients across the studies (Supplementary Table 5). This indicates that, while a sigmoidal model captures much of the nonlinearity, it does not represent the full complexity of progression patterns.

MoGP clusters varied considerably in their rates of progression and the stability of their progression patterns. MoGP enables the characterization of each of these properties through the mean function slope and kernel function length-scale parameters respectively, both of which are learned and optimized through the training process. The model provides estimates for each of these parameters, and these can be used to approximate similarity between clusters depending on the desired clustering property (Extended Data Fig. 7).

Clustering trajectories on the basis of the optimized slope and length-scale parameters reveals interesting patterns (Extended Data Fig. 7). The dominant clinical progression patterns in ALS are sigmoidal fast progression (Extended Data Fig. 7b, 17.48% of individuals), stable slow progression (Extended Data Fig. 7e, 17.38%), unstable slow progression (Extended Data Fig. 7f, 32.98%) and unstable medium progression (Extended Data Fig. 7d, 30.82%). As might have been expected, some types of progression were associated with specific sites of onset. Clusters with fast sigmoidal progression have the highest percentage of individuals with bulbar onset (30.14% of individuals), while those with stable slow progression have the highest percentage of individuals with limb onset (76.97%) (Supplementary Table 8).

Overall, the MoGP model promotes the ability to learn these complex disease progression trajectories better than currently used clinical models, while stratifying patients to reveal common patterns of disease.

Evaluating the robustness of the clusters to sparse data

As clinical data for ALS patients are often incomplete or sparse, we sought to evaluate MoGP performance in these settings. We tested robustness using PRO-ACT, which is the largest of our sources and is a compendium of data from several clinical trials. We also tested robustness using data from CEFT, which is a small clinical cohort within PRO-ACT that may be more representative of common clinical settings. We compared MoGPâs performance against LKM and SM.

We first evaluated the modelâs ability to recreate randomly withheld data points (âinterpolationâ). Across all interpolated tests for PRO-ACT, we found that the clusters identified by MoGP had lower reconstruction error than the LKM (Fig. 3a, Pââ¤â1âÃâ10^â4), and a lower error than the SM when 50% and 75% of training data are included (Fig. 3a, Pââ¤â1âÃâ10^â³). These trends persisted when compared with CEFT (Fig. 3c).

**Fig. 3: Evaluating robustness of cluster assignments with sparse datasets.**

One of the most common uses for trajectory modeling is to predict future ALSFRS-R scores. We therefore evaluated the modelâs ability to predict future ALSFRS-R scores for patients with right-censored data (âpredictionâ). In clinical trials, these predictions are often made with the SM. For PRO-ACT, when only three or six months of data from baseline were provided, the SM and LMK were the most accurate (Fig. 3b). However, when one or more years of training data were provided, the MoGP model outperformed the LKM and SM (Fig. 3b, Pââ¤â1âÃâ10^â2, except for 1.5âyr, where Pâ=â1.34âÃâ10^â¹ for SM), and more accurately predicted future disease progression by more than 0.22, 0.41 and 1.28 ALSFRS-R points at 1, 1.5 and 2âyr respectively. This trend was strengthened in CEFT, in which six months of training data were sufficient to see an improvement in progression forecasting (Fig. 3d, Pââ¤â1âÃâ10^â¹).

For the majority of comparisons, the MoGP identified fewer clusters per mixture model than the SM or LKM, indicating that the lower reconstruction error was not due to overfitting of the cluster assignments (Supplementary Fig. 1).

Transferring trajectories across study populations

Because ALS is heterogeneous and characteristics of study populations can differ considerably, it is important to test whether trajectory models capture patterns that are consistent across populations. To answer this question, we trained MoGP on a large database (âreference modelâ) and used it to predict patient trajectories in other study populations that varied in data collection frequency and follow-up period. We benchmarked the MoGP results against models in which both the test and training sets were derived from the same study population (âstudy-specific modelsâ). The study-specific models allow us to evaluate possible overfitting of the reference model. If the reference model was overfit, we would expect it to have a much higher error than the study-specific models.

We found that the reference model, trained on PRO-ACT, demonstrated strong performance on external datasets, indicating that the trajectory clusters are not overfit to the reference model data (Fig. 4a). Importantly, we found that for all test datasets the reference model outperformed the study-specific models (Fig. 4b, Pâ=â0.0312 for AALS, CEFT, EMORY; Pâ=â0.0625 for NATHIST). AALS had the lowest error when the reference model was used (2.16 ALSFRS-R points), followed by CEFT (2.25), EMORY (2.32) and NATHIST (2.59) (Fig. 4b). These errors were similar to the baseline error (1.88 ALSFRS-R points) when the reference model was tested on the held-out data from PRO-ACT, the study on which it was trained. We additionally benchmarked the model against a reference model for which the cluster labels were randomly shuffled. This randomized control had a mean error of 11.74 ALSFRS-R points, which was much higher than the errors of the reference models. Given that CEFT is a subset of PRO-ACT, it is interesting that the CEFT study-specific model had a higher error than its reference model counterpart; these results suggest that the larger size of the PRO-ACT dataset may allow it to capture trajectories more accurately. The reference modelâs ability to outperform all of the study-specific models is strong evidence that the trajectory patterns identified by MoGP are transferable across ALS study populations.

**Fig. 4: Assessing trajectory consistency across datasets.**

Corresponding survival outcomes with trajectory clusters

Next, we evaluated if the MoGP clusters, which were trained only on ALSFRS-R data, were able to reflect the duration of patient survival from symptom onset to death. The results of the KaplanâMeier analysis are presented in Fig. 5. Some clusters (Fig. 5c,e) reflected longer survival durations, with very few deaths recorded, while other clusters reflected shorter durations. For example, cluster D had a median survival of 2.90âyr from symptom onset, corresponding to faster progression (Fig. 5d,i). Of all pairwise combinations of clusters, 63.40% corresponded to differential survival outcomes when MoGP was used. By contrast, when LKM was used 50.99% of pairwise combinations of clusters corresponded to differential survival outcomes (Pâ<â0.05). These results demonstrate that incorporating nonlinearity improves the correspondence of clusters to survival outcomes and provides evidence that these progression clusters are clinically relevant.

**Fig. 5: Survival outcomes for trajectory clusters.**

Characterizing patterns of decline in alternative ALS measures

In addition to ALSFRS-R scores, there are other important clinical metrics that can be used to monitor ALS disease progression. One is forced vital capacity, which is a spirometer-based measure of lung function and has been used as an indicator of survival and disease progression³⁸. Furthermore, while the ALSFRS-R total is commonly used as an aggregate measure, its component subscores measuring fine motor, gross motor, bulbar and respiratory function can also be analyzed to identify subscore-specific patterns. When we applied MoGP to forced vital capacity and ALSFRS-R subscores from PRO-ACT, we saw that the nonlinearity persisted in these domains. The nonlinear trajectories were particularly pronounced for forced vital capacity and bulbar function (Fig. 6).

**Fig. 6: MoGP trajectory patterns for secondary endpoints of ALS disease progression.**

A key advance of this work is the identification of clusters of patients, which can be used to investigate genetic or environmental causes that may underlie ALS progression. For instance, the C9orf72 repeat expansion is the most common of the known causes of ALS, and it is associated with faster-progression ALS, as indicated by reduced survival^39,40. However, even among patients who share this common genetic cause of ALS, there is some evidence of uncharacterized heterogeneity in ALS progression patterns⁴¹. As an example use-case, we asked if MoGP can be used to stratify patients who carry this repeat expansion. We analyzed data from AALS, a study population with both clinical and molecular data available. The patients with the C9orf72 repeat expansion did not correspond to a single cluster, supporting the hypothesis of heterogeneous progression within this group. As more data accumulate, such analyses could aid in the search for genetic or environmental variables that modify the aggressiveness of C9orf72.

Revealing patterns in Alzheimerâs and Parkinsonâs endpoints

The MoGP approach can be applied to functional rating scales that are widely used in other neurodegenerative diseases. We applied MoGP to the Alzheimerâs Disease Assessment ScaleâCognitive Subscale (ADAS-Cog-13 (refs. ^42,43). The model showed a range of disease progression patterns, with varying severities of progression (Extended Data Fig. 8a and Supplementary Fig. 5). The majority of the largest clusters showed linear trajectories, in which the first-year slope appropriately captured later progression; clusters E and H, while largely linear, deviate from the first-year slope, showing counterexamples to this trend. It is noteworthy that the clusters varied substantially in the rates of conversion of mild cognitive impairment (MCI) to Alzheimerâs disease. Ninety percent of those in cluster F had an MCI diagnosis at baseline, compared with 5.26% in cluster G (Supplementary Table 9).

Similarly to ALS and Alzheimerâs disease, Parkinsonâs disease is heterogeneous in its symptom presentation and progression, which creates challenges in therapeutic discovery. Unlike ALS and Alzheimerâs disease, there are widely used medications for Parkinsonâs disease that can provide symptomatic relief, although they do not slow or stop the progression of the disease⁴⁴. We characterized patterns in motor decline by applying MoGP to Part III of the Movement Disorder SocietyâUnified Rating Scale (MDS-UPDRS)⁴⁵ using only data from the âoff stateâ, that is, when not affected by medications. MoGP identified a number of progression trajectories (Extended Data Fig. 8b and Supplementary Fig. 6), with some showing stability of motor scores (clusters C, F), while others showed clear motor function decline (clusters A, B, D).

Interesting trends emerged from this analysis. Over 90% of individuals in clusters with an unstable slow progression pattern (Supplementary Fig. 8 and Supplementary Table 10) had tremor-dominant (TD)⁴⁶ Parkinsonâs disease, as opposed to postural instability/gait difficulty (PIGD). In contrast with previous studies of the linearity of MDS-UPDRS scores⁴⁷, our results also point to nonlinear complexity in some clusters (clusters C, E, G) (Extended Data Fig. 8b). These analyses demonstrate that MoGPâs flexibility enables it to characterize long-term heterogeneity in time-series metrics in a number of diverse clinical settings.

Discussion

The improved performance of an MoGP model over the slope and linear kernel models indicates that linear models are insufficient to capture the heterogeneity in ALS disease progression. While some patients do indeed have linear trajectories, a substantial portion of patients have nonlinear trajectories. Our work also finds that, while a simple parametric nonlinear modelâa two-parameter sigmoidâis better than a linear model, it still fails to capture the full range of patient trajectory patterns, motivating the use of non-parametric models that can capture both linear and nonlinear trajectories.

Previous work has suggested that the functional cliff patterns seen here may be a result of inconsistencies in the ALSFRS-R or issues related to the ordinal scale used in ALSFRS-R as opposed to a linearly weighted interval scale^48,49. However, the consistency of MoGP-identified patterns across different study populations suggests that the patterns are not the result of deficiencies in the ALSFRS-R. Critically, we also observed nonlinear patterns in vital capacity scores, which are measured independently of ALSFRS-R scores. These findings support the view that nonlinearity is reflective of changes in patient function and not problems in measurement. These findings also have implications for the analysis of clinical trials, many of which use ALSFRS-R and vital capacity metrics as primary or secondary endpoints. In many trajectory clusters, functional cliffs or sigmoidal patterns in disease progression may obscure the detection of therapeutic efficacy if linear models are used. Our results support accounting for nonlinearity when evaluating ALS clinical trial efficacy, with particular salience for clinical trials that are 1âyr in duration or longer.

Our work also demonstrates how existing clinical databases in ALS can be leveraged to enable the characterization of disease progression in sparse datasets from different study populations. A MoGP model trained on the PRO-ACT database accurately predicted trajectories for clinical datasets from AALS, EMORY, CEFT and NATHIST datasets. The transferability of MoGP-identified clusters across the datasets indicates that the trajectory cluster patterns are robust to batch effects due to clinician or site differences, and may reflect underlying disease processes. One of the properties of the Dirichlet process model underlying MoGP clustering is that it will naturally scale the number of identified clusters within a given dataset depending on the number of samples in that dataset; we can use a reference model on a clinical cohort of any size. This non-parametric property of the model underlies the difference in the total number of clusters found in the varying datasets. Conversely, when it is useful to analyze fewer clusters the trajectories can easily be grouped together on the basis of their mean slope and length scale, revealing dominant modes of disease progression. The identification of these clusters creates an opportunity to search for molecular, environmental or other factors that may modify disease progression.

As in many clinical studies, the datasets and therefore the progression patterns in this analysis are influenced by both selection bias and attrition bias. Selection bias refers to the sample of the population that is included in each study. Studies such as AALS, which require enrollment and consent to undergo additional monitoring, tend to be biased towards slower-progressing ALS. The EMORY dataset, which has a high percentage of enrollment from the clinic, is likely to be more reflective of a clinical population, although it reflects a group of patients with higher rates of progression on average. Overall though, observational studies tend to have less standardized frequencies of data collection and sparser measurements. On the flip side, clinical trial datasets typically collect extensive longitudinal data, but because of enrollment criteria can be skewed towards faster-progression individuals. The variation in ages of onset and prevalence of sites of onset differ across clinical cohorts, which can indicate additional potential selection biases. Other variables that can be used to evaluate selection bias but were partially missing or unavailable across our studies include diagnostic delay, forced vital capacity, frontotemporal dementia and C9orf72 status. Attrition bias also plays a strong role in ALS datasets, given the rapid pace of disease progression, with patient monitoring becoming increasingly difficult in late-stage disease; this bias may particularly affect the tail end of the identified trajectory patterns. Given the large sample size in our study, and the consistency of the patterns across datasets, we expect that we are sampling the clinical population as broadly as possible, although future work will involve determining the extent to which these trajectories remain consistent in new datasets.

Ultimately, by identifying clusters of patients who have similar disease progression trajectories, these models could be used to identify molecular correlates that may be associated with ALS progression subtypes. While this work focuses on ALSFRS-R and vital capacity, the field of ALS has identified a growing number of molecular biomarkers and clinical metrics in which progression is poorly understood^50,51. This paper points to the complexity of disease progression in ALS and the necessity of more accurately accounting for heterogeneous trajectory patterns in clinical trial models and research studies.

Methods

Study populations

ALS data in this study were collected from five cohorts: PRO-ACT, CEFT, AALS, EMORY and NATHIST (Supplementary Tables 1 and 2). All scores used for this analysis are clinician reported. The populations varied in size, with PRO-ACT having the largest total number of participants (2,923 participants with at least three ALSFRS-R visits recorded). The populations differed in the median number of months followed (between 11 and 17âmonths) and the median frequency of clinical visits (between four and nine visits). The median slope between the populations also varied, with CEFT and EMORY having the fastest-progression populations (â0.84 and â0.89 ALSFRS-R points/month, respectively), and AALS having the slowest-progression population (â0.55 ALSFRS-R points/month). CEFT had a median of 16.80âmonths of follow-up, while PRO-ACT had a median of 11.95âmonths, indicating that CEFT participants likely comprise some of the longest subject records in PRO-ACT. The differences between the populations allowed us to measure the robustness of our model to data collection methods, frequency of clinical visits and duration of follow-up.

Modeling approach

We characterize disease progression in ALS using a framework for identifying trajectory patterns from longitudinal data. While previous work on disease progression modeling has focused on patient-specific prediction models^16,52,53, a critical advance of this work is the characterization of distinct and large trajectory clusters. Furthermore, we provide a principled approach to characterizing the shapes of disease progression patterns in ALS, which leverages Bayesian non-parametric methods to minimize the number of assumptions that are required for regression models. We show that this method can flexibly be applied to a number of functional clinical measures for progressive diseases. Each component of the model is detailed further below. Further details, including the mathematical specification of the model, can be found in Supplementary notes.

The modeling approach of clustering over temporal progression patterns has been shown to improve the characterization of disease progression in other conditions. For example, Peterson et al. demonstrated the use of an autoregressive Gaussian process model for predicting metrics of Alzheimerâs progression⁵⁴; however, the model made a fundamentally different assumption about the structure of the dataâthat there is a single global progression type, and that each patient follows a noisy version of this global progression typeâwhich is an assumption that does not capture the full heterogeneity of ALS phenotypes. Furthermore, the model requires fixed time intervals of visits, which are not available in many clinical ALS datasets⁵⁴. Zhao et al. present a related clustering approach in multiple sclerosis, although their model relies heavily on prior domain knowledge on how to group patients into subgroups, which has not as yet been clearly defined in ALS⁵⁵. Other models, such as additive Gaussian process regression⁵⁶, can be used to characterize patterns in time-series data, although they lack the ability to stratify patients into disease subtypes.

Gaussian process regression

Gaussian process regression allows the identification of nonlinear trajectory patterns while making minimal assumptions about the shape of the trajectory functions^27,28. A Gaussian process is specified by a mean function and a covariance kernel. Because we expect ALS trajectories to be smooth functions with no discontinuities, our MoGP model uses a squared exponential kernel. The squared exponential kernel has two parameters: the signal variance, which determines the average distance of the function from the mean, and the length scale, which specifies the smoothness of the function. Each of these parameters is determined during the learning phase using the training data.

Monotonic inductive bias

Because ALS trajectories are expected to decline over time, we use a negative linear function in the Gaussian process models of MoGP. To further encourage declining trajectories, we modify the Dirichlet process clustering algorithm, such that an individual can only be placed in a cluster if their score at their initial visit is not substantially higher than the mean function of the current cluster at that point. We also impute an onset-anchor value, a maximum score of a clinical metric assigned to the date corresponding to symptom onset, which has been previously shown to improve prediction in ALS trajectories³⁷.

Dirichlet process clustering

Dirichlet process mixtures^29,30 can be used to identify clusters in data without needing to specify an expected number of clusters in advance. This unsupervised learning model begins by assuming that an infinite number of clusters can exist, and then narrows its prediction to a limited number of components best supported by the observed data. In our case, each mixture component is a function drawn from a Gaussian process. The resulting model clusters patient trajectories by probabilistically assigning them to those components that best explain them. The number of patients in each cluster is also learned from the model, and clusters can differ in size from each other. Through this data-driven approach, the algorithm can learn clusters of ALS patients who share disease progression patterns. The method can also predict the cluster membership and the disease progression pattern of a participant not included in the model, and provide an estimate of the confidence of this prediction.

Model evaluation

Evaluating trajectory nonlinearity

We evaluated how generalizable a linear model trained in the first year of disease progression is to subsequent data points, by calculating an anchored first-year slope. This was computed as the following: (48âââcluster mean function at 1âyr)/time from symptom onset. Anchoring indicates that a score of 48 (the maximum of the ALSFRS-R scale) is imputed at the time of symptom onset³⁷. We compared our MoGP model against two benchmark linear models: an anchored slope model (SM), which is patient specific, and a mixture of Gaussian processes model with a linear kernel (LKM), which clusters patients using a linear parametric model. Additionally, to evaluate the extent to which a nonlinear parametric model represents ALS progression, we compared our model against a patient-specific two-parameter sigmoidal model (SG)²⁰.

Because we are proposing the use of a data-driven model, we aimed to be as conservative as possible in removing patients from the dataset so as to not introduce additional selection bias. For this analysis, participants were excluded from the model if fewer than three complete ALSFRS-R visits were recorded, the first visit was more than 7âyr from symptom onset or an increase of more than six points in ALSFRS-R between consecutive visits was recorded (Supplementary Table 1). The ALSFRS-R is an updated version of the previously used ALSFRS metric, and includes additional questions measuring dyspnea, orthopnea and respiratory insufficiency⁶. The ALSFRS-R measure was used here because it is the current standard in clinical trial analysis^12,13,14. Seven years was selected as the point at which longitudinal data became sparse. Six ALSFRS-R points was selected because a jump such as this was unlikely to be seen unless there was a data-entry error.

For each model, the RMSE between a participantâs measured scores and their predicted cluster mean function were calculated. The RMSE was compared between the models; a lower RMSE indicates reduced error in that model and better model performance.

Robustness to sparse data

We simulated sparsity by withholding data and assessed the modelâs ability to perform two tasks: (1) interpolation of ALSFRS-R scores for a patient with randomly withheld data points, and (2) forecasting future ALSFRS-R disease progression for patients with right-censored data. We tested this using PRO-ACT and CEFT. To have sufficient longitudinal measurements, for interpolation experiments we only included participants with ten or more longitudinal ALSFRS-R visits, and for prediction experiments we only included participants with four or more visits (Supplementary Table 7).

The reconstruction error for each participant was calculated using the RMSE between the original withheld data points and predicted values from the mean function for the participantâs trajectory cluster. This was done across all interpolated tests, in which 25%, 50% and 75% of clinic visits per patient were provided as training data, with selections randomly interspersed across visits. We additionally evaluated the ability of MoGP to predict future progression by using right-censored data with varying numbers of training data (including visits within 0.25, 0.5, 1, 1.5 and 2âyr since baseline visit).

Model generalizability

To evaluate whether clusters derived from one study population could be used to model external study populations, we trained a reference model and evaluated the transferability of this model to unseen ALS patient data. We predicted the cluster membership for each participant, and calculated the RMSE between the participant ALSFRS-R scores and the mean function of their predicted cluster.

We split all of our study populations into test and training datasets (60% train, 40% test; repeated across five randomly split testâtrain datasets). For our reference MoGP model, we used the training data from PRO-ACT, which was chosen because it contained the largest number of samples and is publicly available. For AALS, EMORY, CEFT and NATHIST, we used the training data from each study to train a separate model (study-specific model). For each studyâs remaining test data, we predicted the trajectory function using the reference model and the study-specific model. To approximate the minimum error expected, we calculate the reconstruction error when the reference model is applied to the test set from PRO-ACT, the same study on which it is trained (âbaseline errorâ). We also benchmark against the error when cluster labels on the reference model are randomly shuffled (ârandom cluster assignmentâ).

Relationship to alternative outcome measures

We calculated the KaplanâMeier survival probability curves for the largest MoGP clusters identified from PRO-ACT. If no death was recorded, the participant was marked as censored using the latest date of a recorded ALSFRS-R score.

We trained MoGP models on forced vital capacity percentages (calculated as the maximum of three trials) and ALSFRS-R subscores (fine motor, gross motor, respiratory and bulbar domains). A maximum score of 100% was used for the forced vital capacity percentage model, and a maximum score of 12 was used for ALSFRS-R subscores.

We applied our method to ADAS-Cog-13 (refs. ^42,43) from the Alzheimerâs Disease Neuroimaging Initiative (ADNI⁵⁷). Individuals with a confirmed Alzheimerâs disease diagnosis at any point of the data collection were included in the model; this also included individuals who began the study with MCI and then converted to an Alzheimerâs disease diagnosis. To ensure sufficient longitudinal data, individuals with fewer than three longitudinal visits were excluded, with a total of 331 individuals included in the model. The correlation between the learned clusters and MCI to Alzheimerâs disease conversion was then calculated.

We also applied our method to the MDS-UPDRS⁴⁵ scale from the Parkinsonâs Progression Markers Initiative dataset⁵⁸. In contrast to ALS and AD, for Parkinsonâs disease there are medications that can mitigate symptoms although not long-term progression of the disease⁴⁴. Because we were interested in characterizing progression patterns when not affected by medications, we focused on measurements of the MDS-UPDRS Part III in the off state, which is defined as either before the initiation of medication or after abstaining from medication for at least 12âh. Individuals with fewer than three longitudinal off-medication scores or a first visit more than 10âyr from symptom onset were excluded, with a total of 397 individuals included in the model. We calculated the correlation between the clusters and Parkinsonâs disease subtypes of PIGD and TD, with the designation of PIGD/TD calculated following the method previously described by Stebbins et al.⁴⁶. For the purpose of analyzing the Parkinsonâs disease subtype correlation with cluster membership, we focused on individuals with a stable PIGD/TD designation (one that does not change over the course of the disease).

Statistics and reproducibility

To compare the cumulative distribution function of the RMSE between a participantâs predicted cluster membership and cluster model mean, P values were calculated with KolmogorovâSmirnov two-sample two-sided tests. For interpolation and prediction experiments, to determine if a model error had decreased between the LKM or SM and the MoGP, a Wilcoxon signed-rank one-sided test was used. To assess trajectory consistency between reference models and study-specific models, a Wilcoxon signed-rank one-sided test was used. To calculate survival curves, a KaplanâMeier estimator was used, with P values calculated via the logrank test with FDR correction. P values for cluster correlations were calculated using a hypergeometric test.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

A pretrained reference model for this study can be downloaded here: http://fraenkel.mit.edu/mogp

Source Data for Figs. 2â4 and Extended Data Fig. 7 are available with this manuscript. Source Data for Fig. 1 and Extended Data Fig. 2 are available as a Python object from http://fraenkel.mit.edu/mogp. Other source data are unavailable at this time because they contain patient-level clinical data; however, all figures can be generated using the code provided, after downloading the datasets listed below.

Clinical data for this study can be obtained from the following sources.

AALS (ClinicalTrials.gov identifier NCT02574390) is available for download in the Answer ALS data portal (data.answerals.org). PRO-ACT can be downloaded from the PRO-ACT database (https://nctu.partners.org/ProACT). CEFT (ClinicalTrials.gov identifier NCT00349622) can be downloaded from National Institute of Neurological Disorders and Stroke (NINDS) (https://www.ninds.nih.gov/Current-Research/Research-Funded-NINDS/Clinical-Research/Archived-Clinical-Research-Datasets). EMORY is restricted access at this time due to containing information that could compromise patient privacy, but available with permission from Dr. Jonathan Glass (jglas03@emory.edu) for legitimate research. Response to requests will be provided within two weeks, all data provided will be fully de-identified, a DUA will need to be established and the source data will need to be acknowledged in any publications. NATHIST is available from the ALS/MND Natural History Consortium (https://www.data4cures.org/requestingdata) with a summary of proposed data use, data elements requested and publication intent. The Parkinsonâs Progression Markers Initiative can be downloaded, with a data use agreement, online application and compliance with publication policy (https://www.ppmi-info.org/access-data-specimens/download-data). Applications for data access are reviewed by the Data and Publications Committee within one week of receipt. ADNI can be downloaded through the LONI Image and Data Archive (https://adni.loni.usc.edu/data-samples/access-data/#access_data). Access is contingent on adherence to the ADNI Data Use Agreement and its publication policies. The application process includes the acceptance of a data use agreement and submission of an online application form. The application must include the investigatorâs institutional affiliation and the proposed uses of the ADNI data. ADNI data may not be used for commercial products or redistributed in any way.

Code availability

We provide the Python code for the MoGP framework as well as a pretrained reference model that researchers can use to generate predictions of cluster membership and trajectory function from input patient data. We also provide a pip-installable Python package associated with this work (mogp). All code used for data processing, modeling and figure generation can be found at https://github.com/fraenkel-lab/mogp. Code is also deposited on Zenodo (license BSD 3-Clause; https://doi.org/10.5281/zenodo.6744399)⁵⁹.

References

Brown, R. H. & Al-Chalabi, A. Amyotrophic lateral sclerosis. N. Engl. J. Med. 377, 162â172 (2017).
ArticleÂ Google ScholarÂ
Mandrioli, J. et al. Heterogeneity in ALSFRS-R decline and survival: a population-based study in Italy. Neurol Sci. 36, 2243â2252 (2015).
ArticleÂ Google ScholarÂ
Traxinger, K., Kelly, C., Johnson, B. A., Lyles, R. H. & Glass, J. D. Prognosis and epidemiology of amyotrophic lateral sclerosis. Neurol. Clin. Pract. 3, 313â320 (2013).
ArticleÂ Google ScholarÂ
Proudfoot, M., Jones, A., Talbot, K., Al-Chalabi, A. & Turner, M. R. The ALSFRS as an outcome measure in therapeutic trials and its relationship to symptom onset. Amyotroph. Lateral Scler. Frontotemporal Degener. 17, 414â425 (2016).
ArticleÂ Google ScholarÂ
Bedlack, R. S. et al. How common are ALS plateaus and reversals?. Neurology 86, 808â812 (2016).
ArticleÂ Google ScholarÂ
Cedarbaum, J. M. et al. The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function. J. Neurol. Sci. 169, 13â21 (1999).
ArticleÂ Google ScholarÂ
Goyal, N. A. et al. Addressing heterogeneity in amyotrophic lateral sclerosis CLINICAL TRIALS. Muscle Nerve 62, 156â166 (2020).
ArticleÂ Google ScholarÂ
Kiernan, M. C. et al. Improving clinical trial outcomes in amyotrophic lateral sclerosis. Nat. Rev. Neurol. 17, 104â118 (2021).
ArticleÂ Google ScholarÂ
Armon, C. et al. Linear estimates of disease progression predict survival in patients with amyotrophic lateral sclerosis. Muscle Nerve 23, 874â882 (2000).
ArticleÂ Google ScholarÂ
Elamin, M. et al. Predicting prognosis in amyotrophic lateral sclerosis: a simple algorithm. J. Neurol. 262, 1447â1454 (2015).
ArticleÂ Google ScholarÂ
Labra, J., Menon, P., Byth, K., Morrison, S. & Vucic, S. Rate of disease progression: a prognostic biomarker in ALS. J. Neurol. Neurosurg. Psychiatry 87, 628â632 (2016).
ArticleÂ Google ScholarÂ
Paganoni, S. et al. Trial of sodium phenylbutyrateâtaurursodiol for amyotrophic lateral sclerosis. N. Engl. J. Med. 383, 919â930 (2020).
ArticleÂ Google ScholarÂ
Cudkowicz, M. E. et al. Safety and efficacy of ceftriaxone for amyotrophic lateral sclerosis: a multi-stage, randomised, double-blind, placebo-controlled trial. Lancet Neurol. 13, 1083â1091 (2014).
ArticleÂ Google ScholarÂ
Abe, K. et al. Safety and efficacy of edaravone in well defined patients with amyotrophic lateral sclerosis: a randomised, double-blind, placebo-controlled trial. Lancet Neurol. 16, 505â512 (2017).
ArticleÂ Google ScholarÂ
Kueffner, R. et al. Stratification of amyotrophic lateral sclerosis patients: a crowdsourcing approach. Sci. Rep. 9, 690 (2019).
ArticleÂ Google ScholarÂ
KÃ¼ffner, R. et al. Crowdsourced analysis of clinical trial data to predict amyotrophic lateral sclerosis progression. Nat. Biotechnol. 33, 51â57 (2015).
ArticleÂ Google ScholarÂ
Gordon, P. H. et al. Progression in ALS is not linear but is curvilinear. J. Neurol. 257, 1713â1717 (2010).
ArticleÂ Google ScholarÂ
Thakore, N. J., Lapin, B. R., Pioro, E. P. & Pooled Resource Open-Access ALS Clinical Trials Consortium Trajectories of impairment in amyotrophic lateral sclerosis: insights from the Pooled Resource Open-Access ALS Clinical Trials cohort. Muscle Nerve 57, 937â945 (2018).
ArticleÂ Google ScholarÂ
Ackrivo, J. et al. Classifying patients with amyotrophic lateral sclerosis by changes in FVC. A group-based trajectory analysis. Am. J. Respir. Crit. Care Med. 200, 1513â1521 (2019).
ArticleÂ Google ScholarÂ
Poesen, K. et al. Neurofilament markers for ALS correlate with extent of upper and lower motor neuron disease. Neurology 88, 2302â2309 (2017).
ArticleÂ Google ScholarÂ
Steinbach, R. et al. Applying the D50 disease progression model to gray and white matter pathology in amyotrophic lateral sclerosis. NeuroImage Clin. 25, 102094 (2020).
ArticleÂ Google ScholarÂ
Gomeni, R. & Fava, M. Consortium TPROAACT. Amyotrophic lateral sclerosis disease progression model. Amyotroph. Lateral Scler. Frontotemporal Degener. 15, 119â129 (2014).
ArticleÂ Google ScholarÂ
Ong, M. L., Tan, P. F. & Holbrook, J. D. Predicting functional decline and survival in amyotrophic lateral sclerosis. PLoS ONE 12, e0174925 (2017).
ArticleÂ Google ScholarÂ
Tang, M. et al. Model-based and model-free techniques for amyotrophic lateral sclerosis diagnostic prediction and patient clustering. Neuroinformatics 17, 407â421 (2019).
ArticleÂ Google ScholarÂ
Bell, M. L. & Rabe, B. A. The mixed model for repeated measures for cluster randomized trials: a simulation study investigating bias and type I error with missing continuous data. Trials 21, 148 (2020).
ArticleÂ Google ScholarÂ
Berry, J. D. et al. Improved stratification of ALS clinical trials using predicted survival. Ann. Clin. Transl. Neurol. 5, 474â485 (2018).
ArticleÂ Google ScholarÂ
Rasmussen, C. E. & Williams, C. K. I. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) (MIT Press, 2005).
Rasmussen, C. E. & Ghahramani, Z. Infinite Mixtures of Gaussian Process Experts. Adv. Neural Inf. Process. Syst. 14, 881â888 (2002).
Google ScholarÂ
Lo, A. Y. On a class of Bayesian nonparametric estimates: I. Density estimates. Ann. Stat. 12, 351â357. (1984).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Escobar, M. D. & West, M. Bayesian density estimation and inference using mixtures. J. Am. Stat. Assoc. 90, 577â588 (1995).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Tresp, V. Mixtures of Gaussian processes. Adv. Neural Inf. Process. Syst. 13, 7 (2000).
Google ScholarÂ
Ross, J. C. & Dy, J. G. Nonparametric mixture of Gaussian processes with constraints. Proc. Mach. Learn. Res. 28, 1346â1354 (2013).
Google ScholarÂ
Center for Innovation and Bioinformatics (CIB) (ALS/MND Natural History Consortium, accessed 9 April 2022); https://www.data4cures.org/natural-history-consortium
Atassi, N. et al. The PRO-ACT database. Neurology. 83, 1719â1725 (2014).
ArticleÂ Google ScholarÂ
Pooled Resource Open-Access ALS Clinical Trials Database (PRO-ACT) Data Sets (ALS Association and Neurological Clinical Research Institute, accessed 4 December 2020); https://ncri1.partners.org/ProACT/Data/Index/1
MD MEC. Clinical trial ceftriaxone in subjects with amyotrophic lateral sclerosis (ALS). ClinicalTrials.gov https://clinicaltrials.gov/ct2/show/NCT00349622 (Accessed 7 April 2022).
Karanevich, A., He, J. & Gajewski, B. J. Using an anchor to improve linear predictions with application to predicting disease progression. Rev. Colomb. Estad. 41, 137â155 (2018).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Czaplinski, A., Yen, A. A. & Appel, S. H. Forced vital capacity (FVC) as an indicator of survival and disease progression in an ALS clinic population. J. Neurol. Neurosurg. Psychiatry 77, 390â392 (2006).
ArticleÂ Google ScholarÂ
Umoh, M. E. et al. Comparative analysis of C9orf72 and sporadic disease in an ALS clinic population. Neurology 87, 1024â1030 (2016).
ArticleÂ Google ScholarÂ
Irwin, D. J. et al. Cognitive decline and reduced survival in C9orf72 expansion frontotemporal degeneration and amyotrophic lateral sclerosis. J. Neurol. Neurosurg. Psychiatry 84, 163â169 (2013).
ArticleÂ Google ScholarÂ
Floeter, M. K. et al. Disease progression in C9orf72 mutation carriers. Neurology 89, 234â241 (2017).
ArticleÂ Google ScholarÂ
Kueper, J. K., Speechley, M. & Montero-Odasso, M. The Alzheimerâs Disease Assessment ScaleâCognitive Subscale (ADAS-Cog): modifications and responsiveness in pre-dementia populations. A narrative review. J. Alzheimeras Dis. 63, 423â444 (2018).
ArticleÂ Google ScholarÂ
Mohs, R. C. et al. Development of cognitive instruments for use in clinical trials of antidementia drugs: additions to the Alzheimerâs Disease Assessment Scale that broaden its scope. The Alzheimerâs Disease Cooperative Study. Alzheimer Dis. Assoc. Disord. 11, S13âS21 (1997).
ArticleÂ Google ScholarÂ
Armstrong, M. J. & Okun, M. S. Diagnosis and treatment of Parkinson disease: a review. J. Am. Med. Assoc. 323, 548â560 (2020).
ArticleÂ Google ScholarÂ
Goetz, C. G. et al. Movement Disorder Society-sponsored revision of the Unified Parkinsonâs Disease Rating Scale (MDS-UPDRS): process, format, and clinimetric testing plan. Mov. Disord. 22, 41â47 (2007).
ArticleÂ MathSciNetÂ Google ScholarÂ
Stebbins, G. T. et al. How to identify tremor dominant and postural instability/gait difficulty groups with the movement disorder society unified Parkinsonâs disease rating scale: comparison with the unified Parkinsonâs disease rating scale. Mov. Disord. 28, 668â670 (2013).
ArticleÂ Google ScholarÂ
Holden, S. K., Finseth, T., Sillau, S. H. & Berman, B. D. Progression of MDSâUPDRS scores over five years in de novo Parkinson disease from the Parkinsonâs Progression Markers Initiative cohort. Mov. Disord. Clin. Pract. 5, 47â53 (2017).
ArticleÂ Google ScholarÂ
Andres, P. L. et al. Fixed dynamometry is more sensitive than vital capacity or ALS rating scale. Muscle Nerve 56, 710â715 (2017).
ArticleÂ Google ScholarÂ
Fournier, C. N. et al. Development and validation of the Rasch-Built Overall Amyotrophic Lateral Sclerosis Disability Scale (ROADS). JAMA Neurol. 77, 480â488 (2020).
ArticleÂ Google ScholarÂ
Simon, N. G. et al. Quantifying disease progression in amyotrophic lateral sclerosis. Ann. Neurol. 76, 643â657 (2014).
ArticleÂ Google ScholarÂ
Huang, F. et al. Longitudinal biomarkers in amyotrophic lateral sclerosis. Ann. Clin. Transl. Neurol. 7, 1103â1116 (2020).
ArticleÂ Google ScholarÂ
Taylor, A. A. et al. Predicting disease progression in amyotrophic lateral sclerosis. Ann. Clin. Transl. Neurol. 3, 866â875 (2016).
ArticleÂ Google ScholarÂ
Kimura, F. et al. Progression rate of ALSFRS-R at time of diagnosis predicts survival time in ALS. Neurology 66, 265â267 (2006).
ArticleÂ Google ScholarÂ
Peterson, K., Rudovic, O., Guerrero, R. & Picard, R. W. Personalized Gaussian processes for future prediction of Alzheimerâs disease progression. Preprint at http://arxiv.org/abs/1712.00181 (2018).
Zhao, Y., Chitnis, T., Healy, B. C., Dy, J. G. & Brodley, C. E. Domain induced Dirichlet mixture of Gaussian processes: an application to predicting disease progression in multiple sclerosis patients. In 2015 IEEE International Conference on Data Mining (Eds. Aggarwal, C. et al.) 1129â1134 (IEEE, 2015).
Cheng, L. et al. An additive Gaussian process regression model for interpretable non-parametric analysis of longitudinal data. Nat. Commun. 10, 1798 (2019).
ArticleÂ Google ScholarÂ
Weiner, M. W. et al. Impact of the Alzheimerâs Disease Neuroimaging Initiative, 2004 to 2014. Alzheimer's Dement. 11, 865â884 (2015).
ArticleÂ Google ScholarÂ
Parkinson Progression Marker Initiative The Parkinson Progression Marker Initiative (PPMI). Prog. Neurobiol. 95, 629â635 (2011).
ArticleÂ Google ScholarÂ
MoGP: Mixture of Gaussian Processes Model. Zenodo https://doi.org/10.5281/zenodo.6744399 (2022).

Download references

Acknowledgements

Data used in the preparation of this article were obtained from the PRO-ACT database, the ALS/MND Natural History Consortium, the Parkinsonâs Progression Markers Initiative database and the ADNI database. This research includes the National Institute of Neurologic Disease and Strokeâs Archived Clinical Research data (Clinical Trial of Ceftriaxone in ALS, M. Cudkowicz, Massachusetts General Hospital) obtained from the NINDS Archived Clinical Research Datasets webpage. Additional information about the studies can be found in Supplementary Acknowledgements. The Answer ALS organization, ALS Finding a Cure and Packard Foundation supported the collection of the Answer ALS clinical dataset used in the manuscript. The Muscular Dystrophy Association contributed funding to the Emory ALS Clinic database that was included in this research. C.N.F. received funding from the Department of Veterans Affairs of Research and Development (IK2CX001595-02) and the Department of Defense (AL200156). K. Sachs received funding from the Muscular Dystrophy Association (award 574137). D.R. received funding from the NSF Gradate Research Fellowship Program (GRFP) and Siebel Scholars Fellowship. E.F. and D.R. received funding from Answer ALS, MITâIBM Watson AI Lab (W1771646), the United States Army Medical Research Acquisition Activity (W81XWH-21-1-0245) and NIH (U54NS091046). T.M.H. received funding from the NIH/NINDS (K23NS099380). None of the organizations had any influence on the writing of the manuscript or the decision to submit it for publication.

Author information

A list of authors and their affiliations appears at the end of the paper.

Authors and Affiliations

Department of Biological Engineering, MIT, Cambridge, MA, USA
Divya Ramamoorthy,Â Karen Sachs,Â Jonathan Li,Â Aneesh Donde,Â Nhan Huynh,Â Miriam Adam,Â Brook T. Wassie,Â Alex Lenail,Â Natasha Leanna Patel-Murray,Â Yogindra Raghav,Â Karen Sachs,Â Velina Kozareva,Â Stanislav Tsitkov,Â Tobias EhrenbergerÂ &Â Ernest Fraenkel
Center for Computational Health and MITâIBM Watson AI Lab, IBM Research, Cambridge, MA, USA
Kristen Severson,Â Soumya GhoshÂ &Â Kenney Ng
Next Generation Analytics, Palo Alto, CA, USA
Karen Sachs
Department of Neurology, Emory University School of Medicine, Atlanta, GA, USA
Arish Jamil,Â Jonathan D. GlassÂ &Â Christina N. Fournier
Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
Sara Thrower,Â Sarah Luppino,Â Alanna Farrar,Â Lindsay Pothier,Â Hong Yu,Â Ervin Sinani,Â Prasha Vigneswaran,Â Alexander V. Sherman,Â Merit E. Cudkowicz,Â James Berry,Â Alexander Sherman,Â Kenneth Faulconer,Â Ervin Sanani,Â Alex Berger,Â Julia Mirochnick,Â Todd M. HerringtonÂ &Â James D. Berry
Department of Neurology, Harvard Medical School, Boston, MA, USA
Todd M. Herrington
Brain Science Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Emily G. Baxi,Â Alyssa N. CoyneÂ &Â Jeffrey D. Rothstein
Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Emily G. Baxi,Â Alyssa N. Coyne,Â Elizabeth Mosmiller,Â Lindsey Hayes,Â Aianna Cerezo,Â Omar Ahmad,Â Promit Roy,Â Steven Zeiler,Â John W. Krakauer,Â Nicholas MaragakisÂ &Â Jeffrey D. Rothstein
Center for Systems and Therapeutics and the Taube/Koret Center for Neurodegenerative Disease, Gladstone Institutes and the Departments of Neurology and Physiology, University of California San Francisco, San Francisco, CA, USA
Julia A. Kaye,Â Leandro Lima,Â Stacia Wyman,Â Edward Vertudes,Â Naufa Amirani,Â Krishna Raja,Â Reuben ThomasÂ &Â Steven Finkbeiner
UCI MIND, University of California Irvine, Irvine, CA, USA
Ryan G. Lim,Â Ricardo MiramontesÂ &Â Leslie M. Thompson
Department of Biological Chemistry, University of California Irvine, Irvine, CA, USA
Jie WuÂ &Â Leslie M. Thompson
Advanced Clinical Biosystems Research Institute, The Barbra Streisand Heart Center, The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Vineet Vaibhav,Â Andrea Matlock,Â Vidya Venkatraman,Â Ronald Holewenski,Â Niveda Sundararaman,Â Rakhi Pandey,Â Danica-Mae ManaloÂ &Â Jennifer E. Van Eyk
Cedars-Sinai Biomanufacturing Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Aaron Frank,Â Loren Ornelas,Â Lindsey Panther,Â Emilda Gomez,Â Erick Galvez,Â Daniel Perez,Â Imara Meepe,Â Susan Lei,Â Louis Pinedo,Â Chunyan Liu,Â Ruby Moran,Â Dhruv SareenÂ &Â Clive N. Svendsen
The Board of Governors Regenerative Medicine Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Dhruv Sareen,Â Berhan Mandefro,Â Hannah Trost,Â Maria G. Banuelos,Â Veronica Garcia,Â Michael Workman,Â Richie Ho,Â Robert Baloh,Â Carolyn PrinaÂ &Â Clive N. Svendsen
Technome LLC, Herndon, VA, USA
Barry Landin
Computational Biology Center, IBM T.J. Watson Research Center, Yorktown, NY, USA
Carla Agurto,Â Guillermo CecchiÂ &Â Raquel Norel
Zofia Consulting, Reston, VA, USA
S. Michelle Farr
Department of Neurology and Genetics, Ohio State University Wexner Medical Center, Columbus, OH, USA
Jennifer Roggenbuck,Â Sarah HeintzmanÂ &Â Stephen Kolb
Department of Neurology, Columbia University, New York, NY, USA
Matthew B. HarmsÂ &Â Leslie M. Thompson
Department of Psychiatry and Human Behavior and Sue and Bill Gross Stem Cell Center, University of California Irvine, Irvine, CA, USA
Jennifer StocksdaleÂ &Â Keona Wang
Texas Neurology, Dallas, TX, USA
Todd MorganÂ &Â Daragh Heitzman
Department of Neurology, Washington University, St. Louis, MO, USA
Jennifer Jockel-Balsarotti,Â Elizabeth Karanja,Â Jesse Markway,Â Molly McCallumÂ &Â Tim Miller
Department of Neurology, Northwestern University, Chicago, IL, USA
Ben Joslin,Â Deniz AlibazogluÂ &Â Senda Ajroud-Driss
Microsoft Research, Microsoft Corporation, Redmond, WA, USA
Jay C. Beavers
Microsoft University Relations, Microsoft Corporation, Redmond, WA, USA
Mary BellardÂ &Â Elizabeth Bruce
On Point Scientific Inc., San Diego, CA, USA
Terri Thompson
Department of Neurobiology and Behavior, University of California Irvine, Irvine, CA, USA
Leslie M. Thompson
Centro Clinico NeMO, Milan, Italy
Christian Lunetta
University of Minnesota, Minneapolis, MN, USA
David Walk
St. Louis University, St. Louis, MO, USA
Ghazala Hayat
University of FloridaâGainesville, Gainesville, FL, USA
James Wymer
Virginia Commonwealth University, Richmond, VA, USA
Kelly Gwathmey
Providence Brain and Spine Institute, Portland, OR, USA
Nicholas Olney
Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Senda Ajroud-Driss
Temple University, Philadelphia, PA, USA
Terry Heiman-Patterson
Henry Ford Hospital, Detroit, MI, USA
Ximena Arcila-Londono

Authors

Divya Ramamoorthy
View author publications
You can also search for this author in PubMedÂ Google Scholar
Kristen Severson
View author publications
You can also search for this author in PubMedÂ Google Scholar
Soumya Ghosh
View author publications
You can also search for this author in PubMedÂ Google Scholar
Karen Sachs
View author publications
You can also search for this author in PubMedÂ Google Scholar
Jonathan D. Glass
View author publications
You can also search for this author in PubMedÂ Google Scholar
Christina N. Fournier
View author publications
You can also search for this author in PubMedÂ Google Scholar
Todd M. Herrington
View author publications
You can also search for this author in PubMedÂ Google Scholar
James D. Berry
View author publications
You can also search for this author in PubMedÂ Google Scholar
Kenney Ng
View author publications
You can also search for this author in PubMedÂ Google Scholar
Ernest Fraenkel
View author publications
You can also search for this author in PubMedÂ Google Scholar

Consortia

Answer ALS

Emily G. Baxi
,Â Alyssa N. Coyne
,Â Elizabeth Mosmiller
,Â Lindsey Hayes
,Â Aianna Cerezo
,Â Omar Ahmad
,Â Promit Roy
,Â Steven Zeiler
,Â John W. Krakauer
,Â Divya Ramamoorthy
,Â Jonathan Li
,Â Aneesh Donde
,Â Nhan Huynh
,Â Miriam Adam
,Â Brook T. Wassie
,Â Alex Lenail
,Â Natasha Leanna Patel-Murray
,Â Yogindra Raghav
,Â Karen Sachs
,Â Velina Kozareva
,Â Stanislav Tsitkov
,Â Tobias Ehrenberger
,Â Julia A. Kaye
,Â Leandro Lima
,Â Stacia Wyman
,Â Edward Vertudes
,Â Naufa Amirani
,Â Krishna Raja
,Â Reuben Thomas
,Â Ryan G. Lim
,Â Ricardo Miramontes
,Â Jie Wu
,Â Vineet Vaibhav
,Â Andrea Matlock
,Â Vidya Venkatraman
,Â Ronald Holewenski
,Â Niveda Sundararaman
,Â Rakhi Pandey
,Â Danica-Mae Manalo
,Â Aaron Frank
,Â Loren Ornelas
,Â Lindsey Panther
,Â Emilda Gomez
,Â Erick Galvez
,Â Daniel Perez
,Â Imara Meepe
,Â Susan Lei
,Â Louis Pinedo
,Â Chunyan Liu
,Â Ruby Moran
,Â Dhruv Sareen
,Â Barry Landin
,Â Carla Agurto
,Â Guillermo Cecchi
,Â Raquel Norel
,Â Sara Thrower
,Â Sarah Luppino
,Â Alanna Farrar
,Â Lindsay Pothier
,Â Hong Yu
,Â Ervin Sinani
,Â Prasha Vigneswaran
,Â Alexander V. Sherman
,Â S. Michelle Farr
,Â Berhan Mandefro
,Â Hannah Trost
,Â Maria G. Banuelos
,Â Veronica Garcia
,Â Michael Workman
,Â Richie Ho
,Â Robert Baloh
,Â Jennifer Roggenbuck
,Â Matthew B. Harms
,Â Carolyn Prina
,Â Sarah Heintzman
,Â Stephen Kolb
,Â Jennifer Stocksdale
,Â Keona Wang
,Â Todd Morgan
,Â Daragh Heitzman
,Â Arish Jamil
,Â Jennifer Jockel-Balsarotti
,Â Elizabeth Karanja
,Â Jesse Markway
,Â Molly McCallum
,Â Tim Miller
,Â Ben Joslin
,Â Deniz Alibazoglu
,Â Senda Ajroud-Driss
,Â Jay C. Beavers
,Â Mary Bellard
,Â Elizabeth Bruce
,Â Jonathan D. Glass
,Â Nicholas Maragakis
,Â Merit E. Cudkowicz
,Â James Berry
,Â Terri Thompson
,Â Ernest Fraenkel
,Â Steven Finkbeiner
,Â Leslie M. Thompson
,Â Jennifer E. Van Eyk
,Â Clive N. Svendsen
Â &Â Jeffrey D. Rothstein

Pooled Resource Open-Access ALS Clinical Trials Consortium

Alexander Sherman

ALS/MND Natural History Consortium

Christian Lunetta
,Â David Walk
,Â Ghazala Hayat
,Â James Wymer
,Â Kelly Gwathmey
,Â Nicholas Olney
,Â Senda Ajroud-Driss
,Â Terry Heiman-Patterson
,Â Ximena Arcila-Londono
,Â Alexander Sherman
,Â Kenneth Faulconer
,Â Ervin Sanani
,Â Alex Berger
Â &Â Julia Mirochnick

Contributions

D.R., K. Severson and S.G. contributed to model development and analyzed data. D.R., K. Severson, K.N. and E.F. contributed to project design. D.R. wrote the manuscript with input and revisions from all authors.

Corresponding author

Correspondence to Ernest Fraenkel.

Ethics declarations

Competing interests

K.N., K. Severson and S.G. were employed by IBM Research during this project. K. Sachs consults for Modulo Bio Inc.

Peer review

Peer review information

Nature Computational Science thanks Mamede de Carvalho, Cassie S. Mitchell and Henk-Jan Westeneng for their contribution to the peer review of this work. This article has been peer reviewed as part of Springer Natureâs Guided Open Access initiative. Primary Handling Editor: Ananya Rastogi, in collaboration with the Nature Computational Science team. Peer reviewer reports are available.

Additional information

Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.