Abstract
There is increasing use of digital tools to monitor people with psychosis and schizophrenia remotely, but using this type of data is challenging. This systematic review aimed to summarise how studies processed and analysed data collected through digital devices. In total, 203 articles collecting passive data through smartphones or wearable devices, from participants with psychosis or schizophrenia were included in the review. Accelerometers were the most common device (nâ=â115 studies), followed by smartphones (nâ=â46). The most commonly derived features were sleep duration (nâ=â50) and time spent sedentary (nâ=â41). Thirty studies assessed data quality and another 69 applied data quantity thresholds. Mixed effects models were used in 21 studies and time-series and machine-learning methods were used in 18 studies. Reporting of methods to process and analyse data was inconsistent, highlighting a need to improve the standardisation of methods and reporting in this area of research.
Similar content being viewed by others
Introduction
Psychosis is a collection of experiences that involve an individual losing some contact with reality. It can involve seeing or hearing things others cannot see or hear (hallucinations), believing things that are not shared by others (delusions), and/or confused thinking and speech1. Schizophrenia is considered a severe mental illness and is a diagnostic label given to an individual when symptoms of psychosis occur over a certain period of time, at a certain frequency, and cause significant impairment. The experience of psychosis can be costly to health services and have long-lasting effects on individuals2,3,4,5. Relapse of psychosis is common6. Although there is no universally accepted definition of relapse, it typically refers to the return or exacerbation of psychotic symptoms following a period of improvement or stability that results in a significant change in clinical management7.
Monitoring people with psychosis and schizophrenia and identifying relapse in a timely manner is challenging as contact with mental health services can be infrequent, and retrospective recall of thoughts, feelings and symptoms can lack specificity and accuracy8. With advancements in digital technologies, there is increasing focus on using sensors in digital tools such as mobile phones and wearable devices to support real-time passive monitoring of people with psychosis and schizophrenia. Digital technology encompasses a wide range of devices and applications that process, transmit, and store data. Examples include smartphones, computers, and smartwatches. For many years, research-grade devices, such as accelerometers and actigraphs (which measure movement), have been used in studies to collect physiological and behavioural data from people with psychosis remotely9. More recently, there has been a shift to using emerging internet-enabled technologies, including smartphones and wearable devices, for monitoring symptoms and behaviour, commonly referred to as either remote monitoring or ambulatory assessment10. This can be done using active symptom monitoring (ASM), for example reporting symptoms using a smartphone app, or through collecting passive sensor data such as physical activity levels and sleep data. These data could be used to spot early signs of relapse and provide opportunities for earlier intervention from services11,12.
The types of sensors used for passive data collection in smartphones and wearable devices include accelerometers, GPS sensors, environmental light and sound sensors, and photoplethysmogram (PPG) sensors. The raw data collected through these sensors can then be processed into features or variables, for example, distance travelled, resting heart rate, sleep duration or amount of sedentary behaviour. These features can then be grouped into different behaviours, for example, mobility, physical activity or sociability. This hierarchical framework is described in detail by Mohr et al.13. A glossary of technical terms can be found in the supplementary materials.
Collecting data from people continuously via technology produces huge volumes of data that can be used to identify an individualâs usual behavioural patterns. However, there are challenges when working with these data. First, if people do not carry or wear devices consistently, there may be a large amount of missing data and the accuracy and quality of the data collected through the devices are limited14,15. Second, reproducibility is a concern if researchers do not have access to the raw data and rely on the devicesâ proprietary algorithms to generate features for analysis14,16. Finally, the amount of data can be challenging to analyse, and it is important to use methods that are appropriate for longitudinal data17.
A 2020 systematic review by Benoit et al.18 identified 51 studies using digital phenotyping in psychosis, focussing on describing the machine learning techniques used for analysis in 16 of those studies. Another review by De Angel et al.19 assessed digital monitoring tools in depression and identified key features associated with depression. Both reviews reported a large variation in the types of data and analysis methods used and highlighted inconsistencies in the reporting of methodology and the handling of missing data. However, neither of these reviews examined the methods of pre-processing raw passive data nor how features and behaviours were derived. There are currently no guidelines for reporting or any comprehensive summaries of the methods available for pre-processing and analysis for studies using passive data. Given the complexities and challenges of working with this type of data and the increasing use of emerging technologies12 a review of existing studies would be beneficial to researchers and aid in improving consistency and reproducibility of studies using passive data.
Therefore, this systematic review aimed to summarise how previous studies have collected, processed and analysed passive data collected through smartphones, wearable devices or research-grade devices to infer symptoms and other health-related information from people with psychosis or schizophrenia. Specific research questions were: (i) what sensors have been used in studies utilising digital data in psychosis or schizophrenia and what features have been derived?; (ii) how were the features derived from raw sensor data?; (iii) did the studies set thresholds for the amount of usable data for analysis and how were these defined?; (iv) what statistical methods have been used to analyse the data collected?
Results
Screening
Figure 1 displays the PRISMA flow diagram, including the reasons for exclusion at the full paper screening stage. A total of 15,508 records were identified through searching the four databases, including 6016 duplicates which were removed. After screening titles and abstracts of 9942 records, 8971 were excluded, with 521 remaining. Of those, there were 22 whose full reports were not available or not written in the English language; therefore, 499 had their full text screened for eligibility against the inclusion and exclusion criteria. There were 296 studies excluded at this stage, leaving 203 papers to be included in the review. The full list of papers included in the review can be found in Supplementary Table 3.
The flowchart shows the number of articles identified by the searches across the four databases and the number removed at the title and abstract screening and full text screening stages. Of the 9492 unique articles identified in the searches there were 203 included in the review. The template for the diagram was taken from the PRISMA 2020 guidelines100.
Study characteristics
Table 1 summarises the characteristics of all included studies. The median total sample size for all the studies was 60 (IQR 29â100) and the median for the schizophrenia or psychosis sample only was 36 (IQR 20â66). There were 73 studies (36.0%) that included a healthy or population-based control group and 47 studies (23.2%) that included a mixed severe mental illness (SMI) clinical sample. For those mixed sample studies, the median percentage of the sample with schizophrenia or psychosis was 57.6% (IQR 48.8â74.1%) and the most frequently included other diagnoses were bipolar disorder (nâ=â30), depressive disorders (nâ=â26), mood disorders (nâ=â6), anxiety disorders (nâ=â6) and personality disorders (nâ=â4). The median age of participants across all studies was 40 years (IQR 35â46) and the median percentage of females was 40.0% (IQR 27.8â50.0%) (nâ=â19 did not report any demographic characteristics). Only 74 studies reported the ethnicity of the participants. The median years of illness duration was 14.0 (IQR 9.7â18.0, nâ=â64) and the median age at onset was 24.4 years (IQR 23.6â24.6, nâ=â13).
Over half of the studies (nâ=â111, 55.0%) collected passive data for less than 7 days, 39 studies (19.3%) collected for between 8 and 28 days, 50 studies (24.8%) collected for between 29 and 265 days and two studies collected data for over 1 year. Most studies only collected data from a single period (nâ=â179, 88.2%), with the remaining studies repeating passive data collection multiple times. For example, Gomes et al.20 collected 7 days of passive data at baseline which was then repeated at the end of a 16-week follow-up period.
Of the 203 studies included in the review, 123 reported details of how frequently data had been sampled and the duration of each sample (e.g. Torous et al.21 collected data for 1âmin, every 10âmin). Most of these studies that were collecting data through accelerometers specified the epoch duration for movement counts which ranged from 1âs22 to 2âmin23. For studies collecting GPS data, the sampling frequency was up to 30âmin when stationary and as little as every 10âs when moving24. Three studies25,26,27 collected GPS coordinates every 10âmin or when the individual moved more than 10âm. Collection of audio data also varied, with sampling schedules ranging from every 2âmin to 90âmin.
Devices and sensors
In 185 studies (91.1%), a single device was used for passive data collection, 16 studies (7.9%) used 2 devices and there were 2 studies where 3 devices were used. The most frequently used type of devices were research-grade accelerometers (nâ=â115, 51.6%), followed by smartphones (nâ=â46, 20.6%), smartwatches or commercial fitness bands (nâ=â21, 9.4%), and pedometers (nâ=â12, 5.4%). Most devices only used 1 sensor for data collection (nâ=â136, 62.1%), the maximum number used on a single device was 6 (nâ=â2, 0.9%) and there were four studies that did not specify what type of sensors the devices used.
Figure 2 shows the different sensors used by smartphones, smartwatches or commercial fitness bands, accelerometers and other wearables. Overall, the accelerometer was the most frequently utilised sensor, being used by 80.9% of devices (nâ=â178). Accelerometer sensors were the most frequently used in research-grade accelerometer devices, smartwatches commercial fitness bands, and other wearables. For smartphones, the most frequently used sensors were GPS (78.3%, nâ=â36), accelerometer (56.5%, nâ=â26) and phone use (47.7%, nâ=â21).
The circular bar plots show the types of sensors used by each device group, with the bar representing the percentage of devices in each group using that sensor. The horizontal bar plots show the number of studies that used each feature, coloured by behaviour type (phone use, sleep, location/mobility, physiology, physical activity, and circadian rhythm). Figure a shows results for the smartphone device group, b for accelerometers, c for smartwatches & commercial fitness bands and d for other wearables. Findings for the pedometer device group are not shown. TST total sleep time, EDA electrodermal activity, ECG electrocardiogram.
Features and behaviours
There were 65 features derived from sensor data that were used in at least two different studies, of which the top five most commonly derived were sleep duration or total sleep time (nâ=â50 studies), time spent still or sedentary (nâ=â41), step count (nâ=â41), sleep efficiency (nâ=â32) and mean acceleration or activity count (nâ=â31). Figure 2 displays the top 10 features used for smartphones, smartwatches or commercial fitness bands, accelerometers and other wearables. For smartphones, the top three most common features were all in the location and mobility category, with time spent in primary location, distance travelled and number of unique locations being used 20 (43.5% of studies using a smartphone), 19 (41.3%) and 17 (37.0%) times, respectively.
Grouping these features by behaviour type the most frequently observed was physical activity, with 19 features, followed by sleep (13 features), phone use (10 features) and location or mobility (10 features). Physical activity and sleep features were the most frequent in studies using accelerometers, whilst for the smartwatches and other wearable devices physical activity and physiological features (e.g. heart rate) were common. (see Supplementary Tables 5 and 6 for a full list of features and behaviours).
Data quality and quantity assessment
Ten studies reported technical issues with the devices, including defective or damaged devices28,29,30, malfunctioning devices31 or software32, syncing issues33, compatibility problems34 or unspecified technical problems35,36,37. The proportion of participants in the studies excluded for these reasons ranged from 0.5%29 to 14.8%37 (one study did not report the number affected33). Where studies used statistical methods (e.g. imputation) to handle missing data these are discussed in the âAnalysis methodsâ section.
Thirty studies assessed data quality and excluded data in a variety of different ways. These are summarised in Table 2. Nine studies utilising actigraphs for data collection either visually inspected the data for errors or missingness, or compared the data to a written sleep log and assessed inconsistencies. There were five studies that assessed the quality of GPS data, including removing coordinates whose accuracy was below a certain threshold (ranging from 100âfeet38 to 50âm39) and excluding participants whose travelling distance was greater than a threshold (e.g. travelled >80 miles per day40). Studies using physiological data identified outliers based on what is physiologically feasible; for example, skin temperature valuesâ<â20 or >40â°C25, skin conductance valuesâ<â0.1 or >39.95âµS25, and heart rate values <20 or >160 beats per minute (BPM)41,42,43. Outliers in accelerometer data were identified through visual inspection44 and applying other criteria, including values >30âm/s245 or outside the range of the meanâ±â3 standard deviations46. Of the 30 studies that assessed data quality, there were 13 (43.3%) reported how much data had been excluded or retained.
There were 69 studies that applied a threshold of data quantity, either to periods of data collected or to study participants. Eight of these studies did not specify the exact threshold used. For example, Wulff et al.23 stated days with missing data of several hours were excluded and Dennison et al.47 excluded individuals with insufficient device wear time; however, neither defined their thresholds explicitly. Table 3 summarises the thresholds applied in the remaining 61 studies. There were 26 studies, all using accelerometers, that defined both the number of hours of data for a day to be considered âvalidâ and the number of valid days within the study period needed for a participant to be included in the analysis. The threshold for a valid day ranged from 6âh to 16âh of data, and the number of minimum valid days ranged from 2 to 10 days.
Six studies using data collected through smartphones in the CrossCheck study48 applied thresholds. Four of these studies24,48,49,50 applied a quantity threshold of 19âhours of data for a day to be included, whilst the other two studies specified thresholds of either 751 or 10 days52 of data for participants to be included in analysis. There were three studies that used Fitbit devices to collect data and used step count to define valid days. Two of these used a threshold of 300 steps31,53 whilst another removed days from analysis where zero54 steps had been recorded.
Seventeen studies specified how non-wear time had been defined, with the majority using actigraphs (15/16). Twelve of these used an activity count of zero for a specified timeframe (between 10 and 90âminutes depending on the study) to identify non-wear time periods. Two studies stated non-wear had been flagged by device software55 or an analysis package56 and another identified non-wear as periods where the standard deviation in 2 out of the 3 axes was less than 13âmg, or the value range is less than 50 mg57. Algorithms developed by Troiano et al.58 and Choi et al.59 were cited as methods used for defining non-wear time in actigraphy data. The study by Martanto et al.60 collecting data through Fitbits used a lack of heart rate data to indicate non-wear time.
Finally, four studies collecting data through smartphones defined data quality as the quantity of recorded data as a proportion of the expected quantity according to the sampling frequency. Cohen et al.61 reported overall average passive data quality of 57.4%, Lakhtakia et al.62 reported mean GPS data quality of over 50% at each of their study sites and Henson et al.63 reported collection rates of 72% and 60% for GPS and accelerometer data, respectively.
Data processing methods
For processing the raw data, there were 39 actigraphy studies that stated the software and version used. ActiLife (Actigraph) was the most frequently used software (nâ=â19), followed by Actiwatch (Cambridge Neurotechnology, nâ=â13) and Actiware (Philips, nâ=â7). For studies using other wearables, there were three studies that explicitly stated they had used the Fitbit algorithms to derive features, five that had used the SenseWear software and one that had used the Empatica algorithms. Some studies collecting data through smartphones referred to both the Google Activity Recognition API (nâ=â4) and Android proprietary algorithms for location and activity features (nâ=â3).
Several other methods were cited as being used, including papers by Barnett et al.64 for deriving location and mobility features from GPS data with missingness, Brond et al.65 and Bai et al.66 for processing accelerometer data, and Menghini et al.67 and Cole et al.56,68 for sleep features. The DBSCAN and Haversine algorithms were both used multiple times for location clustering and calculating the distance between locations, respectively. Open-source software was also utilised, including Kubios and Ledalab software for analysing HRV data, the Cortex platform for processing smartphone data and DPSleep69 for sleep data. Three R packages were cited, which were rVAD70 for voice detection, GGIR56,71 for calculating sleep and activity features and the nparACT72 package for deriving circadian rhythm features.
Analysis methods
As well as applying thresholds for data usability, as described above, there were several additional approaches used to handle missing data. Eleven studies imputed missing data, replacing the missing values with either the average value23,52,73,74,75,76,77 for the recording period or the nearest value that was recorded52,53,78,79. For example, one study recorded environmental light using an actigraphy, and if a 2-minute epoch was missing a light value then it was substituted with the value from the closest recorded epoch79. Other methods used to impute missing data were a regularised iterative principal components algorithm80 and multiple imputation by chained equations (MICE)81, where the missing value is estimated multiple times by a model using the available data. Three studies24,82,83 used the amount of missingness as a feature in their analysis (e.g. number of days or minutes of missing data). Liebenthal et al.82 found a significant association between the number of days where phone data was missing (â<â60âmins recorded) and the PANSS84 P2 domain (conceptual disorganisation), but the other two24,83 did not report results specific to the missingness features. An additional study used a logistic regression model with missing data as the outcome and baseline variables as possible predictors of missingness, then used any variables found to be predictive in their mixed-effects model85. Reinertsen et al.42 applied an algorithm classifying participants with schizophrenia from healthy controls using contiguous sliding windows of daily heart rate and accelerometer data. If the data for a given day was missing (â<â50 data points), no features were derived, and no prediction was made for that day. The 2020 study by Adler et al.52 categorised missing data as either type 1 (where data from one sensor was missing but other data had been collected) or type 2 (where data from all sensors was missing for the same period). Type 1 missing data was replaced with zeroes whilst type 2 data was substituted with the average for that hour, unless it was location data whereby the last recorded location was used. Only three studies52,63,86 reported the amount of data that was missing, which ranged from 19%86 to 72%63.
Most of the reviewed studies (nâ=â142) either did not specify how they had utilised the repeated measurements or had calculated a single value (sum or average) of the features across the whole of their data collection period. For studies that utilised longitudinal information, the most frequently used methods were mixed effects models, where random effects were used to distinguish within-person and between-person variation. In the 21 studies that used mixed effects models some used daily summaries of variables in their model, whilst others divided up the follow-up period into multiple time windows and aggregated features over these windows. For example, Kalisperakis et al.78 initially calculated daily features and then used monthly summaries (mean and standard deviation) of those features in the mixed effects models. There were eight studies that used time series modelling methods, including generalised estimating equations87, periodogram analysis23, partial autocorrelation functions88, graph/network algorithms43,76 and anomaly detection methods51,61. Other studies used more comprehensive machine learning techniques which can be applied to more granular data than other methods such as mixed effects models. Examples include neural networks83,89,90, support vector machines41,42 and an unsupervised clustering algorithm91.
Study aims and outcomes
Most of the studies (116 out of 203) included in the review assessed the correlation or association between passive data and either clinical assessment scores, diagnostic status, medication adherence or active symptom monitoring data. There were 39 studies using passive monitoring to assess the effectiveness of a medication or other intervention (e.g. an exercise programme intervention), 16 developing or validating methods, tools or devices for use in passive monitoring, and 15 assessing the feasibility or acceptability of an intervention and/or of passive monitoring.
There were 20 studies that developed models either predicting clinical outcomes or classifying participant groups; these are summarised in Table 4 and discussed further in the following section. Seven of these studies aimed to distinguish people with schizophrenia from healthy controls or from people with other serious mental illnesses including mood disorder and major depressive disorder. Five studies used passive data to predict EMA/ASM scores and there were six studies aiming to identify or predict behaviour anomalies in the period near to a relapse. Two studies developed models to predict the risk of relapse within the following week. An additional study92 initially aimed to identify signatures of relapse, however, there were not enough occurrences of relapse during their study period so instead assessed the feasibility of passive data collection.
Prognostic factor and prognostic model studies
The majority of prognostic studies collected data through smartphones (12 out of 20), with the others using smartwatches nâ=â2), actigraphy devices (nâ=â4) and wearable adhesive patches (nâ=â3). All seven studies aiming to classify people with psychosis or schizophrenia from other groups used physical activity metrics amongst others. For the 12 studies collecting data through smartphones, the most frequent behaviours used were location mobility and phone use, whilst for other devices the most frequent behaviours were physical activity, physiology and sleep. Most used machine learning methods for their analysis, including random forests, anomaly detection algorithms, neural networks and support vector machines. Non-machine learning methods included generalised estimating equations and mixed effects regression. Where appropriate the risk of bias for each study was assessed using either the PROBAST or QUIPS tools. Most studies were deemed to have a moderate or high risk of bias in relation to the measurement of confounders and description of the study sample. The key findings from each study are displayed in Supplementary Table 7, and the quality assessment can be found in Supplementary Tables 8 and 9.
Discussion
The use of smartphones for passive data collection is increasing, with smartwatches and commercial bands less commonly used (see Supplementary Fig. 1). Research-grade accelerometers, including actigraphy devices, were the most frequently used devices. The most frequently measured features were those related to physical activity and sleep, however, this did vary by device type. Most of the studies did not report if they had assessed the data quality or applied a data quantity threshold, and of those that did there were only a handful that reported the amount of data that was excluded or available for analysis. Some studies, particularly those using actigraphy devices, relied on the features derived by the device algorithms whilst others did not state whether they had access to raw data or did not sufficiently describe the methods used to pre-process the data.
Few studies reported the amount of missing data or whether statistical methods had been used (e.g. imputation) to handle missingness. For passive data, the level of missingness can be significant due to potential technical issues with devices transferring of data or participants not wearing devices consistently. Whilst some studies did apply thresholds based on the quantity of data collected the disadvantage of this is that it could lead to a reduction in sample size and removal of some data that still could be useful despite high levels of missingness. There was also little justification for why certain thresholds (e.g. 10âh of data per day) had been chosen. Some studies had used methods to impute missing data (e.g. imputing by average or multiple imputation) that are commonly used in statistical modelling, however it is not clear whether these methods are suitable for this type of high frequency data and there is currently no guidance for researchers on what is the best approach to use. A method developed by Barnett et al.64, which was used in a few studies in the review, derives features from GPS data where missingness is present. Given the variation in types of data collected through passive sensors, it may be that separate methods are needed for each type. However, this adds an additional complexity to the processing of the data. Another approach used in three studies was to use the amount of missing data as a feature in itself. The study by Liebenthal et al.82 reported an association between the number of days of missing phone data and PANSS scores for conceptual disorganisation, indicating that the absence or presence of passive data could potentially be used as a marker of an individualâs health.
Most studies used a single value (e.g. mean or total) to summarise repeated observations over the follow-up period, therefore, not utilising the amount of data to its potential and not capturing the longitudinal patterns of behaviour. Generalised linear mixed-effects models (GLMMs) were used to account for repeated measurements in a few studies and machine learning methods are being increasingly used. GLMMs can model the variation in observations within each individual and between individuals, utilising more of the data collected to model a pattern of behaviour. However, the studies using these models still summarised variables over a period (either daily or weekly). Whilst machine learning methods, such as neural networks and LSTMs, are capable of utilising the granular, high-frequency data, they do usually require large sample sizes for training the models to avoid overfitting (where a model is not generalisable and underperforms when applied to new data). Most of the studies in this review were in relatively small samples (median 60). There is guidance for sample size calculations when developing clinical prediction models, however, there are not any specifically for machine learning algorithms. The other potential drawback of using machine learning models is they can be harder to interpret and explain. This is important for researchers who are interested in identifying the specific passive features or behaviours that are associated with an outcome, e.g. relapse. Some machine learning models (e.g. random forests) are more explainable than others and there are methods to evaluate the importance of features in the models.
The lack of reporting on methodology found in the articles included in this review is consistent with similar reviews by Benoit et al.18, De Angel et al.19 and Rohani et al.93. There were some open-source methods and software used in the studies, for example the GGIR package, but few instances of researchers making their analysis code available. The inconsistencies in how passive data is collected and processed make it difficult to compare findings from any studies or reproduce them.
There are some limitations to our review. Due to the differences in study design in the included articles we did not assess the quality of all studies, only those that were assessing potential prognostic factors or developing prediction models. As there was significant variation in the passive data features used, outcomes assessed, and modelling strategies used we did not attempt to perform meta-analysis with any of the studies. We included studies that collected passive data from people with psychosis and schizophrenia. Passive data collection through wearable devices and smartphone sensors are being used in a range of clinical areas, such as cardiovascular disease94 and rheumatoid arthritis95. By limiting our search to mental health, we may have missed some studies that are using relevant methods. Similarly, there may be newer methods proposed that have not yet been applied to any particular clinical setting and will also have been missed. We also did not check whether included studies had published or pre-registered their protocols, which may have included more detail of their data processing and analysis plans.
Some of the articles included in the review were using data collected in the same study, for example, the data from the CrossCheck48 project was used in at least 9 studies and there were 3 studies using an open source dataset called Psykose96. They were included as separate studies in the review as they did not necessarily use the same methods or the same features in their analyses. This does mean, however, that our estimates of how frequently some devices are being used in these types of study are overestimated. Some of the studies using the CrossCheck data appeared to be using the raw data, whilst others were using the freely available pre-processed version consisting of hourly and daily summaries. Whilst the publishing of open-source datasets is, in general, a positive step forward in terms of open science, it is still important that researchers using this data are provided with sufficient information about how the data were pre-processed. It is also important that the methods in studies that have re-used data make it clear whether or not the raw data has been used, whether any additional exclusion criteria have been applied, or if there has been any additional processing, feature extraction or feature selection performed. Additionally, it should be clearly stated which other published papers have used the same dataset.
In order to increase the reproducibility and validity of studies in this area of research there is a need to significantly improve the reporting of methodologies used, including specifics about the type and models of devices used, the pre-processing and any quality assessment of data, and the handling of missing data. Additionally, to improve the consistency of passive data analysis the availability and use of open-source or standardised methods should be encouraged. This is particularly important given the range of devices currently available for remote monitoring and the emergence of new technologies, such as smart rings97. These devices and the data they collect could enable remote monitoring across a range of health conditions. However, for this to be successful there do need to be improvements in the way this data is currently being used. As well as the methodologies, other aspects of these types of study, for example, the occurrence of adverse events, have also been found to be under-reported98,99. Although creating full guidelines for standardised analysis and reporting of passive sensing data would require input from experts from across the field (e.g. through a Delphi study) and is outside the scope of this paper, Table 4 provides a brief checklist of reporting recommendations based on the findings of the current systematic review.
To conclude, the collection of passive data from wearable devices of people with psychosis and schizophrenia is increasing. However, the reporting of methods used to process and analyse the data is inconsistent, making reproducible research difficult. There is a need, therefore, to improve the standardisation of methods and reporting in this area of research.
Methods
The review protocol was registered prospectively on PROSPERO (CRD 469868). Reporting for the review is in line with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement100 (see Supplementary Table 4 for the completed checklist).
Literature search and study selection
Four databases (Embase, PsychInfo, Medline through Ovid and CINAHL through EBSCO) were searched to identify articles for inclusion in the review. The searches were first performed on 13th June 2023 and then updated on 24th April 2024. The search strategy combined terms relating to psychosis and schizophrenia with terms either relating to smartphones and wearable devices or with actigraphy and accelerometers. The search containing terms relating to smartphones and wearables was limited to articles published from 2007 onwards, as that is the point when smartphones and commercial wearables were available. The search containing terms related to accelerometers and actigraphs was not limited by year of publication. The full search strategy, including the exact terms used, can be seen in Supplementary Tables 1 and 2.
Inclusion criteria were published, peer-reviewed articles collecting any type of passive data through smartphones or wearable devices. We defined wearable devices as any device worn on any part of the body, including research-grade devices (such as accelerometers, pedometers and actigraphs) and commercial devices (such as smartwatches and fitness trackers). We defined passive data as data that had been collected for a period of at least 24âh in uncontrolled settings and when participants had not been given a specific activity to perform. For example, a study where participants in a laboratory wore an accelerometer for 1âh whilst performing specific tasks was not considered passive data. Included studies must have had more than, or equal to, 50% of participants with psychosis or schizophrenia in their clinical sample, or if lower than 50% must have reported results separately for the schizophrenia and psychosis group. Studies were excluded if they used smartphones to collect active symptom monitoring data only, had collected passive data using sensors in a non-wearable device, were not written in English, were non-empirical studies (literature reviews, commentary or opinion pieces, and editorials), were qualitative studies, were published as a pre-print only or if the full text could not be accessed.
All identified articles were exported to EndNote Desktop (version 20.4.1). Duplicates were removed, and all abstracts were screened by one reviewer (SBl) against the inclusion and exclusion criteria. The remaining full-text articles were screened by one reviewer (S.B.l.), with 20% second screened by one of three other reviewers (S. Bu, E.E. and S.F.), with disagreements discussed to reach a consensus (inter-rater reliability coefficient 0.81). The screening was managed using a Microsoft Excel proforma with reasons for inclusion and exclusion during the full article screening recorded.
Data extraction and synthesis
A data extraction form was piloted with five studies before being revised by reviewers SBl, SBu, SF and EE. Data extraction for each study was carried out by either SBl, SF, EE or AO. Data extracted for each study included: (i) study sample characteristics, (ii) data collection, including types of devices and sensors utilised, (iii) features and behaviours derived from the passive data, (iv) methods used to pre-process and analyse the data, including whether data quality was assessed, (v) outcome of interest in the study and (vi) analysis methods used, including methods to handle missing data and repeated/longitudinal measurements.
Following data extraction, devices were categorised into the following groups: (i) smartphones; (ii) smartwatches and commercial fitness bands; (iii) research-grade accelerometers (including actigraphy devices); (iv) pedometers; and (v) other wearables (including research-grade wrist/arm bands, heart rate monitors and adhesive patch sensors). For each device category, the types of sensors used were summarised (e.g. GPS sensor), as well as the features or variables derived from the raw sensor data and the behaviours that can be inferred from the features. Features were grouped into the following behaviour types (i) physical activity, (ii) sleep, (iii) phone use13, (iv) location or mobility, (v) environment, (vi) sociability, (vii) physiology, and (viii) circadian rhythm.
Quality assessment
For longitudinal observational studies conducting prediction101 or prognostic factor102 or predictive modelling, additional information was extracted, including methods of variable selection, modelling and validation. These studies were also assessed for quality using either the Quality in Prognosis Tools (QUIPS)103 or the Prediction model Risk Of Bias Assessment Tool (PROBAST)104 for prognostic factor and prediction model studies, respectively.
Data availability
The datasets used and/or analysed during the current study are available from the corresponding author upon reasonable request.
References
NHS. Psychosis. https://www.nhs.uk/mental-health/conditions/psychosis/overview/ (2023).
Almond, S., Knapp, M., Francois, C., Toumi, M. & Brugha, T. Relapse in schizophrenia: costs, clinical outcomes and quality of life. Br. J. Psychiatry 184, 346â351 (2004).
Lin, C., Zhang, X. & Jin, H. The societal cost of schizophrenia: an updated systematic review of cost-of-illness studies. PharmacoEconomics 41, 139â153 (2023).
Alvarez-Jimenez, M. et al. Risk factors for relapse following treatment for first episode psychosis: a systematic review and meta-analysis of longitudinal studies. Schizophr. Res. 139, 116â128 (2012).
Ascher-Svanum, H. et al. The cost of relapse and the predictors of relapse in the treatment of schizophrenia. BMC Psychiatry 10, 7 (2010).
Phahladira, L. et al. Early recovery in the first 24 months of treatment in first-episode schizophrenia-spectrum disorders. npj Schizophrenia https://doi.org/10.1038/s41537-019-0091-y (2020).
Gleeson, J. F. et al. Systematic review of early warning signs of relapse and behavioural antecedents of symptom worsening in people living with schizophrenia spectrum disorders. Clin. Psychol. Rev. https://doi.org/10.1016/j.cpr.2023.102357 (2024).
Lewis, S. et al. Smartphone-enhanced symptom management in psychosis: open, randomized controlled trial. J. Med. Internet Res. 22, e17019 (2020).
Wee, Z. Y. et al. Actigraphy studies and clinical and biobehavioural correlates in schizophrenia: a systematic review. J. Neural Transmiss. 126, 531â558 (2019).
Trull, T. J. & Ebner-Priemer, U. Ambulatory assessment. Annu. Rev. Clin. Psychol. 9, 151â176 (2013).
Bucci, S., Schwannauer, M. & Berry, N. The digital revolution and its impact on mental health care. Psychol. Psychother. 92, 277â297 (2019).
Torous, J. et al. The growing field of digital psychiatry: current evidence and the future of apps, social media, chatbots, and virtual reality. World Psychiatry 20, 318â335 (2021).
Mohr, D. C., Zhang, M. & Schueller, S. M. Personal sensing: understanding mental health using ubiquitous sensors and machine learning. Annu. Rev. Clin. Psychol. 13, 23â47 (2017).
Renn, B. N., Pratap, A., Atkins, D. C., Mooney, S. D. & Areán, P. A. Smartphone-based passive assessment of mobility in depression: challenges and opportunities. Ment. Health Phys. Act. 14, 136â139 (2018).
Kiang, M. V. et al. Sociodemographic characteristics of missing data in digital phenotyping. Sci. Rep. https://doi.org/10.1038/s41598-021-94516-7 (2021).
Dixon, W. G. et al. Charting a course for smartphones and wearables to transform population health research. J. Med. Internet Res. 25, e42449 (2023).
Barnett, I., Torous, J., Staples, P., Keshavan, M. & Onnela, J.-P. Beyond smartphones and sensors: choosing appropriate statistical methods for the analysis of longitudinal data. J. Am. Med. Inform. Assoc. 25, 1669â1674 (2018).
Benoit, J., Onyeaka, H., Keshavan, M. & Torous, J. Systematic review of digital phenotyping and machine learning in psychosis spectrum illnesses. Harvard Rev. Psychiatry https://doi.org/10.1097/HRP.0000000000000268 (2020).
De Angel, V. et al. Digital health tools for the passive monitoring of depression: a systematic review of methods. npj Digital Med. https://doi.org/10.1038/s41746-021-00548-8 (2022).
Gomes, E. et al. Effects of a group physical activity program on physical fitness and quality of life in individuals with schizophrenia. Ment. Health Phys. Act. 7, 155â162 (2014).
Torous, J. et al. Characterizing the clinical relevance of digital phenotyping data quality with applications to a cohort with schizophrenia. NPJ Digital Med. 1, 15 (2018).
Afonso, P., Brissos, S., Figueira, M. L. & Paiva, T. Schizophrenia patients with predominantly positive symptoms have more disturbed sleep-wake cycles measured by actigraphy. Psychiatry Res. 189, 62â66 (2011).
Wulff, K., Dijk, D. J., Middleton, B., Foster, R. G. & Joyce, E. M. Sleep and circadian rhythm disruption in schizophrenia. Br. J. Psychiatry 200, 308â316 (2012).
Adler, D. A., Wang, F., Mohr, D. C. & Choudhury, T. Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies. PLoS ONE 17, e0266516 (2022).
Raugh, I. M. et al. Digital phenotyping adherence, feasibility, and tolerability in outpatients with schizophrenia. J. Psychiatr. Res. 138, 436â443 (2021).
Narkhede, S. M. et al. Machine learning identifies digital phenotyping measures most relevant to negative symptoms in psychotic disorders: implications for Clinical Trials. Schizophr. Bull. 48, 425â436 (2022).
Raugh, I. M. et al. Geolocation as a digital phenotyping measure of negative symptoms and functional outcome. Schizophr. Bull. https://doi.org/10.1093/schbul/sbaa121 (2020).
Fang, S.-H. et al. Associations between sleep quality and inflammatory markers in patients with schizophrenia. Psychiatry Res. 246, 154â160 (2016).
Deenik, J. et al. Changes in physical and psychiatric health after a multidisciplinary lifestyle enhancing treatment for inpatients with severe mental illness: The MULTI study I. Schizophr. Res. 204, 360â367 (2019).
Deenik, J., Tenback, D. E., Tak, E. C. P. M., Hendriksen, I. J. M. & van Harten, P. N. Improved psychosocial functioning and quality of life in inpatients with severe mental illness receiving a multidisciplinary lifestyle enhancing treatment. The MULTI study II. Ment. Health Phys. Act. 15, 145â152 (2018).
Browne, J. et al. Targeting physical health in schizophrenia: results from the Physical Activity Can Enhance Life (PACE-Life) 24-week open trial. Ment. Health Phys. Act. 20, 100393 (2021).
Mow, J. L. et al. Smartphone-based mobility metrics capture daily social motivation and behavior in schizophrenia. Schizophr. Res. 250, 13â21 (2022).
Thonon, B., Levaux, M.-N., van Aubel, E. & Laroi, F. A group intervention for motivational deficits: preliminary investigation of a blended care approach using ambulatory assessment. Behav. Modif. 46, 1167â1197 (2022).
Jongs, N. et al. A framework for assessing neuropsychiatric phenotypes by using smartphone-based location data. Transl. Psychiatry 10, 211 (2020).
Fowler, J. C. et al. Hummingbird study: results from an exploratory trial assessing the performance and acceptance of a digital medicine system in adults with schizophrenia, schizoaffective disorder, or first-episode psychosis. Neuropsychiatr. Dis. Treat. 17, 483â492 (2021).
Bueno-Antequera, J., Oviedo-Caro, M. A. & Munguia-Izquierdo, D. Ideal cardiovascular health and its association with sedentary behaviour and fitness in psychiatric patients. The PsychiActive project. Nutr., Metab. Cardiovasc. Dis. 28, 900â908 (2018).
Shamir, E. et al. Melatonin improves sleep quality of patients with chronic schizophrenia. J. Clin. Psychiatry 61, 373â377 (2000).
Depp, C. A. et al. GPS mobility as a digital biomarker of negative symptoms in schizophrenia: a case control study. npj Digital Med. 2, 108 (2019).
Henson, P., Pearson, J. F., Keshavan, M. & Torous, J. Impact of dynamic greenspace exposure on symptomatology in individuals with schizophrenia. PLoS ONE 15, e0238498 (2020).
Parrish, E. M. et al. Emotional determinants of life-space through GPS and ecological momentary assessment in schizophrenia: What gets people out of the house? Schizophr. Res. 224, 67â73 (2020).
Osipov, M., Behzadi, Y., Kane, J. M., Petrides, G. & Clifford, G. D. Objective identification and analysis of physiological and behavioral signs of schizophrenia. J. Ment. Health 24, 276â282 (2015).
Reinertsen, E. et al. Continuous assessment of schizophrenia using heart rate and accelerometer data. Physiol. Meas. 38, 1456â1471 (2017).
Reinertsen, E., Shashikumar, S. P., Shah, A. J., Nemati, S. & Clifford, G. D. Multiscale network dynamics between heart rate and locomotor activity are altered in schizophrenia. Physiol. Meas. 39, 115001 (2018).
Bengtsson, J., Olsson, E., Igelstrom, H., Persson, J. & Boden, R. Ambulatory heart rate variability in schizophrenia or depression: impact of anticholinergic burden and other factors. J. Clin. Psychopharmacol. 41, 121â128 (2021).
Strauss, G. P. et al. Validation of accelerometry as a digital phenotyping measure of negative symptoms in schizophrenia. Schizophrenia (Heidelb., Ger.) 8, 37 (2022).
Mayeli, A. et al. Shared and distinct abnormalities in sleep-wake patterns and their relationship with the negative symptoms of Schizophrenia Spectrum Disorder patients. Mol. Psychiatry https://doi.org/10.1038/s41380-023-02050-x (2023).
Dennison, C. A. et al. Association of genetic liability for psychiatric disorders with accelerometer-assessed physical activity in the UK Biobank. PLoS ONE 16, e0249189 (2021).
Ben-Zeev, D. et al. CrossCheck: integrating self-report, behavioral sensing, and smartphone use to identify digital indicators of psychotic relapse. Psychiatr. Rehabilit. J. 40, 266â275 (2017).
Buck, B. et al. Capturing behavioral indicators of persecutory ideation using mobile technology. J. Psychiatr. Res. 116, 112â117 (2019).
He-Yueya, J. et al. Assessing the relationship between routine and schizophrenia symptoms with passively sensed measures of behavioral stability. npj Schizophrenia 6, 35 (2020).
Barnett, I. et al. Relapse prediction in schizophrenia through digital phenotyping: a pilot study. Neuropsychopharmacology 43, 1660â1666 (2018).
Adler, D. A. et al. Predicting early warning signs of psychotic relapse from passive sensing data: an approach using encoder-decoder neural networks. JMIR mHealth uHealth 8, e19962 (2020).
Browne, J. et al. Virtual group-based walking intervention for persons with schizophrenia: a pilot randomized controlled trial. Ment. Health Phys. Act. 24, 100515 (2023).
Diamond, R. et al. The physical activity profiles of patients with persecutory delusions. Ment. Health Phys. Act. 23, 100462 (2022).
von Kanel, S. et al. Measuring catatonia motor behavior with objective instrumentation. Front. Psychiatry 13, 880747 (2022).
Zarbo, C. et al. Ecological monitoring of physical activity, emotions and daily life activities in schizophrenia: the DiAPAson study. BMJ Mental Health https://doi.org/10.1136/bmjment-2023-300836 (2023).
Holt, R. I. G. et al. Structured lifestyle education for people with schizophrenia, schizoaffective disorder and first-episode psychosis (STEPWISE): randomised controlled trial. Br. J. Psychiatry 214, 63â73 (2019).
Troiano, R. P. et al. Physical activity in the United States measured by accelerometer. Med. Sci. Sports Exerc. 40, 181â188 (2008).
Choi, L., Liu, Z., Matthews, C. E. & Buchowski, M. S. Validation of accelerometer wear and nonwear time classification algorithm. Med. Sci. Sports Exerc. 43, 357â364 (2011).
Martanto, W. et al. Association between wrist wearable digital markers and clinical status in Schizophrenia. Gen. Hosp. Psychiatry 70, 134â136 (2021).
Cohen, A. et al. Relapse prediction in schizophrenia with smartphone digital phenotyping during COVID-19: a prospective, three-site, two-country, longitudinal study. Schizophrenia 9, 6 (2023).
Lakhtakia, T. et al. Smartphone digital phenotyping, surveys, and cognitive assessments for global mental health: Initial data and clinical correlations from an international first episode psychosis study. Digital Health 8, 1â18 (2022).
Henson, P., Barnett, I., Keshavan, M. & Torous, J. Towards clinically actionable digital phenotyping targets in schizophrenia. npj Schizophrenia 6, 13 (2020).
Barnett, I. & Onnela, J. P. Inferring mobility measures from GPS traces with missing data. Biostatistics 21, e98âe112 (2020).
Brønd, J. C., Andersen, L. B. & Arvidsson, D. Generating ActiGraph counts from raw acceleration recorded by an alternative monitor. Med. Sci. Sports Exerc. 49, 2351â2360 (2017).
Bai, J. et al. An activity index for raw accelerometry data and its comparison with other activity metrics. PLoS ONE 11, e0160644 (2016).
Menghini, L., Cellini, N., Goldstone, A., Baker, F. C. & De Zambotti, M. A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code. Sleep https://doi.org/10.1093/sleep/zsaa170 (2021).
Cole, R. J., Kripke, D. F., Gruen, W., Mullaney, D. J. & Gillin, J. C. Automatic sleep/wake identification from wrist activity. Sleep 15, 461â469 (1992).
Rahimi-Eichi, H. et al. Open-source Longitudinal Sleep Analysis From Accelerometer Data (DPSleep): algorithm development and validation. JMIR mHealth uHealth 9, e29849 (2021).
Tan, Z.-H., Sarkar, A. K. & Dehak, N. rVAD: an unsupervised segment-based robust voice activity detection method. Comput. Speech Lang. 59, 1â21 (2020).
Migueles, J. H., Rowlands, A. V., Huber, F., Sabia, S. & Van Hees, V. T. GGIR: a research communityâdriven open source R package for generating physical activity and sleep outcomes from multi-day raw accelerometer data. J. Meas. Phys. Behav. 2, 188â196 (2019).
Blume, C., Santhi, N. & Schabus, M. ânparACTâ package for R: a free software tool for the non-parametric analysis of actigraphy data. MethodsX 3, 430â435 (2016).
Zlatintsi, A. et al. E-Prevention: Advanced Support System for Monitoring and Relapse Prevention in Patients with Psychotic Disorders Analyzing Long-Term Multimodal Data from Wearables and Video Captures. Sensors (Basel, Switzerland) https://doi.org/10.3390/s22197544 (2022).
Juda, M., Pater, J., Mistlberger, R. E. & Schutz, C. G. Sleep and rest-activity rhythms in recovering patients with severe concurrent mental and substance use disorder: a pilot study. J. Dual Diagn. 19, 26â39 (2023).
Bromundt, V. et al. Sleep - wake cycles and cognitive functioning in schizophrenia. Br. J. Psychiatry 198, 269â276 (2011).
Fasmer, E. E., Fasmer, O. B., Berle, J. O., Oedegaard, K. J. & Hauge, E. R. Graph theory applied to the analysis of motor activity in patients with schizophrenia and depression. PLoS ONE 13, e0194791 (2018).
Kas, M. J. H. et al. Digital behavioural signatures reveal trans-diagnostic clusters of Schizophrenia and Alzheimerâs disease patients: Trans-diagnostic clustering of digital biotypes. Eur. Neuropsychopharmacol. 78, 3â12 (2024).
Kalisperakis, E. et al. Smartwatch digital phenotypes predict positive and negative symptom variation in a longitudinal monitoring study of patients with psychotic disorders. Front. Psychiatry 14, 1024965 (2023).
Skeldon, A. C., Dijk, D.-J., Meyer, N. & Wulff, K. Extracting circadian and sleep parameters from longitudinal data in schizophrenia for the design of pragmatic light interventions. Schizophr. Bull. 48, 447â456 (2022).
Kuula, L., Halonen, R., Lipsanen, J. & Pesonen, A.-K. Adolescent circadian patterns link with psychiatric problems: a multimodal approach. J. Psychiatr. Res. 150, 219â226 (2022).
Pieters, L. E., Deenik, J., Tenback, D. E., Oort, J. V. & Harten, P. N. V. Exploring the relationship between movement disorders and physical activity in patients with schizophrenia: an actigraphy study. Schizophr. Bull. 47, 906â914 (2021).
Liebenthal, E. et al. Linguistic and non-linguistic markers of disorganization in psychotic illness. Schizophr. Res. https://doi.org/10.1016/j.schres.2022.12.003 (2022).
Mandel, F., Ghosh, R. P. & Barnett, I. Neural networks for clustered and longitudinal data using mixed effects models. Biometrics https://doi.org/10.1111/biom.13615 (2021).
Kay, S. R., Fiszbein, A. & Opler, L. A. The Positive and Negative Syndrome Scale (PANSS) for schizophrenia. Schizophr. Bull. 13, 261â276 (1987).
Cella, M. et al. Evaluating the mechanisms of social cognition intervention in schizophrenia: a proof-of-concept trial. Psychiatry Res. 319, 114963 (2022).
Orleans-Pobee, M. et al. Physical Activity Can Enhance Life (PACE-Life): results from a 10-week walking intervention for individuals with schizophrenia spectrum disorders. J. Ment. Health 31, 357â365 (2022).
Buck, B. et al. Relationships between smartphone social behavior and relapse in schizophrenia: a preliminary report. Schizophr. Res. 208, 167â172 (2019).
Walther, S., Ramseyer, F., Horn, H., Strik, W. & Tschacher, W. Less structured movement patterns predict severity of positive syndrome, excitement, and disorganization. Schizophr. Bull. 40, 585â591 (2014).
Lamichhane, B., Zhou, J. & Sano, A. Psychotic relapse prediction in schizophrenia patients using a personalized mobile sensing-based supervised deep learning model. IEEE J. Biomed. Health Inform. 10.1109/JBHI.2023.3265684 (2023).
Nguyen, D.-K., Chan, C.-L., Li, A.-H. A., Phan, D.-V. & Lan, C.-H. Decision support system for the differentiation of schizophrenia and mood disorders using multiple deep learning models on wearable devices data. Health Inform. J. 28, 14604582221137537 (2022).
Price, G. D. et al. An unsupervised machine learning approach using passive movement data to understand depression and schizophrenia. J. Affect. Disord. 316, 132â139 (2022).
Lahti, A. C., Wang, D., Pei, H., Baker, S. & Narayan, V. A. Clinical utility of wearable sensors and patient-reported surveys in patients with schizophrenia: noninterventional, observational study. JMIR Ment. Health 8, e26234 (2021).
Rohani, D. A., Faurholt-Jepsen, M., Kessing, L. V. & Bardram, J. E. Correlations between objective behavioral features collected from mobile and wearable devices and depressive mood symptoms in patients with affective disorders: systematic review. JMIR Mhealth Uhealth 6, e165 (2018).
Bayoumy, K. et al. Smart wearable devices in cardiovascular care: where we are and how to move forward. Nat. Rev. Cardiol. 18, 581â599 (2021).
Stradford, L. et al. Wearable activity tracker study exploring rheumatoid arthritis patientsâ disease activity using patient-reported outcome measures, clinical measures, and biometric sensor data (the wear study). Contemp. Clin. Trials Commun. 38, 101272 (2024).
Jakobsen, P. et al. in 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS). (IEEE).
Asgari Mehrabadi, M. et al. Sleep tracking of a commercially available smart ring and smartwatch against medical-grade actigraphy in everyday settings: instrument validation study. JMIR Mhealth Uhealth 8, e20465 (2020).
Allan, S. et al. Adverse events reporting in digital interventions evaluations for psychosis: a systematic literature search and individual level content analysis of adverse event reports. Schizophr. Bull. https://doi.org/10.1093/schbul/sbae031 (2024).
Eisner, E. et al. Measurement of adverse events in studies of digital health interventions for psychosis: guidance and recommendations based on a literature search and framework analysis of standard operating procedures. Schizophr. Bull. https://doi.org/10.1093/schbul/sbae048 (2024).
Page, M. J. et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. Br. Med. J. https://doi.org/10.1136/bmj.n71 (2021).
Steyerberg, E. W. et al. Prognosis Research Strategy (PROGRESS) 3: prognostic model research. PLoS Med. https://doi.org/10.1371/journal.pmed.1001381 (2013).
Riley, R. D. et al. Prognosis Research Strategy (PROGRESS) 2: prognostic factor research. PLoS Med. 10, e1001380âe1001380 (2013).
Hayden, J. A., van der Windt, D. A., Cartwright, J. L., Côté, P. & Bombardier, C. Assessing bias in studies of prognostic factors. Ann. Intern. Med. 158, 280â286 (2013).
Wolff, R. F. et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann. Intern. Med. 170, 51â58 (2019).
Savage, C. L. G., Orth, R. D., Jacome, A. M., Bennett, M. E. & Blanchard, J. J. Assessing the psychometric properties of the PROMIS sleep measures in persons with psychosis. Sleep https://doi.org/10.1093/sleep/zsab140 (2021).
Reeve, S., Sheaves, B. & Freeman, D. Sleep disorders in early psychosis: incidence, severity, and association with clinical symptoms. Schizophr. Bull. 45, 287â295 (2019).
Baandrup, L. & Jennum, P. J. A validation of wrist actigraphy against polysomnography in patients with schizophrenia or bipolar disorder. Neuropsychiatr. Dis. Treat. 11, 2271â2277 (2015).
Walther, S., Horn, H., Koschorke, P., Muller, T. J. & Strik, W. Increased motor activity in cycloid psychosis compared to schizophrenia. World J. Biol. Psychiatry 10, 746â751 (2009).
Walther, S., Koschorke, P., Horn, H. & Strik, W. Objectively measured motor activity in schizophrenia challenges the validity of expert ratings. Psychiatry Res. 169, 187â190 (2009).
Walther, S. et al. Higher motor activity in schizophrenia patients treated with olanzapine versus risperidone. J. Clin. Psychopharmacol. 30, 181â184 (2010).
Wichniak, A. et al. Actigraphic monitoring of activity and rest in schizophrenic patients treated with olanzapine or risperidone. J. Psychiatr. Res. 45, 1381â1386 (2011).
Wang, J. et al. Both physical activity and food intake are associated with metabolic risks in patients with schizophrenia. Schizophr. Res. 140, 260â261 (2012).
Abel, D. B., Salyers, M. P., Wu, W., Monette, M. A. & Minor, K. S. Quality versus quantity: determining real-world social functioning deficits in schizophrenia. Psychiatry Res. 301, 113980 (2021).
Abel, D. B. & Minor, K. S. Social functioning in schizophrenia: comparing laboratory-based assessment with real-world measures. J. Psychiatr. Res. 138, 500â506 (2021).
Abplanalp, S. J. et al. Feasibility of using smartphones to capture speech during social interactions in schizophrenia. Schizophr. Res. 228, 51â52 (2021).
Reinertsen, E. & Clifford, G. D. A review of physiological and behavioral monitoring with digital sensors for neuropsychiatric illnesses. Physiol. Meas. 39, 05TR01 (2018).
Wainberg, M. et al. Association of accelerometer-derived sleep measures with lifetime psychiatric diagnoses: a cross-sectional study of 89,205 participants from the UK Biobank. PLoS Med. 18, e1003782 (2021).
Firth, J. et al. The validity and value of self-reported physical activity and accelerometry in people with schizophrenia: a population-scale study of the UK biobank. Schizophr. Bull. 44, 1293â1300 (2018).
Deenik, J. et al. Physical activity and quality of life in long-term hospitalized patients with severe mental illness: A cross-sectional study. BMC Psychiatry 17, 298 (2017).
Smit, M. M. C., Waal, E. D., Tenback, D. E. & Deenik, J. Evaluating the implementation of a multidisciplinary lifestyle intervention for people with severe mental illness in sheltered housing: effectiveness-implementation hybrid randomised controlled trial. BJPsych Open 8, e201 (2022).
Kruisdijk, F. et al. Accelerometer-measured sedentary behaviour and physical activity of inpatients with severe mental illness. Psychiatry Res. 254, 67â74 (2017).
Gomes, E. et al. Quality of life and physical activity levels in outpatients with schizophrenia. Rev. Bras. Psiquiatr. 38, 157â160 (2016).
Andersen, E. et al. Physical activity pattern and cardiorespiratory fitness in individuals with schizophrenia compared with a population-based sample. Schizophr. Res. 201, 98â104 (2018).
Andersen, E. et al. Effect of high-intensity interval training on cardiorespiratory fitness, physical activity and body composition in people with schizophrenia: a randomized controlled trial. BMC Psychiatry 20, 425 (2020).
Engh, J. A. et al. Objectively assessed daily steps-not light intensity physical activity, moderate-to-vigorous physical activity and sedentary time-is associated with cardiorespiratory fitness in patients with schizophrenia. Front. Psychiatry 10, 82 (2019).
Holmen, T. L. et al. The association between cardiorespiratory fitness and cognition appears neither related to current physical activity nor mediated by brain-derived neurotrophic factor in a sample of outpatients with schizophrenia. Front. Psychiatry 10, 785 (2019).
Janney, C. A. et al. Sedentary behavior and psychiatric symptoms in overweight and obese adults with schizophrenia and schizoaffective disorders (WAIST Study). Schizophr. Res. 145, 63â68 (2013).
Janney, C. A. et al. Physical activity and sedentary behavior measured objectively and subjectively in overweight and obese adults with schizophrenia or schizoaffective disorders. J. Clin. Psychiatry 76, e1277âe1284 (2015).
Lindamer, L. A. et al. Assessment of physical activity in middle-aged and older adults with schizophrenia. Schizophr. Res. 104, 294â301 (2008).
Duncan, M. J., Arbour-Nicitopoulos, K., Subramaniapillai, M., Remington, G. & Faulkner, G. Revisiting the International Physical Activity Questionnaire (IPAQ): assessing sitting time among individuals with schizophrenia. Psychiatry Res. 271, 311â318 (2019).
Berry, A., Drake, R. J., Butcher, I. & Yung, A. R. Examining the feasibility, acceptability, validity and reliability of physical activity, sedentary behaviour and sleep measures in people with schizophrenia. Ment. Health Phys. Act. 21, 100415 (2021).
Brobakken, M. F. et al. A comprehensive cardiovascular disease risk profile in patients with schizophrenia. Scand. J. Med. Sci. Sports 29, 575â585 (2019).
Jerome, G. J. et al. Physical activity levels of persons with mental illness attending psychiatric rehabilitation programs. Schizophr. Res. 108, 252â257 (2009).
Duncan, M. J., Arbour-Nicitopoulos, K., Subramanieapillai, M., Remington, G. & Faulkner, G. Revisiting the International Physical Activity Questionnaire (IPAQ): Assessing physical activity among individuals with schizophrenia. Schizophr. Res. 179, 2â7 (2017).
Gorczynski, P., Faulkner, G., Cohn, T. & Remington, G. Examining the efficacy and feasibility of exercise counseling in individuals with schizophrenia: a single-case experimental study. Ment. Health Phys. Act. 7, 191â197 (2014).
Gorczynski, P., Faulkner, G., Cohn, T. & Remington, G. Examining strategies to improve accelerometer compliance for individuals living with schizophrenia. Psychiatr. Rehabilit. J. 37, 333â335 (2014).
Grassmann, V., Subramaniapillai, M., Duncan, M., Arbour-Nicitopoulos, K. & Faulkner, G. E. The relationship between moderate-to-vigorous physical activity and executive function among individuals with schizophrenia: differences by illness duration. Rev. Bras. Psiquiatr. 39, 309â315 (2017).
Oliva, V. et al. Patterns of antipsychotic prescription and accelerometer-based physical activity levels in people with schizophrenia spectrum disorders: a multicenter, prospective study. Int. Clin. Psychopharmacol. 38, 28â39 (2023).
Chen, L. J., Steptoe, A., Chung, M. S. & Ku, P. W. Association between actigraphy-derived physical activity and cognitive performance in patients with schizophrenia. Psychol. Med. 46, 2375â2384 (2016).
Faulkner, G., Cohn, T. & Remington, G. Validation of a physical activity assessment tool for individuals with schizophrenia. Schizophr. Res. 82, 225â231 (2006).
Bueno-Antequera, J., Oviedo-Caro, M. & Munguia-Izquierdo, D. Sedentary behaviour, physical activity, cardiorespiratory fitness and cardiometabolic risk in psychosis: the PsychiActive project. Schizophr. Res. 195, 142â148 (2018).
Afonso, P., Figueira, M. L. & Paiva, T. Sleep-wake patterns in schizophrenia patients compared to healthy controls. World J. Biol. Psychiatry 15, 517â524 (2014).
Walther, S. et al. Quantitative motor activity differentiates schizophrenia subtypes. Neuropsychobiology 60, 80â86 (2009).
Beebe, L. H. et al. A pilot study describing physical activity in persons with schizophrenia spectrum disorders (Ssds) after an exercise program. Issues Ment. Health Nurs. 34, 214â219 (2013).
Williams, J. et al. âWalk this wayâ: results from a pilot randomised controlled trial of a health coaching intervention to reduce sedentary behaviour and increase physical activity in people with serious mental illness. BMC Psychiatry 19, 287 (2019).
Scheewe, T. W. et al. Low physical activity and cardiorespiratory fitness in people with schizophrenia: a comparison with matched healthy controls and associations with mental and physical health. Front. Psychiatry 10, 87 (2019).
Vancampfort, D. et al. Lower cardiorespiratory fitness is associated with more time spent sedentary in first episode psychosis: a pilot study. Psychiatry Res. 253, 13â17 (2017).
Vancampfort, D. et al. Validity and correlates of the International Physical Activity Questionnaire in first-episode psychosis. Early Interv. Psychiatry 13, 562â567 (2019).
Cella, M. et al. Using wearable technology to detect the autonomic signature of illness severity in schizophrenia. Schizophr. Res. 195, 537â542 (2018).
Acknowledgements
This study was supported by the NIHR Manchester Biomedical Research Centre (NIHR 203308) and a National Institute for Health and Care Research research professorship (Bucci: NIHR300794). The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care.
Author information
Authors and Affiliations
Contributions
Sandra Bucci (SBu), Siân Bladon (SBl), J.A., G.M. and MS devised the study. S. Bu, S.B.l., G.M., M.S., E.E. and S.F. devised the search terms and inclusion/exclusion criteria. S.B.l. conducted the searches. S.B.l., S. Bu, S.F. and E.E. screened the abstracts and full papers. S.B.l., S.F., E.E. and A.O. extracted the data and did the quality appraisals. S.B.l. wrote the paper which was reviewed and approved by all authors.
Corresponding author
Ethics declarations
Competing interests
S. Bu and J.A. are Directors and shareholders of CareLoop Health Ltd., a spin of from the University of Manchester to develop and market digital solutions for remote monitoring using smartphones for mental health conditions, currently schizophrenia, and postnatal depression. The remaining authors declare no competing interests.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bladon, S., Eisner, E., Bucci, S. et al. A systematic review of passive data for remote monitoring in psychosis and schizophrenia. npj Digit. Med. 8, 62 (2025). https://doi.org/10.1038/s41746-025-01451-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41746-025-01451-2