1. Introduction
Wearable devices are becoming increasingly popular, with smartwatches representing one of the most popular consumer devices on the market. Since 2016, smartwatches have held the third largest market share of wearable devices. This growth rate suggests that the smartwatch market is going to be bigger than other wearable devices and has the potential to grow to the second biggest market by 2022, with an estimated 109.2 million units being shipped worldwide by 2023 [
1] (however, it is important to note that this statistic was produced before the Covid-19 outbreak; since the outbreak, the market interest in personal health monitoring devices, such as smartwatches, has significantly increased).
The ability to capture physiological signals (e.g., heart rate) conveniently and continuously and to have quick access to notifications are some of the key features of smartwatches that make them appealing to the public. However, in addition to its advantages [
2], every new technology is associated with limitations, and smartwatches are no exception to this. Two well-known smartwatch constraints are the small screen size and limited battery capacity [
3]. The small screen makes interaction with smartwatches less attractive compared to other ubiquitous devices, such as smartphones [
4]. Furthermore, the size of smartwatches is also directly related to their hardware capabilities, as a small device cannot hold a large battery. Additionally, unlike traditional watches, users have to charge their devices frequently; i.e., every day [
5]. As such, the limited battery capability of smartwatches is the biggest challenge restricting its widespread adoption in the market [
6]. This issue also affects the usability of these devices; for example, continuously monitoring physiological signals significantly decreases battery power [
7]. Moreover, newer versions can work relatively independently of smartphones [
3], which requires even more computing power, as the smartphone cannot piggyback on the computational costs of the smartwatch. Therefore, battery optimization is a critical issue for the independent smartwatches that are emerging into the market.
The smartwatch market is expected to grow to
$22 billion by 2022. The growth rate of smartwatches has shown the highest increase compared with smartphones, tablets, and laptop ownership, which illustrates the uptake of the smartwatch and penetration of these devices into daily life [
8]. A more detailed understanding of smartwatch battery usage could help developers, manufacturers and researchers to design or change their software and hardware for optimal battery utilization. We believe it is therefore necessary to analyze the use of smartwatches in the real world, including different brands and the use of a diverse group of users to gain a holistic overview of their usage. Although promising initial research has been carried out in this field, further research is still required to investigate the significant challenges of this area [
5,
9,
10,
11,
12]. In particular, research into human–computer interaction (HCI) is a challenge when research goes beyond the laboratory and is carried out in the real world [
13]. One issue is that human behavior is multifaceted and complicated, and so more complex models are required to analyze collected data [
14,
15]. As such, as the complexity of the model increases, our control and accessibility to the model and its functionality decreases [
13]. Another challenge is that smartwatches are equipped with low-cost sensors and thus are prone to missing data and artifacts [
16], which makes it more difficult to accurately model user behavior. In this paper, we mitigate these challenges with Algorithm A1 for the alignment of sensor data with our deep learning model, which is capable of further exploring the inputs of the model using its first layer filters.
In this work, we benefit from the “insight for wear” (
http://insight4wear.com) smartwatch application, which has been designed based on the UbiqLog architecture [
17] and has been deployed in the real-world market. This application collects smartwatch data from a number of sensors, including screen usage, heart rate, Bluetooth, light, fitness activity, notifications and battery usage. The dataset, which has Institutional Review Board (IRB) approval, includes different smartwatch brands in different countries with more than 832 users (until July 2019). The analysis is comprised of three segments. The first part utilizes 671 users who have more than 10 days of data for all sensors. The second part focuses on 67 users who have collected data utilizing all sensors for more than 30 days. The third part focuses on 62 users who have more than 6 months of battery data available. Even though not all smartwatch brands include all of the sensors that we have used in our analysis, our method is scalable and appropriate for analyzing all battery-powered wearable devices, including fitness trackers, extended reality (XR) glasses, etc. Within this research, the following questions and related contributions have been posited and discussed:
RQ1: “What are the common charging and discharging patterns that users display?” Identifying these patterns can help application developers and device manufacturers to manage smartwatch resources with the aim of less battery utilization. Additionally, these results can help end-users to become more informed about their battery usage and to avoid events that can cause a sudden discharge of the battery. For example, a prediction of the battery consumption rate can be exposed to the end-user at an appropriate time for them to charge their smartwatch before the battery runs out completely [
18].
RQ2: “What are the discharge peaks, and when do users usually charge their smartwatches (weekday/weekend, time of the day)?” By answering this question, application developers or cloud vendors can optimize their background services or libraries to achieve energy-efficient communication with the network and its applications. By using a pipeline of clustering algorithms, we analyze the battery consumption of devices to identify common patterns of battery utilization based on the time of day. These patterns show the times of the day that users are more likely to charge their devices.
RQ3: “What are the drainage symptoms of a smartwatch battery?” We propose a low-parameter fully convolutional neural network (FCNN) model that can be used to identify the correlation among different sensors and their impact on battery usage, using a binary classification of low and high slopes of battery usage. In other words, based on the behavior of the users and sensors, we have designed a deep learning model to investigate if there is any correlation among specific sensors and activities, in terms of battery utilization. By using a simple change in the common architecture of CNNs, our model becomes more interpretable and can be used to extract symptoms from triads, filters and the inputs and outputs of filters. It can also be used in multi-sensor processing (e.g., mobile device sensors) and for different multivariate datasets. The results of our deep learning model also identify the most effective sensors regarding battery consumption and the correlation of sensor data for battery consumption. Furthermore, our model is portable and can be implemented on mobile devices. This is because it is low-parameter and so uses a variety of low-level to high-level features, as well as raw data, as inputs. As such, these features make the model resource-efficient, and so it can be used in on-device data processing.
RQ4: “Which brands and operating system versions contribute to a more sustainable battery life?” We benefit from the longitudinal data available in our dataset, which have enabled us to develop an indexing approach that ranks smartwatch battery deterioration based on the brand and operating system version.
5. Discussion
In this work, we have overcome the limitations of previous research [
18,
28,
37,
39,
41,
60] that has mainly focused on a specific user group (e.g., students) [
28] or one brand of smartwatch only [
28,
35]. In addition, none of the previous works considered the impact of all smartwatch sensors on the analysis of the battery. In comparison to previous research, our work utilizes a significantly larger amount of users and a longer duration of recording data. Below, we report the advantages of introducing our novel deep learning model and summarize our main findings.
Identifying common charging and discharging patterns can help application developers and device manufacturers to manage smartwatch resources with the aim of less battery utilization. Additionally, identifying discharge peaks and common charging patterns of smartwatches can aid application developers or Cloud vendors in the optimization of their background services or libraries, obtaining more energy-efficient communication with the network and its applications. Our findings, which are summarized later in this section, are based on the longitudinal data available in our dataset, which have enabled us to develop a method that ranks smartwatch battery deterioration based on the brand and operating system version for application producers.
5.1. Advantages of a Deep Learning Model
We have proposed a novel model for analyzing the battery drain in smartwatches. Our model, FCNN, uses all available smartwatch sensors together, which is important because all of the sensors are correlated with battery utilization. As an example of sensor correlation, when a notification arrives, the screen turns on; therefore, we have to consider all sensors together, as investigating sensors individually would change and bias the results. Three reasons to justify our use of the FCNN model are listed as follows:
First, we need to have an automatic feature extractor based on battery usage, and the most common method is to use CNN. The lower number of parameters of FCNN versus the CNN and multilayer perceptron (MLP) pair lead us to favor FCNN.
To extract information for the model to learn, we use limited filters of the first and last layers of our model. This functionality cannot be achieved for other models (e.g., RF) that have similar performance in their final decision.
Residual connections help us to improve the performance of the model by mixing different levels of features to make the final decision.
By using a binary classification, we can identify whether the extracted information is related to high or low peaks of battery usage. However, other classification methods, such as regression or multiclass classification, cannot identify such a relation. For example, if we use 10-class classification and choose 1 to be the lowest battery usage and 10 to be the highest battery usage, features that are extracted might be related to two low battery usage classes (e.g., 1, 2 and 3). However, we need only high and low battery users (binary selection) and not multiple classes for the selection.
There is promising research (e.g., Min et al. [
6]) that mitigates unbalanced data by randomly upsampling the data. However, it has been reported [
6] that random upsampling can cause overfitting. We have solved this issue using clustering, which balances the data and reduces the risk of overfitting. Thus, the interpretation of the model is not correct; there is high accuracy but unbalanced data in the classification.
Our model outperforms other classifiers while also providing several advantages that are specific to our model, including the following. (i) In contrast to other deep learning models, our analysis-filter increases the transparency of the models. This is the first step towards solving the trade-off design challenge in HCI [
13]. Previous works (reviewed in
Section 2.3) have mostly been constructed for image datasets and suffer from excessive parameters and complex computations. (ii) By using a 2D layer for the first layer, we utilize multivariate inputs, which is another advantage of our model. This enables the model to consider the interaction between sensors, and its result is more trustworthy than analyzing each sensor’s impact individually on battery usage. (iii) By using 1D layers in our deep learning model and using the 1 × 1 convolution layer as a classifier, the number of trainable parameters of our deep learning model is very low (4928 trainable parameters). To our knowledge, previous works on smartwatches and smartphones do not have an automatic model to extract information (e.g., the CNN model [
57]) to analyze the behavior of users based on the device sensors.
5.2. Findings and Recommendations
In the following, we summarize our novel findings that can help device manufacturers, vendors and application developers, as well as end-users, to improve smartwatch battery utilization.
Devices with newer versions (than 7.1.1) of the Wear OS operating system drain the battery faster than previous versions.
Table 10 shows that installing newer versions of Wear OS on the same device causes a deterioration in the battery quality of the device. This could be due to the advances and increase in the number of applications available for smartwatches. All smartwatch devices—without exception—receive updates both from their vendors and Google as an operating system provider. Besides, the number of installed applications does not significantly increase or decrease. Therefore, the number of applications do not play a role in battery discharge.
Motorola and Sony smartwatch software updates improve battery life, while the battery life of other brands in our dataset deteriorates when updated. Based on the results in
Table 10, Sony and Motorola smartwatches have the longest sustainable battery discharge rate, which improved during their lifetime. This illustrates that these two particular brands most likely invest in improvements to their background services and smartwatch skins with the aim of better energy efficiency. Our dataset does not include all existing brands; thus, we cannot claim that this finding is generalizable among all brands.
The highest peak of smartwatch battery drain is during working hours, from 9:00 to 17:00. On the other hand, sleep time, 22:00 to 6:00, is the best time for batch operations on devices, as most users plug their device in to charge for the longest period of time. Cluster 5 and cluster 2 in
Figure 4, which account for more than half of users, show that battery usage increases sharply from morning to evening. The behavior of 15.70% of users (cluster 5 in
Figure 4) is similar to cluster 2 but with steeper slopes. Other clusters (1, 3 and 4), which account for 29.09% of all users, have different patterns caused by non-routine days. On the other hand, cluster 6 in
Figure 4 shows that approximately 19.93% of users charge their device during the last hours of the day and the first hours of the next day and unplug their device before sleeping. Clusters 3, 4 and 6, which represent about 40% of users, show that most smartwatch users connect their device to a charger at bedtime (i.e., at the end of the day or during the first hours of the day) and remove their device from the charger when they wake-up.
Figure 5 shows that most users prefer to plug-in their device during the last hours (before sleeping) and unplug their device in the morning (after waking up). This confirms the result of
Figure 3, which illustrates the charging patterns of most users. Based on this pattern, we can recommend that batch jobs, such as backing up data to the Cloud or application updates, which require a great deal of battery power, can be performed during sleeping hours.
Different charging behaviors are observed on weekends compared to weekdays.
Figure 4 shows that there is a difference between Saturday and other days of the week. To evaluate this result, we utilize the results of all users’ data who plugged-in for 7 days of the week. By applying a one-way ANOVA statistical test [
66], we found that there is a significant difference between Saturday and other days of the week with a
p-value < 0.05. A justification for why only Saturday and not Sunday was considered could be due to countries that count Sunday as a working day, such as some countries in the Middle East and North Africa, which biases our results toward Saturday.
Interaction with the screen and notifications are the most common causes of battery drain. Based on the results presented in
Table 8, we identify that screen usage and notification sensors had more of an impact on battery discharge in comparison to other sensors. The heat map presented in
Figure 8 shows that this filter extracts features with most focus on physical activity and Bluetooth, then on heart-rate and then on connection to the charger (which turns on the screen automatically).
When users are physically inactive, there is more battery utilization than during physical activities or when inside a vehicle. The effect of turning the charger on/off can be seen in column 8 of
Figure 8. When the charger is connected, the model does not consider any other condition. The screen sensor is on average in its maximum state. The Bluetooth sensor is “on” for all of these periods. The heart rate is constant and at an average which is natural. The light sensor increases to its average of all data of light sensors. In addition to the effect of the charger, the activity type is also correlated; the screen, light, heart rate, and Bluetooth show no related changes. The notification sensor is not important for filter 8 based on
Figure 9. Among the four types of activities that the Google service can identify, when the user is inactive, more battery consumption is present than in other states. This finding might appear obvious, but it is important to note that earlier versions of smartwatches have strong false positives with wrist movement and automatically turn on the screen [
68]; for example, while the user is driving, moving the steering wheel causes the smartwatch to turn on the screen and thus to drain the battery. Our analyses reveals that this problem has been resolved (More details are available in
Appendix C).
6. Conclusions and Future Work
In this work, we benefit from a large dataset to identify the patterns and symptoms of smartwatch battery utilization in the real world. We have used a dataset that is composed of 832 smartwatch users, including different geographical regions and brands. In the first stage, we employed the k-means clustering method to identify general smartwatch battery discharge patterns. Then, to delve deeper into the battery analysis, we designed a novel low-parameter FCNN, which allowed us to identify the impact that multivariate data (other smartwatch sensors except battery) has on a single source of data (battery). Next, we proposed an indexing approach that enabled us to justify longitudinal battery changes based on the version of the operating system and brand of the device.
We report six novel findings of smartwatch battery utilization patterns, which could help device makers, developers and service providers to improve the battery utilization of their products. In future work, we intend to integrate our findings into a battery manager application on a smartwatch or other battery-powered device, such as a robot, and observe the impacts on the behavior of the battery.