Abstract
Recognizing human physical activities using wireless sensor networks has attracted significant research interest due to its broad range of applications, such as healthcare, rehabilitation, athletics, and senior monitoring. There are critical challenges inherent in designing a sensor-based activity recognition system operating in and around a lossy medium such as the human body to gain a trade-off among power consumption, cost, computational complexity, and accuracy. We introduce an innovative wireless system based on magnetic induction for human activity recognition to tackle these challenges and constraints. The magnetic induction system is integrated with machine learning techniques to detect a wide range of human motions. This approach is successfully evaluated using synthesized datasets, laboratory measurements, and deep recurrent neural networks.
Similar content being viewed by others
Introduction
Human activity recognition (HAR) aims to provide information on human physical activity and to detect simple or complex actions in a real-world setting. It allows computer systems to assist users with their tasks and to improve the quality of life in areas such as senior care, rehabilitation, daily life-logging, personal fitness, and assistance for people with cognitive disorders1,2,3,4,5,6. Two main approaches for deployment of HAR systems are external and wearable sensors7. In the external approach, the monitoring devices are set at fixed points, and users are expected to interact with them8. The vision-based technique, for example, is one of the well-known external methods that has been extensively studied for human activity analysis9,10. However, it faces many challenges in terms of coverage, accuracy, privacy, and cost. It requires infrastructure support, such as the installation of video cameras in surveillance areas, which is usually costly. Additionally, cameras cannot capture any data if the user performs out of their reach11,12. In the second approach, on-body sensors, such as accelerometers, gyroscopes, and magnetometers, are used to translate human motion into signal patterns for activity recognition13,14,15. Recent advances in embedded sensor technology have made it feasible to monitor the userâs activity using smart devices. Several research studies have reported the use of smartwatches and smartphones in human activity monitoring, and have presented a satisfactory performance16,17,18,19. Although these devices provide a privacy-aware alternative solution that overcomes many disadvantages of the external approach, they still might not be able to address the requirements of a diverse range of applications. A single wearable cannot cover the entire body and therefore fails to obtain adequate information about the mobility of all body segments20,21,22. For example, inertial sensors embedded in a smartwatch cannot capture the movement of legs, which restricts the capability of the system in classifying activities. Additionally, in systems relying on data from a single device, variations in position can have a significant effect on the performance or lead to the failure of the monitoring system20,23,24.
Wireless body area network (WBAN) consisting of wearable devices operating around the human body can tackle these problems21,25. In WBANs, sensors are spatially distributed over the human body and collect data from the user. Then data are transmitted wirelessly to a central processing unit for detection. This approach can provide comprehensive information on the mobility of body segments and potentially improve system accuracy. However, WBAN design is challenging as many constraining, and often conflicting, requirements have to be taken into account26,27,28. For example, the system has to be inexpensive, accessible to the general public, and meet ergonomic constraints and health requirements. It has to operate under proper guidelines limiting the power exposure to the user since the energy absorption may lead to temperature elevation in biological tissues. To ensure usersâ safety, it has to satisfy specific absorption ratio (SAR) constraints, while providing a reliable wireless link29. Moreover, the system should guarantee the security and privacy of the userâs data. Wearable devices must be small and lightweight, which puts a restriction on the battery size and longevity. On the other hand, frequent battery recharging may not be practical for sensor networks with multiple sensors in applications such as senior monitoring7. Due to the limitation of energy resources, the power management has become a critical issue in designing a WBAN. Since wireless communication consumes a considerable portion of the energy30, numerous studies have proposed and investigated low-power solutions31,32,33,34. The conventional state-of-the-art wireless sensor networks working in the vicinity of the human body adopt radio-wave propagation for signal transmission. This technique is susceptible to the characteristics of the environment, and its signal experiences a high attenuation around a lossy medium, such as the human body. It results in higher power consumption, shorter battery life, and lower reliability33,35,36. Moreover, radio-wave propagation technologies are prone to interference with adjacent communication links since most of them, such as Bluetooth, operate at the busy 2.4âGHz, the industrial, scientific, and medical (ISM) band37,38. They also have potential security problems as their signal cannot be stopped from propagating into free-space. Therefore it can be intercepted even distant from the transmitter39.
We introduce the magnetic induction-based HAR (MI-HAR) system that effectively detects physical movements by magnetic induction (MI) signals. This system represents the motion of human body parts via variations in the MI signals transmitted from transmitter to the receiver during physical action, instead of spatial data measured by the inertial sensors. This approach can overcome several problems associated with conventional sensor-based HAR systems, such as eliminating the need for an extra wireless module, reducing power consumption, and the required bandwidth by combining data collection and wireless signal transmission steps. Moreover, it has other features that are inherited from the MI-based communication system. Here we verify the capability of the proposed method in identifying human actions. We first synthesize MI motion data corresponding to several physical activities. Then we apply machine learning-based classifiers and deep recurrent neural networks to classify human movements. The results indicate that the MI signals are informative descriptors for the motion of human body parts.
Results
System principle
The MI-based communication system is a short-range wireless physical layer that transmits signals by coupling non-propagating magnetic field between the wire coils rather than radiating as conventional methods. The main component of each node is a coil, which is lightweight, portable, inexpensive, simple, and can be worn as accessories such as belts, wristbands, and jewelry33,40. The manufacturing cost of an MI module is approximately less than $20, while a Bluetooth IMU costs more than $100 (refs. 41,42,43). The MI coils have a small radiation resistance, which means that the energy propagated to the far-field is negligible. As a result, multipath fading is not an issue, and the MI system can offer a much better quality of service (QoS) compared to Bluetooth-type systems33,44,45. The non-propagating magnetic field produced by the coils falls off proportional to râ3 instead of râ1 for radiating fields at a transmission distance r. Although the rapid decay limits the coverage range, it can be favorable in short-range applications such as WBANs46. It allows the signal to remain in a âbubbleâ around the coil, which provides a personalized space for the user. It also minimizes the leakage outside the targeted coverage range, reduces interference, increases security, and enables bandwidth reuse44,47. One of the main notable advantages of the MI system is that it works well in lossy dielectric media, such as the human body48. In these environments, the MI system experiences much less energy absorption compared to conventional radio-wave propagation technologies49. It results in lower SAR for applications working around the human body. Due to smaller path loss, the MI system can transmit a signal with much less power for the same range. This system can be up to six times more efficient in terms of battery power compared to other short-range communication systems (e.g., Bluetooth)47. This characteristic enables a large variety of novel and demanding applications in harsh environments such as underwater monitoring of scuba divers39,49,50.
The signal generated by an MI coil attenuates as a function of frequency, channel medium, coilsâ geometry, location, and alignment (see Methods section)33. The non-propagating magnetic field is mainly affected by the permeability of the medium, which is close to the air for non-ferrous materials. The MI channel condition remains constant even in an inhomogeneous lossy medium, such as around the human body33,49. For the frequency of up to 30âMHz, the dimension of the human body is relatively small compared to the wavelength, which makes the propagation and scattering effects insignificant33. The immunity of signal in this frequency range to the environment makes the forward voltage gain, S21, of the MI system only a function of coilsâ locations and alignments for a predefined coil geometry and operating frequency. The gain varies by changing the distance and alignment between the MI coils, and therefore, relative motion between the MI coils yields patterns in the received MI signal. This unique characteristic of the MI system is the fundamental principle of the proposed MI-HAR system.
System framework
The activity recognition process steps are different depending on the application. The framework used in this paper has two main stages: data acquisition and detection. For the first stage, an MI-based communication system is employed, which enables the integration of sensing and wireless data transfer into a single step. The user wears the receiver (RX) coil, for example, as a belt around the waist, and transmitter (TX) coils can be placed around the other skeleton bones, such as wrists, arms, and legs. The human body bones are spatially translated and oriented during a physical activity, which changes the relative location and alignment of the MI coils around them. Collecting the received MI signals transmitted from the coils enclosing skeleton bones can model the relative motion of human bones to represent motion. Since the spatial variations of skeleton bones over time are discriminative descriptors of human actions51, the vector of samples observed by the MI coils over time can be considered as the set of inputs for the activity detection algorithm. Increasing the number of coils around the skeleton bones results in a broader set of input data. It consequently enhances the accuracy of the MI-HAR system in detecting the relative motion of body parts. In the next step, a classification method is applied to the MI motion data for detecting human action.
MI system setup
The MI transceivers adopted in the experiments consist of a coil and L-reversed impedance matching network52. The matching network is used to maximize the transmission efficiency of the overall system52. The coils are identical, air-cored, single layer copper with 5âcm radius, 10 AWG wire diameter, and the user can wear them as accessories. The coilâs radius can change depending on the size of the body part that they are designed to be placed around. The source and load impedances are 50âΩ, and the resonance frequency is 13.56âMHz. As the operating frequency is lower than 30âMHz, the human body effect is neglected33, and the effect of the background medium is considered to be the same as that of air. The reversed L-matching networks consist of a series inductor of 5380ânH and a parallel capacitor of 600âpF.
Synthetic MI motion data
In this study, we have synthesized MI motion data to evaluate the proposed MI-HAR system capability in motion detection. The circuit model of the MI system (see Methods section) is used to calculate the forward voltage gain, which is the scaled version of the received MI signal. As the pattern is the same, we used the generated voltage gain patterns of the system as the input features for the detection algorithm. Figure 1 shows the measured and simulated forward voltage gain of two coils during their movement. Since the distance and misalignment between two coils are required as inputs for the model, their location and alignment are captured using video object tracking (see Methods section). Results show that the simulated signal is consistent with the measured data, which is an indication of a valid model for generating time-series MI data. We have performed experiments for 20 different motions that involve both geo-translation and misalignment of coils. The average normalized root-mean-squared error (NRMSE) of the synthesized and measured S21 for these experiments is less than 10.3%. The reported NRSME not only takes into account model error but also includes the error associated with the motion tracking algorithm using video and vector network analyzer (VNA) measurements.
To synthesize MI motion data during different human actions, we considered a receiver and eight transmitter coils around the torso, hands, arms, legs, and thighs, respectively (see Methods section). For spatial translation and rotation of human body bones, 3D motion capture (MoCap) datasets are employed. Each pair of markers placed at the joints can define a bone. Hence, the location and alignment of MI coils placed around the body parts can be derived and provided as inputs to the model for synthesizing the corresponding MI motion data. Two publicly available experimental datasets: Biological Motion Library (BML)53 with 4 activities and Berkeley Multimodal Human Action Database (MHAD)54 with 11 activities are used here. A brief description of these datasets is presented in the Methods section. The generated synthetic forward voltage gain of the MI transceivers corresponds to these datasets is presented in Fig. 2. A point to consider is that we have extended the single-transmitter/single-receiver model to a multi-transmitter/single-receiver scenario, assuming the interferences such as cross-coupling between coils are negligible, because the interference mitigation techniques such as time-division multiplexing55 or frequency splitting56 can be applied to reduce or ideally eliminate interference between inductive systems. Moreover, interference protocols (e.g., RFID interference protocols) can control communication between transceivers while preventing their interference with one other. Therefore, the model can provide a reasonably accurate estimation of multi-coil system performance.
Performance
Tracking the motion of body parts during physical activity is critical in characterizing an individualâs movement, and collecting data that provide a more accurate representation of these motions results in better activity detection. The MI signals express a strong relationship with the geo-translation of body segments since the system gain is directly affected by distance and misalignment between coils. Distributing more coils around the human body provides comprehensive information about the userâs body movements and results in a better distinction between similar actions. We used the MHAD dataset to compare the capability of the MI signal and accelerometer data in estimating the location of a body part during physical activity. The accelerometer is considered here as a benchmark because it is the most frequently used wearable sensor modality for human activity monitoring. Six markers placed close to the accelerometers are considered as target points. Then the similarity between the 3D location of each target point and data of its corresponding accelerometer and MI transceiver is calculated. We used R2 as the similarity metric, and the average values over the whole dataset are presented in Fig. 3. The results show that, on average, the MI signal has a stronger relationship with the 3D location of markers compared to the accelerometer data. This characteristic can be useful not only in classifying human activities but also in reconstructing the motion trajectories of body segments. Many studies have adopted IMUs to reconstruct the trajectories of movements for motion analysis in different applications. Examples include handwritten digit recognition57, monitoring trunk kinematics during standing up to sitting down58, and tracking the motion of body parts on patients who have been affected by neurological conditions for rehabilitation purposes59. In inertial sensor-based recognition systems, the velocity and positions are computed indirectly by the integration over sensor measurements. It makes the estimation errors caused by the intrinsic noise/drift grow unbounded with time. For example, the average displacement error of Xsens IMU after 1âmin is about 152âm59. On the other hand, the MI motion signal is directly affected by the location and orientation of coils. As a result, the trajectory reconstruction using MI signals does not require integration over measured data, which removes the problem of the cumulative error.
To assess the performance of the proposed MI-HAR system in recognizing human activities, we implemented deep recurrent neural networks (RNNs) based on long short-term memory (LSTM) units due to their strong performance in human activity detection, and their capability in learning complex representations of the motion data60,61. We compared the results of this method with several commonly used classifiers for activity detection using generated synthetic MI motion data. Table 1 summarizes the performance results of LSTM with methods including support vector machines (SVM), K-nearest neighbors (KNN), decision trees (DT), random forests (RF), and logistic regression (LR). The confusion matrix of each classification method on BML and MHAD datasets are also presented in Figs. 4 and 5, respectively. The results are compared to other previously introduced methods using different modalities for activity detection. We employed accuracy as an evaluation metric for comparison, as datasets used in this paper are balanced and have an equal number of samples for each activity. The results presented in ref. 62 show that SVM and Multi-Task Conditional Restricted Boltzmann Machines (MT-CRBMs) classifiers have achieved an accuracy of 41.3% and 54.5% using BML motion capture data, respectively. For the MHAD dataset63, has reported an accuracy of 98% by applying SVM on accelerometer data. The random forest classifier has also achieved an accuracy of 96% and 68.2% using MHAD motion capture and audio data64. The accuracy of LSTM using camera RGB image for human activity classification is stated as 92.4%65. Our results indicate that the deep LSTM model with optimum hyperparameters outperforms other classifiers by a considerable margin on the generated synthetic MI motion data. The recurrent neural networks can capture sequential and time dependencies between input data that results in a strong performance. The LSTM cells let the model capture even longer dependencies compared to vanilla cells. A deep architecture with an optimal number of layers enables the neural network to extract useful discriminative features from the set of input data and to improve the performance of the model. It should be noted that the datasets used in this paper are diverse, which proves the classifier models are valid for a broad range of activity recognition tasks. Moreover, the actions recorded in the BML dataset, including knocking, lifting, and throwing, are very similar as only one hand is moving. The same movement of human body parts in these activities makes it difficult to distinguish and categorize them. Despite these challenges, the deep LSTM model has achieved high accuracy, and it indicates that the recurrent model is capable of classifying human actions by using MI motion signals.
The results correspond to a support vector machines (SVM), b K-nearest neighbors (KNN), c decision trees (DT), d random forests (RF), e logistic regression (LR), and f deep long short-term memory (LSTM) recurrent neural network (RNN) classifiers. The rows and columns represent the percentage of true activity labels and the predicted activity labels, respectively.
The results correspond to a support vector machines (SVM), b K-nearest neighbors (KNN), c decision trees (DT), d random forests (RF), e logistic regression (LR), and f deep long short-term memory (LSTM) recurrent neural network (RNN) classifiers. The rows and columns represent the percentage of true activity labels and the predicted activity labels, respectively.
Discussion
HAR is a powerful technology with a wide range of applications such as healthcare, rehabilitation, sports training, and senior monitoring. We proposed a new wearable-based HAR system using MI for motion capture and wireless signal transmission. This method can tackle existing issues with conventional HAR systems in various aspects, including power consumption, the complexity of implementation, and cost. It can also provide a suitable infrastructure for new applications working in harsh environments, such as underwater. The proposed system is a new sensing approach for capturing human motions, which can also be integrated with other monitoring modalities to provide a more comprehensive HAR system.
To show the capability of the MI-HAR system in detecting human movements, we generated synthetic MI motion data received from MI transmitters around the userâs body during different activities by the MI system model. As mentioned before, the model used for synthesizing MI motion data does not consider cross-coupling between transmitter coils. However, this cross-coupling is not necessarily destructive and can even provide further information regarding the location and alignment of all coils relative to each other. In this scenario, each received signal is not only a function of the transmitter and receiver coils but also the arrangement of all other coils affects it. Therefore, the movement of even a single body part results in a different signal pattern and can make the system more accurate in detecting actions similar to each other. In the future, we plan to build a realistic deployment-ready prototype of the MI system for capturing MI motion signals during various human activities. Such a system would allow us to perform experiments on real-world MI motion data to demonstrate the accuracy of our method and study the effect of cross-coupling interference on the MI-HAR system. The proposed system can also be integrated with other modalities and monitoring techniques to provide a more comprehensive system for human motion tracking.
We employed several commonly used machine leaning-based classifiers and deep recurrent neural networks for the detection step. We empirically evaluated the proposed MI-HAR system by conducting experiments on the generated synthetic MI motion dataset and discussed the outcomes in detail. Experimental results reveal that the proposed deep LSTM model shows outstanding performance compared to other approaches. One of the benefits of using the deep recurrent neural network for sequence classification is that it can support multiple parallel temporal input data from different sensor modalities such as MI sensors, accelerometers, and gyroscopes. The model can learn complex features directly from raw data and map them to activities. It removes the need for manual feature engineering by experts while it achieves a comparable performance to models with the feature handcrafting step. Besides, the neural network model enables an interactive learning system when the user provides training data even after the initial training step. It allows the user to fine-tune a pre-trained neural network model with their personal data. However, the neural network complexity should be assessed where models have to be implemented in embedded systems with limited processing capability. It highlights the importance of trade-off between computational cost and detection accuracy to ensure real-time feedback.
Methods
Theoretic circuit modeling of the MI system
The MI system consisting of two coils can be modeled as a two-port network shown in Fig. 6. Coils are attached to impedance matching networks, called input and output matching networks, to maximize the transmission efficiency of the overall system52. The closed-form expressions of these circuit parameters are reported in ref. 33 to facilitate performance analysis of the MI-based communication system around the human body. The model is validated by simulations and measurements performed for various coils in different locations and alignments relative to each other33. The average error of all experiments compared with the simulated signal attenuation results is lower than 10% in the frequency range below 30âMHz. The more advanced version of the expressions without any simplification is also calculated and reported in this work.
Assume that the transmitter coil with number of turns NTX, area STX, and current ITX is centered at CTX, and its surface normal is \({\widehat{{\bf{n}}}}_{\mathrm{TX}}\). The receiver coil with number of turns NRX and area SRX is centered at CRX, and its surface normal is \({\widehat{{\bf{n}}}}_{\mathrm{RX}}\). The mutual inductance between the coils in a linear, homogeneous, isotropic background medium with permeability μ and complex propagation constant γ can be calculated from \(M=\frac{\mu {N}_{\mathrm{TX}}}{{I}_{\mathrm{TX}}}{\int }_{{S}_{\mathrm{RX}}}{{\bf{H}}}_{\mathrm{TX}}.d{{\bf{S}}}_{\mathrm{RX}}\)33,66. By using the exact expressions for the magnetic field generated by the TX coil HTX and applying a procedure similar to ref. 33, one can derive the mutual inductance without any simplification as follows:
where r is the distance between the origin and the observation point and can be defined in the cylindrical coordinates as follows:
The parameters used in the above expressions are calculated from location and alignment of TX/RX coils as follows:
where Rx(θx), Ry(θy), Rz(θz) are rotation matrices that rotate vectors by an angle θx, θy, θz about the x-, y-, or z-axis using the right-hand rule.
The self-inductance and resistance, which comprises DC resistivity, skin depth δw, and proximity effects, of a coil with radius a, length b, number of turns N, circular cross-section wire, core-material permeability μ, wire diameter Ïw, and wire resistivity of Ïw can be expressed as follows33,67:
There are multiple parameters for analyzing and determining the performance of a two-port network. As the proposed MI-based communication system is a cascaded connection of two-port networks, the ABCD parameters are the best candidate. These parameters, which are also known as transmission, chain, or cascade parameters, relate the input current and voltage at port-1 to the output. The ABCD parameters of the MI system are equivalent to the product of ABCD matrices corresponding to the input matching network, the MI system, and the output matching network, accordingly68. The scattering matrix S is another set of two-port parameters defined in terms of incident and reflected waves at ports. One of the matrix elements is forward voltage gain S21, which shows the voltage of the network at port two divided by the voltage at port-1. Converting the ABCD parameters to S-parameters, the forward voltage gain of the MI system can be determined as follows68:
where A, B, C, D are the ABCD parameters of the overall MI system including the MI coils and the matching circuits.
Measurement
The forward voltage gain of two coils is measured for 30âs via a VNA with 1800 points resolution. The corresponding synthetic S21 is also generated by using the system model for comparison. All parameters of the model are predefined based on the MI system setup except the distance and misalignment between coils, which are variable during the movement. Hence, two coils are labeled with red markers and placed in front of a green screen. The motion of coils is captured via an iPhoneâs built-in camera with 30âfps, and the videos are processed offline to extract markers, their center, and alignment, as shown in Fig. 7. Since only one camera is used, without loss of generality, coils only move in 2D such that the camera can capture their motion. The extracted pixel-wise movement of coils is then converted to the spatial translation using a predefined length âcalibration labelâ. The ratio of the calibration labelâs length to its size extracted from video provides a meter to pixel ratio. As the camera is fixed during the experiment, this ratio remains constant for all frames of the video. The recorded distance between coils covers up to 60âcm range. The generated synthetic MI data are synchronized with measured data by minimizing the NRMSE. The code used to track coils and calculate the forward voltage gain of the system based on the circuit model reported in this work is implemented in MATLAB.
Simulation
 Figure 8a depicts the location of coils considered around the human body for generating synthetic MI motion data. The location of markers required to track coils motion is also displayed in Fig. 8b. Assuming that the coils are located at the midpoint of bones, we can calculate their center by averaging the location of corresponding paired markers. For example, Fig. 8c shows the right leg, its corresponding transmitter coil, and markers. The center of the transmitter coil TX8 can be calculated as \({c}_{\mathrm{T}{\mathrm{X}}_{8}}=({M}_{14}+{M}_{15})/2\). The coils are around the human bones, which indicates that the alignment of the line passing through the markers is the same as the surface normal of its corresponding coil. Therefore, the surface normal of the transmitter TX8 can be written as \({\hat{n}}_{\mathrm{T}{\mathrm{X}}_{8}}=({M}_{15}-{M}_{14})/| {M}_{15}-{M}_{14}|\). The code used to calculate the forward voltage gain of the system based on the circuit model reported in this work is implemented in MATLAB.
a Location of MI transceivers (TXi, RX) on the human body and the laboratory version of an MI transceiver. b Location of markers (Mj) on the body. The marker pairs of (2,9), (3,4), (4,5), (10,11), (11,12), (6,7), (7,8), (13,14), (14,15) define two ends of the torso, left arm, left hand, left thigh, left leg, right arm, right hand, right thigh, and right leg, respectively. Consequently, these pairs can be utilized to calculate the location of coils (\({{\bf{C}}}_{\mathrm{T{X}}_{i}}\), CRX) and their alignment (\({\widehat{{\bf{n}}}}_{\mathrm{T{X}}_{i}}\), \({\widehat{{\bf{n}}}}_{\mathrm{RX}}\)). c The center and alignment of a bone and its corresponding coil can be calculated using markers locations.
Datasets of human activities
The experimental datasets considered in this work contain diverse movement data to verify the applicability of MI-HAR in detecting a wide range of activities. The BML dataset53 contains a full-body movement dataset for walking, knocking, lifting, and throwing performed by 15 male and 15 female actors in a neutral, angry, happy, and sad style. The dataset is balanced and has the same number of records performed by actors for each action. The total number of samples is 1028, with a sampling rate of 60âHz. For walking action, the data are captured for 30âs of walking in a triangle turning rightward (clockwise), and turning leftward (counterclockwise). For the knocking, lifting, and throwing actions, five repetitions of a single action unit are obtained for each data record, which is approximately 20âs in duration. The MHAD dataset54 contains the data for jumping, jumping jacks, bending, punching, waving two hands, waving one hand, clapping, throwing, sit down/stand up, sit down, stand up. The number of records for each action is the same, and each action is performed by seven male and five female subjects, yielding about 659 data sequences. Except for sitting down, stand up, and throwing, all records include five repetitions of a single action. The approximate recording length of activities varies from 2 to 15âs. The dataset consists of data from four microphones with a sampling rate of 48âkHz; six accelerometers fixed on wrists, hips, and ankles with a sampling rate of 30âHz, the optical motion capture system with a sampling rate of 480âHz, cameras with a sampling rate of 22âHz, and depth sensors with a sampling rate of 30âHz. In our experiments, we used the down-sampled MoCap data to 60âHz.
Data preprocessing
In our experiments, we have used the magnitude of MI signals as input for the classifiers. Data samples are processed before fetching into the classification models. The processing methods are implemented using Python 3.6. For data cleaning, the missing values are substituted with previous non-missing values, and a 5-point quadratic (order 4) polynomial Savitzky-Golay filter is applied for denoising. Then the baseline offset is removed from time-series data. In the MHAD dataset, 3% of the signals are removed from the end of each data sample as the reported experiments show improvement in the accuracy63.
Classification
The classifier models are implemented using Python 3.6. They are trained and evaluated on the generated synthetic motion datasets of eight bones using the leave-one-subject-out cross-validation (LOSO-CV) method. For the experiments on the BML and MHAD dataset, respectively, six and two subjects are used for validation and the rest for training.
Machine learning-based classifiers:Â The machine learning-based classifiers are implemented using python library Sklearn69. The multi-class models are non-linear SVM with a polynomial kernel, KNN, decision trees, random forests, and logistic regression. We used the bag-of-words (BoW) representation to characterize the time-series data with different lengths. First, the synthetic MI motion data are divided into fixed-length segments of 1 second using the sliding window technique with 0.8 second overlap. Attributes are then computed for the time domain, frequency domain, and time-frequency domain of each window segment. Frequency domain and time-frequency domain representations of the signal are calculated by the fast Fourier transform (FFT), and single-level discrete Wavelet transform (DWT) based on the Daubechies2 wavelet filter, respectively. The attributes considered here are extremes, mean, median, standard deviation, lower quartile, upper quartile, skewness, kurtosis, and the correlation between each pair of signals. As each action is associated with eight data samples, the resulting feature vector for each segment is generated by the concentration of eight feature sets. Features are also scaled using the min-max scaling method to bound values in the range of 0â1. The scaling makes the weight of all features equal in the process of classification. Next, the feature vectors from the training data are clustered using k-means clustering to define a codebook that contains the cluster centers, which are called codewords. Then, each window segment is assigned the closest codeword, and a time-series is represented as a histogram of codewords. The bag-of-words representations of synthetic MI motion data are used as inputs for the machine learning-based classification models. In our experiments, we quantized the training data of BML and MHAD datasets to 100 and 20 codewords, respectively.
Recurrent neural network: A schematic diagram of the neural network structure is summarized in Fig. 9. The deep LSTM model is implemented in the TensorFlow framework. We used the mean cross-entropy between the ground truth labels and the predicted class membership probability vector as the loss function, and the network parameters are updated by minimizing this loss function. The model is trained using batch gradient descent with the RMSprop updating rule. In each epoch of training, the entire training set is passed through the neural network model to update the model with an exponentially decaying learning rate. The dropout regularization technique is also applied to all nodes in the network to avoid overfitting. The dropout keep-probability determines the probability of keeping a node during training. After each epoch, the performance of the model is evaluated on the validation set. We evaluated the influence of several hyperparameters related to the network architecture and learning process using grid-search. These hyperparameters and their range of values explored for tuning during training are: number of layers in the range of {1, 2,3, 5, 10}, number of units in the range of {5, 10, 15, 20, 30, 40, 50}, keep probability in the range of {0.2, 0.5, 0.8, 1}, optimizer decay rate in the range of {0.8, 0.85, 0.9, 0.95, 0.98}, optimizer momentum in the range of {0, 10â3, 10â2, 10â1}, initial learning rate in the range of {10â3, 10â2, 10â1}, exponential decay rate in the range of {0.85, 0.9, 0.95, 0.98}, and exponential decay step in the range of {50, 100, 200, 300}. We implemented a five-layer network with 20 and 40 units for BML and MHAD datasets, respectively. Both datasets are trained with the optimizer decay rate of 0.95, the initial learning rate of 0.01, the exponential decay rate of 0.98, exponential decay step of 100, and keep the probability of 0.8.
The set of magnetic induction (MI) signals observed by the coils at time t is considered as the input vector xt. A time window of 1âs (Tâ=â1âs) is sliding over the data with 0.5âs overlap, and feeding the truncated subsequences of input data within the window to the batch normalization layer. Then the normalized input data (xtâT+1, . . . , xtâ1, xt) is fetched to the deep long short-term memory (LSTM) model. The network outputs sequences of vectors \(({{\bf{y}}}_{t-T+1}^{L},\ ...,\ {{\bf{y}}}_{t-1}^{L},\ {{\bf{y}}}_{t}^{L})\), where each output vector shows the prediction score of its corresponding input sample. Assuming the input signals are sequenced to N samples, the overall score of the entire window can be calculated by averaging all of the scores within the window into a single prediction vector of scores \({\widehat{{\bf{y}}}}_{t}\)70. Then the prediction scores are converted into class membership probabilities \({\widehat{{\bf{O}}}}_{t}\) by applying a softmax layer. The predicted class membership probability vector contains the probability of every class generated by our model. Then the most probable class is selected as the predicted activity label for the given input data within the time window.
Data availability
The data that support the findings of this study can be reproduced using the codes developed in this work and are also available on Figshare (https://doi.org/10.6084/m9.figshare.c.4844517). The raw data that our synthetic MI motion data were derived from are available in the public domain: BML dataset (http://paco.psy.gla.ac.uk); MHAD dataset (http://tele-immersion.citris-uc.org/berkeley_mhad).
Code availability
Computer code supporting the findings of this study are available on GitHub: synthesizing MI data (https://github.com/negargolestani/Synthesize_MI_data); Activity detection (https://github.com/negargolestani/Activity_Detection).
Change history
03 June 2020
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
References
Yin, J., Yang, Q. & Pan, J. J. Sensor-based abnormal human-activity detection. IEEE Trans. Knowl. Data Eng. 20, 1082â1090 (2008).
Kim, J. et al. Wearable smart sensor systems integrated on soft contact lenses for wireless ocular diagnostics. Nat. Commun. 8, 1â8 (2017).
Mukhopadhyay, S. C. Wearable sensors for human activity monitoring: a review. IEEE Sens. J. 15, 1321â1330 (2014).
Jovanov, E., Milenkovic, A., Otto, C. & De Groen, P. C. A wireless body area network of intelligent motion sensors for computer assisted physical rehabilitation. J. Neuroeng. Rehabil. 2, 6 (2005).
Jones, S. E. et al. Genetic studies of accelerometer-based sleep measures yield new insights into human sleep behaviour. Nat. Commun. 10, 1â12 (2019).
Kim, J., Campbell, A. S., de Ãvila, B. E.-F. & Wang, J. Wearable biosensors for healthcare monitoring. Nat. Biotechnol. 37, 389â406 (2019).
Lara, O. D. & Labrador, M. A. A survey on human activity recognition using wearable sensors. IEEE Commun. Surveys Tuts 15, 1192â1209 (2012).
Wang, W., Liu, A. X., Shahzad, M., Ling, K. & Lu, S. Understanding and modeling of wifi signal based human activity recognition. In Proc. 21st Annual International Conference on Mobile Computing and Networking, 65â76 (Paris, France, 2015).
Poppe, R. A survey on vision-based human action recognition. Image Vis. Comput. 28, 976â990 (2010).
Zhang, H.-B. et al. A comprehensive survey of vision-based human action recognition methods. Sensors 19, 1005 (2019).
Bodor, R., Jackson, B. & Papanikolopoulos, N. Vision-based human tracking and activity recognition. In 11th Mediterranean Conference on Control and Automation, Vol. 1 (Rhodes, Greece, 2003).
Moeslund, T. B., Hilton, A. & Krüger, V. A survey of advances in vision-based human motion capture and analysis. Comput. Vis. Image Und. 104, 90â126 (2006).
Kumari, P., Mathew, L. & Syal, P. Increasing trend of wearables and multimodal interface for human activity monitoring: a review. Biosens. Bioelectron. 90, 298â307 (2017).
Hua, A. et al. Accelerometer-based predictive models of fall risk in older women: a pilot study. NPJ Digit. Med. 1, 1â8 (2018).
Nweke, H. F., Teh, Y. W., Al-Garadi, M. A. & Alo, U. R. Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges. Expert Syst. Appl. 105, 233â261 (2018).
Lu, Y. et al. Towards unsupervised physical activity recognition using smartphone accelerometers. Multimed. Tools Appl. 76, 10701â10719 (2017).
CvetkoviÄ, B., Szeklicki, R., Janko, V., Lutomski, P. & LuÅ¡trek, M. Real-time activity monitoring with a wristband and a smartphone. Inf. Fusion 43, 77â93 (2018).
Hassan, M. M., Uddin, M. Z., Mohamed, A. & Almogren, A. A robust human activity recognition system using smartphone sensors and deep learning. Future Gener. Comp. Syst. 81, 307â313 (2018).
Wang, J., Chen, Y., Hao, S., Peng, X. & Hu, L. Deep learning for sensor-based activity recognition: a survey. Pattern Recogn. Lett. 119, 3â11 (2019).
Allahbakhshi, H., Hinrichs, T., Huang, H. & Weibel, R. The key factors in physical activity type detection using real-life data: a systematic review. Front. Physiol. 10, 75 (2019).
Gao, L., Bourke, A. & Nelson, J. Evaluation of accelerometer based multi-sensor versus single-sensor activity recognition systems. Med. Eng. Phys. 36, 779â785 (2014).
Chen, L., Hoey, J., Nugent, C. D., Cook, D. J. & Yu, Z. Sensor-based activity recognition. IEEE Trans. Syst., Man, Cybern. Syst. 42, 790â808 (2012).
Sztyler, T., Stuckenschmidt, H. & Petrich, W. Position-aware activity recognition with wearable devices. Pervasive Mob. Comput. 38, 281â295 (2017).
Cornacchia, M., Ozcan, K., Zheng, Y. & Velipasalar, S. A survey on activity detection and classification using wearable sensors. IEEE Sens. J. 17, 386â403 (2016).
ErtuÄrul, Ã. F. & Kaya, Y. Determining the optimal number of body-worn sensors for human activity recognition. Soft Comput. 21, 5053â5060 (2017).
Sharma, S., Tripathi, M. & Mishra, V. Survey paper on sensors for body area network in health care. In International Conference on Emerging Trends in Computing and Communication Technologies (ICETCCT), 1â6 (IEEE, Dehradun, India, 2017).
Mosenia, A., Sur-Kolay, S., Raghunathan, A. & Jha, N. K. Wearable medical sensor-based system design: a survey. IEEE Trans. Multi-Scale Comput. Syst. 3, 124â138 (2017).
Gravina, R., Alinia, P., Ghasemzadeh, H. & Fortino, G. Multi-sensor fusion in body sensor networks: state-of-the-art and research challenges. Inf. Fusion 35, 68â80 (2017).
Movassaghi, S., Abolhasan, M., Lipman, J., Smith, D. & Jamalipour, A. Wireless body area networks: a survey. IEEE Commun. Surveys Tuts. 16, 1658â1686 (2014).
Aroganam, G., Manivannan, N. & Harrison, D. Review on wearable technology sensors used in consumer sport applications. Sensors 19, 1983 (2019).
Cavallari, R., Martelli, F., Rosini, R., Buratti, C. & Verdone, R. A survey on wireless body area networks: technologies and design challenges. IEEE Commun. Surv. Tuts 16, 1635â1657 (2014).
Negra, R., Jemili, I. & Belghith, A. Wireless body area networks: applications and technologies. Procedia Comput. Sci. 83, 1274â1281 (2016).
Golestani, N. & Moghaddam, M. Theoretical modeling and analysis of magnetic induction communication in wireless body area networks (wban). IEEE J. Electromagn. RF Microw. Med. Biol. 2, 48â55 (2018).
Hasan, K., Biswas, K., Ahmed, K., Nafi, N. S. & Islam, M. S. A comprehensive review of wireless body area network. J. Netw. Comput. Appl. 143, 178â198 (2019).
Bae, J., Cho, H., Song, K., Lee, H. & Yoo, H.-J. The signal transmission mechanism on the surface of human body for body channel communication. IEEE Trans. Microw. Theory Techn. 60, 582â593 (2012).
Werner, D. H. & Jiang, Z. H. Electromagnetics of Body Area Networks: Antennas, Propagation, and RF Systems (John Wiley and Sons, 2016).
Hayajneh, T., Almashaqbeh, G., Ullah, S. & Vasilakos, A. V. A survey of wireless technologies coexistence in wban: analysis and open research issues. Wirel. Netw. 20, 2165â2199 (2014).
Natarajan, R., Zand, P. & Nabi, M. Analysis of coexistence between IEEE 802.15. 4, BLE and IEEE 802.11 in the 2.4 GHz ISM band. In IECON 2016â42nd Annual Conference of the IEEE Industrial Electronics Society, 6025â6032 (IEEE, Florence, Italy, 2016).
Sharma, A. K. et al. Magnetic induction-based non-conventional media communications: a review. IEEE Sens. J. 17, 926â940 (2016).
Golestani, N. & Moghaddam, M. Communication system design for magnetic induction-based wireless body area network. In USNC-URSI Radio Science Meeting (Joint with AP-S Symposium), 49â50 (IEEE, 2017).
Vicon. https://www.vicon.com/hardware/blue-trident. Accessed 29 Jan 2020.
Zenshin Technology. https://zenshin-tech.com. Accessed 29 Jan 2020.
Prayudi, I. & Kim, D. Design and implementation of imu-based human arm motion capture system. In IEEE International Conference on Mechatronics and Automation, 670â675 (IEEE, Chengdu, China, 2012).
Tal, N., Morag, Y., Shatz, L. & Levron, Y. Design optimization of transmitting antennas for weakly coupled magnetic induction communication systems. PLoS ONE 12, e0171982 (2017).
Bansal, R. Near-field magnetic communication. IEEE Antennas Propag. Mag. 46, 114â115 (2004).
Sun, Z. & Akyildiz, I. F. Magnetic induction communications for wireless underground sensor networks. IEEE Trans. Antennas Propag. 58, 2426â2435 (2010).
Agbinya, J. I. & Masihpour, M. Power equations and capacity performance of magnetic induction communication systems. Wirel. Pers. Commun. 64, 831â845 (2012).
Sun, Z., Akyildiz, I. F., Kisseleff, S. & Gerstacker, W. Increasing the capacity of magnetic induction communications in rf-challenged environments. IEEE Trans. Commun. 61, 3943â3952 (2013).
Kim, H.-J. et al. Review of near-field wireless power and communication for biomedical applications. IEEE Access 5, 21264â21285 (2017).
Domingo, M. C. Magnetic induction for underwater wireless communication networks. IEEE Trans. Antenn. Propag. 60, 2929â2939 (2012).
Vemulapalli, R., Arrate, F. & Chellappa, R. Human action recognition by representing 3d skeletons as points in a lie group. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, 588â595 (Columbus, Ohio, 2014).
Golestani, N. & Moghaddam, M. Improving the efficiency of magnetic induction-based wireless body area network. In IEEE International Microwave Biomedical Conference (IMBioC), 166â168 (IEEE, Philadelphia, Pennsylvania, 2018).
Ma, Y., Paterson, H. M. & Pollick, F. E. A motion capture library for the study of identity, gender, and emotion perception from biological motion. Behav. Res. Methods 38, 134â141 (2006).
Ofli, F., Chaudhry, R., Kurillo, G., Vidal, R. & Bajcsy, R. Berkeley mhad: a comprehensive multimodal human action database. In IEEE Workshop on Applications of Computer Vision (WACV), 53â60 (IEEE, Tampa, Florida, 2013).
Johari, R., Krogmeier, J. V. & Love, D. J. Analysis and practical considerations in implementing multiple transmitters for wireless power transfer via coupled magnetic resonance. IEEE Trans. Ind. Electron. 61, 1774â1783 (2013).
Cannon, B. L., Hoburg, J. F., Stancil, D. D. & Goldstein, S. C. Magnetic resonant coupling as a potential means for wireless power transfer to multiple small receivers. IEEE Trans. Power Electron 24, 1819â1825 (2009).
Pan, T.-Y., Kuo, C.-H., Liu, H.-T. & Hu, M.-C. Handwriting trajectory reconstruction using low-cost imu. IEEE Trans. Emerg. Top. Comput. Intell. 3, 261â270 (2018).
Giansanti, D., Maccioni, G., Benvenuti, F. & Macellari, V. Inertial measurement units furnish accurate trunk trajectory reconstruction of the sit-to-stand manoeuvre in healthy subjects. Med. Biol. Eng. Comput. 45, 969â976 (2007).
Suvorova, S., Vaithianathan, T. & Caelli, T. Action trajectory reconstruction from inertial sensor measurements. In 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA), 989â994 (IEEE, Montreal, Quebec, 2012).
Ordóñez, F. J. & Roggen, D. Deep convolutional and lstm recurrent neural networks for multimodal wearable activity recognition. Sensors 16, 115 (2016).
Hammerla, N. Y., Halloran, S. & Plotz, T. Deep, convolutional, and recurrent models for human activity recognition using wearables. In 25th Int. Joint Conf. Artif. Intell. (IJCAI), 1533â1540 (New York, New York, 2016).
Shields, T. J., Amer, M. R., Ehrlich, M. & Tamrakar, A. Action-affect-gender classification using multi-task representation learning. In IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2249â2258 (IEEE, Honolulu, Hawaii, 2017).
Mimouna, A., Khalifa, A. B. & Amara, N. E. B. Human action recognition using triaxial accelerometer data: selective approach. In 15th International Multi-Conference on Systems, Signals and Devices (SSD), 491â496 (IEEE, Hammamet, Tunisia, 2018).
Garcia-Ceja, E., Galván-Tejada, C. E. & Brena, R. Multi-view stacking for activity recognition with sound and accelerometer data. Inf. Fusion 40, 45â56 (2018).
Noori, F. M., Wallace, B., Uddin, M. Z. & Torresen, J. A robust human activity recognition approach using openpose, motion features, and deep recurrent neural network. In Scandinavian Conference on Image Analysis, 299â310 (Springer, Norrköping, Sweden, 2019).
Cheng, D. K. et al. Field and Wave Electromagnetics. Pearson Education India (Addison-Wesley, 1989).
Rosa, E. B. Calculation of the Self-Inductance of Single-Layer Coils. 31 (US Government Printing Office, 1906).
Frickey, D. A. Conversions between s, z, y, h, abcd, and t parameters which are valid for complex source and load impedances. IEEE Trans. Microw. Theory Techn. 42, 205â211 (1994).
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825â2830 (2011).
Murad, A. & Pyun, J.-Y. Deep recurrent neural networks for human activity recognition. Sensors 17, 2556 (2017).
Acknowledgements
This work was supported in part by a grant (80NSSC17K0283) from the National Aeronautics and Space Administration (NASA) Earth Science Technology Office (ESTO) Advanced Information Systems Technology (AIST) program.
Author information
Authors and Affiliations
Contributions
N.G. was the main contributor to this work and was responsible for developing and implementing the methods, data generation, and analysis; M.M. supervised the research.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Golestani, N., Moghaddam, M. Human activity recognition using magnetic induction-based motion signals and deep recurrent neural networks. Nat Commun 11, 1551 (2020). https://doi.org/10.1038/s41467-020-15086-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-020-15086-2