2019-Feature Selection Framework For XGBoost Based On Electrodermal Activity in Stress Detection
2019-Feature Selection Framework For XGBoost Based On Electrodermal Activity in Stress Detection
2019-Feature Selection Framework For XGBoost Based On Electrodermal Activity in Stress Detection
TABLE IV
P ROPOSED F EATURE E XTRACTION /S ELECTION P ERFORMANCE
Fig. 4. F1 score and computation time along different feature size for chest-
WESAD [5] Time Frequency Entropy Wavelet Fusion based signal in fusion modality. The performance was compared with different
Chest feature selection methods. Blue line: proposed selection f1 score. Red line:
Extraction 89.29 (14) 92.35 (85) 87.87 (30) 78.87 (24) 91.44 (42) 92.27 (195)
Correlation 89.91 (9) 89.74 (37) 87.45 (15) 74.61 (9) 89.89 (29) 91.53 (94) RFE f1 score. Black bar: proposed selection computation time. Grey bar: RFE
Importance 90.08 (5) 89.92 (9) 88.53(5) 76.51 (4) 90.31 (10) 92.38 (9) computation time.
Wrist
Extraction 87.67 (14) 89.73 (85) 84.95 (30) 78.60 (24) 88.42 (42) 89.48 (195)
Correlation 87.02 (9) 88.47 (37) 83.11 (15) 78.36 (9) 89.62 (29) 88.94 (94) sensitive to stress level on time scale and suitable for time-
Importance 88.40 (5) 88.76 (9) 83.70 (5) 77.95 (4) 89.92 (10) 88.78 (9)
Extraction=Extracted features split into time, frequency, entropy, wavelet
frequency analysis.
Correlation=Dominant features without high correlation
Importance=Dominant features important for XGBoost without high correlation
3) Proposed Feature Selection Performance: With lots of
signal processing features, we proposed two selection methods
A. Classification Results to select the dominant features based on their importance
We have three goals for the classification experiment, in- and correlation. Furthermore, we compare our methods with
cluding performance improvement with less overlapped seg- Recursive Feature Elimination (RFE) considering F1-score and
mentation, various signal processing features, and selection computation time, as shown in Fig. 4. The results suggest our
for dominant features. We use XGBoost as our experiment methods with enough feature information can have comparable
model and apply grid search for model parameters tuning. The F1 score to RFE. However, it is considerably less computation
classification performance is evaluated in terms of mean F1- time for our methods during training. By removing correlated
score, which is the harmonic mean of precision and recall. We features and selecting important features for XGBoost, our
employ leave-one-subject-out as our cross-validation scheme, methods can have more efficient performance.
where the classification models are trained using all data but After feature selection for XGBoost, we can obtain the dom-
a previously unseen subject which is used in testing. inant features with similar high classification scores comparing
1) Proposed Segmentation Performance: The stress detec- to all features, as shown in Table IV. Correlation selection can
tion performance with different segmentation methods com- reduce more than half the original features while importance
pared to WESAD is shown in Table III. The extracted features selection can acquire the most dominant features in XGBoost,
and the five machine learning algorithms except XGBoost e.g. features in fusion modality can reduce from 195 features
are all applied the same as WESAD. The results show that to 9 features. The best performance results on XGBoost are
segmentation can have a great influence on model performance 92.38% and 89.92% using chest- and wrist-based EDA signal
with +15% improvement on the testing set, which we suggest respectively. The values indicate that XGBoost is able to
that lots of segments may lead to over-fitting on the similar identify the stress and non-stress class with the proposed
training samples. dominant features. Most importantly, our selection methods
Comparing the performance of the employed algorithms, have the ability to select the features useful for stress detection.
it is obvious that XGBoost has higher classification scores
B. Dominant Feature Analysis
than other ensemble-based or common-used models on both
chest- and wrist-based EDA signals. Therefore, we carry out Dominant features in terms of the feature selection of
more experiments on XGBoost to explore its ability for stress fusion modality are shown in Table V and the features we
detection. propose are highlighted in boldface. The tonic component
2) Proposed Feature Extraction Performance: As for our captures the mean level and the signal range while the phasic
enhanced results on XGBoost, we analyze more features on component emphasizes the entropy and difference. The result
different feature extraction domain as shown in Table IV. shows that the low frequency magnitude and the high fre-
Slight improvements compared to WESAD features are +3% quency complexity of EDA signal contain the most significant
and +2% for chest- and wrist-based signal, respectively, which information in stress detection. Moreover, the property of these
indicate that additional signal processing features of EDA sig- features indicate the positive correlation with stress level,
nal are useful for stress detection. Regarding more information which suggests that stress can lead to high level and high
on different extraction domains, the results suggest that good disorder electrodermal activity.
features are derived more from time and wavelet domain. In addition to the discussion on the information of features,
These processing features show that the EDA signal wave is we take the dominant features as a nine-dimensional vector
TABLE V [4] W. Boucsein, Electrodermal activity. New York: Plenum, 1992.
D OMINANT F EATURES OF F USION M ODALITY [5] P. Schmidt, A. Reiss, R. Duerichen, C. Marberger, and K. Van Laer-
hoven, “Introducing wesad, a multimodal dataset for wearable stress and
No. Dominant features
affect detection,” in Proceedings of the 2018 on International Conference
1 mean of the tonic
on Multimodal Interaction, 2018, pp. 400–408.
2 number of local maximum in 1st difference of phasic
[6] J. Healey, R. W. Picard et al., “Detecting stress during real-world driving
3 approximate entropy of high-frequency wavelet of tonic
tasks using physiological sensors,” IEEE Transactions on Intelligent
4 information entropy of high frequency wavelet of tonic
Transportation Systems, vol. 6, no. 2, pp. 156–166, 2005.
5 1-2Hz spectral power of phasic
[7] M. Gjoreski, H. Gjoreski, M. Luštrek, and M. Gams, “Continuous stress
6 sum of phasic startle magnitudes
detection using a wrist device: in laboratory and real life,” in Proceedings
7 standard deviation of phasic
of the 2016 ACM International Joint Conference on Pervasive and
8 area under the phasic responses
Ubiquitous Computing: Adjunct, 2016, pp. 1185–1193.
9 range of the EDA
[8] A. Anusha, J. Jose, S. Preejith, J. Jayaraj, and S. Mohanasankar,
“Physiological signal based work stress detection using unobtrusive
sensors,” Biomedical Physics & Engineering Express, vol. 4, no. 6, p.
065001, 2018.
[9] V. Sharma, N. R. Prakash, and P. Kalra, “Audio-video emotional
response mapping based upon electrodermal activity,” Biomedical Signal
Processing and Control, vol. 47, pp. 324–333, 2019.
[10] T. Chen and C. Guestrin, “Xgboost: A scalable tree boosting system,”
in Proceedings of the 22nd ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining, 2016, pp. 785–794.
[11] K. Tung, P.-K. Liu, Y.-C. Chuang, S.-H. Wang, and A.-Y. A. Wu,
“Entropy-assisted multi-modal emotion recognition framework based on
physiological signals,” in 2018 IEEE-EMBS Conference on Biomedical
Engineering and Sciences (IECBES), 2018, pp. 22–26.
[12] C. Setz, B. Arnrich, J. Schumm, R. La Marca, G. Tröster, and U. Ehlert,
“Discriminating stress from cognitive load using a wearable eda device,”
IEEE Transactions on Information Technology in Biomedicine, vol. 14,
Fig. 5. t-SNE visualization for dominant features based on EDA signal. Red: no. 2, pp. 410–417, 2010.
non-stress. Blue: stress. [13] H. Kurniawan, A. V. Maslov, and M. Pechenizkiy, “Stress detection from
speech and galvanic skin response signals,” in Proceedings of the 26th
among all data segments. These vectors are projected on a two- IEEE International Symposium on Computer-Based Medical Systems,
2013, pp. 209–214.
dimensional space by t-SNE [21], which we can distinguish [14] H. F. Posada-Quintero, J. P. Florian, A. D. Orjuela-Cañón, T. Aljama-
the data distribution easily, as shown in Fig. 5. The visualiza- Corrales, S. Charleston-Villalobos, and K. H. Chon, “Power spectral
tion result vindicates the ability of the dominant features to density analysis of electrodermal activity for sympathetic function
assessment,” Annals of Biomedical Engineering, vol. 44, no. 10, pp.
detect the non-stress class and stress class while the non-stress 3124–3135, 2016.
class contains baseline and amusement affect states. [15] S. Taylor, N. Jaques, W. Chen, S. Fedor, A. Sano, and R. Picard,
“Automatic identification of artifacts in electrodermal activity data,” in
VI. C ONCLUSION 2015 37th Annual International Conference of the IEEE Engineering in
Medicine and Biology Society (EMBC), 2015, pp. 1934–1937.
In this paper, we propose an enhanced framework for stress [16] Z. Visnovcova, M. Mestanik, M. Gala, A. Mestanikova, and I. Ton-
detection based on EDA signal. The proposed improvements hajzerova, “The complexity of electrodermal activity is altered in mental
for higher performance include less overlapped segmentation, cognitive stressors,” Computers in Biology and Medicine, vol. 79, pp.
123–129, 2016.
more signal processing features, and extreme gradient boosting [17] C. Kirschbaum, K.-M. Pirke, and D. H. Hellhammer, “The trier social
classifier. Moreover, we obtain the most dominant features stress test–a tool for investigating psychobiological stress responses in
related to stress based on the selection of feature importance a laboratory setting,” Neuropsychobiology, vol. 28, no. 1-2, pp. 76–81,
1993.
and correlation. The analysis of dominant feature suggests [18] J. Kim and E. André, “Emotion recognition based on physiological
that the magnitude of low frequency and the complexity of changes in music listening,” IEEE Transactions on Pattern Analysis and
high frequency of EDA signal are useful for stress detection. Machine Intelligence, vol. 30, no. 12, pp. 2067–2083, 2008.
[19] R. Zangróniz, A. Martı́nez-Rodrigo, J. Pastor, M. López, and
According to our studies, we can acquire 92.38% and 89.92% A. Fernández-Caballero, “Electrodermal activity sensor for classification
f1-scores with 9 dominant features in XGBoost classifier based of calm/distress condition,” Sensors, vol. 17, no. 10, p. 2324, 2017.
on chest- and wrist-based EDA signals, which is the state of [20] J. Biesiada and W. Duch, “Feature selection for high-dimensional data-
a pearson redundancy based filter,” in Computer recognition systems 2.
the art on WESAD using EDA for stress detection. Springer, 2007, pp. 242–249.
[21] J. Birjandtalab, D. Cogan, M. B. Pouyan, and M. Nourani, “A non-
R EFERENCES eeg biosignals dataset for assessment and visualization of neurological
[1] G. P. Chrousos and P. W. Gold, “The concepts of stress and stress status,” in 2016 IEEE International Workshop on Signal Processing
system disorders: overview of physical and behavioral homeostasis,” Systems (SiPS), 2016, pp. 110–114.
The Journal of the American Medical Association, vol. 267, no. 9, pp.
1244–1252, 1992.
[2] H. Gao, A. Yüce, and J.-P. Thiran, “Detecting emotional stress from
facial expressions for driving safety,” in 2014 IEEE International Con-
ference on Image Processing (ICIP), 2014, pp. 5961–5965.
[3] H. Lu, D. Frauendorfer, M. Rabbi, M. S. Mast, G. T. Chittaranjan, A. T.
Campbell, D. Gatica-Perez, and T. Choudhury, “Stresssense: Detecting
stress in unconstrained acoustic environments using smartphones,” in
Proceedings of the 2012 ACM Conference on Ubiquitous Computing,
2012, pp. 351–360.