Abstract
Facial paralysis is a syndrome that causes difficulty in the movement of facial muscles on either one or both sides of the face. In this paper, the quantitative assessment for facial paralysis is proposed using dynamic kernels to detect facial paralysis and its various effect level’s on a person’s face by modeling different facial expressions. Initially, the movements of facial muscles are captured locally by spatio-temporal features for each video. Using the extracted spatio-temporal features from all the videos, a large Gaussian mixture model (GMM) is trained to learn the dynamics of facial muscles globally. In order to handle these local and global features in variable-length patterns like videos, we propose to use a dynamic kernel modeling approach. Dynamic kernels are generally known for handling variable-length data patterns like speech, videos, etc., either by mapping them into fixed-length data patterns or by creating new kernels for example by selecting discriminative sets of representations obtained from GMM statistics. In the proposed work, we explore three different kinds of dynamic kernels, namely, explicit mapping kernels, probability-based kernels, and intermediate matching kernels for the modeling of facial expressions. These kernels are then used as feature vectors for classification using the support vector machine (SVM) to detect severity levels of facial paralysis. The efficacy of the proposed dynamic kernel modeling approach for the quantitative assessment of facial paralysis is demonstrated on a self-collected facial paralysis video dataset of 39 facially paralyzed patients of different severity levels. The dataset collected contains patients from different age groups and gender, further, the videos are recorded from seven different view angles for making the proposed model robust to subject and view variations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Yang, C., et al.: Automatic degree evaluation of facial nerve paralysis based on triple-stream long short term memory. In: Proceedings of the Third International Symposium on Image Computing and Digital Medicine, ISICDM 2019, New York, NY, USA, pp. 7–11. Association for Computing Machinery (2019). https://doi.org/10.1145/3364836.3364838
House, J.W., Brackmann, D.E.: Facial nerve grading system. Otolaryngol. Head Neck Surg. 93(2), 146–147 (1985). https://doi.org/10.1177/019459988509300202. pMID: 3921901
Hato, N., Fujiwara, T., Gyo, K., Yanagihara, N.: Yanagihara facial nerve grading system as a prognostic tool in Bell’s palsy. Otol. Neurotol. 35(9), 1669–1672 (2014)
Xu, P., et al.: Automatic evaluation of facial nerve paralysis by dual-path LSTM with deep differentiated network. Neurocomputing 388, 70–77 (2020). https://doi.org/10.1016/j.neucom.2020.01.014. http://www.sciencedirect.com/science/article/pii/S0925231220300412
Ngo, T.H., Chen, Y.W., Seo, M., Matsushiro, N., Xiong, W.: Quantitative analysis of facial paralysis based on three-dimensional features. In: 2016 IEEE International Conference on Image Processing (ICIP), pp. 1319–1323, September 2016. https://doi.org/10.1109/ICIP.2016.7532572
Ngo, T.H., Chen, Y.W., Matsushiro, N., Seo, M.: Quantitative assessment of facial paralysis based on spatiotemporal features. IEICE Trans. Inf. Syst. E99.D(1), 187–196 (2016). https://doi.org/10.1587/transinf.2015EDP7082
Wang, H., Oneata, D., Verbeek, J., Schmid, C.: A robust and efficient video representation for action recognition. Int. J. Comput. Vis. 119(3), 219–238 (2015). https://doi.org/10.1007/s11263-015-0846-5. https://hal.inria.fr/hal-01145834
Perveen, N., Mohan, C.K., Chen, Y.W.: Quantitative analysis of facial paralysis using GMM and dynamic kernels. In: Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISAPP, vol. 5, pp. 173–184. INSTICC, SciTePress (2020). https://doi.org/10.5220/0009104801730184
He, S., Soraghan, J.J., O’Reilly, B.F., Xing, D.: Quantitative analysis of facial paralysis using local binary patterns in biomedical videos. IEEE Trans. Biomed. Eng. 56(7), 1864–1870 (2009). https://doi.org/10.1109/TBME.2009.2017508
Guo, Z., et al.: Deep assessment process: objective assessment process for unilateral peripheral facial paralysis via deep convolutional neural network. In: 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), pp. 135–138, April 2017. https://doi.org/10.1109/ISBI.2017.7950486
Jiang, C., et al.: Automatic facial paralysis assessment via computational image analysis. J. Healthc. Eng. 2020, 2398542 (2020). https://doi.org/10.1155/2020/2398542
Banks, C.A., Bhama, P.K., Park, J., Hadlock, C.R., Hadlock, T.A.: Clinician-graded electronic facial paralysis assessment: the eFACE. Plast. Reconstr. Surg. 136(2), 223e–230e (2015)
Ridha, A., Shehieb, W., Yacoub, P., Al-Balawneh, K., Arshad, K.: Smart prediction system for facial paralysis, pp. 321–327, April 2020. https://doi.org/10.1109/ICEEE49618.2020.9102600
Gupta, A.: StrokeSave: a novel, high-performance mobile application for stroke diagnosis using deep learning and computer vision. CoRR abs/1907.05358 (2019). http://arxiv.org/abs/1907.05358
Park, E., Han, T., Nam, H.S.: mFAST: automatic stoke evaluation system for time-critical treatment with multimodal feature collection and machine learning classification. In: Proceedings of the 2020 12th International Conference on Computer and Automation Engineering, ICCAE 2020, New York, NY, USA, pp. 38–41. Association for Computing Machinery (2020). https://doi.org/10.1145/3384613.3384653
Wang, T., Zhang, S., Dong, J., Liu, L., Yu, H.: Automatic evaluation of the degree of facial nerve paralysis. Multimed. Tools Appl. 75(19), 11893–11908 (2016). https://doi.org/10.1007/s11042-015-2696-0
Tran, D., Bourdev, L.D., Fergus, R., Torresani, L., Paluri, M.: C3D: generic features for video analysis. CoRR abs/1412.0767 (2014). http://arxiv.org/abs/1412.0767
Wachtman, G., et al.: Measurement of asymmetry in persons with facial paralysis. In: Combined Annual Conference of the Robert H. Ivy and Ohio Valley Societies of Plastic and Reconstructive Surgeons, June 2002
Ngo, T.H., Seo, M., Chen, Y.W., Matsushiro, N.: Quantitative assessment of facial paralysis using local binary patterns and Gabor filters. In: Proceedings of the Fifth Symposium on Information and Communication Technology, SoICT 2014, New York, NY, USA, pp. 155–161. ACM (2014). https://doi.org/10.1145/2676585.2676607. http://doi.acm.org/10.1145/2676585.2676607
van der Maaten, L., Hinton, G.: Viualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Perveen, N., Mohan, C.K., Chen, Y.W. (2022). Expression Modeling Using Dynamic Kernels for Quantitative Assessment of Facial Paralysis. In: Bouatouch, K., et al. Computer Vision, Imaging and Computer Graphics Theory and Applications. VISIGRAPP 2020. Communications in Computer and Information Science, vol 1474. Springer, Cham. https://doi.org/10.1007/978-3-030-94893-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-94893-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-94892-4
Online ISBN: 978-3-030-94893-1
eBook Packages: Computer ScienceComputer Science (R0)