Abstract
Facial expressions, especially spontaneous micro-expressions, as an intuitive reflection of human emotions, have come through much concern along with rapid advances in computer vision recently. Micro-expressions are small in amplitude and short in duration and often appear together with macro-expressions, making micro-expression spotting in long videos a challenging task. In this article, we propose intersection over minimum labelling method combined with a Lite General Network and MagFace CNN (LGNMNet) model to predict the possibility of video frames belonging to a micro-expression interval, which balances easy and difficult samples to improve the learning effect of training process. Experimental results show that our method achieves state-of-the-art performance in spotting micro-expressions in long videos of both the CAS(ME)2 and SAMM-LV datasets (with F1-scores of 0.2474 and 0.2555, respectively). Additionally, a new pair-merge way of combining nearby detected apex frames to construct micro-expression intervals in post-processing stage has been devised and analysed, providing a feasible solution for the task of macro- and micro-spotting in long videos.
Similar content being viewed by others
Data availability
The datasets including SAMM and CAS(ME)2 that support the findings of this study are available at http://www2.docm.mmu.ac.uk/STAFF/M.Yap/dataset.php and http://fu.psych.ac.cn/CASME/casme2-en.php respectively.
References
Ekman, P.: Telling lies: clues to deceit in the marketplace, politics, and marriage, revised WW Norton & Company (2009)
Martin, C.W. (ed.): The philosophy of deception. Oxford university press on demand (2009)
Ben, X., Ren, Y., Zhang, J., Wang, S.-J., Kpalma, K., Meng, W., Liu, Y.-J.: Video-based facial micro-expression analysis: a survey of datasets, features and algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 44(9), 5826–5846 (2021)
Liu, J., Li, K., Song, B., Zhao, L.: A multi-stream convolutional neural network for micro-expression recognition using optical flow and evm. arXiv preprint arXiv:2011.03756. (2020)
Yuhong, H.: Research on micro-expression spotting method based on optical flow features. In: Proceedings of the 29th ACM International Conference on Multimedia (MM '21), New York (2021)
Wang, S.-J., Wu, S., Qian, X., Li, J., Fu, X.: A main directional maximal difference analysis for spotting facial movements from long-term videos. Neurocomputing 230, 382–389 (2017)
Liu, Y.J., Zhang, J.K., Yan, W.J., Wang, S.J., Zhao, G., Fu, X.: A main directional mean optical flow feature for spontaneous micro-expression recognition. IEEE Trans. Affect. Comput. 7(4), 299–310 (2016)
Zhou, Y., Song, Y., Chen, L., Chen, Y., Ben, X., Cao, Y.: A novel micro-expression detection algorithm based on BERT and 3DCNN. Image Vis. Comput 119, 104378 (2022)
Sun, B., Cao, S., He, J., Yu, L.: Two-stream attention-aware network for spontaneous micro-expression movement spotting. IEEE 10th International Conference on Software Engineering and Service Science (ICSESS). Beijing, China (2019)
Tran, T.-K., Vo, Q.-N., Hong, X., Zhao, G.J.E.I.: Dense prediction for micro-expression spotting based on deep sequence model. Electronic Imaging, pp. 401–406 (2019)
Verburg, M., Menkovski, V.: Micro-expression detection in long videos using optical flow and recurrent neural networks. In: 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019), New York (2019)
Yang, B., Wu, J., Zhou, Z., Komiya, M., Kishimoto, K., Xu, J., Nonaka, K., Horiuchi, T., Komorita, S. and Hattori, G.: Facial action unit-based deep learning framework for spotting macro-and micro-expressions in long video sequences. City (2021)
Liong, G.-B., See, J., Wong, L.-K.: Shallow optical flow three-stream CNN for macro-and micro-expression spotting from long videos. In: IEEE International Conference on Image Processing (ICIP), New York (2021)
Yu, W.-W., Jiang, J., Li, Y.-J.: LSSNet: A two-stream convolutional neural network for spotting macro-and micro-expression in long videos. In: Proceedings of the 29th ACM International Conference on Multimedia (MM '21), New York (2021)
Zhao, S., Tao, H., Zhang, Y., Xu, T., Zhang, K., Hao, Z., Chen, E.J.N.: A two-stage 3D CNN based learning method for spontaneous micro-expression recognition. Neurocomputing 448, 276–289 (2021)
Nummenmaa, L., Saarimäkia, H., Glereana, E., Gotsopoulosa, A., Jääskeläinena, I. P., Harib, R., Samsa, M., Glerean, E., Hari, R., Hietanen, J. K. Ekman, Paul: Emotions revealed. Recognizing faces and feelings to improve communication and emotional life. New York: holt paper-back, montgomery, arlene (2013) Neurobiology essentials for clinicians. what every therapist needs to know, New York, London (2007)
Bhushan, B.: Study of facial micro-expressions in psychology. In: Understanding facial expressions in communication: cross-cultural and multidisciplinary perspectives, pp. 265–286. Springer (2015)
Li, X., Hong, X., Moilanen, A., Huang, X., Pfister, T., Zhao, G., Pietikäinen, M.: Towards reading hidden emotions: a comparative study of spontaneous micro-expression spotting and recognition methods. IEEE Trans. Affect. Comput. 9(4), 563–577 (2017)
Li, J., Soladie, C., Seguier, R.: Ltp-ml: micro-expression detection by recognition of local temporal pattern of facial movements. In: 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi'an, China (2018)
Pfister, T., Li, X., Zhao, G., Pietikäinen, M.: Recognising spontaneous facial micro-expressions. In: 2011 International Conference on Computer Vision, Barcelona, Spain (2011)
Chen, M., Ma, H.T., Li, J., Wang, H.: Emotion recognition using fixed length micro-expressions sequence and weighting method. In: IEEE International Conference on Real-time Computing and Robotics (RCAR), Angkor Wat, Cambodia (2016)
Xu, F., Zhang, J., Wang, J.Z.: Microexpression identification and categorization using a facial dynamics map. IEEE Trans. Affect. Comput. 8(2), 254–267 (2017)
Shreve, M., Godavarthy, S., Goldgof, D., Sarkar, S.: Macro- and micro-expression spotting in long videos using spatio-temporal strain. In: IEEE International Conference on Automatic Face & Gesture Recognition (FG), Santa Barbara, USA (2011)
Shreve, M., Brizzi, J., Fefilatyev, S., Luguev, T., Goldgof, D., Sarkar, S.: Automatic expression spotting in videos. Image Vis Comput 32(8), 476–486 (2014)
Asthana, A., Zafeiriou, S., Cheng, S., Pantic, M.: Robust discriminative response map fitting with constrained local models. In: IEEE Conference on Computer Vision and Pattern Recognition, Portland, USA (2013)
Guo, Y., Li, B., Ben, X., Ren, Y., Zhang, J., Yan, R., Li, Y.: A Magnitude and angle combined optical flow feature for microexpression spotting. IEEE Multimedia 28(2), 29–39 (2021)
Tran, T.-K., Hong, X., Zhao, G.: Sliding window based micro-expression spotting: a benchmark. In: Advanced Concepts for Intelligent Vision Systems. ACIVS 2017. Lecture Notes in Computer Science(), vol 10617, Springer, Cham (2017)
Xia, Z., Feng, X., Peng, J., Peng, X., Zhao, G.: Spontaneous micro-expression spotting via geometric deformation modeling. Comput. Vis. Image Underst. 147, 87–94 (2016)
Wang, S.-J., He, Y., Li, J., Fu, X.: MESNet: a convolutional neural network for spotting multi-scale micro-expression intervals in long videos. IEEE Trans Image Process 30, 3956–3969 (2021)
Kim, D. H., Baddar, W. J., Ro, Y. M.: Micro-expression recognition with expression-state constrained spatio-temporal feature representations. In: Proceedings of the 24th ACM international conference on Multimedia (MM '16), New York (2016)
Liong, S.-T., See, J., Wong, K., Le Ngo, A.C., Oh, Y.-H., Phan, R.: Automatic apex frame spotting in micro-expression database. 3rd IAPR Asian Conference on Pattern Recognition (ACPR), 665–669, New York (2015)
Pan, H., Xie, L., Wang, Z.: Local bilinear convolutional neural network for spotting macro-and micro-expression intervals in long video sequences. IEEE, Buenos Aires, Argentina (2020)
Li, Y., Huang, X., Zhao, G.: Micro-expression action unit detection with spatial and channel attention. Neurocomputing 436, 221–231 (2021)
Khor, H.-Q., See, J., Phan, R.C.W., Lin, W.: Enriched long-term recurrent convolutional network for facial micro-expression recognition. In: 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi'an, China (2018)
Yap, C. H., Yap, M. H., Davison, A. K., Cunningham, R.: 3d-cnn for facial micro-and macro-expression spotting on long video sequences using temporal oriented reference frame. In: Proceedings of the 30th ACM International Conference on Multimedia (MM '22). New York, (2021)
Deng, J., Guo, J., Ververas, E., Kotsia, I., Zafeiriou, S. Retinaface: Single-shot multi-level face localisation in the wild. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, (2020)
Mohamed, M.A., Mertsching, B.: TV-L1 optical flow estimation with image details recovering based on modified census transform. Springer, Berlin (2012)
Wang, L., Xiong, Y., Wang, Z., Qiao, Y., Lin, D., Tang, X., Gool, L.V.: Temporal segment networks: Towards good practices for deep action recognition. Springer, Cham (2016)
Howard, A., Zhmoginov, A., Chen, L.-C., Sandler, M. and Zhu, M.: Inverted residuals and linear bottlenecks: mobile networks for classification, detection and segmentation. (2018)
Yap, C.H., Kendrick, C., Yap, M.H.: Samm long videos: a spontaneous facial micro-and macro-expressions dataset. In: 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020), Buenos Aires, Argentina (2020)
Qu, F., Wang, S.-J., Yan, W.-J., Li, H., Wu, S., Fu, X.: CAS (ME) $^ 2$: a database for spontaneous macro-expression and micro-expression spotting and recognition. IEEE Trans. Affective. Comput. 9(4), 424–436 (2017)
Davison, A.K., Merghani, W., Yap, M.H.: Objective classes for micro-facial expression recognition. J Imaging 4(10), 119 (2018)
Liong, G.B., Liong, S.-T., See, J., Chan, C.-S.: MTSN: a Multi-Temporal Stream Network for Spotting Facial Macro-and Micro-Expression with Hard and Soft Pseudo-labels. In: Proceedings of the 2nd Workshop on Facial Micro-Expression: Advanced Techniques for Multi-Modal Facial Expression Analysis (FME '22), New York (2022)
Wang, J., Liu, Y., Hu, Y., Shi, H., Mei, T.: Facex-zoo: a pytorch toolbox for face recognition. In: Proceedings of the 29th ACM International Conference on Multimedia (MM '21), New York (2021)
Author information
Authors and Affiliations
Contributions
SY and TY: wrote the main manuscript text. Q-LG: discussed key methods and experiments. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Communicated by B. Bao.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gu, QL., Yang, S. & Yu, T. Lite general network and MagFace CNN for micro-expression spotting in long videos. Multimedia Systems 29, 3521–3530 (2023). https://doi.org/10.1007/s00530-023-01145-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-023-01145-3