Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3503161.3548185acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Mimicking the Annotation Process for Recognizing the Micro Expressions

Published: 10 October 2022 Publication History

Abstract

Micro-expression recognition (MER) has recently become a popular research topic due to its wide applications, e.g., movie rating and recognizing the neurological disorder. By virtue of deep learning techniques, the performance of MER has been significantly improved and reached unprecedented results. This paper proposes a novel architecture to mimic how the expressions are annotated. Specifically, during the annotation process in several datasets, the AU labels are first obtained with FACS, and the expression labels are then decided based on the combinations of the AU labels. Meanwhile, these AU labels describe either the eyes or mouth movements (mutually-exclusive). Following this idea, we design a dual-branch structure with a new augmentation method to separately capture the eyes and mouth features and teach the model what the general expressions should be. Moreover, to adaptively fuse the area features for different expressions, we propose Area Weighted Module to assign different weights to each region. Additionally, we set up an auxiliary task to align the AU similarity scores to help our model capture facial patterns further with AU labels. The proposed approach outperforms other state-of-the-art methods in terms of accuracy on the CASME II and SAMM datasets. Moreover, we provide a new visualization approach to show the relationship between the facial regions and AU features.

Supplementary Material

MP4 File (MM22-fp1805.mp4)
This presentation video covers our ideas and proposed methods in our paper. The contents include how we design our dual-branch model and how we design to visualize the relations between AUs and different facial areas. We also compare our methods with other state-of-the-art to show that our proposed methods are powerful.

References

[1]
Shaojie Bai, J Zico Kolter, and Vladlen Koltun. 2018. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271 (2018).
[2]
Aditya Chattopadhay, Anirban Sarkar, Prantik Howlader, and Vineeth N Balasubramanian. 2018. Grad-cam: Generalized gradient-based visual explanations for deep convolutional networks. In IEEE Winter Conference on Applications of Computer Vision. 839--847.
[3]
Adrian K Davison, Cliff Lansley, Nicholas Costen, Kevin Tan, and Moi Hoon Yap. 2016. Samm: A spontaneous micro-facial movement dataset. IEEE Transactions on Affective Computing, Vol. 9, 1 (2016), 116--129.
[4]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2021. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations.
[5]
Paul Ekman and Wallace V Friesen. 1978. Facial action coding system. Environmental Psychology & Nonverbal Behavior (1978).
[6]
Vida Esmaeili, Mahmood Mohassel Feghhi, and Seyed Omid Shahdi. 2021. Micro-Expression Recognition Using Histogram of Image Gradient Orientation on Diagonal Planes. In International Conference on Pattern Recognition and Image Analysis. 1--5.
[7]
Yiyang Gan, Ruize Han, Liqiang Yin, Wei Feng, and Song Wang. 2021. Self-supervised Multi-view Multi-Human Association and Tracking. In ACM International Conference on Multimedia. 282--290.
[8]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 770--778.
[9]
Miho Iwasaki and Yasuki Noguchi. 2016. Hiding true emotions: micro-expressions in eyes retrospectively concealed by mouth movements. Scientific reports, Vol. 6, 1 (2016), 1--10.
[10]
Huai-Qian Khor, John See, Sze-Teng Liong, Raphael CW Phan, and Weiyao Lin. 2019. Dual-stream shallow networks for facial micro-expression recognition. In IEEE International Conference on Image Processing. 36--40.
[11]
Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, and Dilip Krishnan. 2020. Supervised contrastive learning. In Neural Information Processing Systems, Vol. 33, 18661--18673.
[12]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Neural Information Processing Systems, Vol. 25.
[13]
Ankith Jain Rakesh Kumar and Bir Bhanu. 2021. Micro-expression classification based on landmark relations with graph attention convolutional network. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1511--1520.
[14]
Ling Lei, Tong Chen, Shigang Li, and Jianfeng Li. 2021. Micro-Expression Recognition Based on Facial Graph Representation Learning and Facial Action Unit Fusion. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1571--1580.
[15]
Ling Lei, Jianfeng Li, Tong Chen, and Shigang Li. 2020. A novel graph-tcn with a graph structured representation for micro-expression recognition. In ACM International Conference on Multimedia. 2237--2245.
[16]
Xiaobai Li, Tomas Pfister, Xiaohua Huang, Guoying Zhao, and Matti Pietik"ainen. 2013. A spontaneous micro-expression database: Inducement, collection and baseline. In IEEE International Conference on Automatic face and Gesture Recognition, 1--6.
[17]
Yuchi Liu, Heming Du, Liang Zheng, and Tom Gedeon. 2019. A neural micro-expression recognizer. In IEEE International Conference on Automatic Face and Gesture Recognition. 1--4.
[18]
Yong-Jin Liu, Jin-Kai Zhang, Wen-Jing Yan, Su-Jing Wang, Guoying Zhao, and Xiaolan Fu. 2015. A main directional mean optical flow feature for spontaneous micro-expression recognition. In IEEE Transactions on Affective Computing, Vol. 7, 4 (2015), 299--310.
[19]
Ling Lo, Hong-Xia Xie, Hong-Han Shuai, and Wen-Huang Cheng. 2020. MER-GCN: Micro-expression recognition based on relation modeling with graph convolutional networks. In IEEE Conference on Multimedia Information Processing and Retrieval. 79--84.
[20]
Ilya Loshchilov and Frank Hutter. 2017. SGDR: Stochastic gradient descent with warm restarts. In International Conference on Learning Representations.
[21]
Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In International Conference on Learning Representations.
[22]
Tae-Hyun Oh, Ronnachai Jaroensri, Changil Kim, Mohamed Elgharib, Fr'edo Durand, William T Freeman, and Wojciech Matusik. 2018. Learning-based video motion magnification. In European Conference on Computer Vision. 633--648.
[23]
Michel Owayjan, Ahmad Kashour, Nancy Al Haddad, Mohamad Fadel, and Ghinwa Al Souki. 2012. The design and development of a lie detection system using facial micro-expressions. In International Conference on Advances in Computational Tools for Engineering Applications. 33--38.
[24]
Ramprasaath R Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. 2017. Grad-cam: Visual explanations from deep networks via gradient-based localization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 618--626.
[25]
Chidanand Shetty, Arooz Khan, Tanya Singh, and Keerti Kharatmol. 2021. Movie Review Prediction System by Real Time Analysis of Facial Expression. In International Conference on Communication and Electronics Systems. 1109--1113.
[26]
Baolin Song, Ke Li, Yuan Zong, Jie Zhu, Wenming Zheng, Jingang Shi, and Li Zhao. 2019. Recognizing spontaneous micro-expression using a three-stream convolutional neural network. IEEE Access, Vol. 7 (2019), 184537--184551.
[27]
Aaron Van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation learning with contrastive predictive coding. arXiv e-prints (2018), arXiv--1807.
[28]
Nguyen Van Quang, Jinhee Chun, and Takeshi Tokuyama. 2019. CapsuleNet for micro-expression recognition. In IEEE International Conference on Automatic Face and Gesture Recognition. 1--7.
[29]
Haofan Wang, Zifan Wang, Mengnan Du, Fan Yang, Zijian Zhang, Sirui Ding, Piotr Mardziel, and Xia Hu. 2020. Score-CAM: Score-weighted visual explanations for convolutional neural networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 24--25.
[30]
Su-Jing Wang, Wen-Jing Yan, Xiaobai Li, Guoying Zhao, and Xiaolan Fu. 2014b. Micro-expression recognition using dynamic textures on tensor independent color space. In IEEE International Conference on Pattern Recognition. 4678--4683.
[31]
Yandan Wang, John See, Raphael C-W Phan, and Yee-Hui Oh. 2014a. LBP with six intersection points: Reducing redundant information in LBP-TOP for micro-expression recognition. In Asian Conference on Computer Vision. 525--537.
[32]
Hao-Yu Wu, Michael Rubinstein, Eugene Shih, John Guttag, Frédo Durand, and William T. Freeman. 2012. Eulerian Video Magnification for Revealing Subtle Changes in the World. In ACM Transactions on Graphics, Vol. 31, 4 (2012), 1--8.
[33]
Bin Xia and Shangfei Wang. 2021. Micro-Expression Recognition Enhanced by Macro-Expression from Spatial-Temporal Domain. In International Joint Conference on Artificial Intelligence. 1186--1193.
[34]
Bin Xia, Weikang Wang, Shangfei Wang, and Enhong Chen. 2020. Learning from macro-expression: A micro-expression recognition framework. In ACM International Conference on Multimedia. 2936--2944.
[35]
Zhaoqiang Xia, Xiaopeng Hong, Xingyu Gao, Xiaoyi Feng, and Guoying Zhao. 2019. Spatiotemporal recurrent convolutional networks for recognizing spontaneous micro-expressions. In IEEE Transactions on Multimedia, Vol. 22, 3 (2019), 626--640.
[36]
Hong-Xia Xie, Ling Lo, Hong-Han Shuai, and Wen-Huang Cheng. 2020. Au-assisted graph attention convolutional network for micro-expression recognition. In ACM International Conference on Multimedia. 2871--2880.
[37]
Jingwei Yan, Jingjing Wang, Qiang Li, Chunmao Wang, and Shiliang Pu. 2021. Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition. In ACM International Conference on Multimedia. 1038--1046.
[38]
Wen-Jing Yan, Xiaobai Li, Su-Jing Wang, Guoying Zhao, Yong-Jin Liu, Yu-Hsin Chen, and Xiaolan Fu. 2014. CASME II: An improved spontaneous micro-expression database and the baseline evaluation. In PloS ONE, Vol. 9, 1 (2014), e86041.
[39]
Wen-Jing Yan, Qi Wu, Yu-Hsin Chen, Jing Liang, and Xiaolan Fu. 2013. How Fast Are the Leaked Facial Expressions: The Duration of Micro-Expressions. In Journal of Nonverbal Behavior, Vol. 37, 4 (2013), 217--230.
[40]
Qing-Long Zhang and Yu-Bin Yang. 2021. SA-Net: Shuffle attention for deep convolutional neural networks. In IEEE International Conference on Acoustics, Speech and Signal Processing. 2235--2239.
[41]
Guoying Zhao and Matti Pietikainen. 2007. Dynamic texture recognition using local binary patterns with an application to facial expressions. In IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, 6 (2007), 915--928.
[42]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2921--2929.
[43]
Hongyuan Zhu, Ye Niu, Di Fu, and Hao Wang. 2021. MusicBERT: A Self-supervised Learning of Music Representation. In ACM International Conference on Multimedia. 3955--3963.

Cited By

View all
  • (2024)Demystifying Mental Health by Decoding Facial Action Unit SequencesBig Data and Cognitive Computing10.3390/bdcc80700788:7(78)Online publication date: 9-Jul-2024
  • (2024)Facial Micro-Motion-Aware Mixup for Micro-Expression RecognitionICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10446492(8060-8064)Online publication date: 14-Apr-2024
  • (2023)Data Leakage and Evaluation Issues in Micro-Expression AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.326506315:1(186-197)Online publication date: 6-Apr-2023

Index Terms

  1. Mimicking the Annotation Process for Recognizing the Micro Expressions

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '22: Proceedings of the 30th ACM International Conference on Multimedia
    October 2022
    7537 pages
    ISBN:9781450392037
    DOI:10.1145/3503161
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. AU-feature learning
    2. micro-expression recognition

    Qualifiers

    • Research-article

    Funding Sources

    • Ministry of Science and Technology Taiwan

    Conference

    MM '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 995 of 4,171 submissions, 24%

    Upcoming Conference

    MM '24
    The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne , VIC , Australia

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)93
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 12 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Demystifying Mental Health by Decoding Facial Action Unit SequencesBig Data and Cognitive Computing10.3390/bdcc80700788:7(78)Online publication date: 9-Jul-2024
    • (2024)Facial Micro-Motion-Aware Mixup for Micro-Expression RecognitionICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10446492(8060-8064)Online publication date: 14-Apr-2024
    • (2023)Data Leakage and Evaluation Issues in Micro-Expression AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.326506315:1(186-197)Online publication date: 6-Apr-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media