research-article

Mimicking the Annotation Process for Recognizing the Micro Expressions

Authors:

Hong-Han Shuai,

Wen-Huang ChengAuthors Info & Claims

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Pages 228 - 236

https://doi.org/10.1145/3503161.3548185

Published: 10 October 2022 Publication History

Abstract

Micro-expression recognition (MER) has recently become a popular research topic due to its wide applications, e.g., movie rating and recognizing the neurological disorder. By virtue of deep learning techniques, the performance of MER has been significantly improved and reached unprecedented results. This paper proposes a novel architecture to mimic how the expressions are annotated. Specifically, during the annotation process in several datasets, the AU labels are first obtained with FACS, and the expression labels are then decided based on the combinations of the AU labels. Meanwhile, these AU labels describe either the eyes or mouth movements (mutually-exclusive). Following this idea, we design a dual-branch structure with a new augmentation method to separately capture the eyes and mouth features and teach the model what the general expressions should be. Moreover, to adaptively fuse the area features for different expressions, we propose Area Weighted Module to assign different weights to each region. Additionally, we set up an auxiliary task to align the AU similarity scores to help our model capture facial patterns further with AU labels. The proposed approach outperforms other state-of-the-art methods in terms of accuracy on the CASME II and SAMM datasets. Moreover, we provide a new visualization approach to show the relationship between the facial regions and AU features.

Supplementary Material

MP4 File (MM22-fp1805.mp4)

This presentation video covers our ideas and proposed methods in our paper. The contents include how we design our dual-branch model and how we design to visualize the relations between AUs and different facial areas. We also compare our methods with other state-of-the-art to show that our proposed methods are powerful.

Download
783.89 MB

References

[1]

Shaojie Bai, J Zico Kolter, and Vladlen Koltun. 2018. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271 (2018).

[2]

Aditya Chattopadhay, Anirban Sarkar, Prantik Howlader, and Vineeth N Balasubramanian. 2018. Grad-cam: Generalized gradient-based visual explanations for deep convolutional networks. In IEEE Winter Conference on Applications of Computer Vision. 839--847.

[3]

Adrian K Davison, Cliff Lansley, Nicholas Costen, Kevin Tan, and Moi Hoon Yap. 2016. Samm: A spontaneous micro-facial movement dataset. IEEE Transactions on Affective Computing, Vol. 9, 1 (2016), 116--129.

Digital Library

[4]

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2021. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations.

[5]

Paul Ekman and Wallace V Friesen. 1978. Facial action coding system. Environmental Psychology & Nonverbal Behavior (1978).

[6]

Vida Esmaeili, Mahmood Mohassel Feghhi, and Seyed Omid Shahdi. 2021. Micro-Expression Recognition Using Histogram of Image Gradient Orientation on Diagonal Planes. In International Conference on Pattern Recognition and Image Analysis. 1--5.

[7]

Yiyang Gan, Ruize Han, Liqiang Yin, Wei Feng, and Song Wang. 2021. Self-supervised Multi-view Multi-Human Association and Tracking. In ACM International Conference on Multimedia. 282--290.

[8]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 770--778.

[9]

Miho Iwasaki and Yasuki Noguchi. 2016. Hiding true emotions: micro-expressions in eyes retrospectively concealed by mouth movements. Scientific reports, Vol. 6, 1 (2016), 1--10.

[10]

Huai-Qian Khor, John See, Sze-Teng Liong, Raphael CW Phan, and Weiyao Lin. 2019. Dual-stream shallow networks for facial micro-expression recognition. In IEEE International Conference on Image Processing. 36--40.

[11]

Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, and Dilip Krishnan. 2020. Supervised contrastive learning. In Neural Information Processing Systems, Vol. 33, 18661--18673.

[12]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Neural Information Processing Systems, Vol. 25.

Digital Library

[13]

Ankith Jain Rakesh Kumar and Bir Bhanu. 2021. Micro-expression classification based on landmark relations with graph attention convolutional network. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1511--1520.

[14]

Ling Lei, Tong Chen, Shigang Li, and Jianfeng Li. 2021. Micro-Expression Recognition Based on Facial Graph Representation Learning and Facial Action Unit Fusion. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1571--1580.

[15]

Ling Lei, Jianfeng Li, Tong Chen, and Shigang Li. 2020. A novel graph-tcn with a graph structured representation for micro-expression recognition. In ACM International Conference on Multimedia. 2237--2245.

Digital Library

[16]

Xiaobai Li, Tomas Pfister, Xiaohua Huang, Guoying Zhao, and Matti Pietik"ainen. 2013. A spontaneous micro-expression database: Inducement, collection and baseline. In IEEE International Conference on Automatic face and Gesture Recognition, 1--6.

[17]

Yuchi Liu, Heming Du, Liang Zheng, and Tom Gedeon. 2019. A neural micro-expression recognizer. In IEEE International Conference on Automatic Face and Gesture Recognition. 1--4.

Digital Library

[18]

Yong-Jin Liu, Jin-Kai Zhang, Wen-Jing Yan, Su-Jing Wang, Guoying Zhao, and Xiaolan Fu. 2015. A main directional mean optical flow feature for spontaneous micro-expression recognition. In IEEE Transactions on Affective Computing, Vol. 7, 4 (2015), 299--310.

Digital Library

[19]

Ling Lo, Hong-Xia Xie, Hong-Han Shuai, and Wen-Huang Cheng. 2020. MER-GCN: Micro-expression recognition based on relation modeling with graph convolutional networks. In IEEE Conference on Multimedia Information Processing and Retrieval. 79--84.

[20]

Ilya Loshchilov and Frank Hutter. 2017. SGDR: Stochastic gradient descent with warm restarts. In International Conference on Learning Representations.

[21]

Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In International Conference on Learning Representations.

[22]

Tae-Hyun Oh, Ronnachai Jaroensri, Changil Kim, Mohamed Elgharib, Fr'edo Durand, William T Freeman, and Wojciech Matusik. 2018. Learning-based video motion magnification. In European Conference on Computer Vision. 633--648.

[23]

Michel Owayjan, Ahmad Kashour, Nancy Al Haddad, Mohamad Fadel, and Ghinwa Al Souki. 2012. The design and development of a lie detection system using facial micro-expressions. In International Conference on Advances in Computational Tools for Engineering Applications. 33--38.

[24]

Ramprasaath R Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. 2017. Grad-cam: Visual explanations from deep networks via gradient-based localization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 618--626.

[25]

Chidanand Shetty, Arooz Khan, Tanya Singh, and Keerti Kharatmol. 2021. Movie Review Prediction System by Real Time Analysis of Facial Expression. In International Conference on Communication and Electronics Systems. 1109--1113.

[26]

Baolin Song, Ke Li, Yuan Zong, Jie Zhu, Wenming Zheng, Jingang Shi, and Li Zhao. 2019. Recognizing spontaneous micro-expression using a three-stream convolutional neural network. IEEE Access, Vol. 7 (2019), 184537--184551.

[27]

Aaron Van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation learning with contrastive predictive coding. arXiv e-prints (2018), arXiv--1807.

[28]

Nguyen Van Quang, Jinhee Chun, and Takeshi Tokuyama. 2019. CapsuleNet for micro-expression recognition. In IEEE International Conference on Automatic Face and Gesture Recognition. 1--7.

Digital Library

[29]

Haofan Wang, Zifan Wang, Mengnan Du, Fan Yang, Zijian Zhang, Sirui Ding, Piotr Mardziel, and Xia Hu. 2020. Score-CAM: Score-weighted visual explanations for convolutional neural networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 24--25.

[30]

Su-Jing Wang, Wen-Jing Yan, Xiaobai Li, Guoying Zhao, and Xiaolan Fu. 2014b. Micro-expression recognition using dynamic textures on tensor independent color space. In IEEE International Conference on Pattern Recognition. 4678--4683.

Digital Library

[31]

Yandan Wang, John See, Raphael C-W Phan, and Yee-Hui Oh. 2014a. LBP with six intersection points: Reducing redundant information in LBP-TOP for micro-expression recognition. In Asian Conference on Computer Vision. 525--537.

[32]

Hao-Yu Wu, Michael Rubinstein, Eugene Shih, John Guttag, Frédo Durand, and William T. Freeman. 2012. Eulerian Video Magnification for Revealing Subtle Changes in the World. In ACM Transactions on Graphics, Vol. 31, 4 (2012), 1--8.

Digital Library

[33]

Bin Xia and Shangfei Wang. 2021. Micro-Expression Recognition Enhanced by Macro-Expression from Spatial-Temporal Domain. In International Joint Conference on Artificial Intelligence. 1186--1193.

[34]

Bin Xia, Weikang Wang, Shangfei Wang, and Enhong Chen. 2020. Learning from macro-expression: A micro-expression recognition framework. In ACM International Conference on Multimedia. 2936--2944.

Digital Library

[35]

Zhaoqiang Xia, Xiaopeng Hong, Xingyu Gao, Xiaoyi Feng, and Guoying Zhao. 2019. Spatiotemporal recurrent convolutional networks for recognizing spontaneous micro-expressions. In IEEE Transactions on Multimedia, Vol. 22, 3 (2019), 626--640.

[36]

Hong-Xia Xie, Ling Lo, Hong-Han Shuai, and Wen-Huang Cheng. 2020. Au-assisted graph attention convolutional network for micro-expression recognition. In ACM International Conference on Multimedia. 2871--2880.

Digital Library

[37]

Jingwei Yan, Jingjing Wang, Qiang Li, Chunmao Wang, and Shiliang Pu. 2021. Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition. In ACM International Conference on Multimedia. 1038--1046.

[38]

Wen-Jing Yan, Xiaobai Li, Su-Jing Wang, Guoying Zhao, Yong-Jin Liu, Yu-Hsin Chen, and Xiaolan Fu. 2014. CASME II: An improved spontaneous micro-expression database and the baseline evaluation. In PloS ONE, Vol. 9, 1 (2014), e86041.

[39]

Wen-Jing Yan, Qi Wu, Yu-Hsin Chen, Jing Liang, and Xiaolan Fu. 2013. How Fast Are the Leaked Facial Expressions: The Duration of Micro-Expressions. In Journal of Nonverbal Behavior, Vol. 37, 4 (2013), 217--230.

[40]

Qing-Long Zhang and Yu-Bin Yang. 2021. SA-Net: Shuffle attention for deep convolutional neural networks. In IEEE International Conference on Acoustics, Speech and Signal Processing. 2235--2239.

[41]

Guoying Zhao and Matti Pietikainen. 2007. Dynamic texture recognition using local binary patterns with an application to facial expressions. In IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 29, 6 (2007), 915--928.

Digital Library

[42]

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2016. Learning deep features for discriminative localization. In IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2921--2929.

[43]

Hongyuan Zhu, Ye Niu, Di Fu, and Hao Wang. 2021. MusicBERT: A Self-supervised Learning of Music Representation. In ACM International Conference on Multimedia. 3955--3963.

Digital Library

Cited By

Sharma DSingh JSehra SSehra S(2024)Demystifying Mental Health by Decoding Facial Action Unit SequencesBig Data and Cognitive Computing10.3390/bdcc80700788:7(78)Online publication date: 9-Jul-2024
https://doi.org/10.3390/bdcc8070078
Gu ZPang MXing ZTan WJiang XYan B(2024)Facial Micro-Motion-Aware Mixup for Micro-Expression RecognitionICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10446492(8060-8064)Online publication date: 14-Apr-2024
https://doi.org/10.1109/ICASSP48485.2024.10446492
Varanka TLi YPeng WZhao G(2023)Data Leakage and Evaluation Issues in Micro-Expression AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.326506315:1(186-197)Online publication date: 6-Apr-2023
https://dl.acm.org/doi/10.1109/TAFFC.2023.3265063

Index Terms

Mimicking the Annotation Process for Recognizing the Micro Expressions
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations

Recommendations

AU-assisted Graph Attention Convolutional Network for Micro-Expression Recognition
MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Micro-expressions (MEs) are important clues for reflecting the real feelings of humans, and micro-expression recognition (MER) can thus be applied in various real-world applications. However, it is difficult to perceive and interpret MEs correctly. With ...
Recognizing spontaneous micro-expression from eye region

Micro-expression is a kind of spontaneous facial expression, which is with short duration and low intensity. Because of its involuntary feature, it is helpful to reveal one's true emotion when someone tries to conceal. Therefore, it has attracted a ...
Recognising spontaneous facial micro-expressions
ICCV '11: Proceedings of the 2011 International Conference on Computer Vision

Facial micro-expressions are rapid involuntary facial expressions which reveal suppressed affect. To the best knowledge of the authors, there is no previous work that successfully recognises spontaneous facial micro-expressions. In this paper we show ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

October 2022

7537 pages

ISBN:9781450392037

DOI:10.1145/3503161

General Chairs:
João Magalhães
NOVA University of Lisbon, Portugal
,
Alberto del Bimbo
University of Florence, Italy
,
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Nicu Sebe
University of Trento, Italy
,
Program Chairs:
Xavier Alameda-Pineda
Inria, Grenoble, France
,
Qin Jin
Renmin University of China, China
,
Vincent Oria
New Jersey Institute of Technology, USA
,
Laura Toni
University College London, UK

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 October 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Ministry of Science and Technology Taiwan

Conference

MM '22

Sponsor:

SIGMM

MM '22: The 30th ACM International Conference on Multimedia

October 10 - 14, 2022

Lisboa, Portugal

Acceptance Rates

Overall Acceptance Rate 995 of 4,171 submissions, 24%

Upcoming Conference

MM '24

Sponsor:
sigmm

The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
247
Total Downloads

Downloads (Last 12 months)93
Downloads (Last 6 weeks)7

Reflects downloads up to 12 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Sharma DSingh JSehra SSehra S(2024)Demystifying Mental Health by Decoding Facial Action Unit SequencesBig Data and Cognitive Computing10.3390/bdcc80700788:7(78)Online publication date: 9-Jul-2024
https://doi.org/10.3390/bdcc8070078
Gu ZPang MXing ZTan WJiang XYan B(2024)Facial Micro-Motion-Aware Mixup for Micro-Expression RecognitionICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10446492(8060-8064)Online publication date: 14-Apr-2024
https://doi.org/10.1109/ICASSP48485.2024.10446492
Varanka TLi YPeng WZhao G(2023)Data Leakage and Evaluation Issues in Micro-Expression AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.326506315:1(186-197)Online publication date: 6-Apr-2023
https://dl.acm.org/doi/10.1109/TAFFC.2023.3265063

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents