Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3340555.3355712acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
research-article

Exploring Regularizations with Face, Body and Image Cues for Group Cohesion Prediction

Published: 14 October 2019 Publication History

Abstract

This paper presents our approach for the group cohesion prediction sub-challenge in the EmotiW 2019. The task is to predict group cohesiveness in images. We mainly explore several regularizations with three types of visual cues, namely face, body,and global image. Our main contribution is two-fold. First, we jointly train the group cohesion prediction task and group emotion recognition task using multi-task learning strategy with all visual cues. Second, we elaborately design two regularizations, namely a rank loss and a hourglass loss, where the former aims to give a margin between the distance of distant categories and near categories and the later to avoid centralization predictions with only MSE loss. With careful evaluations, we finally achieve the second place in this sub-challenge with MSE of 0.43821 on the testing set. https://github.com/DaleAG/Group_Cohesion_Prediction

References

[1]
Daniel J Beal, Robin R Cohen, Michael J Burke, and Christy L McLendon. 2003. Cohesion and performance in groups: a meta-analytic clarification of construct relations.Journal of applied psychology 88, 6 (2003), 989.
[2]
Zhe Cao, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2017. Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. In CVPR.
[3]
Albert V Carron and Kevin S Spink. 1995. The group size-cohesion relationship in minimal groups. Small group research 26, 1 (1995), 86–105.
[4]
Abhinav Dhall, Roland Goecke, Shreya Ghosh, Jyoti Joshi, Jesse Hoey, and Tom Gedeon. 2017. From individual to group-level emotion recognition: EmotiW 5.0. In Proceedings of the 19th ACM international conference on multimodal interaction. ACM, 524–528.
[5]
Shreya Ghosh, Abhinav Dhall, and Nicu Sebe. 2018. Predicting Group Cohesiveness in Images. arXiv preprint arXiv:1812.11771(2018).
[6]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep Residual Learning for Image Recognition. CoRR abs/1512.03385(2015). http://arxiv.org/abs/1512.03385
[7]
Jie Hu, Li Shen, and Gang Sun. 2017. Squeeze-and-Excitation Networks. CoRR abs/1709.01507(2017). arxiv:1709.01507http://arxiv.org/abs/1709.01507
[8]
Hayley Hung and Daniel Gatica-Perez. 2010. Estimating cohesion in small groups using audio-visual nonverbal behavior. IEEE Transactions on Multimedia 12, 6 (2010), 563–575.
[9]
Sunan Li, Wenming Zheng, Yuan Zong, Cheng Lu, Chuangao Tang, Xingxun Jiang, Jiateng Liu, and Wanchuang Xia. 2019. Bi-modality Fusion for Emotion Recognition in the Wild. In Proceedings of the 21th ACM International Conference on Multimodal Interaction (in press). ACM.
[10]
Albert E Myers. 1961. Team competition, success, and the adjustment of group members.Technical Report. ILLINOIS UNIV URBANA GROUP EFFECTIVENESS RESEARCH LAB.
[11]
Sara Sabour, Nicholas Frosst, and Geoffrey E Hinton. 2017. Dynamic routing between capsules. In Advances in neural information processing systems. 3856–3866.
[12]
Tomas Simon, Hanbyul Joo, Iain Matthews, and Yaser Sheikh. 2017. Hand Keypoint Detection in Single Images using Multiview Bootstrapping. In CVPR.
[13]
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2818–2826.
[14]
Henri Tajfel. 2010. Social identity and intergroup relations. Vol. 7. Cambridge University Press.
[15]
Kai Wang, Jianfei Yang, Da Guo, Kaipeng Zhang, Xiaojiang Peng, and Yu Qiao. 2019. Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression. In Proceedings of the 21th ACM International Conference on Multimodal Interaction (in press). ACM.
[16]
Kai Wang, Xiaoxing Zeng, Jianfei Yang, Debin Meng, Kaipeng Zhang, Xiaojiang Peng, and Yu Qiao. 2018. Cascade attention networks for group emotion recognition with face, body and image cues. In Proceedings of the 2018 on International Conference on Multimodal Interaction. ACM, 640–645.
[17]
Shih-En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. 2016. Convolutional pose machines. In CVPR.
[18]
Stephen J Zaccaro and M Catherine McCoy. 1988. The effects of task and interpersonal cohesiveness on performance of a disjunctive group task 1. Journal of applied social psychology 18, 10 (1988), 837–851.
[19]
Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, and Yu Qiao. 2016. Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Processing Letters 23, 10 (2016), 1499–1503.
[20]
Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, and Stan Z Li. 2017. S3fd: Single shot scale-invariant face detector. In Proceedings of the IEEE International Conference on Computer Vision. 192–201.
[21]
Zhanpeng Zhang, Ping Luo, Chen Change Loy, and Xiaoou Tang. 2014. Facial landmark detection by deep multi-task learning. In European conference on computer vision. Springer, 94–108.

Cited By

View all
  • (2024)Modeling the Interplay Between Cohesion Dimensions: A Challenge for Group Affective Emergent StatesIEEE Transactions on Affective Computing10.1109/TAFFC.2024.334991015:3(1526-1538)Online publication date: Jul-2024
  • (2023)A Comparative Analysis of Deep Learning Convolutional Neural Network Architectures for Fault Diagnosis of Broken Rotor Bars in Induction MotorsSensors10.3390/s2319819623:19(8196)Online publication date: 30-Sep-2023
  • (2022)Group Emotion Detection Based on Social Robot PerceptionSensors10.3390/s2210374922:10(3749)Online publication date: 14-May-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICMI '19: 2019 International Conference on Multimodal Interaction
October 2019
601 pages
ISBN:9781450368605
DOI:10.1145/3340555
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 October 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Convolutional Neural Networks
  2. Deep Learning
  3. Group Cohesion Prediction

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICMI '19

Acceptance Rates

Overall Acceptance Rate 453 of 1,080 submissions, 42%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Modeling the Interplay Between Cohesion Dimensions: A Challenge for Group Affective Emergent StatesIEEE Transactions on Affective Computing10.1109/TAFFC.2024.334991015:3(1526-1538)Online publication date: Jul-2024
  • (2023)A Comparative Analysis of Deep Learning Convolutional Neural Network Architectures for Fault Diagnosis of Broken Rotor Bars in Induction MotorsSensors10.3390/s2319819623:19(8196)Online publication date: 30-Sep-2023
  • (2022)Group Emotion Detection Based on Social Robot PerceptionSensors10.3390/s2210374922:10(3749)Online publication date: 14-May-2022
  • (2022)Automatic Prediction of Group Cohesiveness in ImagesIEEE Transactions on Affective Computing10.1109/TAFFC.2020.302609513:3(1677-1690)Online publication date: 1-Jul-2022
  • (2021)Using Valence Emotion to Predict Group Cohesion’s Dynamics: Top-down and Bottom-up Approaches2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII)10.1109/ACII52823.2021.9597429(1-8)Online publication date: 28-Sep-2021
  • (2021)D2C-Based Hybrid Network for Predicting Group Cohesion ScoresIEEE Access10.1109/ACCESS.2021.30883409(84356-84363)Online publication date: 2021
  • (2021)Efficient Group-Based Cohesion Prediction in Images Using Facial DescriptorsRecent Trends in Analysis of Images, Social Networks and Texts10.1007/978-3-030-71214-3_12(140-148)Online publication date: 25-Mar-2021
  • (2020)Modeling Dynamics of Task and Social Cohesion from the Group Perspective Using Nonverbal Motion Capture-based FeaturesCompanion Publication of the 2020 International Conference on Multimodal Interaction10.1145/3395035.3425963(182-190)Online publication date: 25-Oct-2020
  • (2020)LDNN: Linguistic Knowledge Injectable Deep Neural Network for Group Cohesiveness UnderstandingProceedings of the 2020 International Conference on Multimodal Interaction10.1145/3382507.3418830(343-350)Online publication date: 21-Oct-2020
  • (2020)Implicit Knowledge Injectable Cross Attention Audiovisual Model for Group Emotion RecognitionProceedings of the 2020 International Conference on Multimodal Interaction10.1145/3382507.3417960(827-834)Online publication date: 21-Oct-2020
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media