Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3323873.3326593acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Image Emotion Distribution Learning with Graph Convolutional Networks

Published: 05 June 2019 Publication History

Abstract

Recently, with the rapid progress of techniques in visual analysis, a lot of attention has been paid to affective computing due to its wide potential applications. Traditional affective analysis mainly focus on single label image emotion classification. But a single image may invoke different emotions for different persons, even for one person. So emotion distribution learning is proposed to capture the underlying emotion distribution for images. Currently, state-of-the-art works model the distribution by deep convolutional networks equipped with distribution specific loss. However, the correlation among different emotions is ignored in these works. Some emotions usually co-appear, while some are hardly invoked at the same time. Properly modeling the correlation is important for image emotion distribution learning. Graph convolutional networks have shown great performance in capturing the underlying relationship in graph, and have been successfully applied in vision problems, such as zero-shot image classification. So, in this paper, we propose to apply graph convolutional networks for emotion distribution learning, termed EmotionGCN, which captures the correlation among emotions. The EmotionGCN can make use of correlation either mined from data, or directly from psychological models, such as Mikels' wheel. Extensive experiments are conducted on the FlickrLDL and TwitterLDL datasets, and the results on seven evaluation metrics demonstrate the superiority of the proposed method.

References

[1]
Damian Borth, Rongrong Ji, Tao Chen, Thomas M. Breuel, and Shih-Fu Chang. 2013. Large-scale visual sentiment ontology and detectors using adjective noun pairs. In ACM International Conference on Multimedia .
[2]
Tao Chen, Felix X. Yu, Jiawei Chen, Yin Cui, Yan-Ying Chen, and Shih-Fu Chang. 2014. Object-Based Visual Sentiment Concept Analysis and Application. In ACM International Conference on Multimedia. 367--376.
[3]
Bin-Bin Gao, Chao Xing, Chen-Wei Xie, Jianxin Wu, and Xin Geng. 2017. Deep Label Distribution Learning With Label Ambiguity. IEEE Transactions on Image Processing, Vol. 26 (2017), 2825--2838.
[4]
Xin Geng, Chao Yin, and Zhi-Hua Zhou. 2013. Facial age estimation by learning from label distributions. IEEE transactions on pattern analysis and machine intelligence, Vol. 35, 10 (2013), 2401--2412.
[5]
Xin Geng and Quan Zhao. 2013. Label Distribution Learning. IEEE Transactions on Knowledge and Data Engineering, Vol. 28 (2013), 1734--1748.
[6]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. IEEE Conference on Computer Vision and Pattern Recognition (2016), 770--778.
[7]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. CoRR, Vol. abs/1412.6980 (2015).
[8]
Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. CoRR, Vol. abs/1609.02907 (2017).
[9]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. Conference on Neural Information Processing Systems, Vol. 141, 5 (2012), 1097--1105.
[10]
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research, Vol. 9, Nov (2008), 2579--2605.
[11]
George A. Miller. 1995. WORDNET: A Lexical Database for English. Commun. ACM, Vol. 38 (1995), 39--41.
[12]
Kuan-Chuan Peng, Tsuhan Chen, Amir Sadovnik, and Andrew C Gallagher. 2015a. A mixed bag of emotions: Model, predict, and transfer emotion distributions. In IEEE conference on computer vision and pattern recognition. 860--868.
[13]
Kuan-Chuan Peng, Tsuhan Chen, Amir Sadovnik, and Andrew C. Gallagher. 2015b. A mixed bag of emotions: Model, predict, and transfer emotion distributions. IEEE Conference on Computer Vision and Pattern Recognition (2015), 860--868.
[14]
Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global vectors for word representation. In Conference on Empirical Methods in Natural Language Processing. 1532--1543.
[15]
Robert Plutchik. 1980. A general psychoevolutionary theory of emotion. In Theories of emotion . Elsevier, 3--33.
[16]
Anthony Rios and Ramakanth Kavuluru. 2018. Few-Shot and Zero-Shot Multi-Label Learning for Structured Label Spaces. Conference on Empirical Methods in Natural Language Processing (2018), 3132--3142.
[17]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[18]
Xiaolong Wang, Yufei Ye, and Abhinav Gupta. 2018. Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs. IEEE Conference on Computer Vision and Pattern Recognition (2018), 6857--6866.
[19]
Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and Philip S. Yu. 2019. A Comprehensive Survey on Graph Neural Networks. CoRR, Vol. abs/1901.00596 (2019).
[20]
Jufeng Yang, Dongyu She, and Ming Sun. 2017a. Joint Image Emotion Classification and Distribution Learning via Deep Convolutional Neural Network. In International Joint Conferences on Artificial Intelligence. 3266--3272.
[21]
Jufeng Yang, Ming Sun, and Xiaoxiao Sun. 2017b. Learning visual sentiment distributions via augmented conditional probability neural network. In AAAI Conference on Artificial Intelligence .
[22]
Ziwei Zhang, Peng Cui, and Wenwu Zhu. 2018. Deep Learning on Graphs: A Survey. CoRR, Vol. abs/1812.04202 (2018).
[23]
Sicheng Zhao, Guiguang Ding, Yue Gao, and Jungong Han. 2017a. Approximating discrete probability distribution of image emotions by multi-modal features fusion. International Joint Conference on Artificial Intelligence (2017), 4669--4675.
[24]
Sicheng Zhao, Guiguang Ding, Qingming Huang, Tat-Seng Chua, Björn W Schuller, and Kurt Keutzer. 2018a. Affective Image Content Analysis: A Comprehensive Survey. In International Joint Conferences on Artificial Intelligence. 5534--5541.
[25]
Sicheng Zhao, Yue Gao, Xiaolei Jiang, Hongxun Yao, Tat-Seng Chua, and Xiaoshuai Sun. 2014. Exploring principles-of-art features for image emotion recognition. In ACM International Conference on Multimedia. ACM, 47--56.
[26]
Sicheng Zhao, Hongxun Yao, Yue Gao, Rongrong Ji, and Guiguang Ding. 2017b. Continuous probability distribution prediction of image emotions via multitask shared sparse regression. IEEE Transactions on Multimedia, Vol. 19, 3 (2017), 632--645.
[27]
Sicheng Zhao, Hongxun Yao, Yue Gao, Rongrong Ji, Wenlong Xie, Xiaolei Jiang, and Tat-Seng Chua. 2016. Predicting personalized emotion perceptions of social images. In ACM International Conference on Multimedia. ACM, 1385--1394.
[28]
Sicheng Zhao, Hongxun Yao, Xiaolei Jiang, and Xiaoshuai Sun. 2015. Predicting discrete probability distribution of image emotions. In IEEE International Conference on Image Processing (ICIP). IEEE, 2459--2463.
[29]
Sicheng Zhao, Xin Zhao, Guiguang Ding, and Kurt Keutzer. 2018b. EmotionGAN: unsupervised domain adaptation for learning discrete probability distributions of image emotions. In ACM International Conference on Multimedia. ACM, 1319--1327.
[30]
Jie Zhou, Ganqu Cui, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, and Maosong Sun. 2018. Graph Neural Networks: A Review of Methods and Applications. CoRR, Vol. abs/1812.08434 (2018).

Cited By

View all
  • (2024)Emotional Video Captioning With Vision-Based Emotion Interpretation NetworkIEEE Transactions on Image Processing10.1109/TIP.2024.335904533(1122-1135)Online publication date: 2024
  • (2024)MASANet: Multi-Aspect Semantic Auxiliary Network for Visual Sentiment AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.333177615:3(1439-1450)Online publication date: Jul-2024
  • (2024)Zero-Shot Visual Sentiment Prediction via Cross-Domain Knowledge DistillationIEEE Open Journal of Signal Processing10.1109/OJSP.2023.33440795(177-185)Online publication date: 2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '19: Proceedings of the 2019 on International Conference on Multimedia Retrieval
June 2019
427 pages
ISBN:9781450367653
DOI:10.1145/3323873
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 June 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. affective computing
  2. graph convolutional networks
  3. label distribution learning

Qualifiers

  • Research-article

Conference

ICMR '19
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)32
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Emotional Video Captioning With Vision-Based Emotion Interpretation NetworkIEEE Transactions on Image Processing10.1109/TIP.2024.335904533(1122-1135)Online publication date: 2024
  • (2024)MASANet: Multi-Aspect Semantic Auxiliary Network for Visual Sentiment AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2023.333177615:3(1439-1450)Online publication date: Jul-2024
  • (2024)Zero-Shot Visual Sentiment Prediction via Cross-Domain Knowledge DistillationIEEE Open Journal of Signal Processing10.1109/OJSP.2023.33440795(177-185)Online publication date: 2024
  • (2024)Object aroused emotion analysis network for image sentiment analysisKnowledge-Based Systems10.1016/j.knosys.2024.111429286(111429)Online publication date: Feb-2024
  • (2023)Label enhancement via joint implicit representation clusteringProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/447(4019-4027)Online publication date: 19-Aug-2023
  • (2023)Robust Local-Global Feature Representation for Image Emotion Distribution LearningProceedings of the 15th International Conference on Digital Image Processing10.1145/3604078.3604092(1-7)Online publication date: 19-May-2023
  • (2023)StyleEDL: Style-Guided High-order Attention Network for Image Emotion Distribution LearningProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612040(853-861)Online publication date: 26-Oct-2023
  • (2023)Graph-Based Facial Affect Analysis: A ReviewIEEE Transactions on Affective Computing10.1109/TAFFC.2022.321591814:4(2657-2677)Online publication date: 1-Oct-2023
  • (2023)Sample Topology Exploration for Label Distribution Learning2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA60987.2023.10302621(1-9)Online publication date: 9-Oct-2023
  • (2023)Doubled coupling for image emotion distribution learningKnowledge-Based Systems10.1016/j.knosys.2022.110107260(110107)Online publication date: Jan-2023
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media