research-article

Target-guided Emotion-aware Chat Machine

Authors:

Yuchong Hu, and

Shanshan FengAuthors Info & Claims

ACM Transactions on Information Systems (TOIS), Volume 39, Issue 4

Article No.: 43, Pages 1 - 24

https://doi.org/10.1145/3456414

Published: 17 August 2021 Publication History

Abstract

The consistency of a response to a given post at the semantic level and emotional level is essential for a dialogue system to deliver humanlike interactions. However, this challenge is not well addressed in the literature, since most of the approaches neglect the emotional information conveyed by a post while generating responses. This article addresses this problem and proposes a unified end-to-end neural architecture, which is capable of simultaneously encoding the semantics and the emotions in a post and leveraging target information to generate more intelligent responses with appropriately expressed emotions. Extensive experiments on real-world data demonstrate that the proposed method outperforms the state-of-the-art methods in terms of both content coherence and emotion appropriateness.

References

[1]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In ICLR.

[2]

Zoraida Callejas, David Griol, and Ramón López-Cózar. 2011. Predicting user mental states in spoken dialogue systems. EURASIP J. Adv. Sig. Process. 2011, 1 (2011), 1–21.

[3]

Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP.

[4]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. NAACL-HLT.

[5]

John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. JMLR 12, 7 (2011), 257–269.

Digital Library

[6]

Joseph L. Fleiss and Jacob Cohen. 1973. The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educ. Psychol. Measure. 33, 3 (1973), 613–619.

[7]

Sayan Ghosh, Mathieu Chollet, Eugene Laksana, Louis-Philippe Morency, and Stefan Scherer. 2017. Affect-lm: A neural language model for customizable affective text generation. In ACL. 634–642.

[8]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (1997), 1735–1780.

Digital Library

[9]

Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen. 2014. Convolutional neural network architectures for matching natural language sentences. In NIPS. 2042–2050.

Digital Library

[10]

Zhiting Hu, Zichao Yang, Xiaodan Liang, Ruslan Salakhutdinov, and Eric P. Xing. 2017. Toward controlled generation of text. In ICML. 1587–1596.

Digital Library

[11]

Chenyang Huang, Osmar Zaiane, Amine Trabelsi, and Nouha Dziri. 2018. Automatic dialogue generation with expressed emotions. In NAACL. 49–54.

[12]

Sébastien Jean, Kyunghyun Cho, Roland Memisevic, and Yoshua Bengio. 2014. On using very large target vocabulary for neural machine translation. In ACL. 1–10.

[13]

Zongcheng Ji, Zhengdong Lu, and Hang Li. 2014. An information retrieval approach to short text conversation (unpublished).

[14]

Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2015. A diversity-promoting objective function for neural conversation models. In NAACL. 110–119.

[15]

Jiwei Li, Michel Galley, Chris Brockett, Georgios P. Spithourakis, Jianfeng Gao, and Bill Dolan. 2016. A persona-based neural conversation model. In ACL. 994–1003.

[16]

Jiwei Li, Will Monroe, and Dan Jurafsky. 2016. A simple, fast diverse decoding algorithm for neural generation. arXiv (2016).

[17]

Qintong Li, Hongshen Chen, Zhaochun Ren, Zhumin Chen, Zhaopeng Tu, and Jun Ma. 2020. EmpDG: Multi-resolution interactive empathetic dialogue generation. In COLING. 4454–4466.

[18]

Zhouhan Lin, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. 2017. A structured self-attentive sentence embedding (unpublished).

[19]

Chia-Wei Liu, Ryan Lowe, Iulian V. Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. In EMNLP. 2122–2132.

[20]

Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. 2015. The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems. In Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue. 285–294.

[21]

Zhengdong Lu and Hang Li. 2013. A deep architecture for matching short texts. In NIPS. 1367–1375.

Digital Library

[22]

Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In EMNLP. 1412–1421.

[23]

Bilyana Martinovski and David Traum. 2003. Breakdown in human-machine interaction: the error is the clue. In ISCA Tutorial and Research Workshop. 11–16.

[24]

John D. Mayer, Richard D. Roberts, and Sigal G. Barsade. 2008. Human abilities: Emotional intelligence. Annu. Rev. Psychol. 59 (2008), 507–536.

[25]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. (unpublished).

[26]

Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. 2016. Sequence to backward and forward sequences: A content-introducing approach to generative short-text conversation. In COLING. 3349–3358.

[27]

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In ACL. 311–318.

Digital Library

[28]

Yehong Peng, Yizhen Fang, Zhiwen Xie, and Guangyou Zhou. 2019. Topic-enhanced emotional conversation generation with attention mechanism. Knowl.-Bas. Syst. 163 (2019), 429–437.

[29]

Robert Plutchik. 1980. A general psychoevolutionary theory of emotion. In Theories of Emotion. Elsevier, 3–33.

[30]

Robert Plutchik. 2001. The nature of emotions: Human emotions have deep evolutionary roots, a fact that may explain their complexity and provide tools for clinical practice. Am. Sci. 89, 4 (2001), 344–350.

[31]

Helmut Prendinger, Junichiro Mori, and Mitsuru Ishizuka. 2005. Using human physiology to evaluate subtle expressivity of a virtual quizmaster in a mathematical game. Int. J. Hum.-Comput. Stud. (2005), 231–245.

Digital Library

[32]

Qiao Qian, Minlie Huang, Haizhou Zhao, Jingfang Xu, and Xiaoyan Zhu. 2017. Assigning personality/identity to a chatting machine for coherent conversation generation. (unpublished).

[33]

Alan Ritter, Colin Cherry, and William B. Dolan. 2011. Data-driven response generation in social media. In EMNLP. 583–593.

Digital Library

[34]

Iulian Vlad Serban, Alessandro Sordoni, Yoshua Bengio, Aaron C. Courville, and Joelle Pineau. 2016. Building end-to-end dialogue systems using generative hierarchical neural network models. In AAAI. 3776–3784.

Digital Library

[35]

Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron Courville, and Yoshua Bengio. 2017. A hierarchical latent variable encoder-decoder model for generating dialogues. In AAAI. 3295–3301.

Digital Library

[36]

Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural responding machine for short-text conversation. In ACL.

[37]

Yan Song, Shuming Shi, Jing Li, and Haisong Zhang. 2018. Directional skip-gram: Explicitly distinguishing left and right context for word embeddings. In NAACL. 175–180.

[38]

Zhenqiao Song, Xiaoqing Zheng, Lu Liu, Mu Xu, and Xuan-Jing Huang. 2019. Generating responses with a specific emotion in dialog. In ACL. 3685–3695.

[39]

Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, and Bill Dolan. 2015. A neural network approach to context-sensitive generation of conversational responses. North American Chapter of ACL-HLT, 196–205.

[40]

Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In NIPS. 3104–3112.

Digital Library

[41]

Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, and Bing Qin. 2014. Learning sentiment-specific word embedding for twitter sentiment classification. In ACL. 1555–1565.

[42]

Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019. Multi-representation fusion network for multi-turn response selection in retrieval-based chatbots. In WSDM. 267–275.

Digital Library

[43]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NIPS. 5998–6008.

Digital Library

[44]

Ashwin K. Vijayakumar, Michael Cogswell, Ramprasath R. Selvaraju, Qing Sun, Stefan Lee, David Crandall, and Dhruv Batra. 2018. Diverse beam search: Decoding diverse solutions from neural sequence models. In AAAI. 7371–7379.

[45]

Oriol Vinyals, Łukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, and Geoffrey Hinton. 2015. Grammar as a foreign language. In NIPS. 2773–2781.

Digital Library

[46]

Oriol Vinyals and Quoc Le. 2015. A neural conversational model. In ICML Deep Learning Workshop.

[47]

Richard Wallace. 2003. The Elements of AIML Style. Alice AI Foundation (2003).

[48]

Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. 2013. A dataset for research on short-text conversations. In EMNLP. 935–945.

[49]

Wei Wei, Gao Cong, Xiaoli Li, See-Kiong Ng, and Guohui Li. 2011. Integrating community question and answer archives. In AAAI. 1255–1260.

Digital Library

[50]

Wei Wei, Jiayi Liu, Xianling Mao, Guibing Guo, Feida Zhu, Pan Zhou, and Yuchong Hu. 2019. Emotion-aware chat machine: Automatic emotional response generation for human-like emotional interaction. In CIKM. 1401–1410.

Digital Library

[51]

Wei Wei, ZhaoYan Ming, Liqiang Nie, Guohui Li, Jianjun Li, Feida Zhu, Tianfeng Shang, and Changyin Luo. 2016. Exploring heterogeneous features for query-focused summarization of categorized community answers. Inf. Sci. 330 (2016), 403–423.

Digital Library

[52]

Joseph Weizenbaum. 1966. ELIZA: A computer program for the study of natural language communication between man and machine. Commun. ACM 9, 1 (1966), 36–45.

Digital Library

[53]

Bruce Wilcox. 2011. Beyond Façade: Pattern Matching for Natural Language Applications. Retreived from GamaSutra.com (2011).

[54]

Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. 2016. Sequential matching network: A new architecture for multi-turn response selection in retrieval-based chatbots. In ACL. 496–505.

[55]

Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, Ming Zhou, and Wei-Ying Ma. 2017. Topic aware neural response generation. In AAAI. 3351–3357.

Digital Library

[56]

Rui Yan, Yiping Song, and Hua Wu. 2016. Learning to respond with deep neural networks for retrieval-based human-computer conversation system. In SIGIR. 55–64.

Digital Library

[57]

Hainan Zhang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng. 2018. Tailored sequence to sequence models to different conversation scenarios. In ACL. 1479–1488.

[58]

Hao Zhou, Minlie Huang, Tianyang Zhang, Xiaoyan Zhu, and Bing Liu. 2018. Emotional chatting machine: Emotional conversation generation with internal and external memory. In AAAI. 730–738.

[59]

Hao Zhou, Tom Young, Minlie Huang, Haizhou Zhao, Jingfang Xu, and Xiaoyan Zhu. 2018. Commonsense knowledge aware conversation generation with graph attention. In IJCAI. 4623–4629.

Digital Library

[60]

Li Zhou, Jianfeng Gao, Di Li, and Heung-Yeung Shum. 2020. The design and implementation of xiaoice, an empathetic social chatbot. Comput. Ling. 46, 1 (2020), 53–93.

Digital Library

[61]

Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, Xuan Liu, and Rui Yan. 2016. Multi-view response selection for human-computer conversation. In EMNLP. 372–381.

[62]

Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu, and Hua Wu. 2018. Multi-turn response selection for chatbots with deep attention matching network. In ACL. 1118–1127.

[63]

Xianda Zhou and William Yang Wang. 2017. Mojitalk: Generating emotional responses at scale. In ACL. 1128–1137.

Cited By

Fu YChen XMiao DQin XLu PLi X(2024)Label-semantics enhanced multi-layer heterogeneous graph convolutional network for Aspect Sentiment Quadruplet ExtractionExpert Systems with Applications10.1016/j.eswa.2024.124523255(124523)Online publication date: Dec-2024
https://doi.org/10.1016/j.eswa.2024.124523
Hou YLiu FZhuang XZhang Y(2024)Twain-GCN: twain-syntax graph convolutional networks for aspect-based sentiment analysisKnowledge and Information Systems10.1007/s10115-024-02135-1Online publication date: 30-May-2024
https://doi.org/10.1007/s10115-024-02135-1
Lu XZhao WZhao YQin BZhang ZWen J(2023)A Topic-Enhanced Approach for Emotion Distribution Forecasting in ConversationsICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP49357.2023.10096414(1-5)Online publication date: 4-Jun-2023
https://doi.org/10.1109/ICASSP49357.2023.10096414
Show More Cited By

Index Terms

Target-guided Emotion-aware Chat Machine
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Discourse, dialogue and pragmatics
      2. Natural language generation

Recommendations

Emotion-aware Chat Machine: Automatic Emotional Response Generation for Human-like Emotional Interaction
CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

The consistency of a response to a given post at semantic-level and emotional-level is essential for a dialogue system to deliver human-like interactions. However, this challenge is not well addressed in the literature, since most of the approaches ...
Read More
Positive Emotion Elicitation in Chat-Based Dialogue Systems

We aim to draw on an important overlooked potential of affective dialogue systems—their application to promote positive emotional states, similar to that of emotional support between humans. This can be achieved by eliciting a more positive emotional ...
Read More
Generating Emotional Social Chatbot Responses with a Consistent Speaking Style
Natural Language Processing and Chinese Computing
Abstract
Emotional conversation plays a vital role in creating more human-like conversations. Although previous works on emotional conversation generation have achieved promising results, the issue of the speaking style inconsistency still exists. In this ...
Read More

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems

ACM Transactions on Information Systems Volume 39, Issue 4

October 2021

482 pages

ISSN:1046-8188

EISSN:1558-2868

DOI:10.1145/3477247

Editor:
Min Zhang
Tsinghua University, China

Issue’s Table of Contents

Copyright © 2021 Association for Computing Machinery.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 August 2021

Accepted: 01 March 2021

Revised: 01 January 2021

Received: 01 May 2020

Published in TOIS Volume 39, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Refereed

Funding Sources

National Natural Science Foundation of China
Equipment Pre-Research Fund

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

10
Total Citations
View Citations
352
Total Downloads

Downloads (Last 12 months)58
Downloads (Last 6 weeks)6

Other Metrics

View Author Metrics

Citations

Cited By

Fu YChen XMiao DQin XLu PLi X(2024)Label-semantics enhanced multi-layer heterogeneous graph convolutional network for Aspect Sentiment Quadruplet ExtractionExpert Systems with Applications10.1016/j.eswa.2024.124523255(124523)Online publication date: Dec-2024
https://doi.org/10.1016/j.eswa.2024.124523
Hou YLiu FZhuang XZhang Y(2024)Twain-GCN: twain-syntax graph convolutional networks for aspect-based sentiment analysisKnowledge and Information Systems10.1007/s10115-024-02135-1Online publication date: 30-May-2024
https://doi.org/10.1007/s10115-024-02135-1
Lu XZhao WZhao YQin BZhang ZWen J(2023)A Topic-Enhanced Approach for Emotion Distribution Forecasting in ConversationsICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP49357.2023.10096414(1-5)Online publication date: 4-Jun-2023
https://doi.org/10.1109/ICASSP49357.2023.10096414
Wang PTao LTang MZhao MWang LXu YTian JMeng K(2023)A novel adaptive marker segmentation graph convolutional network for aspect-level sentiment analysisKnowledge-Based Systems10.1016/j.knosys.2023.110559270:COnline publication date: 21-Jun-2023
https://dl.acm.org/doi/10.1016/j.knosys.2023.110559
Ding JChen XLu PYang ZLi XDu Y(2023)DialogueINAB: an interaction neural network based on attitudes and behaviors of interlocutors for dialogue emotion recognitionThe Journal of Supercomputing10.1007/s11227-023-05439-179:18(20481-20514)Online publication date: 15-Jun-2023
https://dl.acm.org/doi/10.1007/s11227-023-05439-1
Bilquise GIbrahim SShaalan K(2022)Emotionally Intelligent Chatbots: A Systematic Literature ReviewHuman Behavior and Emerging Technologies10.1155/2022/96016302022(1-23)Online publication date: 26-Sep-2022
https://doi.org/10.1155/2022/9601630
Yang MWang ZXu QLi CXu R(2022)Leveraging hierarchical semantic‐emotional memory in emotional conversation generationCAAI Transactions on Intelligence Technology10.1049/cit2.121438:3(824-835)Online publication date: 2-Oct-2022
https://dl.acm.org/doi/10.1049/cit2.12143
Camilleri MTroise C(2022)Live support by chatbots with artificial intelligence: A future research agendaService Business10.1007/s11628-022-00513-917:1(61-80)Online publication date: 10-Nov-2022
https://doi.org/10.1007/s11628-022-00513-9
Kim SSong HPark S(2022)A response generator with response-aware encoder for generating specific and relevant responsesSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-022-07664-x27:7(3721-3732)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.1007/s00500-022-07664-x
Hauff CKiseleva JSanderson MZamani HZhang Y(2021)Conversational Search and Recommendation: Introduction to the Special IssueACM Transactions on Information Systems10.1145/346527239:4(1-6)Online publication date: 1-Sep-2021
https://dl.acm.org/doi/10.1145/3465272

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Issue’s Table of Contents