Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Target-guided Emotion-aware Chat Machine

Published: 17 August 2021 Publication History
  • Get Citation Alerts
  • Abstract

    The consistency of a response to a given post at the semantic level and emotional level is essential for a dialogue system to deliver humanlike interactions. However, this challenge is not well addressed in the literature, since most of the approaches neglect the emotional information conveyed by a post while generating responses. This article addresses this problem and proposes a unified end-to-end neural architecture, which is capable of simultaneously encoding the semantics and the emotions in a post and leveraging target information to generate more intelligent responses with appropriately expressed emotions. Extensive experiments on real-world data demonstrate that the proposed method outperforms the state-of-the-art methods in terms of both content coherence and emotion appropriateness.

    References

    [1]
    Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In ICLR.
    [2]
    Zoraida Callejas, David Griol, and Ramón López-Cózar. 2011. Predicting user mental states in spoken dialogue systems. EURASIP J. Adv. Sig. Process. 2011, 1 (2011), 1–21.
    [3]
    Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP.
    [4]
    Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. NAACL-HLT.
    [5]
    John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. JMLR 12, 7 (2011), 257–269.
    [6]
    Joseph L. Fleiss and Jacob Cohen. 1973. The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educ. Psychol. Measure. 33, 3 (1973), 613–619.
    [7]
    Sayan Ghosh, Mathieu Chollet, Eugene Laksana, Louis-Philippe Morency, and Stefan Scherer. 2017. Affect-lm: A neural language model for customizable affective text generation. In ACL. 634–642.
    [8]
    Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (1997), 1735–1780.
    [9]
    Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen. 2014. Convolutional neural network architectures for matching natural language sentences. In NIPS. 2042–2050.
    [10]
    Zhiting Hu, Zichao Yang, Xiaodan Liang, Ruslan Salakhutdinov, and Eric P. Xing. 2017. Toward controlled generation of text. In ICML. 1587–1596.
    [11]
    Chenyang Huang, Osmar Zaiane, Amine Trabelsi, and Nouha Dziri. 2018. Automatic dialogue generation with expressed emotions. In NAACL. 49–54.
    [12]
    Sébastien Jean, Kyunghyun Cho, Roland Memisevic, and Yoshua Bengio. 2014. On using very large target vocabulary for neural machine translation. In ACL. 1–10.
    [13]
    Zongcheng Ji, Zhengdong Lu, and Hang Li. 2014. An information retrieval approach to short text conversation (unpublished).
    [14]
    Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2015. A diversity-promoting objective function for neural conversation models. In NAACL. 110–119.
    [15]
    Jiwei Li, Michel Galley, Chris Brockett, Georgios P. Spithourakis, Jianfeng Gao, and Bill Dolan. 2016. A persona-based neural conversation model. In ACL. 994–1003.
    [16]
    Jiwei Li, Will Monroe, and Dan Jurafsky. 2016. A simple, fast diverse decoding algorithm for neural generation. arXiv (2016).
    [17]
    Qintong Li, Hongshen Chen, Zhaochun Ren, Zhumin Chen, Zhaopeng Tu, and Jun Ma. 2020. EmpDG: Multi-resolution interactive empathetic dialogue generation. In COLING. 4454–4466.
    [18]
    Zhouhan Lin, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. 2017. A structured self-attentive sentence embedding (unpublished).
    [19]
    Chia-Wei Liu, Ryan Lowe, Iulian V. Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. In EMNLP. 2122–2132.
    [20]
    Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. 2015. The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems. In Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue. 285–294.
    [21]
    Zhengdong Lu and Hang Li. 2013. A deep architecture for matching short texts. In NIPS. 1367–1375.
    [22]
    Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. 2015. Effective approaches to attention-based neural machine translation. In EMNLP. 1412–1421.
    [23]
    Bilyana Martinovski and David Traum. 2003. Breakdown in human-machine interaction: the error is the clue. In ISCA Tutorial and Research Workshop. 11–16.
    [24]
    John D. Mayer, Richard D. Roberts, and Sigal G. Barsade. 2008. Human abilities: Emotional intelligence. Annu. Rev. Psychol. 59 (2008), 507–536.
    [25]
    Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. (unpublished).
    [26]
    Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. 2016. Sequence to backward and forward sequences: A content-introducing approach to generative short-text conversation. In COLING. 3349–3358.
    [27]
    Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In ACL. 311–318.
    [28]
    Yehong Peng, Yizhen Fang, Zhiwen Xie, and Guangyou Zhou. 2019. Topic-enhanced emotional conversation generation with attention mechanism. Knowl.-Bas. Syst. 163 (2019), 429–437.
    [29]
    Robert Plutchik. 1980. A general psychoevolutionary theory of emotion. In Theories of Emotion. Elsevier, 3–33.
    [30]
    Robert Plutchik. 2001. The nature of emotions: Human emotions have deep evolutionary roots, a fact that may explain their complexity and provide tools for clinical practice. Am. Sci. 89, 4 (2001), 344–350.
    [31]
    Helmut Prendinger, Junichiro Mori, and Mitsuru Ishizuka. 2005. Using human physiology to evaluate subtle expressivity of a virtual quizmaster in a mathematical game. Int. J. Hum.-Comput. Stud. (2005), 231–245.
    [32]
    Qiao Qian, Minlie Huang, Haizhou Zhao, Jingfang Xu, and Xiaoyan Zhu. 2017. Assigning personality/identity to a chatting machine for coherent conversation generation. (unpublished).
    [33]
    Alan Ritter, Colin Cherry, and William B. Dolan. 2011. Data-driven response generation in social media. In EMNLP. 583–593.
    [34]
    Iulian Vlad Serban, Alessandro Sordoni, Yoshua Bengio, Aaron C. Courville, and Joelle Pineau. 2016. Building end-to-end dialogue systems using generative hierarchical neural network models. In AAAI. 3776–3784.
    [35]
    Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron Courville, and Yoshua Bengio. 2017. A hierarchical latent variable encoder-decoder model for generating dialogues. In AAAI. 3295–3301.
    [36]
    Lifeng Shang, Zhengdong Lu, and Hang Li. 2015. Neural responding machine for short-text conversation. In ACL.
    [37]
    Yan Song, Shuming Shi, Jing Li, and Haisong Zhang. 2018. Directional skip-gram: Explicitly distinguishing left and right context for word embeddings. In NAACL. 175–180.
    [38]
    Zhenqiao Song, Xiaoqing Zheng, Lu Liu, Mu Xu, and Xuan-Jing Huang. 2019. Generating responses with a specific emotion in dialog. In ACL. 3685–3695.
    [39]
    Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, and Bill Dolan. 2015. A neural network approach to context-sensitive generation of conversational responses. North American Chapter of ACL-HLT, 196–205.
    [40]
    Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In NIPS. 3104–3112.
    [41]
    Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, and Bing Qin. 2014. Learning sentiment-specific word embedding for twitter sentiment classification. In ACL. 1555–1565.
    [42]
    Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019. Multi-representation fusion network for multi-turn response selection in retrieval-based chatbots. In WSDM. 267–275.
    [43]
    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NIPS. 5998–6008.
    [44]
    Ashwin K. Vijayakumar, Michael Cogswell, Ramprasath R. Selvaraju, Qing Sun, Stefan Lee, David Crandall, and Dhruv Batra. 2018. Diverse beam search: Decoding diverse solutions from neural sequence models. In AAAI. 7371–7379.
    [45]
    Oriol Vinyals, Łukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, and Geoffrey Hinton. 2015. Grammar as a foreign language. In NIPS. 2773–2781.
    [46]
    Oriol Vinyals and Quoc Le. 2015. A neural conversational model. In ICML Deep Learning Workshop.
    [47]
    Richard Wallace. 2003. The Elements of AIML Style. Alice AI Foundation (2003).
    [48]
    Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. 2013. A dataset for research on short-text conversations. In EMNLP. 935–945.
    [49]
    Wei Wei, Gao Cong, Xiaoli Li, See-Kiong Ng, and Guohui Li. 2011. Integrating community question and answer archives. In AAAI. 1255–1260.
    [50]
    Wei Wei, Jiayi Liu, Xianling Mao, Guibing Guo, Feida Zhu, Pan Zhou, and Yuchong Hu. 2019. Emotion-aware chat machine: Automatic emotional response generation for human-like emotional interaction. In CIKM. 1401–1410.
    [51]
    Wei Wei, ZhaoYan Ming, Liqiang Nie, Guohui Li, Jianjun Li, Feida Zhu, Tianfeng Shang, and Changyin Luo. 2016. Exploring heterogeneous features for query-focused summarization of categorized community answers. Inf. Sci. 330 (2016), 403–423.
    [52]
    Joseph Weizenbaum. 1966. ELIZA: A computer program for the study of natural language communication between man and machine. Commun. ACM 9, 1 (1966), 36–45.
    [53]
    Bruce Wilcox. 2011. Beyond Façade: Pattern Matching for Natural Language Applications. Retreived from GamaSutra.com (2011).
    [54]
    Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. 2016. Sequential matching network: A new architecture for multi-turn response selection in retrieval-based chatbots. In ACL. 496–505.
    [55]
    Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, Ming Zhou, and Wei-Ying Ma. 2017. Topic aware neural response generation. In AAAI. 3351–3357.
    [56]
    Rui Yan, Yiping Song, and Hua Wu. 2016. Learning to respond with deep neural networks for retrieval-based human-computer conversation system. In SIGIR. 55–64.
    [57]
    Hainan Zhang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng. 2018. Tailored sequence to sequence models to different conversation scenarios. In ACL. 1479–1488.
    [58]
    Hao Zhou, Minlie Huang, Tianyang Zhang, Xiaoyan Zhu, and Bing Liu. 2018. Emotional chatting machine: Emotional conversation generation with internal and external memory. In AAAI. 730–738.
    [59]
    Hao Zhou, Tom Young, Minlie Huang, Haizhou Zhao, Jingfang Xu, and Xiaoyan Zhu. 2018. Commonsense knowledge aware conversation generation with graph attention. In IJCAI. 4623–4629.
    [60]
    Li Zhou, Jianfeng Gao, Di Li, and Heung-Yeung Shum. 2020. The design and implementation of xiaoice, an empathetic social chatbot. Comput. Ling. 46, 1 (2020), 53–93.
    [61]
    Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, Xuan Liu, and Rui Yan. 2016. Multi-view response selection for human-computer conversation. In EMNLP. 372–381.
    [62]
    Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu, and Hua Wu. 2018. Multi-turn response selection for chatbots with deep attention matching network. In ACL. 1118–1127.
    [63]
    Xianda Zhou and William Yang Wang. 2017. Mojitalk: Generating emotional responses at scale. In ACL. 1128–1137.

    Cited By

    View all
    • (2024)Label-semantics enhanced multi-layer heterogeneous graph convolutional network for Aspect Sentiment Quadruplet ExtractionExpert Systems with Applications10.1016/j.eswa.2024.124523255(124523)Online publication date: Dec-2024
    • (2024)Twain-GCN: twain-syntax graph convolutional networks for aspect-based sentiment analysisKnowledge and Information Systems10.1007/s10115-024-02135-1Online publication date: 30-May-2024
    • (2023)A Topic-Enhanced Approach for Emotion Distribution Forecasting in ConversationsICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP49357.2023.10096414(1-5)Online publication date: 4-Jun-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Information Systems
    ACM Transactions on Information Systems  Volume 39, Issue 4
    October 2021
    482 pages
    ISSN:1046-8188
    EISSN:1558-2868
    DOI:10.1145/3477247
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 August 2021
    Accepted: 01 March 2021
    Revised: 01 January 2021
    Received: 01 May 2020
    Published in TOIS Volume 39, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Dialogue generation
    2. emotional conversation
    3. emotional chatbot

    Qualifiers

    • Research-article
    • Refereed

    Funding Sources

    • National Natural Science Foundation of China
    • Equipment Pre-Research Fund

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)58
    • Downloads (Last 6 weeks)6

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Label-semantics enhanced multi-layer heterogeneous graph convolutional network for Aspect Sentiment Quadruplet ExtractionExpert Systems with Applications10.1016/j.eswa.2024.124523255(124523)Online publication date: Dec-2024
    • (2024)Twain-GCN: twain-syntax graph convolutional networks for aspect-based sentiment analysisKnowledge and Information Systems10.1007/s10115-024-02135-1Online publication date: 30-May-2024
    • (2023)A Topic-Enhanced Approach for Emotion Distribution Forecasting in ConversationsICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP49357.2023.10096414(1-5)Online publication date: 4-Jun-2023
    • (2023)A novel adaptive marker segmentation graph convolutional network for aspect-level sentiment analysisKnowledge-Based Systems10.1016/j.knosys.2023.110559270:COnline publication date: 21-Jun-2023
    • (2023)DialogueINAB: an interaction neural network based on attitudes and behaviors of interlocutors for dialogue emotion recognitionThe Journal of Supercomputing10.1007/s11227-023-05439-179:18(20481-20514)Online publication date: 15-Jun-2023
    • (2022)Emotionally Intelligent Chatbots: A Systematic Literature ReviewHuman Behavior and Emerging Technologies10.1155/2022/96016302022(1-23)Online publication date: 26-Sep-2022
    • (2022)Leveraging hierarchical semantic‐emotional memory in emotional conversation generationCAAI Transactions on Intelligence Technology10.1049/cit2.121438:3(824-835)Online publication date: 2-Oct-2022
    • (2022)Live support by chatbots with artificial intelligence: A future research agendaService Business10.1007/s11628-022-00513-917:1(61-80)Online publication date: 10-Nov-2022
    • (2022)A response generator with response-aware encoder for generating specific and relevant responsesSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-022-07664-x27:7(3721-3732)Online publication date: 28-Nov-2022
    • (2021)Conversational Search and Recommendation: Introduction to the Special IssueACM Transactions on Information Systems10.1145/346527239:4(1-6)Online publication date: 1-Sep-2021

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media