CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization

Haitao Lin; Liqun Ma; Junnan Zhu; Lu Xiang; Yu Zhou; Jiajun Zhang; Chengqing Zong

doi:10.18653/v1/2021.emnlp-main.365

CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization

Haitao Lin, Liqun Ma, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

Abstract

Dialogue summarization has drawn much attention recently. Especially in the customer service domain, agents could use dialogue summaries to help boost their works by quickly knowing customer’s issues and service progress. These applications require summaries to contain the perspective of a single speaker and have a clear topic flow structure, while neither are available in existing datasets. Therefore, in this paper, we introduce a novel Chinese dataset for Customer Service Dialogue Summarization (CSDS). CSDS improves the abstractive summaries in two aspects: (1) In addition to the overall summary for the whole dialogue, role-oriented summaries are also provided to acquire different speakers’ viewpoints. (2) All the summaries sum up each topic separately, thus containing the topic-level structure of the dialogue. We define tasks in CSDS as generating the overall summary and different role-oriented summaries for a given dialogue. Next, we compare various summarization methods on CSDS, and experiment results show that existing methods are prone to generate redundant and incoherent summaries. Besides, the performance becomes much worse when analyzing the performance on role-oriented summaries and topic structures. We hope that this study could benchmark Chinese dialogue summarization and benefit further studies.

Anthology ID:: 2021.emnlp-main.365
Volume:: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2021
Address:: Online and Punta Cana, Dominican Republic
Editors:: Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4436–4451
Language:
URL:: https://aclanthology.org/2021.emnlp-main.365
DOI:: 10.18653/v1/2021.emnlp-main.365
Bibkey:
Cite (ACL):: Haitao Lin, Liqun Ma, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, and Chengqing Zong. 2021. CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 4436–4451, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):: CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization (Lin et al., EMNLP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.emnlp-main.365.pdf
Video:: https://aclanthology.org/2021.emnlp-main.365.mp4
Code: xiaolinAndy/CSDS + additional community code

PDF Cite Search Code Video