An Experimental Approach for Information Extraction in Multi-party Dialogue Discourse
Pages 180 - 192
Abstract
In this paper, we address the task of information extraction for transcript meetings. Meeting documents are not usually well structured and are lacking formatting and punctuations. In addition, the information are distributed over multiple sentences. We experimentally investigate the usefulness of numerical statistics and topic modelling methods on a real dataset containing multi-part dialogue texts. Such information extraction can be used for different tasks, of which we consider two: contrasting thematically related but distinct meetings from each other, and contrasting meetings involving the same participants from those involving other. In addition to demonstrating the difference between counting and topic modeling results, we also evaluate our experiments with respect to the gold standards provided for the dataset.
References
[1]
Augmented multi-party interaction (2010). http://www.amiproject.org
[2]
Carletta J Unleashing the killer corpus: experiences in creating the multi-everything AMI meeting corpus Lang. Resour. Eval. 2007 41 181-190
[3]
Fernández, R., Frampton, M., Dowding, J., Adukuzhiyil, A., Ehlen, P., Peters, S.: Identifying relevant phrases to summarize decisions in spoken meetings. In: Proceedings of Interspeech 2008, Brisbane (2008)
[4]
Fernández, R., Frampton, M., Ehlen, P., Purver, M., Peters, S.: Modelling and detecting decisions in multi-party dialogue. In: Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue, SIGdial 2008, pp. 156–163. Association for Computational Linguistics, Stroudsburg, PA, USA (2008)
[5]
Galley, M., McKeown, K., Fosler-Lussier, E., Jing, H.: Discourse segmentation of multi-party conversation. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, ACL 2003, vol. 1. Association for Computational Linguistics, Stroudsburg, PA, USA (2003)
[6]
Georgescul, M., Clark, A., Armstrong, S.: Exploiting structural meeting-specific features for topic segmentation. In: TALN/RECITAL, Toulouse (France), pp. 15–24 (2007)
[7]
Gurin, Y., Szymanski, T., Keane, M.T.: Discovering news events that move markets. In: Intelligent Systems Conference 2017 (IntelliSys2017), London, United Kingdom, 7–8 Sept 2017 (2017)
[8]
He, Q., Chang, K., Lim, E.P.: Analyzing feature trajectories for event detection. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2007, pp. 207–214. ACM, New York (2007)
[9]
Kleinberg, J.M.: Bursty and hierarchical structure in streams. Data Min. Knowl. Discov. 7(4), 373–397 (2003).
[10]
Lau JH, Collier N, and Baldwin T On-line trend analysis with topic models:#twitter trends detection topic model online Proc. COLING 2012 2012 1519-1534
[11]
Lee DD and Seung HS Learning the parts of objects by nonnegative matrix factorization Nature 1999 401 788-791
[12]
Purver, M., Dowding, J., Niekrasz, J., Ehlen, P., Noorbaloochi, S., Peters, S.: Detecting and summarizing action items in multi-party dialogue. In: In Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue (2007)
[13]
Purver, M., Griffiths, T.L., Körding, K.P., Tenenbaum, J.B.: Unsupervised topic modelling for multi-party spoken discourse. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, ACL-44, pp. 17–24. Association for Computational Linguistics, Stroudsburg, PA, USA (2006)
[14]
Ramage, D., Dumais, S.T., Liebling, D.J.: Characterizing microblogs with topic models. In: ICWSM, vol. 10(1), pp. 16 (2010)
[15]
Řehůřek, R., Sojka, P.: Software Framework for Topic Modelling with Large Corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pp. 45–50. ELRA, Valletta, Malta (2010). http://is.muni.cz/publication/884893/en
[16]
Riedhammer, K., Favre, B., Hakkani-Tür, D.: Packing the Meeting Summarization Knapsack. In: Interspeech, Brisbane (Australia). Unknown, Unknown or Invalid Region (2008). https://hal-amu.archives-ouvertes.fr/hal-01194290
[17]
Sayyadi, H., Hurst, M., Maykov, A.: Event detection and tracking in social streams. In: In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2009). AAAI (2009)
[18]
Tur, G., et al.: The CALO meeting speech recognition and understanding system. In: 2008 IEEE Spoken Language Technology Workshop, pp. 69–72 (2008)
[19]
Tur G et al. The calo meeting assistant system IEEE Trans. Audio Speech Lang. Process. 2010 18 1601-1611
[20]
Weng, J., Lee, B.S.: Event detection in twitter. In: ICWSM, vol. 11, pp. 401–408 (2011)
Index Terms
- An Experimental Approach for Information Extraction in Multi-party Dialogue Discourse
Index terms have been assigned to the content through auto-classification.
Recommendations
Matching Intentions for Discourse Parsing in Multi-party Dialogues
Advances and Trends in Artificial Intelligence. Theory and ApplicationsAbstractDiscourse parsing in multi-party dialogues is a fundamental task in dialogue systems, which involves identifying relations between elementary discourse units (EDUs), where an EDU is an utterance in a dialogue. Despite the variety of approaches ...
Comments
Information & Contributors
Information
Published In
© Springer Nature Switzerland AG 2023.
Publisher
Springer-Verlag
Berlin, Heidelberg
Publication History
Published: 26 February 2023
Author Tags
Qualifiers
- Article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Reflects downloads up to 22 Sep 2024
Other Metrics
Citations
View Options
View options
Get Access
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in