Abstract
This work explores a novel approach for conversation detection in email mailboxes. This approach clusters messages into coherent conversations by using a similarity function among messages that takes into consideration all relevant email attributes, such as message subject, participants, date of submission, and message content. The detection algorithm is evaluated against a manual partition of two email mailboxes into conversations. Experimental results demonstrate the superiority of our detection algorithm over several other alternative approaches.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aaron, H., Jen-Yuan, Y.: Email thread reassembly using similarity matching. In: Proceedings of the Third Conference on Email and Anti-Spam (CEAS) (2006)
Gabor, C., Keno, A., Roger, W.: BuzzTrack: Topic Detection and Tracking in Email. In: Proceedings of the 12th international conference on Intelligent user interfaces IUI 2007, ACM Press, New York (2007)
Kalman, Y.M., Rafaeli, S.: Email Chronemics: Unobtrusive Profiling of Response Times. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS 2005), vol. 04, pp. 108.2 (2005)
Kerr, B.: THREAD ARCS: An Email Thread Visualization. In: Proceedings of IEEE InfoVis, Seattle, WA, pp. 211–218 (2003)
Klimt, B., Yang, Y.: Introducing the Enron Corpus. In: Proceedings of the First Conference on Email and Anti-Spam (CEAS), Mountain View, CA (2004)
Lam, D., Rohall, S.L., Schmandt, C., Stern, M.K.: Exploiting e-mail structure to improve summarization. In: ACM 2002 Conference on Computer Supported Cooperative Work (CSCW2002), New Orlenes, LA (2002)
Lewis, D.D., Gale, A.W.: A Sequential Algorithm for Training Text Classifiers. In: Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, Dublin, Ireland, pp. 3–12 (1994)
Lewis, D.D., Knowels, K.A.: Threading Electronic Mail: a preliminary study. In Information Processing and Management 33(2), 209–217 (1997)
Rudy, I.A.: A Critical Review of Research on Electronic Mail. European Journal of Information Systems 4, 198–213 (1996)
The Internet Society. RFC 2822 – Internet Message Format (2001), http://www.faqs.org/rfcs/rfc2822.html
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Erera, S., Carmel, D. (2008). Conversation Detection in Email Systems. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds) Advances in Information Retrieval. ECIR 2008. Lecture Notes in Computer Science, vol 4956. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78646-7_48
Download citation
DOI: https://doi.org/10.1007/978-3-540-78646-7_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78645-0
Online ISBN: 978-3-540-78646-7
eBook Packages: Computer ScienceComputer Science (R0)