Abstract
The natural way to model a news corpus is as a directed graph where stories are linked to one another through a variety of relationships. We formalize this notion by viewing each news story as a set of actors, and by viewing links between stories as transformations these actors go through. We propose and model a simple and comprehensive set of transformations: create, merge, split, continue, and cease. These transformations capture evolution of a single actor and interactions among multiple actors. We present algorithms to rank each transformation and show how ranking helps us to infer important relationships between actors and stories in a corpus. We demonstrate the effectiveness of our notions by experimenting on large news corpora.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Allan, J., Carbonell, J., Doddington, G., Yamron, J., Yang, Y.: Topic detection and tracking pilot study: Final report. In: DARPA Broadcast News Transcription and Understanding Workshop, pp. 194–218 (2006)
Choudhary, R., Mehta, S., Bagchi, A., Balakrishna, R.: A framework for exploring news corpora by actor evolution and interaction. IBM Research Report- RI07004 (2007)
Mei, Q., Zhai, C.: Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: KDD 2005: 11th ACM SIGKDD international conference on Knowledge Discovery and data mining, pp. 198–207 (2005)
Mei, Q., Zhai, C.: A mixture model for contextual text mining. In: KDD 2006: 12th ACM SIGKDD international conference on Knowledge Discovery and data mining, pp. 649–655 (2006)
Nallapati, R., Feng, A., Peng, F., Allan, J.: Event threading within news topics. In: CIKM 2004: 13th ACM International Conference on Information and Knowledge Management, pp. 446–453 (2004)
Silver, D., Wang, X.: Volume tracking. In: VIS 1996: 7th conference on Visualization, pp. 157–164 (1996)
Spiliopoulou, M., Ntoutsi, I., Theodoridis, Y., Schult, R.: Monic: modeling and monitoring cluster transitions. In: KDD 2006: 12th ACM SIGKDD international conference on Knowledge Discovery and data mining, pp. 706–711 (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choudhary, R., Mehta, S., Bagchi, A., Balakrishnan, R. (2008). Towards Characterization of Actor Evolution and Interactions in News Corpora. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds) Advances in Information Retrieval. ECIR 2008. Lecture Notes in Computer Science, vol 4956. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78646-7_39
Download citation
DOI: https://doi.org/10.1007/978-3-540-78646-7_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78645-0
Online ISBN: 978-3-540-78646-7
eBook Packages: Computer ScienceComputer Science (R0)