Abstract
This work presents a survey of methods that visualize text streams. Existing methods are classified and compared from the aspect of visualization process. We introduce new aspects of method comparison: data type, text representation, and the temporal drawing approach. The subjectivity of visualization is described, and evaluation methodologies are explained. Related research areas are discussed and some future trends in the field anticipated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Scott Owen, G., Domik, G., Rhyne, T.M., Brodlie, K.W., Santos, B.S.: Definitions and rationale for visualization, http://www.siggraph.org/education/materials/HyperVis/visgoals/visgoal2.htm (accessed in February 2010)
Friendly, M., Denis, D.: Milestones in the history of thematic cartography, statistical graphics, and data visualization, vol. 9 (2008)
Thomas, J.J., Cook, K.A.: Illuminating the Path: The Research and Development Agenda for Visual Analytics. National Visualization and Analytics Ctr (2005)
Risch, J., Kao, A., Poteet, S., Wu, Y.: Text Visualization for Visual Text Analytics. In: Simoff, S.J., Böhlen, M.H., Mazeika, A. (eds.) Visual Data Mining. LNCS, vol. 4404, pp. 154–171. Springer, Heidelberg (2008)
Keim, D.A., Mansmann, F., Thomas, J.: Visual Analytics: How Much Visualization and How Much Analytics? In: ACM SIGKDD Workshop on Visual Analytics and Knowledge Discovery - VAKD 2009 (2009)
Tukey, J.W.: Exploratory Data Analysis. Addison-Wesley, Reading (1977)
Allan, J.: Tracking, Event- based Information Organization. Kluwer Academic Publishers, Dordrecht (2002)
Luo, D., Yang, J., Fan, J., Ribarsky, W., Luo, H.: EventRiver: Interactive Visual Exploration of Streaming Text. EG/IEEE EuroVis 2009 (2009) (to be published)
Carpendale, S.: Evaluating information visualizations. In: Information Visualization: Human-Centered Issues and Perspectives, pp. 19–45. Springer, Heidelberg (2008)
Tufte, E.R.: Visual Explanations. Graphics Press (1997)
Benson, J., Crist, D., Lafleur, P.: Agent-based visualization of streaming text. In: Proc. IEEE Info. Vis. Conf., Raleigh (2008)
Weskamp, M.: (2004), http://marumushi.com/projects/newsmap (acc. in Apr. 2010)
Albrecht-Buehler, C., Watson, B., Shamma, D.A.: Visualizing live text streams using motion and temporal pooling. IEEE Comp. Graph. App. 25(3), 52–59 (2005)
Mao, Y., Dillon, J., Lebanon, G.: Sequential document visualization. IEEE Transactions on Visualization and Computer Graphics 13(6), 1208–1215 (2007)
Linguistic Data Consortium: The New York Times Annotated Corpus, http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2008T19
Leskovec, J., Backstrom, L., Kleinberg, J.M.: Meme-tracking and the dynamics of the news cycle. In: Proc. of the 15th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp. 497–506 (2009)
Wise, J.A., Thomas, J.J., Pennock, K., Lantrip, D., Pottier, M., Schur, A., Crow, V.: Visualizing the non-visual: Spatial analysis and interaction with information from text documents. In: Proc. IEEE Symp. Info. Vis., pp. 51–58 (1995)
Miller, N.E., Wong, P.C., Brewster, M., Foote, H.: TOPIC ISLANDS - a wavelet-based text visualization system. In: Proc. 9th IEEE Conf. on Vis., pp. 189–196 (1998)
Berendt, B., Subasic, I.: STORIES in time: A graph-based interface for news tracking and discovery. In: Web Intel./IAT Workshops, pp. 531–534. IEEE, Los Alamitos (2009)
Kontostathis, A., Galitsky, L., Pottenger, W.M., Roy, S., Phelps, D.J.: A Survey of Emerging Trend Detection in Textual Data Mining (2003)
Yang, Y., Akers, L., Klose, T., Yang, C.B.: Text mining and visualization tools - impressions of emerging capabilities. World Patent Info. 30(4), 280–293 (2008)
Salton, G., Wong, A., Yang, A.C.S.: A vector space model for automatic indexing. Communications of the ACM 18, 229–237 (1975)
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Mei, Q., Zhai, C.: Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: Grossman, R., Bayardo, R.J., Bennett, K.P. (eds.) Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, August 21-24, pp. 198–207. ACM, New York (2005)
Moens, M.F.: Information Extraction, Algorithms and Prospects in a Retrieval Context. Springer, Heidelberg (2006)
Gregory, M., Chinchor, N., Whitney, P.: User-directed sentiment analysis: Visualizing the affective content of documents. In: Proc. of the Workshop on Sentiment and Subjectivity in Text. Association for Computational Linguistics (2006)
Sammon, J.: A nonlinear mapping for data structure analysis. IEEE Transactions on Computing 5(18), 401–409 (1969)
Lin, X., Soergel, D., Marchionini, G.: A Self-organizing semantic map for information retrieval. In: Proc. 14th. Ann. Int. ACM SIGIR Conf. on R&D In Information Retrieval, pp. 262–269 (1991)
Chalmers, M., Chitson, P.: Bead: Explorations in information visualization. In: Proc. of the 15th ACM SIGIR Conf. on R&D in Information Retrieval (1992)
Rennison, E.: Galaxy of news: An approach to visualizing and understanding expansive news landscapes. In: ACM User Interface Soft. and Tech., pp. 3–12 (1994)
Davidson, G.S., Hendrickson, B., Johnson, D.K., Meyers, C.E., Wylie, B.N.: Knowledge mining with VxInsight: Discovery through interaction. J. Intell. Inf. Syst. 11(3), 259–285 (1998)
Kaski, S., Lagus, K., Kohonen, T.: WEBSOM - Self-organizing maps of document collections. Neurocomputing 21, 101–117 (1998)
Risch, J., Rex, D., Dowson, S., Walters, T., May, R., Moon, B.: The starlight information visualization system. In: Proc. IEEE Conf. Info. Vis., pp. 42–49 (1997)
Havre, S., Hetzler, E.G., Nowell, L.T.: Themeriver: Visualizing theme changes over time. In: Proc. IEEE Conf. Info. Vis., pp. 115–124 (2000)
Kabán, A., Girolami, M.: A dynamic probabilistic model to visualise topic evolution in text streams. J. Intelligent Information Systems 18(2-3), 107–125 (2002)
Andrews, K., Kienreich, W., Sabol, V., Becker, J., Droschl, G., Kappe, F., Granitzer, M., Auer, P., Tochtermann, K.: The infosky visual explorer: exploiting hierarchical structure and document similarities. Info. Vis. 1(3-4), 166–181 (2002)
Wong, P.C., Foote, H., Adams, D., Cowley, W., Thomas, J.: Dynamic visualization of transient data streams. In: Proc. IEEE Symp. Info. Vis. (2003)
Fortuna, B., Grobelnik, M., Mladenic, D.: Visualization of text document corpus. Informatica (Slovenia) 29(4), 497–504 (2005)
Paulovich, F.V., Minghim, R.: Text map explorer: a tool to create and explore document maps. In: IV, pp. 245–251. IEEE Computer Society, Los Alamitos (2006)
Don, A., Zheleva, E., Gregory, M., Tarkan, S., Auvil, L., Clement, T., Shneiderman, B., Plaisant, C.: Discovering interesting usage patterns in text collections: integrating text mining with visualization. In: Proc. 16th ACM Conf. Information and Knowledge Management, CIKM, pp. 213–222 (2007)
Ghoniem, M., Luo, D., Yang, J., Ribarsky, W.: NewsLab: Exploratory Broadcast News Video Analysis. In: IEEE Symp. Vis. Analytics Sci. and Tech., pp. 123–130 (2007)
Paulovich, F.V., de Oliveira, M.C.F., Minghim, R.: The projection explorer: A flexible tool for projection-based multidimensional visualization. In: Proc. 20th Brazilian Symp. Comp. Graph. and Image Processing (SIBGRAPI), pp. 27–36 (2007)
Alencar, A.B., de Oliveira, M.C.F., Paulovich, F.V., Minghim, R., Andrade, M.G.: Temporal-PEx: Similarity-based visualization of time series. In: Proc. 20th Brazilian Symp. Comp. Graph. and Image Processing, SIBGRAPI (2007)
Ishikawa, Y., Hasegawa, M.: T-Scroll: Visualizing trends in a time-series of documents for interactive user exploration. In: Kovács, L., Fuhr, N., Meghini, C. (eds.) ECDL 2007. LNCS, vol. 4675, pp. 235–246. Springer, Heidelberg (2007)
Terachi, M., Saga, R., Sheng, Z., Tsuji, H.: Visualized technique for trend analysis of news articles. In: Nguyen, N.T., Borzemski, L., Grzech, A., Ali, M. (eds.) IEA/AIE 2008. LNCS (LNAI), vol. 5027, pp. 659–668. Springer, Heidelberg (2008)
Petrović, S., Dalbelo Bašić, B., Morin, A., Zupan, B., Chauchat, J.H.: Textual features for corpus visualization using correspondence analysis. Intell. Data Anal. 13(5), 795–813 (2009)
Strobelt, H., Oelke, D., Rohrdantz, C., Stoffel, A., Keim, D.A., Deussen, O.: Document cards: A top trumps visualization for documents. IEEE Trans. Vis. Comput. Graph 15(6), 1145–1152 (2009)
Prabowo, R., Thelwall, M., Alexandrov, M.: Generating overview timelines for major events in an RSS corpus. J. Informetrics 1(2), 131–144 (2007)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. American Society for Info. Science 41 (1990)
Jackson, J.E.: A User’s Guide to Principal Components. John Willey, New York (1991)
Greenacre, M.J.: Correspondence analysis in practice. Chapman and Hall, Boca Raton (2007)
Kruskal, J.B., Wish, M.: Multidimensional Scaling. Sage Publications, CA (1978)
York, J., Bohn, S., Pennock, K., Lantrip, D.: Clustering and dimensionality reduction in spire. In: AIPA Steering Group. Proc. Symp. Advanced Intelligence Processing and Analysis. Office of R&D, Washington (1995)
Paulovich, F.V., Nonato, L.G., Minghim, R., Levkowitz, H.: Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping. IEEE T. Vis. Comp. Graph. 14(3), 564–575 (2008)
Kohonen, T.: Self-Organizing Maps. Springer, Berlin (1995)
Fruchterman, T.M.J., Reingold, E.M.: Graph drawing by force-directed placement. Software: Practice and Experience 21(11), 1129–1164 (1991)
Minghim, R., Paulovich, F.V., Lopes, A.A.: Content-based text mapping using multidimensional projections for exploration of document collections. In: IS&T/SPIE Symp. on Elect. Imag. - Vis. and Data Anal., San Jose (2006)
Shneiderman, B.: Treemaps for space-constrained visualization of hierarchies, http://www.cs.umd.edu/hcil/treemap-history/index.shtml (accessed in April 2010)
Berndt, D.J., Clifford, J.: Using dynamic time warping to find patterns in time series. In: AAAI Workshop on Knowledge Discovery in Databases, pp. 359–370 (1994)
Keogh, E.J., Lonardi, S., Ratanamahatana, C.A., Wei, L., Lee, S.H., Handley, J.: Compression-based data mining of sequential data. Data Min. Knowl. Discov. 14(1), 99–129 (2007)
Hearst, M.: User Interfaces and Visualization. Addison-Wesley Longman, Amsterdam (1999)
Eler, D.M., Paulovich, F.V., de Oliveira, M.C.F., Minghim, R.: Coordinated and multiple views for visualizing text collections. In: IEEE 12th Conf. Info. Vis., pp. 246–251 (2008)
van Wijk, J.J.: Views on visualization. IEEE T. Vis. Comp. Graph. 12(4) (2006)
McGrath, J.E.: Methodology matters: doing research in the behavioral and social sciences. Morgan Kaufmann Publishers Inc., San Francisco (1995)
Microsoft: Pivot project, http://www.getpivot.com/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Šilić, A., Bašić, B.D. (2010). Visualization of Text Streams: A Survey. In: Setchi, R., Jordanov, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2010. Lecture Notes in Computer Science(), vol 6277. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15390-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-15390-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15389-1
Online ISBN: 978-3-642-15390-7
eBook Packages: Computer ScienceComputer Science (R0)