Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2254556.2254660acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaviConference Proceedingsconference-collections
research-article

The semantics of clustering: analysis of user-generated spatializations of text documents

Published: 21 May 2012 Publication History

Abstract

Analyzing complex textual datasets consists of identifying connections and relationships within the data based on users' intuition and domain expertise. In a spatial workspace, users can do so implicitly by spatially arranging documents into clusters to convey similarity or relationships. Algorithms exist that spatialize and cluster such information mathematically based on similarity metrics. However, analysts often find inconsistencies in these generated clusters based on their expertise. Therefore, to support sensemaking, layouts must be co-created by the user and the model. In this paper, we present the results of a study observing individual users performing a sensemaking task in a spatial workspace. We examine the users' interactions during their analytic process, and also the clusters the users manually created. We found that specific interactions can act as valuable indicators of important structure within a dataset. Further, we analyze and characterize the structure of the user-generated clusters to identify useful metrics to guide future algorithms. Through a deeper understanding of how users spatially cluster information, we can inform the design of interactive algorithms to generate more meaningful spatializations for text analysis tasks, to better respond to user interactions during the analytics process, and ultimately to allow analysts to more rapidly gain insight.

References

[1]
Marshall, C. C. and Rogers, R. A. Two years before the mist: experiences with Aquanet. Proceedings of the ACM conference on Hypertext (Milan, Italy, 1992). ACM, 53--62.
[2]
Andrews, C., Endert, A., Yost, B. and North, C. Information visualization on large, high-resolution displays: Issues, challenges, and opportunities. Information Visualization, 10, 4 (2011), 341--355.
[3]
Andrews, C., Endert, A. and North, C. Space to Think: Large, High-Resolution Displays for Sensemaking. CHI (2010), 55--64.
[4]
Pirolli, P. and Card, S. Sensemaking Processes of Intelligence Analysts and Possible Leverage Points as Identified Though Cognitive Task Analysis Proceedings of the 2005 International Conference on Intelligence Analysis, McLean, Virginia(2005), 6.
[5]
Thomas, J. J. and Cook, K. A. Illuminating the path. IEEE Computer Society, 2005.
[6]
Endert, A., Fiaux, P. and North, C. Semantic Interaction for Visual Text Analytics. CHI (2012).
[7]
Skupin, A. A Cartographic Approach to Visualizing Conference Abstracts. IEEE Computer Graphics and Applications, 22(2002), 50--58.
[8]
Kaban, A. A Scalable Generative Topographic Mapping for Sparse Data Sequences. International Conference on Information Technology: Coding and Computing (ITCC'05), (2005).
[9]
Xu, R. and Wunsch, D., II Survey of clustering algorithms. IEEE Transactions on Neural Networks, 16, 3 (2005), 645--678.
[10]
Alonso, O., Gertz, M. and Baeza-Yates, R. Clustering and exploring search results using timeline constructions. Proceedings of the 18th ACM conference on Information and knowledge management (Hong Kong, China, 2009). ACM, 97--106.
[11]
Wise, J. A., Thomas, J. J., Pennock, K., Lantrip, D., Pottier, M., Schur, A. and Crow, V. Visualizing the non-visual: spatial analysis and interaction with information for text documents. Readings in information visualization: using vision to think (1999). Morgan Kaufmann Publishers Inc., 442--450.
[12]
Pike, W. A., Stasko, J., Chang, R. and O'Connell, T. A. The science of interaction. Information Visualization, 8, 4, 263--274.
[13]
Yi, J. S., Kang, Y. a., Stasko, J. and Jacko, J. Toward a Deeper Understanding of the Role of Interaction in Information Visualization. IEEE Transactions on Visualization and Computer Graphics, 13, 6 (2007), 1224--1231.
[14]
Card, S. K., Mackinlay, J. D. and Shneiderman, B. Readings in information visualization: using vision to think. Morgan Kaufmann Publishers Inc., 1999.
[15]
Dou, W., Jeong, D. H., Stukes, F., Ribarsky, W., Lipford, H. R. and Chang, R. Recovering Reasoning Processes from User Interactions. IEEE Computer Graphics and Applications, 29(2009), 52--61.
[16]
Chang, R., Ghoniem, M., Kosara, R., Ribarsky, W., Yang, J., Suma, E., Ziemkiewicz, C., Kern, D. and Sudjianto, A. WireVis: Visualization of Categorical, Time-Varying Data From Financial Transactions. Proceedings of the 2007 IEEE Symposium on Visual Analytics Science and Technology (2007). IEEE Computer Society, 155--162.
[17]
Shupp, L., Andrews, C., Dickey-Kurdziolek, M., Yost, B. and North, C. Shaping the Display of the Future: The Effects of Display Size and Curvature on User Performance and Insights. Human--Computer Interaction, 24, 1 (2009), 230--272.
[18]
Hossain, M. S., Andrews, C., Ramakrishnan, N. and North, C. Helping Intelligence Analysis Make Connections. Workshop on Scalable Integration of Analytics and Visualization (San Fransisco, 2011).
[19]
Hossain, M. S., Gresock, J., Edmonds, Y., Helm, R., Potts, M. and Ramakrishnan, N. Connecting the Dots between Abstracts. PLOS One, 7, 1 (2012), e29509.
[20]
Rose, S., Engel, D., Cramer, N. and Cowley, W. Automatic Keyword Extraction from Individual Documents. Text Mining (2010). John Wiley & Sons, Ltd, 1--20.
[21]
Shipman, F. and Marshall, C. Formality Considered Harmful: Experiences, Emerging Themes, and Directions on the Use of Formal Representations in Interactive Systems. Comput. Supported Coop. Work, 8, 4 (1999), 333--352.
[22]
Endert, A., Han, C., Maiti, D., House, L., Leman, S. C. and North, C. Observation-level Interaction with Statistical Models for Visual Analytics. IEEE VAST (2011), 121--130.
[23]
Xing, E. P., Ng, A. Y., Jordan, M. I. and Russell, S. Distance Metric Learning, with Application to Clustering with Side-information. Advances in Neural Information Processing Systems 15 (2002). MIT Press.

Cited By

View all
  • (2024)Understanding Human-AI Workflows for Generating PersonasProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3660729(757-781)Online publication date: 1-Jul-2024
  • (2023)Towards Usable Neural Comment Generation via Code-Comment Linkage Interpretation: Method and Empirical StudyIEEE Transactions on Software Engineering10.1109/TSE.2022.321485949:4(2239-2254)Online publication date: 1-Apr-2023
  • (2022)Towards Immersive Collaborative SensemakingProceedings of the ACM on Human-Computer Interaction10.1145/35677416:ISS(722-746)Online publication date: 14-Nov-2022
  • Show More Cited By

Index Terms

  1. The semantics of clustering: analysis of user-generated spatializations of text documents

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AVI '12: Proceedings of the International Working Conference on Advanced Visual Interfaces
    May 2012
    846 pages
    ISBN:9781450312875
    DOI:10.1145/2254556
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • Consulta Umbria SRL
    • University of Salerno: University of Salerno

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 May 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. clustering
    2. text analytics
    3. visual analytics
    4. visualization

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    AVI'12
    Sponsor:
    • University of Salerno

    Acceptance Rates

    Overall Acceptance Rate 128 of 490 submissions, 26%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)19
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 09 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Understanding Human-AI Workflows for Generating PersonasProceedings of the 2024 ACM Designing Interactive Systems Conference10.1145/3643834.3660729(757-781)Online publication date: 1-Jul-2024
    • (2023)Towards Usable Neural Comment Generation via Code-Comment Linkage Interpretation: Method and Empirical StudyIEEE Transactions on Software Engineering10.1109/TSE.2022.321485949:4(2239-2254)Online publication date: 1-Apr-2023
    • (2022)Towards Immersive Collaborative SensemakingProceedings of the ACM on Human-Computer Interaction10.1145/35677416:ISS(722-746)Online publication date: 14-Nov-2022
    • (2022)Evaluating the Benefits of Explicit and Semi-Automated Clusters for Immersive Sensemaking2022 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)10.1109/ISMAR55827.2022.00064(479-488)Online publication date: Oct-2022
    • (2022)Designing a Topic-Based Literature Exploration Tool in AR - An exploratory study for neuroscience2022 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct)10.1109/ISMAR-Adjunct57072.2022.00099(471-476)Online publication date: Oct-2022
    • (2021)Semantic Explanation of Interactive Dimensionality Reduction2021 IEEE Visualization Conference (VIS)10.1109/VIS49827.2021.9623322(26-30)Online publication date: Oct-2021
    • (2021)Narrative Sensemaking: Strategies for Narrative Maps Construction2021 IEEE Visualization Conference (VIS)10.1109/VIS49827.2021.9623296(181-185)Online publication date: Oct-2021
    • (2021)An Examination of Grouping and Spatial Organization Tasks for High-Dimensional Data ExplorationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.302889027:2(1742-1752)Online publication date: Feb-2021
    • (2019)Sherpa: Leveraging User Attention for Computational Steering in Visual Analytics2019 IEEE Visualization in Data Science (VDS)10.1109/VDS48975.2019.8973384(48-57)Online publication date: Oct-2019
    • (2019)Bridging Text Visualization and Mining: A Task-Driven SurveyIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2018.283434125:7(2482-2504)Online publication date: 1-Jul-2019
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media