short-paper

How to Improve Semantics Understanding of Word Clouds

Authors:

Yan LiAuthors Info & Claims

VINCI '19: Proceedings of the 12th International Symposium on Visual Information Communication and Interaction

Article No.: 13, Pages 1 - 5

https://doi.org/10.1145/3356422.3356449

Published: 20 September 2019 Publication History

Abstract

Word cloud is a text visualization technique which is widely applied in helping improve semantic understanding about target materials. One of the most important features is the font size, which represents words frequencies of a document. As the result, in this paper, we explore how to set font sizes of words, and its influence on semantic understanding through people's performance with qualitative and controlled experiments. Adopting an machine learning algorithm LDA (Latent Dirichlet Allocation) topic model, we quantify semantics of the document and judge participants' accuracy performance. The experimental results show the influence of different font size on semantic understanding performance and provide insights for ways in promoting semantic understanding of word cloud.

References

[1]

Eric Alexander, Chih-Ching Chang, Mariana Shimabukuro, Steven Franconeri, Christopher Collins, and Michael Gleicher. 2018. Perceptual biases in font size as a data encoding. IEEE transactions on visualization and computer graphics 24, 8 (2018), 2397--2410.

[2]

Lukas Barth, Stephen G Kobourov, and Sergey Pupyrev. 2014. Experimental comparison of semantic word clouds. In International Symposium on Experimental Algorithms. Springer, 247--258.

Digital Library

[3]

Scott Bateman, Carl Gutwin, and Miguel Nacenta. 2008. Seeing things in the clouds: the effect of visual features on tag cloud selections. In Proceedings of the nineteenth ACM conference on Hypertext and hypermedia. ACM, 193--202.

Digital Library

[4]

David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022.

Digital Library

[5]

Ming-Te Chi, Shih-Syun Lin, Shiang-Yi Chen, Chao-Hung Lin, and Tong-Yee Lee. 2015. Morphable word clouds for time-varying text data visualization. IEEE transactions on visualization and computer graphics 21, 12 (2015), 1415--1426.

Digital Library

[6]

Weiwei Cui, Yingcai Wu, Shixia Liu, Furu Wei, Michelle X Zhou, and Huamin Qu. 2010. Context preserving dynamic word cloud visualization. In 2010 IEEE Pacific Visualization Symposium (PacificVis). IEEE, 121--128.

[7]

Emden R Gansner, Yifan Hu, and Stephen C North. 2013. Interactive Visualization of Streaming Text Data with Dynamic Maps. J. Graph Algorithms Appl. 17, 4 (2013), 515--540.

[8]

Martin J Halvey and Mark T Keane. 2007. An assessment of tag presentation techniques. In Proceedings of the 16th international conference on World Wide Web. ACM, 1313--1314.

Digital Library

[9]

Florian Heimerl, Steffen Lohmann, Simon Lange, and Thomas Ertl. 2014. Word cloud explorer: Text analytics based on word clouds. In 2014 47th Hawaii International Conference on System Sciences. IEEE, 1833--1842.

Digital Library

[10]

Kyle Koh, Bongshin Lee, Bohyoung Kim, and Jinwook Seo. 2010. Maniwordle: Providing flexible control over wordle. IEEE Transactions on Visualization and Computer Graphics 16, 6 (2010), 1190--1197.

Digital Library

[11]

Aradhna Krishna. 2012. An integrative review of sensory marketing: Engaging the senses to affect perception, judgment and behavior. Journal of consumer psychology 22, 3 (2012), 332--351.

[12]

Xiaotong Liu, Han-Wei Shen, and Yifan Hu. 2015. Supporting multifaceted viewing of word clouds with focus+ context display. Information Visualization 14, 2 (2015), 168--180.

[13]

Steffen Lohmann, Florian Heimerl, Fabian Bopp, Michael Burch, and Thomas Ertl. 2015. Concentri cloud: Word cloud visualization for multiple text documents. In 2015 19th International Conference on Information Visualisation. IEEE, 114--120.

Digital Library

[14]

Steffen Lohmann, Jürgen Ziegler, and Lena Tetzlaff. 2009. Comparison of tag cloud layouts: Task-related performance and visual exploration. In IFIP Conference on Human-Computer Interaction. Springer, 392--404.

Digital Library

[15]

Anna W Rivadeneira, Daniel M Gruen, Michael J Muller, and David R Millen. 2007. Getting our head in the clouds: toward evaluation studies of tagclouds. In Proceedings of the SIGCHI conference on Human factors in computing systems. ACM, 995--998.

Digital Library

[16]

Elvis Saravia, Carlos Argueta, and Yi-Shin Chen. 2015. Emoviz: Mining the world's interest through emotion analysis. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015. ACM, 753--756.

Digital Library

[17]

Ji Wang. 2012. Clustered layout word cloud for user generated online reviews. Ph.D. Dissertation. Virginia Tech.

[18]

Ji Wang, Jian Zhao, Sheng Guo, Chris North, and Naren Ramakrishnan. 2014. ReCloud: semantics-based word cloud visualization of user reviews. In Proceedings of Graphics Interface 2014. Canadian Information Processing Society, 151--158.

[19]

Yunhai Wang, Xiaowei Chu, Chen Bao, Lifeng Zhu, Oliver Deussen, Baoquan Chen, and Michael Sedlmair. 2018. Edwordle: Consistency-preserving word cloud editing. IEEE transactions on visualization and computer graphics 24, 1 (2018), 647--656.

[20]

Ho Chung Wu, Robert Wing Pong Luk, Kam Fai Wong, and Kui Lam Kwok. 2008. Interpreting tf-idf term weights as making relevance decisions. ACM Transactions on Information Systems (TOIS) 26, 3 (2008), 13.

Digital Library

[21]

Yingcai Wu, Thomas Provan, Furu Wei, Shixia Liu, and Kwan-Liu Ma. 2011. Semantic-preserving word clouds by seam carving. In Computer Graphics Forum, Vol. 30. Wiley Online Library, 741--750.

[22]

Jin Xu, Yubo Tao, and Hai Lin. 2016. Semantic word cloud generation based on word embeddings. In 2016 IEEE Pacific Visualization Symposium (PacificVis). IEEE, 239--243.

Cited By

Méndez GMoreno OMendoza P(2023)The Landscape of Visual Information Communication and Interaction ResearchProceedings of the 16th International Symposium on Visual Information Communication and Interaction10.1145/3615522.3615523(1-8)Online publication date: 22-Sep-2023
https://dl.acm.org/doi/10.1145/3615522.3615523

Recommendations

Concentri Cloud: Word Cloud Visualization for Multiple Text Documents
IV '15: Proceedings of the 2015 19th International Conference on Information Visualisation

Word clouds provide a simple and effective means to visually communicate the most frequent words of text documents. However, only few word cloud visualizations support the contrastive analysis of multiple texts. This paper introduces Concentri Cloud, a ...
Design and Implementation of Text Understanding System Based on Semantic Tagging Instances
AIEE '23: Proceedings of the 2023 4th International Conference on Artificial Intelligence in Electronics Engineering

There are many challenges in text semantic understanding research, such as sparse semantic features, massive representation forms, complex semantic redundancy, diverse echo phenomena, etc. Based on explicit representation of human thinking responses, ...
Fisheye word cloud for temporal sentiment exploration
CHI EA '13: CHI '13 Extended Abstracts on Human Factors in Computing Systems

This poster abstract presents a new word cloud technique, the Fisheye Word Cloud, for exploring time-series data in a focused+context approach to analyzing word data. Our design has two features: cursor-centric layout and word cloud generation on ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

VINCI '19: Proceedings of the 12th International Symposium on Visual Information Communication and Interaction

September 2019

201 pages

ISBN:9781450376266

DOI:10.1145/3356422

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

East China Normal University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 September 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper
Research
Refereed limited

Conference

VINCI'2019

VINCI'2019: The 12th International Symposium on Visual Information Communication and Interaction

September 20 - 22, 2019

Shanghai, China

Acceptance Rates

Overall Acceptance Rate 71 of 193 submissions, 37%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
114
Total Downloads

Downloads (Last 12 months)21
Downloads (Last 6 weeks)1

Reflects downloads up to 12 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Méndez GMoreno OMendoza P(2023)The Landscape of Visual Information Communication and Interaction ResearchProceedings of the 16th International Symposium on Visual Information Communication and Interaction10.1145/3615522.3615523(1-8)Online publication date: 22-Sep-2023
https://dl.acm.org/doi/10.1145/3615522.3615523

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents