Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3356422.3356449acmotherconferencesArticle/Chapter ViewAbstractPublication PagesvinciConference Proceedingsconference-collections
short-paper

How to Improve Semantics Understanding of Word Clouds

Published: 20 September 2019 Publication History

Abstract

Word cloud is a text visualization technique which is widely applied in helping improve semantic understanding about target materials. One of the most important features is the font size, which represents words frequencies of a document. As the result, in this paper, we explore how to set font sizes of words, and its influence on semantic understanding through people's performance with qualitative and controlled experiments. Adopting an machine learning algorithm LDA (Latent Dirichlet Allocation) topic model, we quantify semantics of the document and judge participants' accuracy performance. The experimental results show the influence of different font size on semantic understanding performance and provide insights for ways in promoting semantic understanding of word cloud.

References

[1]
Eric Alexander, Chih-Ching Chang, Mariana Shimabukuro, Steven Franconeri, Christopher Collins, and Michael Gleicher. 2018. Perceptual biases in font size as a data encoding. IEEE transactions on visualization and computer graphics 24, 8 (2018), 2397--2410.
[2]
Lukas Barth, Stephen G Kobourov, and Sergey Pupyrev. 2014. Experimental comparison of semantic word clouds. In International Symposium on Experimental Algorithms. Springer, 247--258.
[3]
Scott Bateman, Carl Gutwin, and Miguel Nacenta. 2008. Seeing things in the clouds: the effect of visual features on tag cloud selections. In Proceedings of the nineteenth ACM conference on Hypertext and hypermedia. ACM, 193--202.
[4]
David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022.
[5]
Ming-Te Chi, Shih-Syun Lin, Shiang-Yi Chen, Chao-Hung Lin, and Tong-Yee Lee. 2015. Morphable word clouds for time-varying text data visualization. IEEE transactions on visualization and computer graphics 21, 12 (2015), 1415--1426.
[6]
Weiwei Cui, Yingcai Wu, Shixia Liu, Furu Wei, Michelle X Zhou, and Huamin Qu. 2010. Context preserving dynamic word cloud visualization. In 2010 IEEE Pacific Visualization Symposium (PacificVis). IEEE, 121--128.
[7]
Emden R Gansner, Yifan Hu, and Stephen C North. 2013. Interactive Visualization of Streaming Text Data with Dynamic Maps. J. Graph Algorithms Appl. 17, 4 (2013), 515--540.
[8]
Martin J Halvey and Mark T Keane. 2007. An assessment of tag presentation techniques. In Proceedings of the 16th international conference on World Wide Web. ACM, 1313--1314.
[9]
Florian Heimerl, Steffen Lohmann, Simon Lange, and Thomas Ertl. 2014. Word cloud explorer: Text analytics based on word clouds. In 2014 47th Hawaii International Conference on System Sciences. IEEE, 1833--1842.
[10]
Kyle Koh, Bongshin Lee, Bohyoung Kim, and Jinwook Seo. 2010. Maniwordle: Providing flexible control over wordle. IEEE Transactions on Visualization and Computer Graphics 16, 6 (2010), 1190--1197.
[11]
Aradhna Krishna. 2012. An integrative review of sensory marketing: Engaging the senses to affect perception, judgment and behavior. Journal of consumer psychology 22, 3 (2012), 332--351.
[12]
Xiaotong Liu, Han-Wei Shen, and Yifan Hu. 2015. Supporting multifaceted viewing of word clouds with focus+ context display. Information Visualization 14, 2 (2015), 168--180.
[13]
Steffen Lohmann, Florian Heimerl, Fabian Bopp, Michael Burch, and Thomas Ertl. 2015. Concentri cloud: Word cloud visualization for multiple text documents. In 2015 19th International Conference on Information Visualisation. IEEE, 114--120.
[14]
Steffen Lohmann, Jürgen Ziegler, and Lena Tetzlaff. 2009. Comparison of tag cloud layouts: Task-related performance and visual exploration. In IFIP Conference on Human-Computer Interaction. Springer, 392--404.
[15]
Anna W Rivadeneira, Daniel M Gruen, Michael J Muller, and David R Millen. 2007. Getting our head in the clouds: toward evaluation studies of tagclouds. In Proceedings of the SIGCHI conference on Human factors in computing systems. ACM, 995--998.
[16]
Elvis Saravia, Carlos Argueta, and Yi-Shin Chen. 2015. Emoviz: Mining the world's interest through emotion analysis. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015. ACM, 753--756.
[17]
Ji Wang. 2012. Clustered layout word cloud for user generated online reviews. Ph.D. Dissertation. Virginia Tech.
[18]
Ji Wang, Jian Zhao, Sheng Guo, Chris North, and Naren Ramakrishnan. 2014. ReCloud: semantics-based word cloud visualization of user reviews. In Proceedings of Graphics Interface 2014. Canadian Information Processing Society, 151--158.
[19]
Yunhai Wang, Xiaowei Chu, Chen Bao, Lifeng Zhu, Oliver Deussen, Baoquan Chen, and Michael Sedlmair. 2018. Edwordle: Consistency-preserving word cloud editing. IEEE transactions on visualization and computer graphics 24, 1 (2018), 647--656.
[20]
Ho Chung Wu, Robert Wing Pong Luk, Kam Fai Wong, and Kui Lam Kwok. 2008. Interpreting tf-idf term weights as making relevance decisions. ACM Transactions on Information Systems (TOIS) 26, 3 (2008), 13.
[21]
Yingcai Wu, Thomas Provan, Furu Wei, Shixia Liu, and Kwan-Liu Ma. 2011. Semantic-preserving word clouds by seam carving. In Computer Graphics Forum, Vol. 30. Wiley Online Library, 741--750.
[22]
Jin Xu, Yubo Tao, and Hai Lin. 2016. Semantic word cloud generation based on word embeddings. In 2016 IEEE Pacific Visualization Symposium (PacificVis). IEEE, 239--243.

Cited By

View all
  • (2023)The Landscape of Visual Information Communication and Interaction ResearchProceedings of the 16th International Symposium on Visual Information Communication and Interaction10.1145/3615522.3615523(1-8)Online publication date: 22-Sep-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
VINCI '19: Proceedings of the 12th International Symposium on Visual Information Communication and Interaction
September 2019
201 pages
ISBN:9781450376266
DOI:10.1145/3356422
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • East China Normal University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 September 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. LDA method
  2. semantic understanding
  3. text visualization
  4. word cloud

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

VINCI'2019

Acceptance Rates

Overall Acceptance Rate 71 of 193 submissions, 37%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)The Landscape of Visual Information Communication and Interaction ResearchProceedings of the 16th International Symposium on Visual Information Communication and Interaction10.1145/3615522.3615523(1-8)Online publication date: 22-Sep-2023

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media