VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository

Hu, Kevin; Gaikwad, Neil; Bakker, Michiel; Hulsebos, Madelon; Zgraggen, Emanuel; Hidalgo, César; Kraska, Tim; Li, Guoliang; Satyanarayan, Arvind; Demiralp, Çağatay

Computer Science > Human-Computer Interaction

arXiv:1905.04616 (cs)

[Submitted on 12 May 2019]

Title:VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository

Authors:Kevin Hu, Neil Gaikwad, Michiel Bakker, Madelon Hulsebos, Emanuel Zgraggen, César Hidalgo, Tim Kraska, Guoliang Li, Arvind Satyanarayan, Çağatay Demiralp

View PDF

Abstract:Researchers currently rely on ad hoc datasets to train automated visualization tools and evaluate the effectiveness of visualization designs. These exemplars often lack the characteristics of real-world datasets, and their one-off nature makes it difficult to compare different techniques. In this paper, we present VizNet: a large-scale corpus of over 31 million datasets compiled from open data repositories and online visualization galleries. On average, these datasets comprise 17 records over 3 dimensions and across the corpus, we find 51% of the dimensions record categorical data, 44% quantitative, and only 5% temporal. VizNet provides the necessary common baseline for comparing visualization design techniques, and developing benchmark models and algorithms for automating visual analysis. To demonstrate VizNet's utility as a platform for conducting online crowdsourced experiments at scale, we replicate a prior study assessing the influence of user task and data distribution on visual encoding effectiveness, and extend it by considering an additional task: outlier detection. To contend with running such studies at scale, we demonstrate how a metric of perceptual effectiveness can be learned from experimental results, and show its predictive power across test datasets.

Comments:	CHI'19
Subjects:	Human-Computer Interaction (cs.HC); Databases (cs.DB); Machine Learning (cs.LG)
Cite as:	arXiv:1905.04616 [cs.HC]
	(or arXiv:1905.04616v1 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.1905.04616

Submission history

From: Çağatay Demiralp [view email]
[v1] Sun, 12 May 2019 00:47:28 UTC (5,577 KB)

Computer Science > Human-Computer Interaction

Title:VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators