Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Employing a Parametric Model for Analytic Provenance

Published: 01 April 2014 Publication History

Abstract

We introduce a propagation-based parametric symbolic model approach to supporting analytic provenance. This approach combines a script language to capture and encode the analytic process and a parametrically controlled symbolic model to represent and reuse the logic of the analysis process. Our approach first appeared in a visual analytics system called CZSaw. Using a script to capture the analyst’s interactions at a meaningful system action level allows the creation of a parametrically controlled symbolic model in the form of a Directed Acyclic Graph (DAG). Using the DAG allows propagating changes. Graph nodes correspond to variables in CZSaw scripts, which are results (data and data visualizations) generated from user interactions. The user interacts with variables representing entities or relations to create the next step’s results. Graph edges represent dependency relationships among nodes. Any change to a variable triggers the propagation mechanism to update downstream dependent variables and in turn updates data views to reflect the change. The analyst can reuse parts of the analysis process by assigning new values to a node in the graph. We evaluated this symbolic model approach by solving three IEEE VAST Challenge contest problems (from IEEE VAST 2008, 2009, and 2010). In each of these challenges, the analyst first created a symbolic model to explore, understand, analyze, and solve a particular subproblem and then reused the model via its dependency graph propagation mechanism to solve similar subproblems. With the script and model, CZSaw supports the analytic provenance by capturing, encoding, and reusing the analysis process. The analyst can recall the chronological states of the analysis process with the CZSaw script and may interpret the underlying rationale of the analysis with the symbolic model.

References

[1]
Robert Aish and Robert Woodbury. 2005. Multi-level interaction in parametric design. In Proceedings of Smart Graphics 2005. 151--162.
[2]
Alias-i. 2012. LingPipe 4.1.0. Retrieved May 1, 2010, from http://alias-i.com/lingpipe.
[3]
Bentley Systems, Inc. 2010. GenerativeComponents V8i. Retrieved May 5, 2011, from http://www.bentley.com/en-us/products/generativecomponents/.
[4]
Véronique Benzaken, Jean-Daniel Fekete, Pierre-Luc Hémery, Wael Khemiri, and Ioana Manolescu. 2011. EdiFlow: Data-intensive interactive workflows for visual analytics. In Proceedings of the IEEE 27th International Conference on Data Engineering (ICDE). 780--791.
[5]
Steven P. Callahan, Juliana Freire, Emanuele Santos, Carlos E. Scheidegger, Claudio T. Silva, and Huy T. Vo. 2006. Managing the evolution of dataflows with VisTrails. In Proceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW’06). IEEE Computer Society, Washington, DC, 71.
[6]
Yingjie Victor Chen. 2007. Revealing Hidden Structure: Visualizing Bibliographic Coupling and Co-citation Relations in Mulitimedia Collections. Master’s thesis. Simon Fraser University.
[7]
Yingjie Victor Chen, Dustin Dunsmuir, Nazanin Kadivar, Eric Lee, Jeffrey Guenther, Saba Alimadadi Jani, John Dill, Christopher Shaw, Robert Woodbury, Maureen Stone, and Cheryl Qian. 2010. CZSaw: Model based interactive analysis of interwoven, imprecise narratives, VAST 2010 Mini Challenge 1 Award: Outstanding Interaction Mode. In Proceedings of IEEE Visual Analytics Science and Technology (VAST’10). IEEE, Salt Lake City, UT.
[8]
Phillip T. Cox and Tomasz Pietryzkowsky. 1990. Using a pictorial representation to combine dataflow and object-orientation in a language-independent programming mechanism. Visual Programming Environments: Paradigms and Systems, 313--322.
[9]
Dassault Systèmes SolidWorks Corp. 2011. SolidWorks 2011. Retrieved August 10, 2011, from http://www.solidworks.com/.
[10]
Jack B. Dennis. 1974. First version of a data flow procedure language. In Proceedings of the Colloque sur la Programmation Programming Symposium. Springer-Verlag, London, UK, 362--376.
[11]
Mark Derthick and Steven F. Roth. 2001. Enhancing data exploration with a branching history of user operations. Knowledge Based Systems 14, 1--2, 65--74.
[12]
Dustin Dunsmuir, Mahshid Z. Baraghoush, Yingjie Victor Chen, Minoo Erfani Joorabchi, Mona Erfani Joorabchi, Saba Alimadadi, Eric Lee, John Dill, Cheryl Qian, Christopher Shaw, and Robert Woodbury. 2010. CZsaw, IMAS & Tableau: Collaboration among teams, VAST 2010 MGrand Challenge Award: Excellent Student Team Analysis. In Proceedings of IEEE Visual Analytics Science and Technology (VAST’10). IEEE, Salt Lake City, UT.
[13]
Dustin Dunsmuir, Eric Lee, Chris D. Shaw, Maureen Stone, Robert Woodbury, and John Dill. 2012. A focus + context technique for visualizing a document collection. In Proceedings of the Hawaii International Conference on System Sciences. IEEE Computer Society, Los Alamitos, CA, 1835--1844.
[14]
Ryan Eccles, Thomas Kapler, Robert Harper, and William Wright. 2007. Stories in GeoTime. In Proceedings of IEEE Symposium on Visual Analytics Science and Technology (VAST’07). 19--26.
[15]
Federal Aviation Administration. 2013. FAA Wildlife Strike Database. Retrieved May 31, 2012, from http://wildlife.faa.gov
[16]
Thomas M. J. Fruchterman and Edward M. Reingold. 1991. Graph drawing by force-directed placement. Software Practice and Experience (Wiley) 21, 11.
[17]
Supriya Garg, Julia Eunju Nam, I. V. Ramakrishnan, and Klaus Mueller. 2008. Model-driven visual analytics. In Proceedings of the IEEE Symposium on Visual Analytics Science and Technology (VAST’08). 19--26.
[18]
David Gotz and Michelle X. Zhou. 2008. Characterizing users’ visual analytic activity for insight provenance. Information Visualization 8, 1, 42--55.
[19]
Jeffrey Heer, Jock D. Mackinlay, Chris Stolte, and Maneesh Agrawala. 2008. Graphical histories for visualization: Supporting analysis, communication, and evaluation. IEEE Transactions on Visualization and Computer Graphics 14, 6, 1189--1196.
[20]
Daniel D. Hils. 1992. Visual languages and computing survey: Data flow visual programming languages. Journal of Visual Languages & Computing 3, 1, 69--101.
[21]
Christoph M. Hoffman and Robert Joan-Arinyo. 2005. A brief on constraint solving. Computer-Aided Design and Applications 2, 5, 655--664.
[22]
IEEE VAST. 2008. IEEE VAST 2008 Challenge MC3: Cell Phone Calls. Retrieved May 31, 2012, from http://www.cs.umd.edu/hcil/VASTchallenge08.
[23]
IEEE VAST. 2009. IEEE VAST 2009 Challenge MC2: Social Network and Geospatial. Retrieved May 31, 2012, from http://hcil.cs.umd.edu/localphp/hcil/vast/index.php.
[24]
IEEE VAST. 2010. IEEE VAST 2010 Challenge MC1: Text Records—Investigations into Arms Dealing. Retrieved May 31, 2012, from http://hcil.cs.umd.edu/localphp/hcil/vast10/index.php.
[25]
Anthony Jameson and John Riedl. 2011. Introduction to the transactions on interactive intelligent systems. ACM Transactions on Interactive Intelligent Systems 1, 1, Article 1, 6 pages.
[26]
T. J. Jankun-Kelly, Kwan-Liu Ma, and Michael Gertz. 2007. A model and framework for visualization exploration. IEEE Transactions on Visualization and Computer Graphics 13, 2, 357--369.
[27]
Wesley M. Johnston, J. R. Paul Hanna, and Richard J. Millar. 2004. Advances in dataflow programming languages. Computing Surveys 36, 1 (March 2004), 1--34.
[28]
Nazanin Kadivar. 2011. Visualizing the Analysis Process: CZSaw’s History View. Master’s thesis. Simon Fraser University.
[29]
Nazanin Kadivar, Yingjie Victor Chen, Dustin Dunsmuir, Eric Lee, Cheryl Qian, John Dill, Christopher Shaw, and Robert Woodbury. 2009. Capturing and supporting the analysis process. In Proceedings of IEEE Visual Analytics Science and Technology (VAST’09). Atlantic City, NJ, 131--138.
[30]
Arthur B. Kahn. 1962. Topological sorting of large networks. Communications of the ACM 5, 11, 558--562.
[31]
Michael M. Kessler. 1963. Bibliographic coupling between scientific papers. American Documentation 14, 1, 10--25.
[32]
Matthias Kreuseler, Thomas Nocke, and Heidrun Schumann. 2004. A history mechanism for visual data mining. In Proceedings of the IEEE Symposium on Information Visualization (InfoVis’04). IEEE Computer Society, Austin, TX, 49--56.
[33]
Fred N. Krull. 1994. The origin of computer graphics within General Motors. IEEE Annals of the History of Computing 16, 3, 40--56.
[34]
Robert McNeel. 2010. Grasshopper—Generative Modeling for Rhino. Retrieved May 25, 2012, from http://www.grasshopper3d.com/.
[35]
Patrick Niemeyer. 2005. BeanShell—Lightweight Scripting for Java. Retrieved May 5, 2011, from http://www.beanshell.org/.
[36]
Wouter de Nooy, Andrej Mrvar, and Vladimir Batagelj. 2005. Exploratory Social Network Analysis with Pajek. Cambridge University Press.
[37]
Chris North, Remco Chang, Alex Endert, Wenwen Dou, Richard May, Bill Pike, and Glenn Fink. 2011. Analytic provenance: Process+interaction+insight. In Proceedings of CHI’11 Extended Abstracts on Human Factors in Computing Systems (CHI EA’11). 33--36.
[38]
William A. Pike, John Stasko, Remco Chang, and Theresa A. O’Connell. 2009. The science of interaction. Information Visualization 8, 4, 263--274.
[39]
Peter Pirolli and Stuart K. Card. 1999. Information foraging. Psychological Review 106, 4, 643--675.
[40]
Zhenyu Cheryl Qian, Yingjie Victor Chen, and Robert F. Woodbury. 2010. Design patterns to support collaborative parametric design. International Journal of Design Sciences and Technology 17, 1, 57--75.
[41]
Daniel M. Russell, Mark J. Stefik, Peter Pirolli, and Stuart K. Card. 1993. The cost structure of sensemaking. In Proceedings of the INTERACT’93 and CHI’93 Conference on Human Factors in Computing Systems. ACM Press, New York, NY, 269--276.
[42]
Jean Scholtz, Mark A. Whiting, Catherine Plaisant, and Georges Grinstein. 2012. A reflection on seven years of the VAST Challenge. In Proceedings of the 2012 BELIV Workshop: Beyond Time and Errors— Novel Evaluation Methods for Visualization (BELIV’12). ACM, New York, NY, Article 13, 8 pages.
[43]
Frank M. Shipman and Haowei Hsieh. 2000. Navigable history: A reader’s view of writer’s time. New Review of Hypermedia and Multimedia 6, 1, 147--167.
[44]
Ben Shneiderman. 1996. The eyes have it: A task by data type taxonomy for information visualizations. In Proceedings of the IEEE Symposium on Visual Languages (VL’96). IEEE Computer Society, Washington, DC, 336--343. http://dl.acm.org/citation.cfm?id=832277.834354
[45]
Yedendra Babu Shrinivasan and Jarke J. van Wijk. 2008. Supporting the analytical reasoning process in information visualization. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’08). ACM, New York, NY, 1237--1246.
[46]
Claudio Silva, Juliana Freire, and Steven Callahan. 2007. Provenance for visualizations: Reproducibility and beyond. IEEE Computing in Science & Engineering 9, 5, 82--89.
[47]
Henry Small. 1973. Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society of Information Science 24, 265--269.
[48]
John Stasko, Carsten Gorg, Zhicheng Liu, and Kanupriya Singhal. 2007. Jigsaw: Supporting investigative analysis through interactive visualization. In Proceedings of IEEE Symposium on Visual Analytics Science and Technology (VAST’07). 131--138.
[49]
Ivan Edward Sutherland. 1963. Sketchpad, A Man-Machine Graphical Communication System. Ph.D. Dissertation. Massachusetts Institute of Technology.
[50]
James J. Thomas and Kristin A. Cook. 2005. Illuminating the Path: The Research and Development Agenda for Visual Analytics. National Visualization and Analytics Center, Los Alamitos, CA.
[51]
Robert Woodbury. 2010. Elements of Parametric Design. Routledge.
[52]
William Wright, David Schroh, Pascale Proulx, Alex Skaburskis, and Brian Cort. 2006. The sandbox for analysis: Concepts and methods. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI’06). ACM, New York, NY, 801--810.
[53]
Ling Xiao, John Gerth, and Pat Hanrahan. 2006. Enhancing visual analysis of network traffic using a knowledge representation. In Proceedings of IEEE Visual Analytics Science and Technology (VAST’06). 107--114.
[54]
Ji Soo Yi, Youn ah Kang, John T. Stasko, and Julie A. Jacko. 2007. Toward a deeper understanding of the role of interaction in information visualization. IEEE Transactions on Visualization and Computer Graphics 13, 6, 1224--1231.
[55]
Caroline Ziemkiewicz, Alvitta R. Ottley, Jordan Crouser, Krysta Chauncey, Sara L. Su, and Remco Chang. 2012. Understanding visualization by understanding individual users. IEEE Computer Graphics and Applications 32, 6, 88--94.

Cited By

View all
  • (2022)Reusing Interactive Analysis WorkflowsComputer Graphics Forum10.1111/cgf.1452841:3(133-144)Online publication date: 12-Aug-2022
  • (2022)Visionary: a framework for analysis and visualization of provenance dataKnowledge and Information Systems10.1007/s10115-021-01645-664:2(381-413)Online publication date: 1-Feb-2022
  • (2020)Survey on the Analysis of User Interactions and Visualization ProvenanceComputer Graphics Forum10.1111/cgf.1403539:3(757-783)Online publication date: 18-Jul-2020
  • Show More Cited By

Index Terms

  1. Employing a Parametric Model for Analytic Provenance

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Interactive Intelligent Systems
    ACM Transactions on Interactive Intelligent Systems  Volume 4, Issue 1
    Special Issue on Interactive Computational Visual Analytics
    April 2014
    181 pages
    ISSN:2160-6455
    EISSN:2160-6463
    DOI:10.1145/2602757
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 April 2014
    Accepted: 01 September 2013
    Revised: 01 July 2013
    Received: 01 May 2012
    Published in TIIS Volume 4, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Dependency graph
    2. analytical reasoning
    3. history
    4. user interaction
    5. visual scripting

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 20 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Reusing Interactive Analysis WorkflowsComputer Graphics Forum10.1111/cgf.1452841:3(133-144)Online publication date: 12-Aug-2022
    • (2022)Visionary: a framework for analysis and visualization of provenance dataKnowledge and Information Systems10.1007/s10115-021-01645-664:2(381-413)Online publication date: 1-Feb-2022
    • (2020)Survey on the Analysis of User Interactions and Visualization ProvenanceComputer Graphics Forum10.1111/cgf.1403539:3(757-783)Online publication date: 18-Jul-2020
    • (2017)Interactive design galleries: A general approach to interacting with design alternativesDesign Studies10.1016/j.destud.2017.05.00152(40-72)Online publication date: Sep-2017
    • (2016)Using Machine Learning to Infer Reasoning Provenance From User Interaction Log DataJournal of Cognitive Engineering and Decision Making10.1177/155534341667278211:1(23-41)Online publication date: 25-Oct-2016

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media