Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2463676.2463683acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
demonstration

Stat!: an interactive analytics environment for big data

Published: 22 June 2013 Publication History

Abstract

Exploratory analysis on big data requires us to rethink data management across the entire stack -- from the underlying data processing techniques to the user experience. We demonstrate Stat! -- a visualization and analytics environment that allows users to rapidly experiment with exploratory queries over big data. Data scientists can use Stat! to quickly refine to the correct query, while getting immediate feedback after processing a fraction of the data. Stat! can work with multiple processing engines in the backend; in this demo, we use Stat! with the Microsoft StreamInsight streaming engine. StreamInsight is used to generate incremental early results to queries and refine these results as more data is processed. Stat! allows data scientists to explore data, dynamically compose multiple queries to generate streams of partial results, and display partial results in both textual and visual form.

References

[1]
B. Babcock et al. Models and Issues in Data Stream Systems. In PODS, 2002.
[2]
J. Hellerstein et al. Interactive Data Analysis: The Control Project. IEEE Computer, August, 1999.
[3]
D. Fisher, I. Popov, S. Drucker, and m c schraefel. "Trust Me, I'm Partially Right: Incremental Visualization Lets Analysts Explore Large Datasets Faster". In CHI, 2012.
[4]
E. Meijer. 2011. "The world according to LINQ". Commun. ACM 54, 10 (October 2011), 45--51.
[5]
S. Melnik et al., "Dremel: Interactive Analysis of Web-Scale Datasets". In VLDB, 2010.
[6]
C. Jermaine et al. Scalable approximate query processing with the DBO engine. In SIGMOD, 2007.
[7]
T. Condie et al. MapReduce Online. In NSDI, 2010.
[8]
A. Hall et al. Processing a trillion cells per mouse click. In VLDB, 2012.
[9]
B. Chandramouli et al. Temporal analytics on big data for Web advertising. In ICDE, 2012.
[10]
Microsoft StreamInsight. http://aka.ms/stream.

Cited By

View all
  • (2022)Semantics and Anomaly Preserving Sampling Strategy for Large-Scale Time Series DataACM/IMS Transactions on Data Science10.1145/35119182:4(1-25)Online publication date: 30-Mar-2022
  • (2020)Iris: Amortized, Resource Efficient Visualizations of Voluminous Spatiotemporal Datasets2020 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT)10.1109/BDCAT50828.2020.00003(47-56)Online publication date: Dec-2020
  • (2019)HillviewProceedings of the VLDB Endowment10.14778/3342263.334227912:11(1442-1457)Online publication date: 1-Jul-2019
  • Show More Cited By

Index Terms

  1. Stat!: an interactive analytics environment for big data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
    June 2013
    1322 pages
    ISBN:9781450320375
    DOI:10.1145/2463676
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. analytics
    2. big data
    3. interactive
    4. tool
    5. visualization

    Qualifiers

    • Demonstration

    Conference

    SIGMOD/PODS'13
    Sponsor:

    Acceptance Rates

    SIGMOD '13 Paper Acceptance Rate 76 of 372 submissions, 20%;
    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 09 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Semantics and Anomaly Preserving Sampling Strategy for Large-Scale Time Series DataACM/IMS Transactions on Data Science10.1145/35119182:4(1-25)Online publication date: 30-Mar-2022
    • (2020)Iris: Amortized, Resource Efficient Visualizations of Voluminous Spatiotemporal Datasets2020 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT)10.1109/BDCAT50828.2020.00003(47-56)Online publication date: Dec-2020
    • (2019)HillviewProceedings of the VLDB Endowment10.14778/3342263.334227912:11(1442-1457)Online publication date: 1-Jul-2019
    • (2018)A Session-Based Approach to Fast-But-Approximate Interactive Data Cube ExplorationACM Transactions on Knowledge Discovery from Data10.1145/307064812:1(1-26)Online publication date: 13-Feb-2018
    • (2018)VPL-Based Big Data Analysis System: UDASIEEE Access10.1109/ACCESS.2018.28578456(40883-40897)Online publication date: 2018
    • (2017)Effective Big Data VisualizationProceedings of the 21st International Database Engineering & Applications Symposium10.1145/3105831.3105857(298-303)Online publication date: 12-Jul-2017
    • (2017)Scalable Complex Event Processing on a NotebookProceedings of the 11th ACM International Conference on Distributed and Event-based Systems10.1145/3093742.3095093(327-330)Online publication date: 8-Jun-2017
    • (2017)Gaussian CubesIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2016.259869423:1(681-690)Online publication date: 1-Jan-2017
    • (2016)Interacting with large distributed datasets using sketchProceedings of the 16th Eurographics Symposium on Parallel Graphics and Visualization10.5555/3061436.3061442(31-43)Online publication date: 6-Jun-2016
    • (2016)The Six Pillars for Building Big Data Analytics EcosystemsACM Computing Surveys10.1145/296314349:2(1-36)Online publication date: 2-Aug-2016
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media