Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2463676.2463683acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
demonstration

Stat!: an interactive analytics environment for big data

Published: 22 June 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Exploratory analysis on big data requires us to rethink data management across the entire stack -- from the underlying data processing techniques to the user experience. We demonstrate Stat! -- a visualization and analytics environment that allows users to rapidly experiment with exploratory queries over big data. Data scientists can use Stat! to quickly refine to the correct query, while getting immediate feedback after processing a fraction of the data. Stat! can work with multiple processing engines in the backend; in this demo, we use Stat! with the Microsoft StreamInsight streaming engine. StreamInsight is used to generate incremental early results to queries and refine these results as more data is processed. Stat! allows data scientists to explore data, dynamically compose multiple queries to generate streams of partial results, and display partial results in both textual and visual form.

    References

    [1]
    B. Babcock et al. Models and Issues in Data Stream Systems. In PODS, 2002.
    [2]
    J. Hellerstein et al. Interactive Data Analysis: The Control Project. IEEE Computer, August, 1999.
    [3]
    D. Fisher, I. Popov, S. Drucker, and m c schraefel. "Trust Me, I'm Partially Right: Incremental Visualization Lets Analysts Explore Large Datasets Faster". In CHI, 2012.
    [4]
    E. Meijer. 2011. "The world according to LINQ". Commun. ACM 54, 10 (October 2011), 45--51.
    [5]
    S. Melnik et al., "Dremel: Interactive Analysis of Web-Scale Datasets". In VLDB, 2010.
    [6]
    C. Jermaine et al. Scalable approximate query processing with the DBO engine. In SIGMOD, 2007.
    [7]
    T. Condie et al. MapReduce Online. In NSDI, 2010.
    [8]
    A. Hall et al. Processing a trillion cells per mouse click. In VLDB, 2012.
    [9]
    B. Chandramouli et al. Temporal analytics on big data for Web advertising. In ICDE, 2012.
    [10]
    Microsoft StreamInsight. http://aka.ms/stream.

    Cited By

    View all
    • (2022)Semantics and Anomaly Preserving Sampling Strategy for Large-Scale Time Series DataACM/IMS Transactions on Data Science10.1145/35119182:4(1-25)Online publication date: 30-Mar-2022
    • (2020)Iris: Amortized, Resource Efficient Visualizations of Voluminous Spatiotemporal Datasets2020 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT)10.1109/BDCAT50828.2020.00003(47-56)Online publication date: Dec-2020
    • (2019)HillviewProceedings of the VLDB Endowment10.14778/3342263.334227912:11(1442-1457)Online publication date: 1-Jul-2019
    • Show More Cited By

    Index Terms

    1. Stat!: an interactive analytics environment for big data

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
      June 2013
      1322 pages
      ISBN:9781450320375
      DOI:10.1145/2463676
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 22 June 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. analytics
      2. big data
      3. interactive
      4. tool
      5. visualization

      Qualifiers

      • Demonstration

      Conference

      SIGMOD/PODS'13
      Sponsor:

      Acceptance Rates

      SIGMOD '13 Paper Acceptance Rate 76 of 372 submissions, 20%;
      Overall Acceptance Rate 785 of 4,003 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 27 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Semantics and Anomaly Preserving Sampling Strategy for Large-Scale Time Series DataACM/IMS Transactions on Data Science10.1145/35119182:4(1-25)Online publication date: 30-Mar-2022
      • (2020)Iris: Amortized, Resource Efficient Visualizations of Voluminous Spatiotemporal Datasets2020 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT)10.1109/BDCAT50828.2020.00003(47-56)Online publication date: Dec-2020
      • (2019)HillviewProceedings of the VLDB Endowment10.14778/3342263.334227912:11(1442-1457)Online publication date: 1-Jul-2019
      • (2018)A Session-Based Approach to Fast-But-Approximate Interactive Data Cube ExplorationACM Transactions on Knowledge Discovery from Data10.1145/307064812:1(1-26)Online publication date: 13-Feb-2018
      • (2018)VPL-Based Big Data Analysis System: UDASIEEE Access10.1109/ACCESS.2018.28578456(40883-40897)Online publication date: 2018
      • (2017)Effective Big Data VisualizationProceedings of the 21st International Database Engineering & Applications Symposium10.1145/3105831.3105857(298-303)Online publication date: 12-Jul-2017
      • (2017)Scalable Complex Event Processing on a NotebookProceedings of the 11th ACM International Conference on Distributed and Event-based Systems10.1145/3093742.3095093(327-330)Online publication date: 8-Jun-2017
      • (2017)Gaussian CubesIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2016.259869423:1(681-690)Online publication date: 1-Jan-2017
      • (2016)Interacting with large distributed datasets using sketchProceedings of the 16th Eurographics Symposium on Parallel Graphics and Visualization10.5555/3061436.3061442(31-43)Online publication date: 6-Jun-2016
      • (2016)The Six Pillars for Building Big Data Analytics EcosystemsACM Computing Surveys10.1145/296314349:2(1-36)Online publication date: 2-Aug-2016
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media