Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3210259.3210270acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
demonstration

A graph-based framework for analyzing SQL query logs

Published: 10 June 2018 Publication History

Abstract

Analytical SQL queries are a valuable source of information. Query log analysis can provide insight into the usage of datasets and uncover knowledge that cannot be inferred from source schemas or content alone. To unlock this potential, flexible mechanisms for meta-querying are required. Syntactic and semantic aspects of queries must be considered along with contextual information.
We present an extensible framework for analyzing SQL query logs. Query logs are mapped to a multi-relational graph model and queried using domain-specific traversal expressions. To enable concise and expressive meta-querying, semantic analyses are conducted on normalized relational algebra trees with accompanying schema lineage graphs. Syntactic analyses can be conducted on corresponding query texts and abstract syntax trees. Additional metadata allows to inspect the temporal and social context of each query.
In this demonstration, we show how query log analysis with our framework can support data source discovery and facilitate collaborative data science. The audience can explore an exemplary query log to locate queries relevant to a data analysis scenario, conduct graph analyses on the log and assemble a customized logmonitoring dashboard.

References

[1]
Danaparamita and Gatterbauer. 2011. QueryViz: helping users understand SQL queries and their patterns. In EDBT'11.
[2]
Van den Bussche et al. 2005. Towards Practical Meta-querying. Inf. Syst. 30, 4 (2005).
[3]
Divesh and Velegrakis. 2007. Intensional Associations Between Data and Metadata. In SIGMOD'07.
[4]
Deng et al. 2017. The Data Civilizer System. In CIDR.
[5]
Hu et al. 2008. QueryScope: visualizing queries for repeatable database tuning. PVLDB 1, 2 (2008).
[6]
Halevy et al. 2016. Goods: Organizing Google's Datasets. In SIGMOD '16. 12.
[7]
Khoussainova et al. 2009. A Case for A Collaborative Query Management System. In CIDR'09.
[8]
Koutrika et al. 2010. Explaining structured queries in natural language. In ICDE'10.
[9]
Khoussainova et al. 2010. SnipSuggest: Context-Aware Autocompletion for SQL. PVLDB 4, 1 (2010).
[10]
Khoussainova et al. 2011. Session-Based Browsing for More Effective Query Reuse. In SSDBM'11.
[11]
Kul et al. 2016. Summarizing Large Query Logs in Ettu. CoRR abs/1608.01013 (2016).
[12]
Papastefanatos et al. 2005. Hecataeus: A framework for representing SQL constructs as graphs. In CEUR Workshop Proceedings, Vol. 363.
[13]
Haas. 2017. Leveraging Data and People to Accelerate Data Science. In ICDE'17.
[14]
Hausenblas and Nadeau. 2013. Apache drill: interactive ad-hoc analysis at scale. Big Data 1, 2 (2013), 100--104.
[15]
Rodriguez. 2015. The Gremlin graph traversal machine and language. In DBPL'15.

Cited By

View all
  • (2020)A Framework for DSL-Based Query Classification Using Relational and Graph-Based Data ModelsProceedings of the 3rd Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3398682.3399167(1-5)Online publication date: 14-Jun-2020
  • (2020)Pharos: Query-Driven Schema Inference for the Semantic WebMachine Learning and Knowledge Discovery in Databases10.1007/978-3-030-43887-6_10(112-124)Online publication date: 28-Mar-2020
  • (2018)Crossing an OCEAN of queriesProceedings of the 30th International Conference on Scientific and Statistical Database Management10.1145/3221269.3223025(1-4)Online publication date: 9-Jul-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GRADES-NDA '18: Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)
June 2018
94 pages
ISBN:9781450356954
DOI:10.1145/3210259
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 June 2018

Check for updates

Qualifiers

  • Demonstration

Conference

SIGMOD/PODS '18
Sponsor:

Acceptance Rates

GRADES-NDA '18 Paper Acceptance Rate 10 of 26 submissions, 38%;
Overall Acceptance Rate 29 of 61 submissions, 48%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)1
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2020)A Framework for DSL-Based Query Classification Using Relational and Graph-Based Data ModelsProceedings of the 3rd Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3398682.3399167(1-5)Online publication date: 14-Jun-2020
  • (2020)Pharos: Query-Driven Schema Inference for the Semantic WebMachine Learning and Knowledge Discovery in Databases10.1007/978-3-030-43887-6_10(112-124)Online publication date: 28-Mar-2020
  • (2018)Crossing an OCEAN of queriesProceedings of the 30th International Conference on Scientific and Statistical Database Management10.1145/3221269.3223025(1-4)Online publication date: 9-Jul-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media