Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3211954.3211955acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Contextual Intelligence for Unified Data Governance

Published: 10 June 2018 Publication History
  • Get Citation Alerts
  • Abstract

    Current data governance techniques are very labor-intensive, as teams of data stewards typically rely on best practices to transform business policies into governance rules. As data plays an increasingly key role in today's data-driven enterprises, current approaches do not scale to the complexity and variety present in the data ecosystem of an enterprise as an increasing number of data requirements, use cases, applications, tools and systems come into play. We believe techniques from artificial intelligence and machine learning have potential to improve discoverability, quality and compliance in data governance. In this paper, we propose a framework for 'contextual intelligence', where we argue for (1) collecting and integrating contextual metadata from variety of sources to establish a trusted unified repository of contextual data use across users and applications, and (2) applying machine learning and artificial intelligence techniques over this rich contextual metadata to improve discoverability, quality and compliance in governance practices. We propose an architecture that unifies governance across several systems, with a graph serving as a core repository of contextual metadata, accurately representing data usage across the enterprise and facilitating machine learning, We demonstrate how our approach can enable ML-based recommendations in support of governance best practices.

    References

    [1]
    2018. IBM Information Server 11.7. (2018).
    [2]
    2018. IBM InfoSphere Information Analyzer. (2018). https://www.ibm.com/us-en/marketplace/infosphere-information-analyzer
    [3]
    Gediminas Adomavicius, Ramesh Sankaranarayanan, Shahana Sen, and Alexander Tuzhilin. 2005. Incorporating Contextual Information in Recommender Systems Using a Multidimensional Approach. ACM Trans. Inf. Syst. 23, 1 (Jan. 2005), 103--145.
    [4]
    Javad Akbarnejad, Gloria Chatzopoulou, Magdalini Eirinaki, Suju Koshy, Sarika Mittal, Duc On, Neoklis Polyzotis, and Jothi S. Vindhiya Varman. 2010. SQL QueRIE Recommendations. Proc. VLDB Endow. 3, 1--2 (Sept. 2010), 1597--1600.
    [5]
    Ricardo Baeza-Yates, Carlos Hurtado, and Marcelo Mendoza. 2005. Query Recommendation Using Query Logs in Search Engines. In Current Trends in Database Technology - EDBT 2004 Workshops, Wolfgang Lindner, Marco Mesiti, Can Türker, Yannis Tzitzikas, and Athena I. Vakali (Eds.). Springer, 588--596.
    [6]
    Gloria Chatzopoulou, Magdalini Eirinaki, and Neoklis Polyzotis. 2009. Query Recommendations for Interactive Database Exploration. In Scientific and Statistical Database Management, Marianne Winslett (Ed.). Springer, 3--18.
    [7]
    Scheepers F. Nguyen N. van Kessel R. Chessell, M. and R. van der Starre. 1994. Governing and Managing Big Data for Analytics and Decision Makers. IBM Redguides for Business Leaders (1994). http://www.redbooks.ibm.com/redpapers/pdfs/redp5120.pdf
    [8]
    Christina Christodoulakis, Eser Kandogan, Ignacio G. Terrizzano, and Renée J. Miller. 2017. VIQS: Visual Interactive Exploration of Query Semantics. In Proceedings of the 2017 ACM Workshop on Exploratory Search and Interactive Data Analytics (ESIDA '17). ACM, New York, NY, USA, 25--32.
    [9]
    R. J. DeStefano, L. Tao, and K. Gai. 2016. Improving Data Governance in Large Organizations through Ontology and Linked Data. In 2016 IEEE 3rd International Conference on Cyber Security and Cloud Computing (CSCloud). 279--284.
    [10]
    P. Dourish. 2004. What We Talk About when We Talk About Context. Personal Ubiquitous Comput. 8, 1 (Feb. 2004), 19--30.
    [11]
    Corentin Follenfant, Olivier Corby, Fabien Gandon, and David Trastour. 2012. RDF Modelling and SPARQL Processing of SQL Abstract Syntax Trees. In PSW -1st Workshop on Programming the Semantic Web. Boston, United States.
    [12]
    Bill Howe, Garret Cole, Emad Souroush, Paraschos Koutris, Alicia Key, Nodira Khoussainova, and Leilani Battle. 2011. Database-as-a-Service for Long Tail Science. In SSDBM '11: Proceedings of the 23rd Scientific and Statistical Database Management Conference.
    [13]
    Shrainik Jain and Bill Howe. {n. d.}. Data Cleaning in the Wild: Reusable Curation Idioms from a Multi-Year SQL Workload. ({n. d.}).
    [14]
    Shrainik Jain, Dominik Moritz, Daniel Halperin, Bill Howe, and Ed Lazowska. 2016. SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment. In Proceedings of the 2016 International Conference on Management of Data (SIGMOD '16). ACM, New York, NY, USA, 281--293.
    [15]
    Shrainik Jain, Dominik Moritz, and Bill Howe. 2016. High Variety Cloud Databases. In Proceedings of the 2016 IEEE Cloud Data Management Workshop.
    [16]
    E. Kandogan, A. Balakrishnan, E. M. Haber, and J. S. Pierce. 2014. From Data to Insight: Work Practices of Analysts in the Enterprise. IEEE Computer Graphics and Applications 34, 5 (Sept 2014), 42--50.
    [17]
    E. Kandogan, M. Roth, C. Kieliszewski, F. ÃŰzcan, B. Schloss, and M. T. Schmidt. 2013. Data for All: A Systems Approach to Accelerate the Path from Data to Insight. In 2013 IEEE International Congress on Big Data. 427--428.
    [18]
    E. Kandogan, M. Roth, P. Schwarz, J. Hui, I. Terrizzano, C. Christodoulakis, and R. J. Miller. 2015. LabBook: Metadata-driven social collaborative data analysis. In 2015 IEEE International Conference on Big Data (Big Data). 431--440.
    [19]
    Vijay Khatri and Carol V. Brown. 2010. Designing Data Governance. Commun. ACM 53, 1 (Jan. 2010), 148--152.
    [20]
    Nodira Khoussainova, YongChul Kwon, Magdalena Balazinska, and Dan Suciu. 2010. SnipSuggest: Context-aware Autocompletion for SQL. Proc. VLDB Endow. 4, 1 (Oct. 2010), 22--33.
    [21]
    Hao Ma, Tom Chao Zhou, Michael R. Lyu, and Irwin King. 2011. Improving Recommender Systems by Incorporating Social Contextual Information. ACM Trans. Inf. Syst. 29, 2, Article 9 (April 2011), 23 pages.
    [22]
    Patrick Marcel and Elsa Negre. 2011. A survey of query recommendation techniques for data warehouse exploration. In Actes des 7èmes journées francophones sur les Entrepôts de Données et l'Analyse en ligne, Clermont-Ferrand, France, EDA 2011, Juin 2011. 119--134.
    [23]
    Nicolas Pasquier, Yves Bastide, Rafik Taouil, and Lotfi Lakhal. 1999. Discovering Frequent Closed Itemsets for Association Rules. In Proceedings of the 7th International Conference on Database Theory (ICDT '99). Springer-Verlag, London, UK, UK, 398--416.
    [24]
    Torsten Priebe and Günther Pernul. 2003. Towards Integrative Enterprise Knowledge Portals. In Proceedings of the Twelfth International Conference on Information and Knowledge Management (CIKM '03). ACM, New York, NY, USA, 216--223.
    [25]
    P. P. Tallon. 2013. Corporate Governance of Big Data: Perspectives on Value, Risk, and Cost. Computer 46, 6 (June 2013), 32--38.
    [26]
    Kristin Weber, Boris Otto, and Hubert Österle. 2009. One Size Does Not Fit All---A Contingency Approach to Data Governance. J. Data and Information Quality 1, 1, Article 4 (June 2009), 27 pages.

    Cited By

    View all
    • (2024)CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI SystemsProceedings of the Conference on Governance, Understanding and Integration of Data for Effective and Responsible AI10.1145/3665601.3669846(16-25)Online publication date: 9-Jun-2024
    • (2024)Ethical Framework for Harnessing the Power of AI in Healthcare and BeyondIEEE Access10.1109/ACCESS.2024.336991212(31014-31035)Online publication date: 2024
    • (2023)Control and Data Integrity are Important Factors of Data Governance Technology2023 10th International Conference on ICT for Smart Society (ICISS)10.1109/ICISS59129.2023.10291563(1-6)Online publication date: 6-Sep-2023

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    aiDM'18: Proceedings of the First International Workshop on Exploiting Artificial Intelligence Techniques for Data Management
    June 2018
    34 pages
    ISBN:9781450358514
    DOI:10.1145/3211954
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 June 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Analytics
    2. Context
    3. Data Governance
    4. Graph

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    SIGMOD/PODS '18
    Sponsor:

    Acceptance Rates

    aiDM'18 Paper Acceptance Rate 5 of 8 submissions, 63%;
    Overall Acceptance Rate 19 of 26 submissions, 73%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)43
    • Downloads (Last 6 weeks)2

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI SystemsProceedings of the Conference on Governance, Understanding and Integration of Data for Effective and Responsible AI10.1145/3665601.3669846(16-25)Online publication date: 9-Jun-2024
    • (2024)Ethical Framework for Harnessing the Power of AI in Healthcare and BeyondIEEE Access10.1109/ACCESS.2024.336991212(31014-31035)Online publication date: 2024
    • (2023)Control and Data Integrity are Important Factors of Data Governance Technology2023 10th International Conference on ICT for Smart Society (ICISS)10.1109/ICISS59129.2023.10291563(1-6)Online publication date: 6-Sep-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media