Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1516360.1516374acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free access

Answering aggregate keyword queries on relational databases using minimal group-bys

Published: 24 March 2009 Publication History
  • Get Citation Alerts
  • Abstract

    Keyword search has been recently extended to relational databases to retrieve information from text-rich attributes. However, all the existing methods focus on finding individual tuples matching a set of query keywords from one table or the join of multiple tables. In this paper, we motivate a novel problem of aggregate keyword search: finding minimal group-bys covering a set of query keywords well, which is useful in many applications. We develop two interesting approaches to tackle the problem, and further extend our methods to allow partial matches. An extensive empirical evaluation using both real data sets and synthetic data sets is reported to verify the effectiveness of aggregate keyword search and the efficiency of our methods.

    References

    [1]
    S. Agrawal et al. DBXplorer: A system for keyword-based search over relational databases. In ICDE'02.
    [2]
    S. Amer-Yahia et al. Report on the DB/IR panel at sigmod 2005. SIGMOD Record, 34(4):71--74, 2005.
    [3]
    K. Beyer and R. Ramakrishnan. Bottom-up computation of sparse and iceberg cubes. In SIGMOD'99.
    [4]
    G. Bhalotia et al. Keyword searching and browsing in databases using banks. In ICDE'02.
    [5]
    S. Chaudhuri et al. Integrating DB and IR technologies: What is the sound of one hand clapping? In CIDR'05.
    [6]
    T. H. Cormen et al. Introduction to Algorithms. McGraw-Hill Higher Education, 2001.
    [7]
    B. Ding et al. Finding top-k min-cost connected trees in databases. In ICDE'07.
    [8]
    M. Fang et al. Computing iceberg queries efficiently. In VLDB'98.
    [9]
    Y. Feng et al. Range Cube: Efficient cube computation by exploiting data correlation. In ICDE'04.
    [10]
    M. Garey and D. Johnson. Computers and Intractability: a Guide to The Theory of NP-Completeness. Freeman and Company, New York, 1979.
    [11]
    J. Gray et al. Data cube: A relational operator generalizing group-by, cross-tab and sub-totals. In ICDE'96.
    [12]
    J. Han et al. Efficient computation of iceberg cubes with complex measures. In SIGMOD'01.
    [13]
    D. Harman et al. Inverted files. In Information retrieval: data structures and algorithms, pages 28--43, Upper Saddle River, NJ, USA, 1992. Prentice-Hall, Inc.
    [14]
    H. He et al. BLINKS: ranked keyword searches on graphs. In SIGMOD'07.
    [15]
    V. Hristidis et al. Efficient ir-style keyword search over relational databases. In VLDB'03.
    [16]
    V. Hristidis and Y. Papakonstantinou. Discover: Keyword search in relational databases. In VLDB'02.
    [17]
    V. Kacholia et al. Bidirectional expansion for keyword search on graph databases. In VLDB'05.
    [18]
    B. Kimelfeld and Y. Sagiv. Finding and approximating top-k answers in keyword proximity search. In PODS'06.
    [19]
    F. Liu et al. Effective keyword search in relational databases. In SIGMOD'06.
    [20]
    Y. Luo et al. Spark: top-k keyword query in relational databases. In SIGMOD'07.
    [21]
    R. T. Ng et al. Iceberg-cube computation with PC clusters. In SIGMOD'01.
    [22]
    Q. H. Vu et al. A graph method for keyword-based selection of the top-k databases. In SIGMOD'08.
    [23]
    G. Weikum. DB&IR: both sides now. In SIGMOD'07.
    [24]
    P. Wu et al. Towards keyword-driven analytical processing. In SIGMOD'07.
    [25]
    D. Xin et al. Star-cubing: Computing iceberg cubes by top-down and bottom-up integration. In VLDB'02.
    [26]
    B. Yu et al. Effective keyword-based selection of relational databases. In SIGMOD'07.

    Cited By

    View all
    • (2015)Towards Integrated Study of Data Management and Data MiningProcedia Computer Science10.1016/j.procs.2015.07.11755(1331-1339)Online publication date: 2015
    • (2014)Pocket-cubes, bringing multidimensional data views to mobile platforms2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC.2014.6974481(3555-3560)Online publication date: Oct-2014
    • (2014)Topical Presentation of Search Results on DatabaseDatabase Systems for Advanced Applications10.1007/978-3-319-05813-9_23(343-360)Online publication date: 2014
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EDBT '09: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
    March 2009
    1180 pages
    ISBN:9781605584225
    DOI:10.1145/1516360
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 March 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Conference

    EDBT/ICDT '09
    EDBT/ICDT '09: EDBT/ICDT '09 joint conference
    March 24 - 26, 2009
    Saint Petersburg, Russia

    Acceptance Rates

    Overall Acceptance Rate 7 of 10 submissions, 70%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)32
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 05 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)Towards Integrated Study of Data Management and Data MiningProcedia Computer Science10.1016/j.procs.2015.07.11755(1331-1339)Online publication date: 2015
    • (2014)Pocket-cubes, bringing multidimensional data views to mobile platforms2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC.2014.6974481(3555-3560)Online publication date: Oct-2014
    • (2014)Topical Presentation of Search Results on DatabaseDatabase Systems for Advanced Applications10.1007/978-3-319-05813-9_23(343-360)Online publication date: 2014
    • (2013)A graph-theoretic approach to optimize keyword queries in relational databasesKnowledge and Information Systems10.1007/s10115-013-0690-241:3(843-870)Online publication date: 16-Oct-2013
    • (2012)Efficient and Effective Aggregate Keyword Search on Relational DatabasesInternational Journal of Data Warehousing and Mining10.4018/jdwm.20121001038:4(41-81)Online publication date: 1-Oct-2012
    • (2012)An Automatic Machine Learning Method for the Study of Keyword SuggestionMachine Learning Algorithms for Problem Solving in Computational Applications10.4018/978-1-4666-1833-6.ch009(149-165)Online publication date: 2012
    • (2012)Computing Structural Statistics by Keywords in DatabasesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2012.7824:10(1731-1746)Online publication date: 1-Oct-2012
    • (2012)Building a term suggestion and ranking system based on a probabilistic analysis model and a semantic analysis graphDecision Support Systems10.1016/j.dss.2012.02.00153:1(257-266)Online publication date: 1-Apr-2012
    • (2011)TEXplorerProceedings of the 20th ACM international conference on Information and knowledge management10.1145/2063576.2063822(1709-1718)Online publication date: 24-Oct-2011
    • (2011)Efficient Keyword-Based Search for Top-K Cells in Text CubeIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2011.3423:12(1795-1810)Online publication date: 1-Dec-2011
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media