Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Summarizing answer graphs induced by keyword queries

Published: 01 September 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Keyword search has been popularly used to query graph data. Due to the lack of structure support, a keyword query might generate an excessive number of matches, referred to as "answer graphs", that could include different relationships among keywords. An ignored yet important task is to group and summarize answer graphs that share similar structures and contents for better query interpretation and result understanding. This paper studies the summarization problem for the answer graphs induced by a keyword query Q. (1) A notion of summary graph is proposed to characterize the summarization of answer graphs. Given Q and a set of answer graphs G, a summary graph preserves the relation of the keywords in Q by summarizing the paths connecting the keywords nodes in G. (2) A quality metric of summary graphs, called coverage ratio, is developed to measure information loss of summarization. (3) Based on the metric, a set of summarization problems are formulated, which aim to find minimized summary graphs with certain coverage ratio. (a) We show that the complexity of these summarization problems ranges from ptime to NP-complete. (b) We provide exact and heuristic summarization algorithms. (4) Using real-life and synthetic graphs, we experimentally verify the effectiveness and the efficiency of our techniques.

    References

    [1]
    C. C. Aggarwal and H. Wang. A survey of clustering algorithms for graph data. In Managing and Mining Graph Data, pages 275-301. 2010.
    [2]
    D. Bustan and O. Grumberg. Simulation-based minimization. TOCL, 4(2):181-206, 2003.
    [3]
    S. Chakrabarti, S. Sarawagi, and S. Sudarshan. Enhancing search with structure. IEEE Data Eng. Bull., 33(1):3-24, 2010.
    [4]
    M. Charikar, S. Guha, É. Tardos, and D. Shmoys. A constant-factor approximation algorithm for the k-median problem. In STOC, pages 1-10, 1999.
    [5]
    Y. Chen, W. Wang, Z. Liu, and X. Lin. Keyword search on structured and semi-structured data. In SIGMOD, 2009.
    [6]
    J. Fan, G. Li, and L. Zhou. Interactive sql query suggestion: Making databases user-friendly. In ICDE, pages 351-362, 2011.
    [7]
    W. Fan, J. Li, J. Luo, Z. Tan, X. Wang, and Y. Wu. Incremental graph pattern matching. In SIGMOD, 2011.
    [8]
    L. Fang, A. D. Sarma, C. Yu, and P. Bohannon. Rex: Explaining relationships between entity pairs. PVLDB, 5(3):241-252, 2011.
    [9]
    H. Fu and K. Anyanwu. Effectively interpreting keyword queries on rdf databases with a rear view. In ISWC, 2011.
    [10]
    M. Garey and D. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. 1979.
    [11]
    R. Gentilini, C. Piazza, and A. Policriti. From bisimulation to simulation: Coarsest partition problems. J. Automated Reasoning, 2003.
    [12]
    J. Goldstein, V. Mittal, J. Carbonell, and M. Kantrowitz. Multi-document summarization by sentence extraction. In NAACL-ANLP Workshop on Automatic summarization, pages 40-48, 2000.
    [13]
    L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. Xrank: ranked keyword search over xml documents. In SIGMOD, 2003.
    [14]
    H. He, H. Wang, J. Yang, and P. S. Yu. Blinks: ranked keyword searches on graphs. In SIGMOD, pages 305-316, 2007.
    [15]
    M. R. Henzinger, T. A. Henzinger, and P. W. Kopke. Computing simulations on finite and infinite graphs. In FOCS, 1995.
    [16]
    Y. Huang, Z. Liu, and Y. Chen. Query biased snippet generation in xml search. In SIGMOD, pages 315-326, 2008.
    [17]
    V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar. Bidirectional expansion for keyword search on graph databases. In VLDB, 2005.
    [18]
    R. Kaushik, P. Shenoy, P. Bohannon, and E. Gudes. Exploiting local similarity for indexing paths in graph-structured data. In ICDE, pages 129-140, 2002.
    [19]
    G. Koutrika, Z. M. Zadeh, and H. Garcia-Molina. Data clouds: summarizing keyword search results over structured data. In EDBT, pages 391-402, 2009.
    [20]
    G. Li, B. Ooi, J. Feng, J. Wang, and L. Zhou. Ease: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In SIGMOD, 2008.
    [21]
    Z. Liu and Y. Chen. Query results ready, now what? IEEE Data Eng. Bull., 33(1):46-53, 2010.
    [22]
    Z. Liu and Y. Chen. Return specification inference and result clustering for keyword search on xml. TODS, 35(2):10, 2010.
    [23]
    Z. Liu, S. Natarajan, and Y. Chen. Query expansion based on clustered results. PVLDB, 4(6):350-361, 2011.
    [24]
    K. V. Mardia, J. T. Kent, and J. M. Bibby. Multivariate analysis. 1980.
    [25]
    T. Milo and D. Suciu. Index structures for path expressions. In ICDT, 1999.
    [26]
    S. Navlakha, R. Rastogi, and N. Shrivastava. Graph summarization with bounded error. In SIGMOD, 2008.
    [27]
    K. Parthasarathy, S. Kumar, and D. Damien. Algorithm for answer graph construction for keyword queries on rdf data. In WWW, 2011.
    [28]
    D. Petkova, W. B. Croft, and Y. Diao. Refining keyword queries for xml retrieval by combining content and structure. In ECIR, pages 662-669, 2009.
    [29]
    J. Plesník. Complexity of decomposing graphs into factors with given diameters or radii. Mathematica Slovaca, 32(4):379-388, 1982.
    [30]
    N. Sarkas, N. Bansal, G. Das, and N. Koudas. Measure-driven keyword-query expansion. PVLDB, 2(1):121-132, 2009.
    [31]
    S. Shekarpour, S. Auer, A.-C. N. Ngomo, D. Gerber, S. Hellmann, and C. Stadler. Keyword-driven sparql query generation leveraging background knowledge. In Web Intelligence, pages 203-210, 2011.
    [32]
    M. Sydow, M. Pikula, R. Schenkel, and A. Siemion. Entity summarisation with limited edge budget on knowledge graphs. In IMCSIT, pages 513-516, 2010.
    [33]
    S. Tata and G. M. Lohman. Sqak: doing more with keywords. In SIGMOD, 2008.
    [34]
    Y. Tian, R. Hankins, and J. Patel. Efficient aggregation for graph summarization. In SIGMOD, 2008.
    [35]
    T. Tran, H. Wang, S. Rudolph, and P. Cimiano. Top-k exploration of query candidates for efficient keyword search on graph-shaped (rdf) data. In ICDE, 2009.
    [36]
    V. V. Vazirani. Approximation Algorithms. Springer, 2003.
    [37]
    H. Wang and C. Aggarwal. A survey of algorithms for keyword search on graph data. Managing and Mining Graph Data, pages 249-273, 2010.
    [38]
    N. Zhang, Y. Tian, and J. M. Patel. Discovery-driven graph summarization. In ICDE, 2010.

    Cited By

    View all
    • (2022)Multi-relation Graph SummarizationACM Transactions on Knowledge Discovery from Data10.1145/349456116:5(1-30)Online publication date: 9-Mar-2022
    • (2021)Distributed aggregation-based attributed graph summarization for summary-based approximate attributed graph queriesExpert Systems with Applications: An International Journal10.1016/j.eswa.2021.114921176:COnline publication date: 15-Aug-2021
    • (2021)GS4: Graph stream summarization based on both the structure and semanticsThe Journal of Supercomputing10.1007/s11227-020-03290-277:3(2713-2733)Online publication date: 1-Mar-2021
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 6, Issue 14
    September 2013
    384 pages

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 September 2013
    Published in PVLDB Volume 6, Issue 14

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 11 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Multi-relation Graph SummarizationACM Transactions on Knowledge Discovery from Data10.1145/349456116:5(1-30)Online publication date: 9-Mar-2022
    • (2021)Distributed aggregation-based attributed graph summarization for summary-based approximate attributed graph queriesExpert Systems with Applications: An International Journal10.1016/j.eswa.2021.114921176:COnline publication date: 15-Aug-2021
    • (2021)GS4: Graph stream summarization based on both the structure and semanticsThe Journal of Supercomputing10.1007/s11227-020-03290-277:3(2713-2733)Online publication date: 1-Mar-2021
    • (2020)What is Normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive SummarizationProceedings of The Web Conference 202010.1145/3366423.3380189(1115-1126)Online publication date: 20-Apr-2020
    • (2020)KvGR: A Graph-Based Interface for Explorative Sequential Question Answering on Heterogeneous Information SourcesAdvances in Information Retrieval10.1007/978-3-030-45439-5_50(760-773)Online publication date: 14-Apr-2020
    • (2018)KlustreeProceedings of the ACM India Joint International Conference on Data Science and Management of Data10.1145/3152494.3152509(265-272)Online publication date: 11-Jan-2018
    • (2018)Coverage-Oriented Diversification of Keyword Search Results on GraphsDatabase Systems for Advanced Applications10.1007/978-3-319-91458-9_10(166-183)Online publication date: 21-May-2018
    • (2017)Summarizing static and dynamic big graphsProceedings of the VLDB Endowment10.14778/3137765.313782510:12(1981-1984)Online publication date: 1-Aug-2017
    • (2017)Graph ExplorationProceedings of the 2017 ACM International Conference on Management of Data10.1145/3035918.3054778(1737-1740)Online publication date: 9-May-2017
    • (2017)Graph Querying Meets HCIProceedings of the 2017 ACM International Conference on Management of Data10.1145/3035918.3054774(1731-1736)Online publication date: 9-May-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media