Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1142351.1142377acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
Article

Finding and approximating top-k answers in keyword proximity search

Published: 26 June 2006 Publication History

Abstract

Various approaches for keyword proximity search have been implemented in relational databases, XML and the Web. Yet, in all of them, an answer is a Q-fragment, namely, a subtree T of the given data graph G, such that T contains all the keywords of the query Q and has no proper subtree with this property. The rank of an answer is inversely proportional to its weight. Three problems are of interest: finding an optimal (i.e., top-ranked) answer, computing the top-k answers and enumerating all the answers in ranked order. It is shown that, under data complexity, an efficient algorithm for solving the first problem is sufficient for solving the other two problems with polynomial delay. Similarly, an efficient algorithm for finding a θ-approximation of the optimal answer suffices for carrying out the following two tasks with polynomial delay, under query-and-data complexity. First, enumerating in a (θ+1)-approximate order. Second, computing a (θ+1)-approximation of the top-k answers. As a corollary, this paper gives the first efficient algorithms, under data complexity, for enumerating all the answers in ranked order and for computing the top-k answers. It also gives the first efficient algorithms, under query-and-data complexity, for enumerating in a provably approximate order and for computing an approximation of the top-k answers.

References

[1]
S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: enabling keyword search over relational databases. In SIGMOD, 2002.
[2]
G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. In ICDE, 2002.
[3]
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1-7), 1998.
[4]
M. Charikar, C. Chekuri, T. Y. Cheung, Z. Dai, A. Goel, S. Guha, and M. Li. Approximation algorithms for directed steiner problems. In SODA, 1998.
[5]
S. Dreyfus and R. Wagner. The Steiner problem in graphs. Networks, 1, 1972.
[6]
D.-Z. Du, J. Smith, and J. Rubinstein. Advances in Steiner Trees. Springer, 2000.
[7]
R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. In PODS, 2001.
[8]
J. Feldman and M. Ruhl. The directed steiner network problem is tractable for a constant number of terminals. In FOCS, 1999.
[9]
N. Garg, G. Konjevod, and R. Ravi. A polylogarithmic approximation algorithm for the group steiner tree problem. J. Algorithms, 37(1), 2000.
[10]
C. S. Helvig, G. Robins, and A. Zelikovsky. An improved approximation scheme for the group Steiner problem. Networks, 37(1), 2001.
[11]
V. Hristidis and Y. Papakonstantinou. DISCOVER: Keyword search in relational databases. In VLDB, 2002.
[12]
V. Hristidis, Y. Papakonstantinou, and A. Balmin. Keyword proximity search on XML graphs. In ICDE, 2003.
[13]
D. Johnson, M. Yannakakis, and C. Papadimitriou. On generating all maximal independent sets. Information Processing Letters, 27, 1988.
[14]
V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar. Bidirectional expansion for keyword search on graph databases. In VLDB, 2005.
[15]
B. Kimelfeld and Y. Sagiv. New algorithms for computing Steiner trees for a fixed number of terminals. To be found in the first author's home page (http://www.cs.huji.ac.il/~bennyk).
[16]
B. Kimelfeld and Y. Sagiv. Efficient engines for keyword proximity search. In WebDB, 2005.
[17]
B. Kimelfeld and Y. Sagiv. Efficiently enumerating results of keyword search. In DBPL, 2005.
[18]
L. Kou, G. Markowsky, and L. Berman. A fast algorithm for Steiner trees. Acta Inf., 15, 1981.
[19]
E. L. Lawler. A procedure for computing the k best solutions to discrete optimization problems and its application to the shortest path problem. Management Science, 18, 1972.
[20]
W.-S. Li, K. S. Candan, Q. Vu, and D. Agrawal. Retrieving and organizing web pages by "information unit". In WWW, 2001.
[21]
G. Robins and A. Zelikovsky. Improved Steiner tree approximation in graphs. In SODA, 2000.
[22]
M. Y. Vardi. The complexity of relational query languages (extended abstract). In STOC, 1982.
[23]
J. Y. Yen. Finding the k shortest loopless paths in a network. Management Science, 17, 1971.
[24]
J. Y. Yen. Another algorithm for finding the k shortest loopless network paths. In "Proc. 41st Mtg. Operations Research Society of America", volume 20, 1972.
[25]
A. Zelikovsky. An 11/6-approximation algorithm for the network steiner problem. Algorithmica, 9(5), 1993.

Cited By

View all
  • (2023)Efficient Approximation Framework for Attribute RecommendationProceedings of the ACM on Management of Data10.1145/36267261:4(1-26)Online publication date: 12-Dec-2023
  • (2023)An Efficient Keywords Search in Temporal Social NetworksData Science and Engineering10.1007/s41019-023-00218-78:4(368-384)Online publication date: 9-Sep-2023
  • (2022)Linear-Delay Enumeration for Minimal Steiner ProblemsProceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3517804.3524148(301-313)Online publication date: 12-Jun-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PODS '06: Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
June 2006
382 pages
ISBN:1595933182
DOI:10.1145/1142351
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 June 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. approximations of top-k answers
  2. keyword proximity search
  3. top-k answers

Qualifiers

  • Article

Conference

SIGMOD/PODS06

Acceptance Rates

PODS '06 Paper Acceptance Rate 35 of 185 submissions, 19%;
Overall Acceptance Rate 642 of 2,707 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Efficient Approximation Framework for Attribute RecommendationProceedings of the ACM on Management of Data10.1145/36267261:4(1-26)Online publication date: 12-Dec-2023
  • (2023)An Efficient Keywords Search in Temporal Social NetworksData Science and Engineering10.1007/s41019-023-00218-78:4(368-384)Online publication date: 9-Sep-2023
  • (2022)Linear-Delay Enumeration for Minimal Steiner ProblemsProceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3517804.3524148(301-313)Online publication date: 12-Jun-2022
  • (2022)Key-core: cohesive keyword subgraph exploration in large graphsWorld Wide Web10.1007/s11280-021-00926-y25:2(831-856)Online publication date: 1-Mar-2022
  • (2021)Efficient Indexing of Top-k Entities in Systems of Engagement with Extensions for Geo-tagged EntitiesData Science and Engineering10.1007/s41019-021-00173-16:4(411-433)Online publication date: 11-Oct-2021
  • (2021)Keyword Search on Large Graphs: A SurveyData Science and Engineering10.1007/s41019-021-00154-46:2(142-162)Online publication date: 31-Mar-2021
  • (2020)Efficient keyword search over graph-structured data based on minimal covered r-cliquesFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.180013321:3(448-464)Online publication date: 1-Apr-2020
  • (2020)Keyword Search over Knowledge Graphs via Static and Dynamic Hub LabelingsProceedings of The Web Conference 202010.1145/3366423.3380110(235-245)Online publication date: 20-Apr-2020
  • (2020)PPKWS: An Efficient Framework for Keyword Search on Public-Private Networks2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00046(457-468)Online publication date: Apr-2020
  • (2020)Towards Efficient Retrieval of Top-k Entities in Systems of EngagementWeb Information Systems Engineering – WISE 202010.1007/978-3-030-62008-0_4(52-67)Online publication date: 20-Oct-2020
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media