Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2452376.2452433acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Efficient top-k query answering using cached views

Published: 18 March 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Top-k query processing has recently received a significant amount of attention due to its wide application in information retrieval, multimedia search and recommendation generation. In this work, we consider the problem of how to efficiently answer a top-k query by using previously cached query results. While there has been some previous work on this problem, existing algorithms suffer from either limited scope or lack of scalability. In this paper, we propose two novel algorithms for handling this problem. The first algorithm LPTA+ provides significantly improved efficiency compared to the state-of-the-art LPTA algorithm [26] by reducing the number of expensive linear programming problems that need to be solved. The second algorithm we propose leverages a standard space partition-based index structure in order to avoid many of the drawbacks of LPTA-based algorithms, thereby further improving the efficiency of query processing. Through extensive experiments on various datasets, we demonstrate that our algorithms significantly outperform the state of the art.

    References

    [1]
    Auto123 consumer car ratings. http://www.auto123.com/en/car-reviews/consumer-ratings/.
    [2]
    Flickr. http://www.flickr.com.
    [3]
    IMDB. http://imdb.com.
    [4]
    Memcached. http://memcached.org.
    [5]
    Metacritics. http://www.metacritic.com.
    [6]
    Metascore. http://www.metacritic.com/about-metascores.
    [7]
    Nba basketball statistics. http://www.databasebasketball.com.
    [8]
    Rottentomatoes. http://www.rottentomatoes.com.
    [9]
    Twitter. http://twitter.com.
    [10]
    U.s. news best cars. http://usnews.rankingsandreviews.com/cars-trucks/rankings/cars/.
    [11]
    U.s. news best collage rankings. http://www.usnews.com/rankings.
    [12]
    Wikipedia. http://www.wikipedia.org.
    [13]
    World university rankings. http://www.timeshighereducation.co.uk/world-university-rankings/.
    [14]
    Youtube. http://www.youtube.com.
    [15]
    S. Abiteboul and O. M. Duschka. Complexity of answering queries using materialized views. In PODS, pages 254--263, 1998.
    [16]
    R. Akbarinia, E. Pacitti, and P. Valduriez. Best position algorithms for top-k queries. In VLDB, pages 495--506, 2007.
    [17]
    G. J. Badros, A. Borning, and P. J. Stuckey. The Cassowary linear arithmetic constraint solving algorithm. ACM Trans. Comput.-Hum. Interact., 8(4):267--306, 2001.
    [18]
    E. Baikousi and P. Vassiliadis. View usability and safety for the answering of top-k queries via materialized views. In DOLAP, pages 97--104, 2009.
    [19]
    C. Böhm, S. Berchtold, and D. A. Keim. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Comput. Surv., 33(3):322--373, 2001.
    [20]
    S. Börzsönyi, D. Kossmann, and K. Stocker. The skyline operator. In ICDE, pages 421--430, 2001.
    [21]
    A. K. Chandra and P. M. Merlin. Optimal implementation of conjunctive queries in relational data bases. In STOC, pages 77--90, 1977.
    [22]
    Y.-C. Chang, L. D. Bergman, V. Castelli, C.-S. Li, M.-L. Lo, and J. R. Smith. The onion technique: Indexing for linear optimization queries. In SIGMOD, pages 391--402, 2000.
    [23]
    H.-T. Chou and D. J. DeWitt. An evaluation of buffer management strategies for relational database systems. In VLDB, pages 127--141, 1985.
    [24]
    G. Dantzig. Linear Programming and Extensions. Princeton University, 1998.
    [25]
    G. Das, D. Gunopulos, N. Koudas, and N. Sarkas. Ad-hoc top-k query answering for data streams. In VLDB, pages 183--194, 2007.
    [26]
    G. Das, D. Gunopulos, N. Koudas, and D. Tsirogiannis. Answering top-k queries using views. In VLDB, pages 451--462, 2006.
    [27]
    R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. J. Comput. Syst. Sci., 66(4):614--656, 2003.
    [28]
    A. Y. Halevy. Answering queries using views: A survey. VLDB J., 10(4):270--294, 2001.
    [29]
    J.-S. Heo, J. Cho, and K.-Y. Whang. The hybrid-layer index: A synergic approach to answering top-k queries in arbitrary subspaces. In ICDE, pages 445--448, 2010.
    [30]
    V. Hristidis, N. Koudas, and Y. Papakonstantinou. PREFER: A system for the efficient execution of multi-parametric ranked queries. In SIGMOD, pages 259--270, 2001.
    [31]
    I. F. Ilyas, W. G. Aref, and A. K. Elmagarmid. Supporting top-k join queries in relational databases. In VLDB, pages 754--765, 2003.
    [32]
    I. F. Ilyas, G. Beskales, and M. A. Soliman. A survey of top-k query processing techniques in relational database systems. ACM Comput. Surv., 40(4), 2008.
    [33]
    Y. E. Ioannidis. The history of histograms. In VLDB, pages 19--30, 2003.
    [34]
    T. Johnson and D. Shasha. 2Q: A low overhead high performance buffer management replacement algorithm. In VLDB, pages 439--450, 1994.
    [35]
    H. Kellerer, U. Pferschy, and D. Pisinger. Knapsack Problems. Springer, 2004.
    [36]
    C. T. Kwok and D. S. Weld. Planning to gather information. In AAAI/IAAI, Vol. 1, pages 32--39, 1996.
    [37]
    J. Lee, H. Cho, and S. won Hwang. Efficient dual-resolution layer indexing for top-k queries. In ICDE, pages 1084--1095, 2012.
    [38]
    A. Y. Levy, A. O. Mendelzon, Y. Sagiv, and D. Srivastava. Answering queries using views. In PODS, pages 95--104, 1995.
    [39]
    C. Li, K. C.-C. Chang, I. F. Ilyas, and S. Song. RankSQL: Query algebra and optimization for relational top-k queries. In SIGMOD, pages 131--142, 2005.
    [40]
    S. J. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Pearson Education, third edition, 2010.
    [41]
    N. H. Ryeng, A. Vlachou, C. Doulkeridis, and K. Nørvåg. Efficient distributed top-k query processing with caching. In DASFAA, pages 280--295, 2011.
    [42]
    D. Theodoratos and T. K. Sellis. Data warehouse configuration. In VLDB, pages 126--135, 1997.
    [43]
    P. Tsaparas, T. Palpanas, Y. Kotidis, N. Koudas, and D. Srivastava. Ranked join indices. In ICDE, pages 277--288, 2003.
    [44]
    D. Xin, C. Chen, and J. Han. Towards robust indexing for ranked queries. In VLDB, pages 235--246, 2006.
    [45]
    A. Yu, P. K. Agarwal, and J. Yang. Processing a large number of continuous preference top-k queries. In SIGMOD, pages 397--408, 2012.

    Cited By

    View all

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EDBT '13: Proceedings of the 16th International Conference on Extending Database Technology
    March 2013
    793 pages
    ISBN:9781450315975
    DOI:10.1145/2452376
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 March 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. top-k query answering using views
    2. top-k query processing

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    EDBT/ICDT '13

    Acceptance Rates

    Overall Acceptance Rate 7 of 10 submissions, 70%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Query Recombination: To Process a Large Number of Concurrent Top-k Queries towards IoT Data on an Edge Server2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS54860.2022.00060(559-569)Online publication date: Jul-2022
    • (2022)PSATop-kKnowledge-Based Systems10.1016/j.knosys.2021.107614235:COnline publication date: 10-Jan-2022
    • (2020)Efficient Algorithm for Answering Fuzzy Medical Requests in Pervasive Healthcare Information SystemsData Analytics in Medicine10.4018/978-1-7998-1204-3.ch014(268-287)Online publication date: 2020
    • (2020)Enhanced query processing over semantic cache for cloud based relational databasesJournal of Ambient Intelligence and Humanized Computing10.1007/s12652-020-01943-x14:5(5853-5871)Online publication date: 13-Apr-2020
    • (2019)Efficient main-memory top-K selection for multicore architecturesProceedings of the VLDB Endowment10.14778/3364324.336432713:2(114-127)Online publication date: 1-Oct-2019
    • (2019)Secure Query Processing over Encrypted Data Using a Distributed Index Structure for Outsourcing Sensitive DataEconomics of Grids, Clouds, Systems, and Services10.1007/978-3-030-13342-9_16(187-198)Online publication date: 8-Feb-2019
    • (2017)Efficient Algorithm for Answering Fuzzy Medical Requests in Pervasive Healthcare Information SystemsInternational Journal of Healthcare Information Systems and Informatics10.4018/IJHISI.201704010312:2(46-64)Online publication date: 1-Apr-2017
    • (2017)Bitmap-based distributed index structure and encrypted query processing schemes for outsourcing mobile sensitive data2017 IEEE International Conference on Big Data and Smart Computing (BigComp)10.1109/BIGCOMP.2017.7881680(288-295)Online publication date: Feb-2017
    • (2017)Exploratory product search using top-k join queriesInformation Systems10.1016/j.is.2016.09.00464:C(75-92)Online publication date: 1-Mar-2017
    • (2017)Subscription-based data aggregation techniques for top-k monitoring queriesWorld Wide Web10.1007/s11280-016-0385-120:2(237-265)Online publication date: 1-Mar-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media