Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2588555.2610493acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Towards indexing functions: answering scalar product queries

Published: 18 June 2014 Publication History

Abstract

We consider a broad category of analytic queries, denoted by scalar product queries, which can be expressed as a scalar product between a known function over multiple database attributes and an unknown set of parameters. More specifically, given a set of d-dimensional data points, we retrieve all points x which satisfy an inequality given by a scalar product: <a, f(x)> <= b. We assume that the function f() is application specific and known apriori, while the query parameters a and the inequality parameter b are known only at the time of querying.
Efficiently answering such scalar product queries are essential in a wide range of applications including evaluation of complex SQL functions, time series prediction, scientific simulation, and active learning. Although some specific subclasses of the aforementioned scalar product queries and their applications have been studied in computational geometry, machine learning, and in moving object queries, surprisingly no generalized indexing scheme has been proposed for efficiently computing scalar product queries.
We present a lightweight, yet scalable, dynamic, and generalized indexing scheme, called the planar index, for answering scalar product queries in an accurate manner, which is based on the idea of indexing function f(x) for each data point x using multiple sets of parallel hyperplanes. Planar index has loglinear indexing time and linear space complexity, and the query time ranges from logarithmic to being linear in the number of data points. Based on an extensive set of experiments on several real-world and synthetic datasets, we show that planar index is not only scalable and dynamic, but also effective in various real-world applications including intersection finding between moving objects and active learning.

References

[1]
P. K. Agarwal, L. Arge, J. Erickson, P. G. Franciosa, and J. S. Vitter. Efficient Searching with Linear Constraints. In PODS, 1998.
[2]
S. Arya, D. M. Mount, and J. Xia. Tight Lower Bounds for Halfspace Range Searching. Discrete Comput. Geom., 47(4):711--730, 2012.
[3]
R. Basri, T. Hassner, and L. Zelnik-Manor. Approximate Nearest Subspace Search. TPAMI, 33(2):266--278, 2011.
[4]
S. Börzsönyi, D. Kossmann, and K. Stocker. The Skyline Operator. In ICDE, 2001.
[5]
G. Box and G. Jenkins. Time Series Analysis: Forecasting and Control. San Francisco: Holden-Day, 1970.
[6]
Y.-C. Chang, L. Bergman, V. Castelli, C.-S. Li, M.-L. Lo, and J. R. Smith. The Onion Technique: Indexing for Linear Optimization Queries. In SIGMOD, 2000.
[7]
S. Chen, C. S. Jensen, and D. Lin. A Benchmark for Evaluating Moving Object Indexes. PVLDB, 1(2):1574--1585, 2008.
[8]
S. Chen, B. C. Ooi, K.-L. Tan, and M. A. Nascimento. ST2 B-Tree: a Self-Tunable Spatio-Temporal B+-Tree Index for Moving Objects. In SIGMOD, 2008.
[9]
J. Dittrich, L. Blunschi, and M. A. V. Salles. MOVIES: Indexing Moving Objects by Shooting Index Images. GeoInformatica, 15(4):727--767, 2011.
[10]
R. Fagin, A. Lotem, and M. Naor. Optimal Aggregation Algorithms for Middleware. In PODS, 2001.
[11]
J. Goldstein, R. Ramakrishnan, U. Shaft, and J.-B. Yu. Processing Queries by Linear Constraints. In PODS, 1997.
[12]
V. Hristidis, N. Koudas, Y. Papakonstantinou, Y. Papakonstantinou, and L. J. Ca. PREFER: A System for the Efficient Execution of Multiparametric Ranked Queries. In SIGMOD, 2001.
[13]
I. F. Ilyas, G. Beskales, and M. A. Soliman. A Survey of Top-k Query Processing Techniques in Relational Database Systems. ACM Comput. Surv., 40(4):11:1--11:58, 2008.
[14]
P. Jain, S. Vijayanarasimhan, and K. Grauman. Hashing Hyperplane Queries to Near Points with Applications to Large-Scale Active Learning. In NIPS, 2010.
[15]
C. S. Jensen, D. Lin, and B. C. Ooi. Query and Update Efficient B+ Tree based Indexing of Moving Objects. In VLDB, 2004.
[16]
G. Kollios, D. Gunopulos, and V. J. Tsotras. On Indexing Mobile Objects. In PODS, 1999.
[17]
C. Li, K. C.-C. Chang, I. F. Ilyas, and S. Song. RankSQL: Query Algebra and Optimization for Relational Top-K Queries. In SIGMOD, 2005.
[18]
W. Liu, J. Wang, Y. Mu, S. Kumar, and S.-F. Chang. Compact Hyperplane Hashing with Bilinear Functions. In ICML, 2012.
[19]
J. Matousek. Reporting Points in Halfspaces. Computational Geometry, 2(3):169--186, 1992.
[20]
M. A. Nascimento and J. R. O. Silva. Towards Historical R-Trees. In SAC, 1998.
[21]
Oracle. Oracle Function-based Indexes. In 11g Release.
[22]
P. Ram and A. G. Gray. Maximum Inner-product Search Using Cone Trees. In KDD, 2012.
[23]
S. Saltenis, C. S. Jensen, S. T. Leutenegger, and M. A. Lopez. Indexing the Positions of Continuously Moving Objects. In SIGMOD, 2000.
[24]
H. Samet. Applications of Spatial Data Structures - Computer Graphics, Image Processing, and GIS. Addison-Wesley, 1990.
[25]
H. Samet, J. Sankaranarayanan, and M. Auerbach. Indexing Methods for Moving Object Databases: Games and Other Applications. In SIGMOD, 2013.
[26]
B. Settles. Active Learning Literature Survey. CS Tech. Report, Univ. of Wisconsin-Madison, 2009.
[27]
A. P. Sistla, O. Wolfson, S. Chamberlain, and S. Dao. Modeling and Querying Moving Objects. In ICDE, 1997.
[28]
Y. Tao and D. Papadias. Time-Parameterized Queries in Spatio-Temporal Databases. In SIGMOD, 2002.
[29]
J. Uhlmann. Satisfying General Proximity/Similarity Queries with Metric Trees. Inf. Process. Lett., 40(4):175--179, 1991.
[30]
D. Xin, J. Han, and K. C. Chang. Progressive and Selective Merge: Computing Top-k with Ad-Hoc Ranking Functions. In SIGMOD, 2007.
[31]
P. Y. Yianilos. Data Structures and Algorithms for Nearest Neighbor Search in General Metric Spaces. In SODA, 1993.
[32]
M. Yiu, Y. Tao, and N. Mamoulis. The B dual-Tree: Indexing Moving Objects by Space Filling Curves in the Dual Space. VLDB J., 17(3):379--400, 2008.
[33]
R. Zhang, J. Qi, D. Lin, W. Wang, and R. C.-W. Wong. A Highly Optimized Algorithm for Continuous Intersection Join Queries over Moving Objects. VLDB J., 21(4):561--586, 2012.

Cited By

View all
  • (2022)Verifying the Correctness of Analytic Query ResultsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.303731334:9(4527-4537)Online publication date: 1-Sep-2022
  • (2022)Towards Secure and Efficient Equality Conjunction Search Over Outsourced DatabasesIEEE Transactions on Cloud Computing10.1109/TCC.2020.297517510:2(1445-1461)Online publication date: 1-Apr-2022
  • (2018)Querying a Collection of Continuous FunctionsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2018.280293630:9(1783-1795)Online publication date: 1-Sep-2018
  • Show More Cited By

Index Terms

  1. Towards indexing functions: answering scalar product queries

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
    June 2014
    1645 pages
    ISBN:9781450323765
    DOI:10.1145/2588555
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 June 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. function indexing
    2. moving object indexing
    3. planar index
    4. scalar product query

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS'14
    Sponsor:

    Acceptance Rates

    SIGMOD '14 Paper Acceptance Rate 107 of 421 submissions, 25%;
    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 08 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Verifying the Correctness of Analytic Query ResultsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.303731334:9(4527-4537)Online publication date: 1-Sep-2022
    • (2022)Towards Secure and Efficient Equality Conjunction Search Over Outsourced DatabasesIEEE Transactions on Cloud Computing10.1109/TCC.2020.297517510:2(1445-1461)Online publication date: 1-Apr-2022
    • (2018)Querying a Collection of Continuous FunctionsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2018.280293630:9(1783-1795)Online publication date: 1-Sep-2018
    • (2018)Image Retrieval and Relevance FeedbackEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_1016(1790-1795)Online publication date: 7-Dec-2018
    • (2016)Authentication of function queries2016 IEEE 32nd International Conference on Data Engineering (ICDE)10.1109/ICDE.2016.7498252(337-348)Online publication date: May-2016
    • (2015)Scalable action localization with kernel-space hashing2015 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP.2015.7350799(257-261)Online publication date: Sep-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media