Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2663714.2668047acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

On Efficient Query Processing with the Earth Mover's Distance

Published: 03 November 2014 Publication History

Abstract

The Earth Mover's Distance which is proposed in computer vision as a distance-based similarity model has been widely used and investigated in various domains for similarity search. Although there exists the opportunity to apply this well-known similarity model reflecting the human perceptual similarity both on feature histograms and signatures as feature representation techniques, efficiency improvement approaches towards the Earth Mover's Distance were often investigated on feature histograms. Thus, it can be brought into question how k-nearest-neighbor queries can be processed efficiently by using this distance-based similarity model in a database of feature signatures, such as in a multimedia database. In this paper, the work in progress is presented regarding the new lower bound Independent Minimization for Signatures (IM-Sig) to the Earth Mover's Distance on feature signatures as an efficient filter approximation approach. Furthermore, the problems and challenging issues regarding efficient query processing on feature signatures are presented. The ongoing experimental evaluation on real data points out the high efficiency of the proposed lower bound, contributing to a promising start in the research field of efficient query processing with the Earth Mover's Distance.

References

[1]
R. Agrawal, C. Faloutsos, and A. N. Swami. Efficient similarity search in sequence databases. In FODO, pages 69--84, 1993.
[2]
R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows: Theory, Algorithms, and Applications. Pearson Education Limited, England, 2014.
[3]
I. Assent, A. Wenning, and T. Seidl. Approximation techniques for indexing the earth mover's distance in multimedia databases. In ICDE, page 11, 2006.
[4]
I. Assent, M. Wichterich, T. Meisen, and T. Seidl. Efficient similarity search using the earth mover's distance for large multimedia databases. In ICDE, pages 307--316, 2008.
[5]
I. Assent, M. Wichterich, and T. Seidl. Adaptable distance functions for similarity-based multimedia retrieval. Datenbank-Spektrum, 6(19):23--31, 2006.
[6]
M. S. Bazaraa, J. J. Jarvis, and H. D. Sherali. Linear Programming and Network Flows. John Wiley & Sons, USA, 2010.
[7]
C. Beecks. Distance-based similarity models for content-based multimedia retrieval. PhD thesis, RWTH Aachen University, 2013.
[8]
C. Beecks, M. S. Uysal, and T. Seidl. A comparative study of similarity measures for content-based multimedia retrieval. In ICME, pages 1552--1557, July 2010.
[9]
C. Beecks, M. S. Uysal, and T. Seidl. Signature quadratic form distance. CIVR '10, pages 438--445, New York, NY, USA, 2010. ACM.
[10]
C. Bohm, S. Berchtold, and D. A. Keim. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Computing Surveys, 33:322--373, 2001.
[11]
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, pages 248--255, June 2009.
[12]
C. Faloutsos, M. Ranganathan, and Y. Manolopoulos. Fast subsequence matching in time-series databases. SIGMOD, 23(2):419--429, May 1994.
[13]
F. Hillier and G. Lieberman. Introduction to Linear Programming. McGraw-Hill, 1990.
[14]
M. E. Houle, X. Ma, M. Nett, and V. Oria. Dimensional testing for multi-step similarity search. In ICDM, pages 299--308, 2012.
[15]
F. Korn, N. Sidiropoulos, C. Faloutsos, E. L. Siegel, and Z. P. Fast nearest neighbor search in medical image databases. In VLDB, pages 215--226, 1996.
[16]
H.-P. Kriegel, P. Kroger, P. Kunath, and M. Renz. Generalizing the optimality of multi-step k-nearest neighbor query processing. In SSTD, pages 75--92, 2007.
[17]
Y. Rubner, C. Tomasi, and L. Guibas. A metric for distributions with applications to image databases. In ICCV98, pages 59--66, 1998.
[18]
Y. Rubner, C. Tomasi, and L. J. Guibas. The earth mover's distance as a metric for image retrieval. Int. Journal of Computer Vision, 40(2):99--121, 2000.
[19]
B. E. Ruttenberg and A. K. Singh. Indexing the earth mover's distance using normal distributions. PVLDB, 5(3):205--216, 2011.
[20]
T. Seidl and H.-P. Kriegel. Optimal multi-step k-nearest neighbor search. In SIGMOD, pages 154--165, 1998.
[21]
M. Shishibori, D. Koizumi, and K. Kita. Fast retrieval algorithm for earth mover's distance using emd lower bounds and a skipping algorithm. Adv. MultiMedia, 2011:1:1--1:9, Jan. 2011.
[22]
C. Smith. Facebook users are uploading 350 million new photos each day. (September 18, 2013). Retrieved August 22, 2014 from http://www.businessinsider.com/facebook-350-million-photos-each-day-2013-9.
[23]
H. Tamura, S. Mori, and T. Yamawaki. Textural features corresponding to visual perception. TSMC, 8(6):460--473, 1978.
[24]
Y. Tang, L. H. U, Y. Cai, N. Mamoulis, and R. Cheng. Earth mover's distance based similarity search at scale. PVLDB, 7(4):313--324, 2013.
[25]
M. S. Uysal, C. Beecks, J. Schmucking, and T. Seidl. Efficient filter approximation using the Earth Mover's Distance in very large multimedia databases with feature signatures. In CIKM, 2014, accepted.
[26]
R. J. Vanderbei. Linear Programming: Foundations and Extensions. 1996.
[27]
M. Wichterich, I. Assent, P. Kranen, and T. Seidl. Efficient emd-based similarity search in multimedia databases via exible dimensionality reduction. In SIGMOD, pages 199--212, 2008.
[28]
J. Xu, Z. Zhang, A. K. H. Tung, and G. Yu. Efficient and effective similarity search over probabilistic data based on earth mover's distance. PVLDB, 3(1):758--769, 2010.
[29]
YouTube. Statistics. Retrieved August 22, 2014 from https://www.youtube.com/yt/press/statistics.html.
[30]
P. Zezula, G. Amato, V. Dohnal, and M. Batko. Similarity Search - The Metric Space Approach, volume 32 of Advances in Database Systems. 2006.

Cited By

View all
  • (2015)Large-scale Efficient and Effective Video Similarity SearchProceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval10.1145/2809948.2809950(3-8)Online publication date: 22-Oct-2015
  • (2015)Efficient similarity search in scientific databases with feature signaturesProceedings of the 27th International Conference on Scientific and Statistical Database Management10.1145/2791347.2791384(1-12)Online publication date: 29-Jun-2015
  • (2015)On efficient content-based near-duplicate video detection2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI)10.1109/CBMI.2015.7153633(1-6)Online publication date: Jun-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PIKM '14: Proceedings of the 7th Workshop on Ph.D Students
November 2014
70 pages
ISBN:9781450314817
DOI:10.1145/2663714
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. earth mover's distance
  2. filter distance
  3. lower bound
  4. multimedia databases

Qualifiers

  • Research-article

Funding Sources

Conference

CIKM '14
Sponsor:

Acceptance Rates

PIKM '14 Paper Acceptance Rate 4 of 10 submissions, 40%;
Overall Acceptance Rate 25 of 62 submissions, 40%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Large-scale Efficient and Effective Video Similarity SearchProceedings of the 2015 Workshop on Large-Scale and Distributed System for Information Retrieval10.1145/2809948.2809950(3-8)Online publication date: 22-Oct-2015
  • (2015)Efficient similarity search in scientific databases with feature signaturesProceedings of the 27th International Conference on Scientific and Statistical Database Management10.1145/2791347.2791384(1-12)Online publication date: 29-Jun-2015
  • (2015)On efficient content-based near-duplicate video detection2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI)10.1109/CBMI.2015.7153633(1-6)Online publication date: Jun-2015
  • (2015)FELICITYProceedings of the 8th International Conference on Similarity Search and Applications - Volume 937110.1007/978-3-319-25087-8_34(347-350)Online publication date: 12-Oct-2015
  • (2014)PIKM 2014Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management10.1145/2661829.2663543(2098-2099)Online publication date: 3-Nov-2014

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media