Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Using extended feature objects for partial similarity retrieval

Published: 11 November 1997 Publication History

Abstract

In this paper, we introduce the concept of extended feature objects for similarity retrieval. Conventional approaches for similarity search in databases map each object in the database to a point in some high-dimensional feature space and define similarity as some distance measure in this space. For many similarity search problems, this feature-based approach is not sufficient. When retrieving partially similar polygons, for example, the search cannot be restricted to edge sequences, since similar polygon sections may start and end anywhere on the edges of the polygons. In general, inherently continuous problems such as the partial similarity search cannot be solved by using point objects in feature space. In our solution, we therefore introduce extended feature objects consisting of an infinite set of feature points. For an efficient storage and retrieval of the extended feature objects, we determine the minimal bounding boxes of the feature objects in multidimensional space and store these boxes using a spatial access structure. In our concrete polygon problem, sets of polygon sections are mapped to 2D feature objects in high-dimensional space which are then approximated by minimal bounding boxes and stored in an R $^*$-tree. The selectivity of the index is improved by using an adaptive decomposition of very large feature objects and a dynamic joining of small feature objects. For the polygon problem, translation, rotation, and scaling invariance is achieved by using the Fourier-transformed curvature of the normalized polygon sections. In contrast to vertex-based algorithms, our algorithm guarantees that no false dismissals may occur and additionally provides fast search times for realistic database sizes. We evaluate our method using real polygon data of a supplier for the car manufacturing industry.

References

[1]
{AB 92} Alt H, Blömer J (1992): Resemblance and Symmetries of Geometric Patterns. Data Structures and Efficient Algorithms, in: LNCS, Vol. 594, Springer, pp. 1-24.
[2]
{AFS 93} Agrawal R, Faloutsos C, Swami A (1993): Efficient Similarity Search in Sequence Databases. Proc. 4th Int. Conf. on Foundations of Data Organization and Algorithms, LNCS, Vol. 730, Springer, pp. 69-84.
[3]
{ALSS 95} Agrawal R, Lin K, Sawhney H, Shim K (1995): Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases. Proc. 21st Conf. on Very Large Databases, Zürich, Switzerland, pp. 490-501.
[4]
{Ber 97} Berchtold S (1997): Geometry based search of similar parts. (in german), Ph.D. thesis, University of Munich.
[5]
{BKS 93} Brinkhoff T, Kriegel H.-P, Schneider R (1993): Comparison of Approximations of Complex Objects Used for Approximation-based Query Processing in Spatial Database Systems. Proc. 9th Int. Conf. on Data Engineering, Vienna, Austria, pp. 40-49.
[6]
{BKSS 90} Beckmann N, Kriegel H.-P, Schneider R, Seeger B (1990): The R*-tree: An Efficient and Robust Access Method for Points and Rectangles. Proc. ACM SIGMOD Int. Conf. on Management of Data, Atlantic City, NJ, pp. 322-331.
[7]
{FBFH 94} Faloutsos C, Barber R, Flickner M, Hafner J, et al (1994): Efficient and Effective Querying by Image Content. J Intel Inf Syst 3:231-262.
[8]
{Fre 87} Freeston M (1987): The BANG file: A new kind of grid file. Proc. ACM SIGMOD Int. Conf. on Management of Data, San Francisco, CA, pp. 260-269.
[9]
{FRM 94} Faloutsos C, Ranganathan M, Manolopoulos Y (1994): Fast Subsequence Matching in Time-Series Databases. Proc. ACM SIGMOD Int. Conf. on Management of Data, Minneapolis, MN, pp. 419-429.
[10]
{Gar 82} Gargantini I (1982): An Effective Way to Represent Quadtrees. Commun ACM 25 (12) 905-910.
[11]
{Gut 84} Guttman A (1984): R-trees: A Dynamic Index Structure for Spatial Searching. Proc. ACM SIGMOD Int. Conf. on Management of Data, Boston, MA, pp. 47-57.
[12]
{Gue 89} Günther O (1989): The Design of the Cell Tree: An Object-Oriented Index Structure for Geometric Databases. Proc. 5th Int. Conf. on Data Engineering, Los Angeles, CA, pp. 598-605.
[13]
{HB 86} Horn P, Berthold K (1986): Robot vision. MIT Press, Cambridge, MA.
[14]
{Jag 90a} Jagadish HV (1990): Linear Clustering of Objects with Multiple Attributes. Proc. ACM SIGMOD Int. Conf. on Management of Data, Atlantic City, NJ, pp. 332-342.
[15]
{Jag 90b} Jagadish HV (1990): Spatial Search with Polyhedra. Proc. 6th Int. Conf. on Data Engineering, Los Angeles, CA, pp. 311-319.
[16]
{Jag 91} Jagadish HV (1991): A Retrieval Technique for Similar Shapes. Proc. ACM SIGMOD Int. Conf. on Management of Data, Denver, CO, pp. 208-217.
[17]
{KM 95} Kehrer L, Meinecke, C (1995): Perceptual Organization of Visual Patterns: The Segmentation of Textures. In Prinz W, Bridgeman B, (Eds) Handbook of Perception and Action: Vol. 1: Perception, Chapter 2, Academic Press, London.
[18]
{KSP 95} Kauppinen H, Seppänen T, Pietikäinen M (1995): An Experimental Comparison of Autoregressive and Fourier-Based Descriptors in 2D Shape Classification. IEEE Trans. Pattern Anal Mach Intell 17 (2).
[19]
{MG 93} Mehrotra R, Gary J (1993): Feature-Based Retrieval of Similar Shapes. Proc. 9th Int. Conf. on Data Engeneering, Vienna, Austria.
[20]
{MG 95} Mehrotra R, Gary J (1995): Feature-Index-Based Similar Shape Retrieval. Proc. 3rd Working Conf. on Visual Database Systems, pp. 46-65.
[21]
{Mum 87} Mumford D (1987): The Problem of Robust Shape Descriptors. Proc. IEEE 1st International Conf. on Computer Vision.
[22]
{NHS 84} Nievergelt J, Hinterberger H, Sevcik KC (1984): The Grid File: An Adaptable, Symmetric Multikey File Structure. ACM Trans Database Syst 9(1):38-71.
[23]
{Ore 90} Orenstein J, (1990): A Comparison of Spatial Query Processing Techniques for Native and Parameter Spaces. Proc. ACM SIGMOD Int. Conf. on Management of Data, Atlantic City, NJ, pp. 343-352.
[24]
{PF 94} Petrakis E, Faloutsos C (1994): Similarity Searching in Large Image DataBases. Technical Report CS-TR-3388, University of Maryland.
[25]
{RH 92} Rigoutsos I, Hummel R (1992): Massively Parallel Model Matching: Geometric Hashing on the Connection Machine. IEEE Computer 25(2):33-42.
[26]
{SK 90} Seeger B, Kriegel H-P (1990): The Buddy Tree: An Efficient and Robust Access Method for Spatial Data Base Systems. Proc. 16th Int. Conf. on Very Large Data Bases, Brisbane, Australia, pp. 590-601.
[27]
{SM 90} Stein F, Medioni G (1990): Efficient Two Dimensional Object Recognition. 10th. Int. Conf. on Pattern Recognition, Atlantic City, NJ, pp. 13-17.
[28]
{SRF 87} Sellis T, Roussopoulos N, Faloutsos C (1987): The R+-Tree: A Dynamic Index for Multi-Dimensional Objects. Proc. 13th Int. Conf. on Very Large Databases, Brighton, England, pp. 507-518.
[29]
{Wei 80} Weisstein N (1980): The Joy of Fourier Analysis. In Harris CS (Ed) Visual coding and adaptability, Erlbaum, Hillsdale NJ.
[30]
{WW 80} Wallace T, Wintz P (1980): An Efficient Three-Dimensional Aircraft Recognition Algorithm Using Normalized Fourier Descriptors. Computer Graphics and Image Processing 13:99-126.

Cited By

View all
  • (2009)Partial Similarity of Objects, or How to Compare a Centaur to a HorseInternational Journal of Computer Vision10.1007/s11263-008-0147-384:2(163-183)Online publication date: 1-Aug-2009
  • (2006)3D protein structure matching by patch signaturesProceedings of the 17th international conference on Database and Expert Systems Applications10.1007/11827405_52(528-537)Online publication date: 4-Sep-2006
  • (2005)Feature-based similarity search in 3D object databasesACM Computing Surveys10.1145/1118890.111889337:4(345-387)Online publication date: 1-Dec-2005
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image The VLDB Journal — The International Journal on Very Large Data Bases
The VLDB Journal — The International Journal on Very Large Data Bases  Volume 6, Issue 4
November 1997
88 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 11 November 1997

Author Tags

  1. CAD databases
  2. Fourier transformation
  3. Indexing and query processing of spatial objects
  4. Partial similarity retrieval

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)5
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2009)Partial Similarity of Objects, or How to Compare a Centaur to a HorseInternational Journal of Computer Vision10.1007/s11263-008-0147-384:2(163-183)Online publication date: 1-Aug-2009
  • (2006)3D protein structure matching by patch signaturesProceedings of the 17th international conference on Database and Expert Systems Applications10.1007/11827405_52(528-537)Online publication date: 4-Sep-2006
  • (2005)Feature-based similarity search in 3D object databasesACM Computing Surveys10.1145/1118890.111889337:4(345-387)Online publication date: 1-Dec-2005
  • (2004)CartoDrawIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2004.126076110:1(95-110)Online publication date: 1-Jan-2004
  • (2003)Using sets of feature vectors for similarity search on voxelized CAD objectsProceedings of the 2003 ACM SIGMOD international conference on Management of data10.1145/872757.872828(587-598)Online publication date: 9-Jun-2003
  • (2003)Visualizing geographic informationInformation Visualization10.1057/palgrave.ivs.95000392:1(58-67)Online publication date: 1-Mar-2003
  • (2002)Efficient retrieval of similar shapesThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778010005911:1(17-27)Online publication date: 1-Aug-2002
  • (2000)A cost model for query processing in high dimensional data spacesACM Transactions on Database Systems10.1145/357775.35777625:2(129-178)Online publication date: 1-Jun-2000
  • (1999)XZ-OrderingProceedings of the 6th International Symposium on Advances in Spatial Databases10.5555/647226.719087(75-90)Online publication date: 20-Jul-1999
  • (1999)Clustering techniques for large data sets—from the past to the futureTutorial notes of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/312179.312189(141-181)Online publication date: 1-Aug-1999
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media