Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1896300.1896374guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

PCA-SIFT: a more distinctive representation for local image descriptors

Published: 27 June 2004 Publication History

Abstract

Stable local feature detection and representation is a fundamental component of many image registration and object recognition algorithms. Mikolajczyk and Schmid [14] recently evaluated a variety of approaches and identified the SIFT [11] algorithm as being the most resistant to common image deformations. This paper examines (and improves upon) the local image descriptor used by SIFT. Like SIFT, our descriptors encode the salient aspects of the image gradient in the feature point's neighborhood; however, instead of using SIFT's smoothed weighted histograms, we apply Principal Components Analysis (PCA) to the normalized gradient patch. Our experiments demonstrate that the PCAbased local descriptors are more distinctive, more robust to image deformations, and more compact than the standard SIFT representation. We also present results showing that using these descriptors in an image retrieval application results in increased accuracy and faster matching.

References

[1]
Viewpoint change sequences. http://www.inrialpes.fr/ movi/.
[2]
S. Agarwal and D. Roth. Learning a sparse representation for object detection. In Proceedings of European Conference on Computer Vision, pages 113-130, 2002.
[3]
R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In Proceedings of Computer Vision and Pattern Recognition, June 2003.
[4]
W. T. Freeman and E. H. Adelson. The design and use of steerable filters. IEEE Trans. Pattern Analysis and Machine Intelligence, 13(9):891-906, 1991.
[5]
K. Fukunaga and W. Koontz. Application of the Karhunen-Loeve expansion to feature selection and ordering. IEEE Trans. Communications, 19(4), 1970.
[6]
C. Harris and M. Stephens. A combined corner and edge detector. In Alvey Vision Conference, pages 147-151, 1988.
[7]
I. T. Joliffe. Principal Component Analysis. Springer-Verlag, 1986.
[8]
J. Karhunen and J. Joutsensalo. Generalization of principal component analysis, optimization problems and neural networks. Neural Networks, 8(4), 1995.
[9]
J. Koenderink and A. van Doorn. Representation of local geometry in the visual system. In Biological Cybernetics, volume 55, pages 367-375, 1987.
[10]
D. Lee and S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401, 1999.
[11]
D. G. Lowe. Object recognition from local scale-invariant features. In Proceedings of International Conference on Computer Vision, pages 1150-1157, 1999.
[12]
D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004.
[13]
K. Mikolajczyk and C. Schmid. Indexing based on scale invariant interest points. In Proceedings of International Conference on Computer Vision, pages 525-531, July 2001.
[14]
K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Proceedings of Computer Vision and Pattern Recognition, June 2003.
[15]
H. Murase and S. Nayar. Detection of 3D objects in cluttered scenes using hierarchical eigenspace. Pattern Recognition Letters, 18(4), April 1997.
[16]
F. Schaffalitzky and A. Zisserman. Multi-view matching for unordered image sets. In Proceedings of European Conference on Computer Vision, volume 1, pages 414-431. Springer-Verlag, 2002.
[17]
M. Turk and A. Pentland. Face recognition using eigenfaces. In Proceedings of Computer Vision and Pattern Recognition, 1991.
[18]
L. Van Gool, T. Moons, and D. Ungureanu. Affine/photometric invariants for planar intensity patterns. In Proceedings of European Conference on Computer Vision, 1996.

Cited By

View all
  • (2019)Deep Spatial-Temporal Neural Network for Classification of EEG-Based Motor ImageryProceedings of the 2019 International Conference on Artificial Intelligence and Computer Science10.1145/3349341.3349414(265-272)Online publication date: 12-Jul-2019
  • (2019)Maximally Correlated Principal Component Analysis Based on Deep Parameterization LearningACM Transactions on Knowledge Discovery from Data10.1145/333218313:4(1-17)Online publication date: 29-Jul-2019
  • (2019)Rotation-Invariant Approach Using Mask to Content-Based Image RetrievalProceedings of the 2019 5th International Conference on Computer and Technology Applications10.1145/3323933.3324066(11-14)Online publication date: 16-Apr-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
CVPR'04: Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
June 2004
1041 pages

Sponsors

  • IEEE-CS\DATC: IEEE Computer Society

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 June 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Deep Spatial-Temporal Neural Network for Classification of EEG-Based Motor ImageryProceedings of the 2019 International Conference on Artificial Intelligence and Computer Science10.1145/3349341.3349414(265-272)Online publication date: 12-Jul-2019
  • (2019)Maximally Correlated Principal Component Analysis Based on Deep Parameterization LearningACM Transactions on Knowledge Discovery from Data10.1145/333218313:4(1-17)Online publication date: 29-Jul-2019
  • (2019)Rotation-Invariant Approach Using Mask to Content-Based Image RetrievalProceedings of the 2019 5th International Conference on Computer and Technology Applications10.1145/3323933.3324066(11-14)Online publication date: 16-Apr-2019
  • (2019)A Local Metric for Defocus Blur Detection Based on CNN Feature LearningIEEE Transactions on Image Processing10.1109/TIP.2018.288183028:5(2107-2115)Online publication date: 1-May-2019
  • (2019)Efficiently Annotating Object Images with Absolute Size Information Using Mobile DevicesInternational Journal of Computer Vision10.1007/s11263-018-1093-3127:2(207-224)Online publication date: 1-Feb-2019
  • (2019)Low-dimensional superpixel descriptor and its application in visual correspondence estimationMultimedia Tools and Applications10.1007/s11042-019-7248-678:14(19457-19472)Online publication date: 1-Jul-2019
  • (2019)Bi-dimensional empirical mode decomposition (BEMD) algorithm based on particle swarm optimization-fractal interpolationMultimedia Tools and Applications10.1007/s11042-018-7097-878:12(17239-17264)Online publication date: 1-Jun-2019
  • (2019)Research of commodity recommendation workflow based on LSH algorithmMultimedia Tools and Applications10.1007/s11042-018-5716-z78:4(4327-4345)Online publication date: 1-Feb-2019
  • (2019)A comparative study of hash based approximate nearest neighbor learning and its application in image retrievalArtificial Intelligence Review10.1007/s10462-017-9591-152:1(323-355)Online publication date: 1-Jun-2019
  • (2019)Object recognition based on critical nodesPattern Analysis & Applications10.1007/s10044-018-00777-w22:1(147-163)Online publication date: 1-Feb-2019
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media