Article

PCA-SIFT: a more distinctive representation for local image descriptors

Authors:

Rahul SukthankarAuthors Info & Claims

CVPR'04: Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Pages 506 - 513

Published: 27 June 2004 Publication History

Abstract

Stable local feature detection and representation is a fundamental component of many image registration and object recognition algorithms. Mikolajczyk and Schmid [14] recently evaluated a variety of approaches and identified the SIFT [11] algorithm as being the most resistant to common image deformations. This paper examines (and improves upon) the local image descriptor used by SIFT. Like SIFT, our descriptors encode the salient aspects of the image gradient in the feature point's neighborhood; however, instead of using SIFT's smoothed weighted histograms, we apply Principal Components Analysis (PCA) to the normalized gradient patch. Our experiments demonstrate that the PCAbased local descriptors are more distinctive, more robust to image deformations, and more compact than the standard SIFT representation. We also present results showing that using these descriptors in an image retrieval application results in increased accuracy and faster matching.

References

[1]

Viewpoint change sequences. http://www.inrialpes.fr/ movi/.

[2]

S. Agarwal and D. Roth. Learning a sparse representation for object detection. In Proceedings of European Conference on Computer Vision, pages 113-130, 2002.

[3]

R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In Proceedings of Computer Vision and Pattern Recognition, June 2003.

[4]

W. T. Freeman and E. H. Adelson. The design and use of steerable filters. IEEE Trans. Pattern Analysis and Machine Intelligence, 13(9):891-906, 1991.

[5]

K. Fukunaga and W. Koontz. Application of the Karhunen-Loeve expansion to feature selection and ordering. IEEE Trans. Communications, 19(4), 1970.

[6]

C. Harris and M. Stephens. A combined corner and edge detector. In Alvey Vision Conference, pages 147-151, 1988.

[7]

I. T. Joliffe. Principal Component Analysis. Springer-Verlag, 1986.

[8]

J. Karhunen and J. Joutsensalo. Generalization of principal component analysis, optimization problems and neural networks. Neural Networks, 8(4), 1995.

[9]

J. Koenderink and A. van Doorn. Representation of local geometry in the visual system. In Biological Cybernetics, volume 55, pages 367-375, 1987.

[10]

D. Lee and S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401, 1999.

[11]

D. G. Lowe. Object recognition from local scale-invariant features. In Proceedings of International Conference on Computer Vision, pages 1150-1157, 1999.

[12]

D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004.

[13]

K. Mikolajczyk and C. Schmid. Indexing based on scale invariant interest points. In Proceedings of International Conference on Computer Vision, pages 525-531, July 2001.

[14]

K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Proceedings of Computer Vision and Pattern Recognition, June 2003.

[15]

H. Murase and S. Nayar. Detection of 3D objects in cluttered scenes using hierarchical eigenspace. Pattern Recognition Letters, 18(4), April 1997.

[16]

F. Schaffalitzky and A. Zisserman. Multi-view matching for unordered image sets. In Proceedings of European Conference on Computer Vision, volume 1, pages 414-431. Springer-Verlag, 2002.

[17]

M. Turk and A. Pentland. Face recognition using eigenfaces. In Proceedings of Computer Vision and Pattern Recognition, 1991.

[18]

L. Van Gool, T. Moons, and D. Ungureanu. Affine/photometric invariants for planar intensity patterns. In Proceedings of European Conference on Computer Vision, 1996.

Cited By

Qiao WBi X(2019)Deep Spatial-Temporal Neural Network for Classification of EEG-Based Motor ImageryProceedings of the 2019 International Conference on Artificial Intelligence and Computer Science10.1145/3349341.3349414(265-272)Online publication date: 12-Jul-2019
https://dl.acm.org/doi/10.1145/3349341.3349414
Chen HLi JGao JSun YHu YYin B(2019)Maximally Correlated Principal Component Analysis Based on Deep Parameterization LearningACM Transactions on Knowledge Discovery from Data10.1145/333218313:4(1-17)Online publication date: 29-Jul-2019
https://dl.acm.org/doi/10.1145/3332183
Chifa NBadri ARuichek Y(2019)Rotation-Invariant Approach Using Mask to Content-Based Image RetrievalProceedings of the 2019 5th International Conference on Computer and Technology Applications10.1145/3323933.3324066(11-14)Online publication date: 16-Apr-2019
https://dl.acm.org/doi/10.1145/3323933.3324066
Show More Cited By

PCA-SIFT: a more distinctive representation for local image descriptors
1. Computing methodologies

Recommendations

B-SIFT: A Binary SIFT Based Local Image Feature Descriptor
ICDH '12: Proceedings of the 2012 Fourth International Conference on Digital Home

Local feature point detection and description are the basis for Computer Vision. SIFT is one of the most efficient local image descriptors and have been well studied in recent years. In this paper, we introduce B-SIFT, a novel binary local image ...
KPB-SIFT: a compact local feature descriptor
MM '10: Proceedings of the 18th ACM international conference on Multimedia

Invariant feature descriptors such as SIFT and GLOH have been demonstrated to be very robust for image matching and object recognition. However, such descriptors are typically of high dimensionality, e.g. 128-dimension in the case of SIFT. This limits ...
SIFT-Based Image Compression
ICME '12: Proceedings of the 2012 IEEE International Conference on Multimedia and Expo

This paper proposes a novel image compression scheme based on the local feature descriptor - Scale Invariant Feature Transform (SIFT). The SIFT descriptor characterizes an image region invariantly to scale and rotation. It is used widely in image ...

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings

CVPR'04: Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

June 2004

1041 pages

Sponsors

IEEE-CS\DATC: IEEE Computer Society

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 June 2004

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

324
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 25 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Qiao WBi X(2019)Deep Spatial-Temporal Neural Network for Classification of EEG-Based Motor ImageryProceedings of the 2019 International Conference on Artificial Intelligence and Computer Science10.1145/3349341.3349414(265-272)Online publication date: 12-Jul-2019
https://dl.acm.org/doi/10.1145/3349341.3349414
Chen HLi JGao JSun YHu YYin B(2019)Maximally Correlated Principal Component Analysis Based on Deep Parameterization LearningACM Transactions on Knowledge Discovery from Data10.1145/333218313:4(1-17)Online publication date: 29-Jul-2019
https://dl.acm.org/doi/10.1145/3332183
Chifa NBadri ARuichek Y(2019)Rotation-Invariant Approach Using Mask to Content-Based Image RetrievalProceedings of the 2019 5th International Conference on Computer and Technology Applications10.1145/3323933.3324066(11-14)Online publication date: 16-Apr-2019
https://dl.acm.org/doi/10.1145/3323933.3324066
Zeng KWang YMao JLiu JPeng WChen N(2019)A Local Metric for Defocus Blur Detection Based on CNN Feature LearningIEEE Transactions on Image Processing10.1109/TIP.2018.288183028:5(2107-2115)Online publication date: 1-May-2019
https://dl.acm.org/doi/10.1109/TIP.2018.2881830
Hofmann MSeeland MMäder P(2019)Efficiently Annotating Object Images with Absolute Size Information Using Mobile DevicesInternational Journal of Computer Vision10.1007/s11263-018-1093-3127:2(207-224)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1007/s11263-018-1093-3
Du SIkenaga T(2019)Low-dimensional superpixel descriptor and its application in visual correspondence estimationMultimedia Tools and Applications10.1007/s11042-019-7248-678:14(19457-19472)Online publication date: 1-Jul-2019
https://dl.acm.org/doi/10.1007/s11042-019-7248-6
An FLiu Z(2019)Bi-dimensional empirical mode decomposition (BEMD) algorithm based on particle swarm optimization-fractal interpolationMultimedia Tools and Applications10.1007/s11042-018-7097-878:12(17239-17264)Online publication date: 1-Jun-2019
https://dl.acm.org/doi/10.1007/s11042-018-7097-8
Liu DHuo CYan H(2019)Research of commodity recommendation workflow based on LSH algorithmMultimedia Tools and Applications10.1007/s11042-018-5716-z78:4(4327-4345)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1007/s11042-018-5716-z
Arulmozhi PAbirami S(2019)A comparative study of hash based approximate nearest neighbor learning and its application in image retrievalArtificial Intelligence Review10.1007/s10462-017-9591-152:1(323-355)Online publication date: 1-Jun-2019
https://dl.acm.org/doi/10.1007/s10462-017-9591-1
Boluk ADemirci M(2019)Object recognition based on critical nodesPattern Analysis & Applications10.1007/s10044-018-00777-w22:1(147-163)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1007/s10044-018-00777-w
Show More Cited By

View Options

View options

Figures

Tables

Media

View Table of Conten