Cross-X Learning for Fine-Grained Visual Categorization

Luo, Wei; Yang, Xitong; Mo, Xianjie; Lu, Yuheng; Davis, Larry S.; Li, Jun; Yang, Jian; Lim, Ser-Nam

Abstract:Recognizing objects from subcategories with very subtle differences remains a challenging task due to the large intra-class and small inter-class variation. Recent work tackles this problem in a weakly-supervised manner: object parts are first detected and the corresponding part-specific features are extracted for fine-grained classification. However, these methods typically treat the part-specific features of each image in isolation while neglecting their relationships between different images. In this paper, we propose Cross-X learning, a simple yet effective approach that exploits the relationships between different images and between different network layers for robust multi-scale feature learning. Our approach involves two novel components: (i) a cross-category cross-semantic regularizer that guides the extracted features to represent semantic parts and, (ii) a cross-layer regularizer that improves the robustness of multi-scale features by matching the prediction distribution across multiple layers. Our approach can be easily trained end-to-end and is scalable to large datasets like NABirds. We empirically analyze the contributions of different components of our approach and demonstrate its robustness, effectiveness and state-of-the-art performance on five benchmark datasets. Code is available at \url{this https URL}.

Comments:	accepted by ICCV 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1909.04412 [cs.CV]
	(or arXiv:1909.04412v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1909.04412

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-X Learning for Fine-Grained Visual Categorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators