research-article

Cross-Domain 3D Model Retrieval Based On Contrastive Learning And Label Propagation

Authors:

An-An LiuAuthors Info & Claims

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Pages 286 - 295

https://doi.org/10.1145/3503161.3548044

Published: 10 October 2022 Publication History

Get Access

Abstract

In this work, we aim to tackle the task of unsupervised image based 3D model retrieval, where we seek to retrieve unlabeled 3D models that are most visually similar to the 2D query image. Due to the challenging modality gap between 2D images and 3D models, existing mainstream methods adopt domain-adversarial techniques to eliminate the gap, which cannot guarantee category-level alignment that is important for retrieval performance. Recent methods align the class centers of 2D images and 3D models to pay attention to the category-level alignment. However, there still exist two main issues: 1) the category-level alignment is too rough, and 2) the category prediction of unlabeled 3D models is not accurate. To overcome the first problem, we utilize contrastive learning for fine-grained category-level alignment across domains, which pulls both prototypes and samples with the same semantic information closer and pushes those with different semantic information apart. To provide reliable semantic prediction for contrastive learning and also address the second issue, we propose the consistent decision for pseudo labels of 3D models based on both the trained image classifier and label propagation. Experiments are carried out on MI3DOR and MI3DOR-2 datasets, and the results demonstrate the effectiveness of our proposed method.

Supplementary Material

MP4 File (MM22-fp1232.mp4)

We propose a novel cross-domain 3D model retrieval method based on contrastive learning and label propagation to tackle the task of unsupervised image based 3D model retrieval. We perform fine grained semantic alignment via category-level and sample-level contrastive learning. We also improve the prediction accuracy for unlabeled 3D models with the consensus of image classifier and label propagation. Experiments are carried out on two commonly used datasets, and the results demonstrate the effectiveness of our proposed method.

Download
25.07 MB

References

[1]

Miao Hu, Xianzhuo Luo, Jiawen Chen, Young Choon Lee, Yipeng Zhou, and Di Wu. Virtual reality: A survey of enabling technologies and its applications in iot. Journal of Network and Computer Applications, page 102970, 2021.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Transductive Multilabel Learning via Label Set Propagation

Deep semi-supervised learning with contrastive learning and partial label propagation for image data

Contrastive learning from label distribution: A case study on text classification

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations