Object Centric Point Sets Feature Learning with Matrix Decomposition

Wang, Zijia; Yang, Wenbin; Liu, Zhisong; Chen, Qiang; Ni, Jiacheng; Jia, Zhen

doi:10.1007/978-3-031-27066-6_18

Zijia Wang¹⁰,
Wenbin Yang¹⁰,
Zhisong Liu¹⁰,
Qiang Chen¹⁰,
Jiacheng Ni¹⁰ &
…
Zhen Jia¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13848))

Included in the following conference series:

Asian Conference on Computer Vision

322 Accesses

Abstract

A representation matching the invariance/equivariance characteristics must be learnt to rebuild a morphable 3D model from a single picture input. However, present approaches for dealing with 3D point clouds depend heavily on a huge quantity of labeled data, while unsupervised methods need a large number of parameters. This is not productive. In the field of 3D morphable model building, the encoding of input photos has received minimal consideration. In this paper, we design a unique framework that strictly adheres to the permutation invariance of input points. Matrix Decomposition-based Invariant (MDI) learning is a system that offers a unified architecture for unsupervised invariant point set feature learning. The key concept behind our technique is to derive invariance and equivariance qualities for a point set via a simple but effective matrix decomposition. MDI is incredibly efficient and effective while being basic. Empirically, its performance is comparable to or even surpasses the state of the art. In addition, we present a framework for manipulating avatars based on CLIP and TBGAN, and the results indicate that our learnt features may help the model achieve better manipulation outcomes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

PPF-FoldNet: Unsupervised Learning of Rotation Invariant 3D Local Descriptors

LoGDesc: Local Geometric Features Aggregation for Robust Point Cloud Registration

Mapping in a Cycle: Sinkhorn Regularized Unsupervised Learning for Point Cloud Shapes

References

Aubry, M., Schlickewei, U., Cremers, D.: The wave kernel signature: a quantum mechanical approach to shape analysis. In: 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 1626–1633. IEEE (2011)
Google Scholar
Bronstein, M.M., Kokkinos, I.: Scale-invariant heat kernel signatures for non-rigid shape recognition. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1704–1711. IEEE (2010)
Google Scholar
Chen, D.Y., Tian, X.P., Shen, Y.T., Ouhyoung, M.: On visual similarity based 3D model retrieval. In: Computer Graphics Forum, vol. 22, pp. 223–232. Wiley Online Library (2003)
Google Scholar
Creager, E., Jacobsen, J., Zemel, R.S.: Exchanging lessons between algorithmic fairness and domain generalization. CoRR abs/2010.07249 (2020). https://arxiv.org/abs/2010.07249
Deprelle, T., Groueix, T., Fisher, M., Kim, V.G., Russell, B.C., Aubry, M.: Learning elementary structures for 3D shape generation and matching. arXiv preprint arXiv:1908.04725 (2019)
Duchi, J., Glynn, P., Namkoong, H.: Statistics of robust optimization: a generalized empirical likelihood approach. arXiv preprint arXiv:1610.03425 (2016)
Fang, Y., et al.: 3D deep shape descriptor. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2319–2328 (2015)
Google Scholar
Gafni, G., Thies, J., Zollhofer, M., Nießner, M.: Dynamic neural radiance fields for monocular 4D facial avatar reconstruction. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8649–8658 (2021)
Google Scholar
Gecer, B., et al.: Synthesizing coupled 3D face modalities by trunk-branch generative adversarial networks. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12374, pp. 415–433. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58526-6_25
Chapter Google Scholar
Geng, Z., Guo, M.H., Chen, H., Li, X., Wei, K., Lin, Z.: Is attention better than matrix decomposition? arXiv preprint arXiv:2109.04553 (2021)
Guan, N., Tao, D., Luo, Z., Shawe-Taylor, J.: MahNMF: Manhattan non-negative matrix factorization. arXiv preprint arXiv:1207.3438 (2012)
Guo, K., Zou, D., Chen, X.: 3D mesh labeling via deep convolutional neural networks. ACM Trans. Graph. (TOG) 35(1), 1–12 (2015)
Article Google Scholar
Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1999)
Article Google Scholar
Li, N., Raza, M.A., Hu, W., Sun, Z., Fisher, R.: Object-centric representation learning with generative spatial-temporal factorization. In: Advances in Neural Information Processing Systems, vol. 34 (2021)
Google Scholar
Li, Y., Pirk, S., Su, H., Qi, C.R., Guibas, L.J.: FPNN: field probing neural networks for 3D data. In: Advances in Neural Information Processing Systems, vol. 29 (2016)
Google Scholar
Ling, H., Jacobs, D.W.: Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 286–299 (2007)
Article Google Scholar
Maturana, D., Scherer, S.: VoxNet: a 3D convolutional neural network for real-time object recognition. In: 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 922–928. IEEE (2015)
Google Scholar
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 652–660 (2017)
Google Scholar
Qi, C.R., Su, H., Nießner, M., Dai, A., Yan, M., Guibas, L.J.: Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5648–5656 (2016)
Google Scholar
Rusu, R.B., Blodow, N., Beetz, M.: Fast point feature histograms (FPFH) for 3D registration. In: 2009 IEEE International Conference on Robotics and Automation, pp. 3212–3217. IEEE (2009)
Google Scholar
Rusu, R.B., Blodow, N., Marton, Z.C., Beetz, M.: Aligning point cloud views using persistent feature histograms. In: 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 3384–3391. IEEE (2008)
Google Scholar
Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.: Multi-view convolutional neural networks for 3D shape recognition. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 945–953 (2015)
Google Scholar
Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. In: Computer Graphics Forum, vol. 28, pp. 1383–1392. Wiley Online Library (2009)
Google Scholar
Sun, W., Tagliasacchi, A., Deng, B., Sabour, S., Yazdani, S., Hinton, G., Yi, K.M.: Canonical capsules: self-supervised capsules in canonical pose. In: Thirty-Fifth Conference on Neural Information Processing Systems (2021)
Google Scholar
Wang, D.Z., Posner, I.: Voting for voting in online point cloud object detection. In: Robotics: Science and Systems, Rome, Italy, vol. 1, pp. 10–15 (2015)
Google Scholar
Wu, Z., et al.: 3D shapenets: a deep representation for volumetric shapes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1912–1920 (2015)
Google Scholar
Zhang, B.H., Lemoine, B., Mitchell, M.: Mitigating unwanted biases with adversarial learning. In: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, pp. 335–340 (2018)
Google Scholar
Zhao, Y., Birdal, T., Deng, H., Tombari, F.: 3D point capsule networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1009–1018 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Dell Technologies OCTO Research Office, Shanghai, China
Zijia Wang, Wenbin Yang, Zhisong Liu, Qiang Chen, Jiacheng Ni & Zhen Jia

Authors

Zijia Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wenbin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Zhisong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jiacheng Ni
View author publications
You can also search for this author in PubMed Google Scholar
Zhen Jia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zijia Wang .

Editor information

Editors and Affiliations

University of Tokyo, Tokyo, Japan
Yinqiang Zheng
Hacettepe University, Ankara, Türkiye
Hacer Yalim Keleş
Data61/CSIRO, Canberra, ACT, Australia
Piotr Koniusz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Z., Yang, W., Liu, Z., Chen, Q., Ni, J., Jia, Z. (2023). Object Centric Point Sets Feature Learning with Matrix Decomposition. In: Zheng, Y., Keleş, H.Y., Koniusz, P. (eds) Computer Vision – ACCV 2022 Workshops. ACCV 2022. Lecture Notes in Computer Science, vol 13848. Springer, Cham. https://doi.org/10.1007/978-3-031-27066-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-031-27066-6_18
Published: 09 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-27065-9
Online ISBN: 978-3-031-27066-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Object Centric Point Sets Feature Learning with Matrix Decomposition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

PPF-FoldNet: Unsupervised Learning of Rotation Invariant 3D Local Descriptors

LoGDesc: Local Geometric Features Aggregation for Robust Point Cloud Registration

Mapping in a Cycle: Sinkhorn Regularized Unsupervised Learning for Point Cloud Shapes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Object Centric Point Sets Feature Learning with Matrix Decomposition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

PPF-FoldNet: Unsupervised Learning of Rotation Invariant 3D Local Descriptors

LoGDesc: Local Geometric Features Aggregation for Robust Point Cloud Registration

Mapping in a Cycle: Sinkhorn Regularized Unsupervised Learning for Point Cloud Shapes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation