Article

Free access

PointCNN: convolution on Χ-transformed points

Authors:

Baoquan ChenAuthors Info & Claims

NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems

Pages 828 - 838

Published: 03 December 2018 Publication History

PDF eReader Publisher Site

Abstract

We present a simple and general framework for feature learning from point clouds. The key to the success of CNNs is the convolution operator that is capable of leveraging spatially-local correlation in data represented densely in grids (e.g. images). However, point clouds are irregular and unordered, thus directly convolving kernels against features associated with the points will result in desertion of shape information and variance to point ordering. To address these problems, we propose to learn an Χ-transformation from the input points to simultaneously promote two causes: the first is the weighting of the input features associated with the points, and the second is the permutation of the points into a latent and potentially canonical order. Element-wise product and sum operations of the typical convolution operator are subsequently applied on the Χ-transformed features. The proposed method is a generalization of typical CNNs to feature learning from point clouds, thus we call it PointCNN. Experiments show that PointCNN achieves on par or better performance than state-of-the-art methods on multiple challenging benchmark datasets and tasks.

References

[1]

Martín Abadi and et al. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org.

[2]

Iro Armeni, Ozan Sener, Amir R. Zamir, Helen Jiang, Ioannis Brilakis, Martin Fischer, and Silvio Savarese. 3d semantic parsing of large-scale indoor spaces. In CVPR, pages 1534-1543, 2016.

[3]

Matan Atzmon, Haggai Maron, and Yaron Lipman. Point convolutional neural networks by extension operators. ACM Trans. Graph., 37(4):71:1-71:12, July 2018.

Digital Library

[4]

Yizhak Ben-Shabat, Michael Lindenbaum, and Anath Fischer. 3d point cloud classification and segmentation using 3d modified fisher vector representation for convolutional neural networks. arXiv preprint arXiv:1711.08241, 2018.

[5]

Michael M. Bronstein, Joan Bruna, Yann LeCun, Arthur Szlam, and Pierre Vandergheynst. Geometric deep learning: going beyond euclidean data. IEEE Signal Processing Magazine, 34(4):18-42, 2017.

[6]

François Chollet. Xception: Deep learning with depthwise separable convolutions. arXiv preprint arXiv:1610.02357, 2016.

[7]

Djork-Arné Clevert, Thomas Unterthiner, and Sepp Hochreiter. Fast and accurate deep network learning by exponential linear units (elus). In ICLR, 2016.

[8]

Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, and Stephen Gould. Deeppermnet: Visual permutation learning. In CVPR, July 2017.

[9]

Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, and Matthias Nießner. Scannet: Richly-annotated 3d reconstructions of indoor scenes. In CVPR, 2017.

[10]

Sander Dieleman, Jeffrey De Fauw, and Koray Kavukcuoglu. Exploiting cyclic symmetry in convolutional neural networks. In Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48, ICML'16, pages 1889-1898. JMLR.org, 2016.

Digital Library

[11]

Mathias Eitz, James Hays, and Marc Alexa. How do humans sketch objects? ToG, 31(4):44:1-44:10, 2012.

Digital Library

[12]

Benjamin Graham, Martin Engelcke, and Laurens van der Maaten. 3d semantic segmentation with submanifold sparse convolutional networks. arXiv preprint arXiv:1711.10275, 2017.

[13]

Benjamin Graham and Laurens van der Maaten. Submanifold sparse convolutional networks. arXiv preprint arXiv:1706.01307, 2017.

[14]

Fabian Groh, Patrick Wieschollek, and Hendrik P. A. Lensch. Flex-convolution (deep learning beyond grid-worlds). arXiv preprint arXiv:1803.07289, 2018.

[15]

David Ha and Douglas Eck. A neural representation of sketch drawings. arXiv preprint arXiv:1704.03477, 2017.

[16]

Geoffrey E Hinton, Alex Krizhevsky, and Sida D Wang. Transforming auto-encoders. In International Conference on Artificial Neural Networks, pages 44-51. Springer, 2011.

Digital Library

[17]

Binh-Son Hua, Minh-Khoi Tran, and Sai-Kit Yeung. Point-wise convolutional neural network. In CVPR, 2018.

[18]

Qiangui Huang, Weiyue Wang, and Ulrich Neumann. Recurrent slice networks for 3d segmentation on point clouds. In CVPR, 2018.

[19]

Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning, pages 448-456, 2015.

Digital Library

[20]

Max Jaderberg, Karen Simonyan, Andrew Zisserman, et al. Spatial transformer networks. In Advances in Neural Information Processing Systems, pages 2017-2025, 2015.

Digital Library

[21]

Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. In ICLR, 2014.

[22]

Roman Klokov and Victor Lempitsky. Escape from cells: Deep kd-networks for the recognition of 3d point cloud models. In ICCV, 2017.

[23]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, pages 1097-1105, 2012.

Digital Library

[24]

Loïc Landrieu and Martin Simonovsky. Large-scale point cloud semantic segmentation with superpoint graphs. CoRR, abs/1711.09869, 2017.

[25]

Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning. Nature, 521(7553):436-444, 2015.

[26]

Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.

[27]

Jiaxin Li, Ben M. Chen, and Gim Hee Lee. So-net: Self-organizing network for point cloud analysis. In CVPR, 2018.

[28]

Yangyan Li, Sören Pirk, Hao Su, Charles R Qi, and Leonidas J Guibas. Fpnn: Field probing neural networks for 3d data. In NIPS, pages 307-315, 2016.

Digital Library

[29]

Min Lin, Qiang Chen, and Shuicheng Yan. Network in network. In ICLR, 2014.

[30]

Haggai Maron, Meirav Galun, Noam Aigerman, Miri Trope, Nadav Dym, Ersin Yumer, Vladimir G. Kim, and Yaron Lipman. Convolutional neural networks on surfaces via seamless toric covers. ACM Trans. Graph., 36(4):71:1-71:10, July 2017.

Digital Library

[31]

Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodolà, Jan Svoboda, and Michael M. Bronstein. Geometric deep learning on graphs and manifolds using mixture model cnns. In CVPR, July 2017.

[32]

Hyeonwoo Noh, Seunghoon Hong, and Bohyung Han. Learning deconvolution network for semantic segmentation. In ICCV, ICCV '15, pages 1520-1528, Washington, DC, USA, 2015. IEEE Computer Society.

Digital Library

[33]

Charles R. Qi, Hao Su, Kaichun Mo, and Leonidas J. Guibas. Pointnet: Deep learning on point sets for 3d classification and segmentation. In CVPR, pages 77-85, July 2017.

[34]

Charles R. Qi, Hao Su, Matthias Nießner, Angela Dai, Mengyuan Yan, and Leonidas J. Guibas. Volumetric and multi-view cnns for object classification on 3d data. In CVPR, pages 5648-5656, 2016.

[35]

Charles R Qi, Li Yi, Hao Su, and Leonidas J Guibas. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In NIPS, pages 5105-5114, 2017.

Digital Library

[36]

Siamak Ravanbakhsh, Jeff Schneider, and Barnabas Poczos. Deep learning with sets and point clouds. arXiv preprint arXiv:1611.04500, 2016.

[37]

Gernot Riegler, Ali Osman Ulusoys, and Andreas Geiger. Octnet: Learning deep 3d representations at high resolutions. In CVPR, 2017.

[38]

Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. In Nassir Navab, Joachim Hornegger, William M. Wells, and Alejandro F. Frangi, editors, MICCAI, pages 234-241, Cham, 2015. Springer International Publishing.

[39]

David E. Rumelhart, Geoffrey E. Hinton, and Ronald J. Williams. Learning internal representations by error propagation. In David E. Rumelhart, James L. McClelland, and CORPORATE PDP Research Group, editors, Parallel Distributed Processing: Explorations in the Microstructure of Cognition, Vol. 1, pages 318-362. MIT Press, Cambridge, MA, USA, 1986.

Digital Library

[40]

Sara Sabour, Nicholas Frosst, and Geoffrey E. Hinton. Dynamic routing between capsules. In NIPS, pages 3859-3869, 2017.

Digital Library

[41]

Tianjia Shao, Yin Yang, Yanlin Weng, Qiming Hou, and Kun Zhou. H-CNN: spatial hashing based CNN for 3d shape analysis. arXiv preprint arXiv:1803.11385, 2018.

[42]

Yiru Shen, Chen Feng, Yaoqing Yang, and Dong Tian. Mining point cloud local structures by kernel correlation and graph pooling. In CVPR, 2018.

[43]

Hang Su, Varun Jampani, Deqing Sun, Subhransu Maji, Evangelos Kalogerakis, Ming-Hsuan Yang, and Jan Kautz. Splatnet: Sparse lattice networks for point cloud processing. In CVPR, 2018.

[44]

Maxim Tatarchenko, Jaesik Park, Vladlen Koltun, and Qian-Yi Zhou. Tangent convolutions for dense prediction in 3d. In CVPR, 2018.

[45]

Lyne P. Tchapmi, Christopher B. Choy, Iro Armeni, JunYoung Gwak, and Silvio Savarese. Segcloud: Semantic segmentation of 3d point clouds. In 3DV, 2017.

[46]

Chu Wang, Babak Samari, and Kaleem Siddiqi. Local spectral graph convolution for point set feature learning. arXiv preprint arXiv:1803.05827, 2018.

[47]

Peng-Shuai Wang, Yang Liu, Yu-Xiao Guo, Chun-Yu Sun, and Xin Tong. O-cnn: Octree-based convolutional neural networks for 3d shape analysis. ACM Trans. Graph., 36(4):72:1-72:11, July 2017.

Digital Library

[48]

Shenlong Wang, Simon Suo, Wei-Chiu Ma, Andrei Pokrovsky, and Raquel Urtasun. Deep parametric continuous convolutional neural networks. In CVPR, 2018.

[49]

Weiyue Wang, Ronald Yu, Qiangui Huang, and Ulrich Neumann. SGPN: similarity group proposal network for 3d point cloud instance segmentation. In CVPR, 2018.

[50]

Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E. Sarma, Michael M. Bronstein, and Justin M. Solomon. Dynamic graph cnn for learning on point clouds. arXiv preprint arXiv:1801.07829, 2018.

[51]

Shihao Wu, Hui Huang, Minglun Gong, Matthias Zwicker, and Daniel Cohen-Or. Deep points consolidation. ToG, 34(6):176:1-176:13, October 2015.

Digital Library

[52]

Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 3d shapenets: A deep representation for volumetric shapes. In CVPR, pages 1912-1920, 2015.

[53]

Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, and Yu Qiao. Spidercnn: Deep learning on point sets with parameterized convolutional filters. arXiv preprint arXiv:1803.11527, 2018.

[54]

Li Yi, Vladimir G. Kim, Duygu Ceylan, I-Chao Shen, Mengyan Yan, Hao Su, Cewu Lu, Qixing Huang, Alla Sheffer, and Leonidas Guibas. A scalable active framework for region annotation in 3d shape collections. ToG, 35(6):210:1-210:12, November 2016.

Digital Library

[55]

Li Yi, Hao Su, Xingwen Guo, and Leonidas Guibas. Syncspeccnn: Synchronized spectral cnn for 3d shape segmentation. In CVPR, pages 6584-6592, July 2017.

[56]

Li Yi, Hao Su, Lin Shao, Manolis Savva, Haibin Huang, Yang Zhou, Benjamin Graham, Martin Engelcke, Roman Klokov, Victor Lempitsky, et al. Large-scale 3d shape reconstruction and segmentation from shapenet core55. arXiv preprint arXiv:1710.06104, 2017.

[57]

Qian Yu, Yongxin Yang, Feng Liu, Yi-Zhe Song, Tao Xiang, and Timothy M. Hospedales. Sketch-a-net: A deep neural network that beats humans. IJCV, 122(3):411-425, May 2017.

Digital Library

[58]

Manzil Zaheer, Satwik Kottur, Siamak Ravanbakhsh, Barnabas Poczos, Ruslan R. Salakhutdinov, and Alexander J Smola. Deep sets. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, NIPS, pages 3394-3404, 2017.

Digital Library

Cited By

Tang KShi YWu JPeng WKhan AZhu PGu Z(2022)NormalAttackSecurity and Communication Networks10.1155/2022/11866332022Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.1155/2022/1186633
Himeur CLejemble TPellegrini TPaulin MBarthe LMellado N(2021)PCEDNet: A Lightweight Neural Network for Fast and Interactive Edge Detection in 3D Point CloudsACM Transactions on Graphics10.1145/348180441:1(1-21)Online publication date: 10-Nov-2021
https://dl.acm.org/doi/10.1145/3481804
Liu YGuo JBenes BDeussen OZhang XHuang H(2021)TreePartNetACM Transactions on Graphics10.1145/3478513.348048640:6(1-16)Online publication date: 10-Dec-2021
https://dl.acm.org/doi/10.1145/3478513.3480486
Show More Cited By

Recommendations

PointCNNVis: A visual analysis system for the interpretability of PointCNN
ICCAI '22: Proceedings of the 8th International Conference on Computing and Artificial Intelligence

The proposal of PointCNN enables CNN to be applied to point cloud segmentation and classification tasks. While using CNN, the local spatial information of the point cloud is also considered, which greatly improves the disorder of the point cloud. ...
Undecimated wavelet shrinkage estimate of the 1D and 2D spectra
ICASSP '00: Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 04

We study the problem of estimating the log-spectrum of a stationary Gaussian time series by thresholding the wavelet coefficients. We propose the use of the undecimated wavelet transform to denoise the log-periodogram. For this, we review a denoising ...
LMAE

We propose a large margin Auto-encoders algorithm that boost the discriminability of classifications.We provide the optimization of the proposed LMAE algorithm.We conduct extensive experiments to verify the proposed algorithm in comparison with the ...

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings

NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems

December 2018

11021 pages

Publisher

Curran Associates Inc.

Red Hook, NY, United States

Publication History

Published: 03 December 2018

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
469
Total Downloads

Downloads (Last 12 months)104
Downloads (Last 6 weeks)15

Reflects downloads up to 15 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Tang KShi YWu JPeng WKhan AZhu PGu Z(2022)NormalAttackSecurity and Communication Networks10.1155/2022/11866332022Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.1155/2022/1186633
Himeur CLejemble TPellegrini TPaulin MBarthe LMellado N(2021)PCEDNet: A Lightweight Neural Network for Fast and Interactive Edge Detection in 3D Point CloudsACM Transactions on Graphics10.1145/348180441:1(1-21)Online publication date: 10-Nov-2021
https://dl.acm.org/doi/10.1145/3481804
Liu YGuo JBenes BDeussen OZhang XHuang H(2021)TreePartNetACM Transactions on Graphics10.1145/3478513.348048640:6(1-16)Online publication date: 10-Dec-2021
https://dl.acm.org/doi/10.1145/3478513.3480486
Liao YLi XTong ZZhao YLim AKuang ZMidoglu CShen HZhuang YSmith JYang YCesar PMetze FPrabhakaran B(2021)Reproducibility Companion PaperProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3477934(3610-3614)Online publication date: 17-Oct-2021
https://dl.acm.org/doi/10.1145/3474085.3477934
Du BGao XHu WLi XShen HZhuang YSmith JYang YCesar PMetze FPrabhakaran B(2021)Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud LearningProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475458(3133-3142)Online publication date: 17-Oct-2021
https://dl.acm.org/doi/10.1145/3474085.3475458
Xia YXia YLi WSong RCao KStilla UShen HZhuang YSmith JYang YCesar PMetze FPrabhakaran B(2021)ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point CompletionProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475348(1938-1947)Online publication date: 17-Oct-2021
https://dl.acm.org/doi/10.1145/3474085.3475348
Li LYuan CShen HZhuang YSmith JYang YCesar PMetze FPrabhakaran B(2021)VQMG: Hierarchical Vector Quantised and Multi-hops Graph Reasoning for Explicit Representation LearningProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475224(5029-5037)Online publication date: 17-Oct-2021
https://dl.acm.org/doi/10.1145/3474085.3475224
Metzer GHanocka RGiryes RCohen-Or D(2021)Self-Sampling for Neural Point Cloud ConsolidationACM Transactions on Graphics10.1145/347064540:5(1-14)Online publication date: 24-Sep-2021
https://dl.acm.org/doi/10.1145/3470645
Shinohara TXiu HMatsuoka M(2020)Semantic Segmentation for Full-Waveform LiDAR Data Using Local and Hierarchical Global Feature ExtractionProceedings of the 28th International Conference on Advances in Geographic Information Systems10.1145/3397536.3422209(640-650)Online publication date: 3-Nov-2020
https://dl.acm.org/doi/10.1145/3397536.3422209
Hu JWang BQian LPan YGuo XLiu LWang W(2019)MAT-netProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367032.3367143(774-781)Online publication date: 10-Aug-2019
https://dl.acm.org/doi/10.5555/3367032.3367143
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents