Researchers have achieved great success in dealing with 2D images using deep learning. In recent years, 3D computer vision and geometry deep learning have gained ever more attention. Many advanced techniques for 3D shapes have been proposed for different applications. Unlike 2D images, which can be uniformly represented by a regular grid of pixels, 3D shapes have various representations, such as depth images, multi-view images, voxels, point clouds, meshes, implicit surfaces, etc. The performance achieved in different applications largely depends on the representation used, and there is no unique representation that works well for all applications. Therefore, in this survey, we review recent developments in deep learning for 3D geometry from a representation perspective, summarizing the advantages and disadvantages of different representations for different applications. We also present existing datasets in these representations and further discuss future research directions.
This work was supported by the National Natural Science Foundation of China (61828204, 61872440), Beijing Municipal Natural Science Foundation (L182016), Youth Innovation Promotion Association CAS, CCF-Tencent Open Fund, Royal Society- Newton Advanced Fellowship (NAF\R2\192151), and the Royal Society (IES\R1\180126).
Author information
Authors and Affiliations
Corresponding author
Additional information
Yun-Peng Xiao received his bachelor degree in computer science from Nankai University. He is currently a master student in the Institute of Computing Technology, the Chinese Academy of Sciences. His research interests include computer graphics and geometric processing.
Yu-Kun Lai received his bachelor and Ph.D. degrees in computer science from Tsinghua University in 2003 and 2008, respectively. He is currently a Reader in the School of Computer Science & Informatics, Cardiff University. His research interests include computer graphics, geometry processing, image processing and computer vision. He is on the editorial boards of Computer Graphics Forum and The Visual Computer.
Fang-Lue Zhang is currently a lecturer with Victoria University of Wellington, New Zealand. He received his bachelor degree from Zhejiang University, Hangzhou, in 2009, and doctoral degree from Tsinghua University, Beijing, in 2015. His research interests include image and video editing, computer vision, and computer graphics. He is a member of IEEE and ACM. He received a Victoria Early-Career Research Excellence Award in 2019.
Chunpeng Li received his Ph.D. degree in 2008 and now is an associate professor at the Institute of Computing Technology, the Chinese Academy of Sciences. His main research interests are in virtual reality, human-computer interaction, and computer graphics.
Lin Gao received his bachelor degree in mathematics from Sichuan University and Ph.D. degree in computer science from Tsinghua University. He is currently an associate professor at the Institute of Computing Technology, the Chinese Academy of Sciences. His research interests include computer graphics and geometric processing. He received a Newton Advanced Fellowship award from the Royal Society in 2019.
Xiao, YP., Lai, YK., Zhang, FL. et al. A survey on deep geometry learning: From a representation perspective. Comp. Visual Media 6, 113–133 (2020). https://doi.org/10.1007/s41095-020-0174-8
Issue Date:
https://doi.org/10.1007/s41095-020-0174-8