A survey on deep learning advances on different 3D data representations

E Ahmed, A Saint, AER Shabayek… - arXiv preprint arXiv …, 2018 - arxiv.org
E Ahmed, A Saint, AER Shabayek, K Cherenkova, R Das, G Gusev, D Aouada, B Ottersten
arXiv preprint arXiv:1808.01462, 2018arxiv.org
3D data is a valuable asset the computer vision filed as it provides rich information about the
full geometry of sensed objects and scenes. Recently, with the availability of both large 3D
datasets and computational power, it is today possible to consider applying deep learning to
learn specific tasks on 3D data such as segmentation, recognition and correspondence.
Depending on the considered 3D data representation, different challenges may be foreseen
in using existent deep learning architectures. In this work, we provide a comprehensive …
3D data is a valuable asset the computer vision filed as it provides rich information about the full geometry of sensed objects and scenes. Recently, with the availability of both large 3D datasets and computational power, it is today possible to consider applying deep learning to learn specific tasks on 3D data such as segmentation, recognition and correspondence. Depending on the considered 3D data representation, different challenges may be foreseen in using existent deep learning architectures. In this work, we provide a comprehensive overview about various 3D data representations highlighting the difference between Euclidean and non-Euclidean ones. We also discuss how Deep Learning methods are applied on each representation, analyzing the challenges to overcome.
arxiv.org