Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3672919.3672983acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsaideConference Proceedingsconference-collections
research-article

Homogenous multimodal 3D object detection based on deformable Transformer and attribute dependencies

Published: 24 July 2024 Publication History

Abstract

In this work, we propose a unified multi-representation 3D object detection framework called attribute dependent Homogenous fusion 3D object Detection Network ADHF. The proposed method aims to solve the problem of inefficient utilization of point cloud multiple representations. For this purpose, the cross-head self-attention mechanism CSA extracts key points and Voxels features, and it realizes cross-head information interaction using only multi-head attention and channel adjustment. Different from previous work, our method mines the similarity between Voxels features and key features in Deformable Attention Transformer by improving the data transmission information flow, so as to capture remote dependencies, aggregate multi-representation information, and make extracted point voxels features more discriminating. The previous methods usually predict the properties of 3D objects at one time, and fail to make full use of the hidden sequence information in the prediction features of objects. The ADHF framework is equipped with a novel attribute dependent detection head ADH to solve the above problems. Overall, ADHF is an interim attempt to represent different modes in a unified framework. The proposed method achieves leading performance for target detection in Kitti set.

References

[1]
Ashish V, Noam S, Niki P, Jakob U, Llion J, AidanN. G, Lukasz K and Illia P 2017 Attention is All you Need. Neural Information Processing Systems.
[2]
X Zhu, W Su, L Lu, B Li, X Wang and J. Dai 2020 Deformable DETR: Deformable Transformers for End-to-End Object Detection Computer Vision and Pattern Recognition.
[3]
Y Yan, Y Mao, and B Li 2018 SECOND: Sparsely Embedded Convolutional Detection Sensors, P 3337
[4]
L Wu, X Liu, and Q Liu 2021 Centroid Transformers: Learning to Abstract with Attention. arXiv: Learning.
[5]
J. Deng, S. Shi, P. Li, W. Zhou, Y. Zhang, and H. Li 2022 Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection,” Proceedings of the AAAI Conference on Artificial Intelligence P1201–1209
[6]
H Sheng, S Cai, Y Liu, B Deng, J Huang, X. Hua and M. Zhao 2021 Improving 3d object detection with channel-wise transformer IEEE/CVF International Conference on Computer Vision (ICCV) P 2743–2752.
[7]
Z Li, Y Yao, Z Quan, W Yang and J. Xie 2021 SIENet: Spatial Information Enhancement Network for 3D Object Detection from Point Cloud Cornell. arXiv, Cornell University – arXiv.
[8]
Charles R. Qi, W Liu, C. Wu, H Su and L. Guibas 2018 Frustum pointnets for 3d object detection from rgb-d data IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9]
T Guan, J Wang, S Lan, Rohan C, Z Wu, Larry D and Dinesh M 2022 M3DETR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (USA: Waikoloa, HI).
[10]
JordanS. K. Hu, T. Kuai and StevenL. W 2022 Point Density-Aware Voxels for LiDAR 3D Object Detection IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11]
Q. Xu, Y. Zhong and U. Neumann 2022 Behind the Curtain: Learning Occluded Shapes for 3D Object Detection Proceedings of the AAAI Conference on Artificial IntelligencP2893–2901.
[12]
S Shi, X Wang and H Li 2019 Pointrcnn:3d object proposal generation and detection from point cloud Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition P 770.
[13]
Z Yang, Y Sun, S Liu, X Shen and Jiaya Jia 2019 STD: sparse-to-dense 3d object detector for point cloud International Conference on Computer Vision (ICCV).
[14]
S Shi, C Guo, L Jiang, Z Wang, J Shi, X Wang and H Li 2020 Pv-rcnn: Pointvoxel feature set abstraction for 3d object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition P10529–10538.
[15]
Shi, Z Wang, X Wang and H Li 2019 Part-aˆ2 net: 3d part-aware and aggregation neural network for object detection from point cloud. arXiv preprint arXiv:1907.03670, vol 2(3).
[16]
W. Zheng, W. Tang, L. Jiang, and C.-W. Fu 2021 SE-SSD: Self-Ensembling Single-Stage Object Detector From Point Cloud IEEE/CVF(CVPR) (Nashville, TN, USA).

Index Terms

  1. Homogenous multimodal 3D object detection based on deformable Transformer and attribute dependencies

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CSAIDE '24: Proceedings of the 2024 3rd International Conference on Cyber Security, Artificial Intelligence and Digital Economy
    March 2024
    676 pages
    ISBN:9798400718212
    DOI:10.1145/3672919
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 July 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CSAIDE 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 7
      Total Downloads
    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media