MDHA

[ACCEPTED IN IROS 2024]

MDHA is a novel sparse query-based framework for 3D object detection which constructs adaptive 3D output proposals using hybrid anchors from multi-view, multi-scale input. Moreover, we propose a Circular Deformable Attention mechanism for efficient multi-view attention. Our model diagram is shown below:

Environment Setup

Create a conda environment:

conda create -n mdha python=3.8 -y
conda activate mdha

Install dependencies:

pip install -r requirements.txt

Note: please ensure torch is installed correctly. Note that we use CUDA 11.1, if you require a different CUDA version then follow the official instructions though we do not guarantee the model will successfully run/train.

Clone the repo:

git clone https://github.com/NaomiEX/MDHA

Navigate to the project folder:

cd path/to/MDHA/

Compile CUDA ops for CDA:

cd projects/mmdet3d_plugin/attentions/ops
./make.sh
cd ../../../../

Data Preparation

Download Dataset

Create a data directory:

mkdir ./data

Download nuScenes dataset to ./data directory.

(Preferred) download Trainval and Test splits for the nuScenes Full dataset (v1.0) into ./data/nuscenes and ./data/nuscenes/test respectively [WARNING: this will download a large amount of data ~ 340GB]
(Not Recommended) download the Mini split for the nuScenes Full dataset (v1.0). This is a subset of trainval with only 10 scenes. You may have to adjust some of the following steps if you choose to download the Mini split.

Creating annotation files

Our data preparation is a modification of StreamPETR which additionally aligns the cameras to follow circular order as outlined in the paper.

python tools/create_data_nusc.py --trainval-path data/nuscenes --test-path data/nuscenes/test --out-dir data/nuscenes/anno --version v1.0 --with_circular_cams

Backbone weights

Weights are loaded from the ckpt folder:

mkdir ./ckpt

Download the backbone pre-trained ResNet-50 weights from PyTorch into ckpt/:

wget https://download.pytorch.org/models/resnet50-19c8e357.pth -O ckpt/resnet50-19c8e357.pth

Depth Coords (optional)

The depth coordinates are only used for MDHA-fixed.

Create experiments/depth_coords folder

mkdir -p experiments/depth_coords
cd experiments/depth_coords

Download depth coords from here into the experiments/depth_coords folder.

Folder structure

At this point your project folder structure should look like this:

├── ckpt/
|   ├── resnet50-19c8e357.pth
├── data/
|   ├── nuscenes/
|   |   ├── anno/
|   |   |   ├── nuscenes_circular_infos_test.pkl
|   |   |   ├── nuscenes_circular_infos_train.pkl
|   |   |   ├── nuscenes_circular_infos_val.pkl
|   |   ├── maps/
|   |   ├── samples/
|   |   ├── sweeps/
|   |   ├── v1.0-trainval/
|   |   ├── test/
|   |   |   ├── maps/
|   |   |   ├── samples/
|   |   |   ├── sweeps/
|   |   |   ├── v1.0-test
├── projects/
├── tools/

Training and Inference

Train

You can train the model on a local machine with 1 GPU as follows:

tools/dist_train.sh projects/configs/execution/local/cmdha_4ptenc_24ptdec_anchorref_updatepos_convdepth_mult_wrap_newproj_1gpu2bs.py 1 --work-dir work_dirs/test

Or with 2 GPUs as follows:

tools/dist_train.sh projects/configs/execution/local/cmdha_4ptenc_24ptdec_anchorref_updatepos_convdepth_mult_wrap_newproj_2gpu2bs.py 2 --work-dir work_dirs/test

Note 1: we also provide slurm_train.sh in tools/ directory to train on a cluster using SLURM.

Note 2: by default, local training will only train with batch size of 2 for 25 epochs due to the long training time. Our paper results were obtained by training on a cluster using 4 A100s for 100 epochs. Set projects/configs/execution/hpc for training configs with bigger batch sizes

Evaluation

tools/dist_test.sh /path/to/execution/config /path/to/model/checkpoint 1 --eval bbox

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
imgs		imgs
projects		projects
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compile.job		compile.job
requirements.txt		requirements.txt
test_multiple.sh		test_multiple.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MDHA

[ACCEPTED IN IROS 2024]

Environment Setup

Data Preparation

Download Dataset

Creating annotation files

Backbone weights

Depth Coords (optional)

Folder structure

Training and Inference

Train

Evaluation

About

Releases

Packages

Contributors 2

Languages

License

NaomiEX/MDHA

Folders and files

Latest commit

History

Repository files navigation

MDHA

[ACCEPTED IN IROS 2024]

Environment Setup

Data Preparation

Download Dataset

Creating annotation files

Backbone weights

Depth Coords (optional)

Folder structure

Training and Inference

Train

Evaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages