Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble

Juhan Cha*, Minseok Joo*, Jihwan Park, Sanghyeok Lee, Injae Kim, Hyunwoo J. Kim†.

Installation

Please follow the instructions to set up the MEFormer

Environments

Python 3.8
CUDA 11.1
PyTorch 1.10

1. Clone Repository

git clone https://github.com/hanchaa/MEFormer.git
cd MEFormer

2. Create environment & Install libraries

conda create -n MEFormer python=3.8
conda activate MEFormer
pip install torch==1.10.1+cu111 torchvision==0.11.2+cu111 torchaudio==0.10.1 -f https://download.pytorch.org/whl/cu111/torch_stable.html

pip install openmim
mim install mmcv-full==1.6.0
pip install -r requirements.txt

3. Download pre-trained weights

Download the pretrained weight of the image backbone from Google Drive and move them to ckpts directory.

MEFormer
├─ ckpts
│  ├─ fcos3d_vovnet_imgbackbone-remapped.pth
│  └─ nuim_r50.pth
├─ figures
├─ projects
└─ tools

4. Prepare data

Follow the instructions from mmdet3d for data preparation.

Train & Inference

Train

tools/dist_train.sh $path_to_config$ 8

Inference

tools/dist_test.sh $path_to_config$ $path_to_weight$ 8 --eval bbox

Results

Results on nuScenes validation set.

Config	NDS	mAP	Schedule	FPS	weights
MEFormer	73.9%	71.5%	6 epoch *	3.1	Google Drive
MEFormer w/o PME	73.7%	71.3%	20 epoch	3.4	Google Drive

FPS is measured with a single NVIDIA A6000 GPU.

* means MEFormer with PME should be trained after MEFormer w/o PME is trained first.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figures		figures
projects		projects
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble

Installation

Environments

1. Clone Repository

2. Create environment & Install libraries

3. Download pre-trained weights

4. Prepare data

Train & Inference

Train

Inference

Results

About

Releases

Packages

Languages

License

hanchaa/MEFormer

Folders and files

Latest commit

History

Repository files navigation

Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble

Installation

Environments

1. Clone Repository

2. Create environment & Install libraries

3. Download pre-trained weights

4. Prepare data

Train & Inference

Train

Inference

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages