Starred repositories
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Code, scripts and data to re-produce the results published in the paper "SelfVIO: Self-Supervised Deep Monocular Visual-Inertial Odometry and Depth Estimation"."
This repository contains the code for the paper "DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium", a self-supervised dept…
Code for ICRA2023 paper "NeRF-Loc: Visual Localization with Conditional Neural Radiance Field"
Code for Monocular Visual-Inertial Depth Estimation (ICRA 2023)
PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024
3D Object Detection for Autonomous Driving: A Comprehensive Survey (IJCV 2023)
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
[ECCV 2022]JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes
[CVPR 2022 Oral, Best Student Paper] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation
Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion (CVPR 2022, Oral)
This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.
OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving
visual attention network based monocular depth estimation
A resource repository for 3D machine learning
A simple and powerful web application framework.
[CVPR'22] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
Monocular Depth Estimation Toolbox based on MMSegmentation.
[CoRL 2022] SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation
Cross-Modal Unsupervised Domain Adaptationfor 3D Semantic Segmentation
Fisheye or Normal Camera Intrinsic and Extrinsic Calibration. Surround Camera Bird Eye View Generator.
Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals