# MoVis **Repository Path**: Raiden_cn/MoVis ## Basic Information - **Project Name**: MoVis - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-03-09 - **Last Updated**: 2025-03-09 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # MoVis: When 3D Object Detection is Like Human Monocular Vision
More demo videos can be downloaded [here](https://drive.google.com/file/d/1a45uTuUwVgAZH81JWi0q_8Cav6q7HOom/view?usp=drive_link) ## :newspaper:News - **[2024/11/12]** We updated the [PyTorch 2.5.1](https://pytorch.org/) environment with the latest cuda release and uploaded the full MoVis code :rocket: . - **[2024/9/4]** We uploaded an intermediate version of the MoVis code :smile:. - **[2024/3/28]** MoVis Project Creation :sunglasses:. ## :star:Overview ![overview](./assets/overview.jpg) - MoVis is designed based on the way human monocular vision perceives 3D objects. Spatial Relation Encoder (SRE) aims to decouple the interaction between features. Object-level depth modulator (ODM) obtains high-precision depth information by color sequence. The spatial Context Processor (SCP) decodes the different features. - Extensive experiments on KITTI and Rope3D demonstrate the state-of-the-art performance of our MoVis. ## :dart:Model Zoo
Method AP3D|R40|IoU>0.7 APBEV|R40|IoU>0.7 Download
Easy Mod. Hard Easy Mod. Hard
MoVis 28.02 20.80 17.73 37.56 27.28 23.55 model / log
28.51 20.77 17.65 37.84 27.04 23.32 model
28.33 20.75 17.58 37.03 26.77 23.01 model
## :see_no_evil:Results
## :computer:Installation
**Step 1**: Clone this project and create a conda environment: ```shell git clone https://github.com/KotlinWang/MoVis.git cd MoVis conda create -n movis python=3.11 conda activate movis ``` **Step 2**: Install pytorch and torchvision matching your CUDA version: ```shell conda install pytorch torchvision cudatoolkit # We adopt torch 2.5.1+cu121 ``` **Step 3**: Install requirements and compile the deformable attention: ```shell pip install -r requirements.txt cd lib/models/movis/ops/ bash make.sh cd ../../../.. ``` **Step 4**: Make dictionary for saving training losses: ```shell mkdir logs ```
## :satellite:Preparing Dataset
Download [KITTI](http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d) datasets and prepare the directory structure as: ``` │MonoDETR/ ├──... ├──data/KITTIDataset/ │ ├──ImageSets/ │ ├──training/ │ ├──testing/ ├──... ``` You can also change the data path at "dataset/root_dir" in `configs/movis.yaml`. Download [Rope3D](http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d) datasets and convert it to KITTI format via [DAIR-V2X](https://github.com/destinyls/DAIR-V2X).
## :running: Run ### Train You can modify the settings of models and training in `configs/movis.yaml` and indicate the GPU in `train.sh`: ```shell bash train.sh configs/movis.yaml movis ``` ### Test The best checkpoint will be evaluated as default. You can change it at "tester/checkpoint" in `configs/movis.yaml`: ```shell bash test.sh configs/movis.yaml checkpoint_best ``` ## :mag: Related Projects Our code is based on [MonoDETR](https://github.com/ZrrSkywalker/MonoDETR) and [RT-DETR](https://github.com/lyuwenyu/RT-DETR). ## Citation If you find this project useful in your research, please consider citing: ``` ```