# MoVis **Repository Path**: Raiden_cn/MoVis ## Basic Information - **Project Name**: MoVis - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-03-09 - **Last Updated**: 2025-03-09 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # MoVis: When 3D Object Detection is Like Human Monocular Vision

More demo videos can be downloaded [here](https://drive.google.com/file/d/1a45uTuUwVgAZH81JWi0q_8Cav6q7HOom/view?usp=drive_link) ## :newspaper:News - **[2024/11/12]** We updated the [PyTorch 2.5.1](https://pytorch.org/) environment with the latest cuda release and uploaded the full MoVis code :rocket: . - **[2024/9/4]** We uploaded an intermediate version of the MoVis code :smile:. - **[2024/3/28]** MoVis Project Creation :sunglasses:. ## :star:Overview ![overview](./assets/overview.jpg) - MoVis is designed based on the way human monocular vision perceives 3D objects. Spatial Relation Encoder (SRE) aims to decouple the interaction between features. Object-level depth modulator (ODM) obtains high-precision depth information by color sequence. The spatial Context Processor (SCP) decodes the different features. - Extensive experiments on KITTI and Rope3D demonstrate the state-of-the-art performance of our MoVis. ## :dart:Model Zoo

Method	AP_{3D\|R40\|IoU>0.7}			AP_{BEV\|R40\|IoU>0.7}			Download
Method	Easy	Mod.	Hard	Easy	Mod.	Hard	Download
MoVis	28.02	20.80	17.73	37.56	27.28	23.55	model / log
	28.51	20.77	17.65	37.84	27.04	23.32	model
	28.33	20.75	17.58	37.03	26.77	23.01	model

## :see_no_evil:Results

## :computer:Installation

**Step 1**: Clone this project and create a conda environment: ```shell git clone https://github.com/KotlinWang/MoVis.git cd MoVis conda create -n movis python=3.11 conda activate movis ``` **Step 2**: Install pytorch and torchvision matching your CUDA version: ```shell conda install pytorch torchvision cudatoolkit # We adopt torch 2.5.1+cu121 ``` **Step 3**: Install requirements and compile the deformable attention: ```shell pip install -r requirements.txt cd lib/models/movis/ops/ bash make.sh cd ../../../.. ``` **Step 4**: Make dictionary for saving training losses: ```shell mkdir logs ```

## :satellite:Preparing Dataset

Download [KITTI](http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d) datasets and prepare the directory structure as: ``` │MonoDETR/ ├──... ├──data/KITTIDataset/ │ ├──ImageSets/ │ ├──training/ │ ├──testing/ ├──... ``` You can also change the data path at "dataset/root_dir" in `configs/movis.yaml`. Download [Rope3D](http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d) datasets and convert it to KITTI format via [DAIR-V2X](https://github.com/destinyls/DAIR-V2X).

## :running: Run ### Train You can modify the settings of models and training in `configs/movis.yaml` and indicate the GPU in `train.sh`: ```shell bash train.sh configs/movis.yaml movis ``` ### Test The best checkpoint will be evaluated as default. You can change it at "tester/checkpoint" in `configs/movis.yaml`: ```shell bash test.sh configs/movis.yaml checkpoint_best ``` ## :mag: Related Projects Our code is based on [MonoDETR](https://github.com/ZrrSkywalker/MonoDETR) and [RT-DETR](https://github.com/lyuwenyu/RT-DETR). ## Citation If you find this project useful in your research, please consider citing： ``` ```