# MoVis
**Repository Path**: Raiden_cn/MoVis
## Basic Information
- **Project Name**: MoVis
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2025-03-09
- **Last Updated**: 2025-03-09
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
# MoVis: When 3D Object Detection is Like Human Monocular Vision
More demo videos can be downloaded [here](https://drive.google.com/file/d/1a45uTuUwVgAZH81JWi0q_8Cav6q7HOom/view?usp=drive_link)
## :newspaper:News
- **[2024/11/12]** We updated the [PyTorch 2.5.1](https://pytorch.org/) environment with the latest cuda release and uploaded the full MoVis code :rocket: .
- **[2024/9/4]** We uploaded an intermediate version of the MoVis code :smile:.
- **[2024/3/28]** MoVis Project Creation :sunglasses:.
## :star:Overview

- MoVis is designed based on the way human monocular vision perceives 3D objects. Spatial Relation Encoder (SRE) aims to decouple the interaction between features. Object-level depth modulator (ODM) obtains high-precision depth information by color sequence. The spatial Context Processor (SCP) decodes the different features.
- Extensive experiments on KITTI and Rope3D demonstrate the state-of-the-art performance of our MoVis.
## :dart:Model Zoo
| Method |
AP3D|R40|IoU>0.7 |
APBEV|R40|IoU>0.7 |
Download |
| Easy |
Mod. |
Hard |
Easy |
Mod. |
Hard |
| MoVis |
28.02 |
20.80 |
17.73 |
37.56 |
27.28 |
23.55 |
model / log |
| 28.51 |
20.77 |
17.65 |
37.84 |
27.04 |
23.32 |
model |
| 28.33 |
20.75 |
17.58 |
37.03 |
26.77 |
23.01 |
model |
## :see_no_evil:Results
## :computer:Installation
**Step 1**: Clone this project and create a conda environment:
```shell
git clone https://github.com/KotlinWang/MoVis.git
cd MoVis
conda create -n movis python=3.11
conda activate movis
```
**Step 2**: Install pytorch and torchvision matching your CUDA version:
```shell
conda install pytorch torchvision cudatoolkit
# We adopt torch 2.5.1+cu121
```
**Step 3**: Install requirements and compile the deformable attention:
```shell
pip install -r requirements.txt
cd lib/models/movis/ops/
bash make.sh
cd ../../../..
```
**Step 4**: Make dictionary for saving training losses:
```shell
mkdir logs
```
## :satellite:Preparing Dataset
Download [KITTI](http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d) datasets and prepare the directory structure as:
```
│MonoDETR/
├──...
├──data/KITTIDataset/
│ ├──ImageSets/
│ ├──training/
│ ├──testing/
├──...
```
You can also change the data path at "dataset/root_dir" in `configs/movis.yaml`.
Download [Rope3D](http://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=3d) datasets and convert it to KITTI format via [DAIR-V2X](https://github.com/destinyls/DAIR-V2X).
## :running: Run
### Train
You can modify the settings of models and training in `configs/movis.yaml` and indicate the GPU in `train.sh`:
```shell
bash train.sh configs/movis.yaml movis
```
### Test
The best checkpoint will be evaluated as default. You can change it at "tester/checkpoint" in `configs/movis.yaml`:
```shell
bash test.sh configs/movis.yaml checkpoint_best
```
## :mag: Related Projects
Our code is based on [MonoDETR](https://github.com/ZrrSkywalker/MonoDETR) and [RT-DETR](https://github.com/lyuwenyu/RT-DETR).
## Citation
If you find this project useful in your research, please consider citing:
```
```