# ViP-DeepLab **Repository Path**: sakura_wz/ViP-DeepLab ## Basic Information - **Project Name**: ViP-DeepLab - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2021-07-07 - **Last Updated**: 2021-07-07 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # ViP-DeepLab ## Introduction In this repository, we present the datasets and the toolkits of [ViP-DeepLab](https://arxiv.org/abs/2012.05258). ViP-DeepLab is a unified model attempting to tackle the long-standing and challenging inverse projection problem in vision, which we model as restoring the point clouds from perspective image sequences while providing each point with instance-level semantic interpretations. Solving this problem requires the vision models to predict the spatial location, semantic class, and temporally consistent instance label for each 3D point. ViP-DeepLab approaches it by jointly performing monocular depth estimation and video panoptic segmentation. We name this joint task as Depth-aware Video Panoptic Segmentation (DVPS), and propose a new evaluation metric along with two derived datasets for it. This repository includes the datasets SemKITTI-DVPS and Cityscapes-DVPS along with the evaluation toolkits. [![Demo](readme_srcs/ViP-DeepLab.gif)](https://youtu.be/XR4HFiwwao0) ## Datasets ### SemKITTI-DVPS SemKITTI-DVPS is derived from [SemanticKITTI](http://semantic-kitti.org/) dataset. SemanticKITTI dataset is based on the odometry dataset of the [KITTI Vision benchmark](http://www.cvlibs.net/datasets/kitti/index.php). SemanticKITTI dataset provides perspective images and panoptic-labeled 3D point clouds. To convert it for DVPS, we project the 3D point clouds onto the image plane and name the derived dataset as SemKITTI-DVPS. SemKITTI-DVPS is distributed under [Creative Commons Attribution-NonCommercial-ShareAlike](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. The dataset and the evaluation toolkit are in the folder `semkitti-dvps`. ![SemKITTI-DVPS example.](readme_srcs/sk_example.png) ### Cityscapes-DVPS Cityscapes-DVPS is derived from [Cityscapes-VPS](https://github.com/mcahny/vps) by adding re-computed depth maps from [Cityscapes](https://www.cityscapes-dataset.com/) dataset. Cityscapes-DVPS is distributed under [Creative Commons Attribution-NonCommercial-ShareAlike](https://creativecommons.org/licenses/by-nc-sa/4.0/) license. The dataset and the evaluation toolkit are in the folder `cityscapes-dvps`. ![Cityscapes-DVPS example.](readme_srcs/cs_example.png) ## Citation If you use the datasets in your research, please cite our project. ```BibTeX @article{vip_deeplab, title={ViP-DeepLab: Learning Visual Perception with Depth-aware Video Panoptic Segmentation}, author={Siyuan Qiao and Yukun Zhu and Hartwig Adam and Alan Yuille and Liang-Chieh Chen}, journal={arXiv preprint arXiv:2012.05258}, year={2020} } ```