Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21

Last update: Dec 06, 2022

Related tags

Deep Learning MonoFlex

Overview

MonoFlex

Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21.

Work in progress.

Installation

This repo is tested with Ubuntu 20.04, python==3.7, pytorch==1.4.0 and cuda==10.1

conda create -n monoflex python=3.7

conda activate monoflex

Install PyTorch and other dependencies:

conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch

pip install -r requirements.txt

Build DCNv2 and the project

cd models/backbone/DCNv2

. make.sh

cd ../../..

python setup develop

Data Preparation

Please download KITTI dataset and organize the data as follows:

#ROOT		
  |training/
    |calib/
    |image_2/
    |label/
    |ImageSets/
  |testing/
    |calib/
    |image_2/
    |ImageSets/

Then modify the paths in config/paths_catalog.py according to your data path.

Training & Evaluation

Training with one GPU. (TODO: The multi-GPU training will be further tested.)

CUDA_VISIBLE_DEVICES=0 python tools/plain_train_net.py --batch_size 8 --config runs/monoflex.yaml --output output/exp

The model will be evaluated periodically (can be adjusted in the CONFIG) during training and you can also evaluate a checkpoint with

CUDA_VISIBLE_DEVICES=0 python tools/plain_train_net.py --config runs/monoflex.yaml --ckpt YOUR_CKPT  --eval

You can also specify --vis when evaluation to visualize the predicted heatmap and 3D bounding boxes. The pretrained model for train/val split and logs are here.

Note: we observe an obvious variation of the performance for different runs and we are still investigating possible solutions to stablize the results, though it may inevitably due to the utilized uncertainties.

Citation

If you find our work useful in your research, please consider citing:

@InProceedings{MonoFlex,
    author    = {Zhang, Yunpeng and Lu, Jiwen and Zhou, Jie},
    title     = {Objects Are Different: Flexible Monocular 3D Object Detection},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {3289-3298}
}

Acknowlegment

The code is heavily borrowed from SMOKE and thanks for their contribution.

Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21

Related tags

Overview

MonoFlex

Installation

Data Preparation

Training & Evaluation

Citation

Acknowlegment

Owner

Yunpeng

Face Mask Detection on Image and Video using tensorflow and keras

3D HourGlass Networks for Human Pose Estimation Through Videos

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for lossless prediction".

Async API for controlling Hue Lights

Boundary-preserving Mask R-CNN (ECCV 2020)

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

Keras implementation of the GNM model in paper ’Graph-Based Semi-Supervised Learning with Nonignorable Nonresponses‘

Jupyter notebooks for the code samples of the book "Deep Learning with Python"

Lightweight, Python library for fast and reproducible experimentation :microscope:

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

[ICCV 2021] Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

A curated list of awesome Machine Learning frameworks, libraries and software.

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

HDMapNet: A Local Semantic Map Learning and Evaluation Framework

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

BEAS: Blockchain Enabled Asynchronous & Secure Federated Machine Learning