MonoRCNN is a monocular 3D object detection method for automonous driving

Last update: Dec 27, 2022

Related tags

Overview

MonoRCNN

MonoRCNN is a monocular 3D object detection method for automonous driving, published at ICCV 2021. This project is an implementation of MonoRCNN.

Visualization

Methodology

Installation

Python 3.6
PyTorch 1.5.0
Detectron2 0.1.3

Please use the Detectron2 included in this project. To ignore fully occluded objects during training, build.py, rpn.py, and roi_heads.py have been modified.

Dataset Preparation

KITTI

Model & Log

KITTI val1 split

Organize the downloaded files as follows:

├── projects
│   ├── MonoRCNN
│   │   ├── output
│   │   │   ├── model
│   │   │   ├── log.txt
│   │   │   ├── ...

Test

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1 --resume --eval-only

Set VISUALIZE as True to visualize 3D object detection results (saved in output/evaluation/test/visualization).

Training

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1

Citation

If you find this project useful in your research, please cite:

@inproceedings{MonoRCNN_ICCV21,
    title = {Geometry-based Distance Decomposition for Monocular 3D Object Detection},
    author = {Xuepeng Shi and Qi Ye and 
              Xiaozhi Chen and Chuangrong Chen and 
              Zhixiang Chen and Tae-Kyun Kim},
    booktitle = {ICCV},
    year = {2021},
}

Contact

[email protected]

MonoRCNN is a monocular 3D object detection method for automonous driving

Related tags

Overview

MonoRCNN

Visualization

Methodology

Related Link

Installation

Dataset Preparation

Model & Log

Test

Training

Citation

Contact

Acknowledgement

Owner

Portfolio analytics for quants, written in Python

K Closest Points and Maximum Clique Pruning for Efficient and Effective 3D Laser Scan Matching (To appear in RA-L 2022)

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

SimBERT升级版（SimBERTv2）！

clustimage is a python package for unsupervised clustering of images.

Code base for NeurIPS 2021 publication titled Kernel Functional Optimisation (KFO)

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

2021:"Bridging Global Context Interactions for High-Fidelity Image Completion"

WSDM2022 "A Simple but Effective Bidirectional Extraction Framework for Relational Triple Extraction"

LieTransformer: Equivariant Self-Attention for Lie Groups

Fully Convlutional Neural Networks for state-of-the-art time series classification

A library of extension and helper modules for Python's data analysis and machine learning libraries.

NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)

Rank 1st in the public leaderboard of ScanRefer (2021-03-18)

Sequential GCN for Active Learning

Mesh Graphormer is a new transformer-based method for human pose and mesh reconsruction from an input image

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Election Exit Poll Prediction and U.S.A Presidential Speech Analysis using Machine Learning

Project code for weakly supervised 3D object detectors using wide-baseline multi-view traffic camera data: WIBAM.