Object Depth via Motion and Detection Dataset

Last update: Dec 21, 2022

Related tags

Overview

ODMD Dataset

ODMD is the first dataset for learning Object Depth via Motion and Detection. ODMD training data are configurable and extensible, with each training example consisting of a series of object detection bounding boxes, camera movement distances, and ground truth object depth. As a benchmark evaluation, we provide four ODMD validation and test sets with 21,600 examples in multiple domains, and we also convert 15,650 examples from the ODMS benchmark for detection. In our paper, we use a single ODMD-trained network with object detection or segmentation to achieve state-of-the-art results on existing driving and robotics benchmarks and estimate object depth from a camera phone, demonstrating how ODMD is a viable tool for monocular depth estimation in a variety of mobile applications.

Contact: Brent Griffin (griffb at umich dot edu)

Depth results using a camera phone.

Using ODMD

Run ./demo/demo_datagen.py to generate random ODMD data to train or test your model.
Example data generation and camera configurations are provided in the ./config/ folder. demo_datagen.py has the option to save data into a static dataset for repeated use.
[native Python]

Run ./demo/demo_dataset_eval.py to evaluate your model on the ODMD validation and test sets.
demo_dataset_eval.py has an example evaluation for the Box_LS baseline and instructions for using our detection-based version of ODMS. Results are saved in the ./results/ folder.
[native Python]

Benchmark

Method	Normal	Perturb Camera	Perturb Detect	Robot	All
DBox	1.73	2.45	2.54	11.17	4.47
DBox_Abs	1.11	2.05	1.75	13.29	4.55
Box_LS	0.00	4.47	21.60	21.23	11.83

Is your technique missing although it's published and the code is public? Let us know and we'll add it.

Using DBox Method

Run ./demo/demo_dataset_DBox_train.py to train your own DBox model using ODMD.
Run ./demo/demo_dataset_DBox_eval.py after training to evaluate your DBox model.
Example training and DBox model configurations are provided in the ./config/ folder. Models are saved in the ./results/model/ folder.
[native Python, has Torch dependency]

Publication

Please cite our paper if you find it useful for your research.

@inproceedings{GrCoCVPR21,
  author = {Griffin, Brent A. and Corso, Jason J.},
  booktitle={The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  title = {Depth from Camera Motion and Object Detection},
  year = {2021}
}

CVPR 2021 supplementary video: https://youtu.be/GruhbdJ2l7k

Use

This code is available for non-commercial research purposes only.

Object Depth via Motion and Detection Dataset

Related tags

Overview

ODMD Dataset

Using ODMD

Benchmark

Using DBox Method

Publication

Use

Owner

Brent Griffin

Official PyTorch Implementation of Rank & Sort Loss [ICCV2021]

The official PyTorch code implementation of "Personalized Trajectory Prediction via Distribution Discrimination" in ICCV 2021.

An Intelligent Self-driving Truck System For Highway Transportation

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

Romanian Automatic Speech Recognition from the ROBIN project

The code for our paper submitted to RAL/IROS 2022: OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition.

Framework for Spectral Clustering on the Sparse Coefficients of Learned Dictionaries

People Interaction Graph

Solving SMPL/MANO parameters from keypoint coordinates.

Efficient Deep Learning Systems course

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

HandTailor: Towards High-Precision Monocular 3D Hand Recovery

Linear algebra python - Number of operations and problems in Linear Algebra and Numerical Linear Algebra

Geometric Vector Perceptron --- a rotation-equivariant GNN for learning from biomolecular structure

A project to build an AI voice assistant using Python . The Voice assistant interacts with the humans to perform basic tasks.

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Deployment of PyTorch chatbot with Flask

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Tensorflow 2.x based implementation of EDSR, WDSR and SRGAN for single image super-resolution

PyTorch implementation of spectral graph ConvNets, NIPS’16