Welcome to the comma.ai Calibration Challenge!

Your goal is to predict the direction of travel (in camera frame) from provided dashcam video.

This repo provides 10 videos. Every video is 1min long and 20 fps.
5 videos are labeled with a 2D array describing the direction of travel at every frame of the video with a pitch and yaw angle in radians.
5 videos are unlabeled. It is your task to generate the labels for them.
The example labels are generated using a Neural Network, and the labels were confirmed with a SLAM algorithm.
You can estimate the focal length to be 910 pixels.

Context

The devices that run openpilot are not mounted perfectly. The camera is not exactly aligned to the vehicle. There is some pitch and yaw angle between the camera of the device and the vehicle, which can vary between installations. Estimating these angles is essential for accurate control of the vehicle. The best way to start estimating these values is to predict the direction of motion in camera frame. More info can be found in this readme.

Deliverable

Your deliverable is the 5 labels called 5.txt to 9.txt. These labels should be a 2D array that contains the pitch and yaw angles of the direction of travel (in camera frame) of every frame of the respective videos. Zip them up and e-mail it to [email protected].

Evaluation

We will evaluate your mean squared error against our ground truth labels. Errors for frames where the car speed is less than 4m/s will be ignored. Those are also labeled as NaN in the example labels.

This repo includes an eval script that will give an error score (lower is better). You can use it to test your solutions against the labeled examples. We will use this script to evaluate your solution.

Hints

Keep the goal and evaluation script in mind, creative solutions are allowed.
Look at plots of your solutions before submitting.

$500 Prize CLAIMED

The first submission that scores an error under 25% on the unlabeled set, will receive a $500 prize.

The comma.ai Calibration Challenge!

Related tags

Overview

Welcome to the comma.ai Calibration Challenge!

Context

Deliverable

Evaluation

Hints

$500 Prize CLAIMED

Owner

comma.ai

Human annotated noisy labels for CIFAR-10 and CIFAR-100.

Machine Learning Platform for Kubernetes

AdamW optimizer for bfloat16 models in pytorch.

Convert weight file.pth to weight file.blob

Instance-wise Feature Importance in Time (FIT)

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Disagreement-Regularized Imitation Learning

GeneralOCR is open source Optical Character Recognition based on PyTorch.

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

Tensorflow 2 Object Detection API kurulumu, GPU desteği, custom model hazırlama

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

Spherical CNNs

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

This is the official repository of XVFI (eXtreme Video Frame Interpolation)

An implementation of IMLE-Net: An Interpretable Multi-level Multi-channel Model for ECG Classification

Python Implementation of Chess Playing AI with variable difficulty

Bilinear attention networks for visual question answering

Analysis of Smiles through reservoir sampling & RDkit