A collection of educational notebooks on multi-view geometry and computer vision.

Last update: Dec 09, 2022

Related tags

Overview

Multiview notebooks

This is a collection of educational notebooks on multi-view geometry and computer vision. Subjects covered in these notebooks include:

Camera calibration
Perspective projection
3D point triangulation
Quaternions as 3D pose representation
Perspective-n-point (PnP) algorithm
Levenberg–Marquardt optimization
Epipolar geometry
Relative 2nd cam pose from stereo views w. fundamental matrix
Relative 2nd cam pose from stereo views w. homography
Bundle adjustment
Structure from motion

Note Notebook 5 is working but not as tidy as the rest (yet). This notebook covers the Faugeras method to infer relative pose from a homography.

How to run

The notebooks can be run in the browser by clicking the binder badge . If one is interested in running the notebooks locally, I highly recommend using Docker as there is a dependency on g2opy and ipyvolume, which are challenging to install.

# Builds the environment 
docker build -t multiview_notebooks .

# Start a jupyter lab which can be opened in the browser
docker run -it --rm -p 8888:8888 multiview_notebooks jupyter-lab --ip=0.0.0.0 --port=8888

After starting the jupyter lab, the notebooks can be found in the home directory.
For the source of the Dockerfile, see this repository

Examples of visualizations

For more examples, see this video on youtube

A collection of educational notebooks on multi-view geometry and computer vision.

Related tags

Overview

Multiview notebooks

How to run

Examples of visualizations

Owner

Max

Neural Motion Learner With Python

Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination

shufflev2-yolov5：lighter, faster and easier to deploy

The official implementation of CircleNet: Anchor-free Detection with Circle Representation, MICCAI 2030

Official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"

DL & CV-based indicator toolset for the vehicle drivers via live dash-cam footage.

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction"

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

Neural Re-rendering for Full-frame Video Stabilization

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

Out-of-Distribution Generalization of Chest X-ray Using Risk Extrapolation

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

PyTorch implementation of ENet

Bayesian Deep Learning and Deep Reinforcement Learning for Object Shape Error Response and Correction of Manufacturing Systems

Projecting interval uncertainty through the discrete Fourier transform

CVPR 2020 oral paper: Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax.

Official PyTorch implementation of UACANet: Uncertainty Aware Context Attention for Polyp Segmentation

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.