Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Last update: Nov 30, 2022

Related tags

Overview

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

This repository contains the code for

2D-MobileStereoNet prediction	Error map

3D-MobileStereoNet prediction	Error map

Installation

Requirements

The code is tested on:

Ubuntu 18.04
Python 3.6
PyTorch 1.4.0
Torchvision 0.5.0
CUDA 10.0

Setting up the environment

conda env create --file mobilestereonet.yml
conda activate mobilestereonet

Training

Set a variable (e.g. DATAPATH) for the dataset directory DATAPATH="/Datasets/SceneFlow/" or DATAPATH="/Datasets/KITTI2015/". Then, you can run the train.py file as below:

Pretraining on SceneFlow

python train.py --dataset sceneflow --datapath $DATAPATH --trainlist ./filenames/sceneflow_train.txt --testlist ./filenames/sceneflow_test.txt --epochs 20 --lrepochs "10,12,14,16:2" --batch_size 8 --test_batch_size 8 --model MSNet2D

Finetuning on KITTI

python train.py --dataset kitti --datapath $DATAPATH --trainlist ./filenames/kitti15_train.txt --testlist ./filenames/kitti15_val.txt --epochs 400 --lrepochs "200:10" --batch_size 8 --test_batch_size 8 --loadckpt ./checkpoints/pretrained.ckpt --model MSNet2D

The arguments in both cases can be set differently depending on the model and the system.

Prediction

The following script creates disparity maps for a specified model:

python prediction.py --datapath $DATAPATH --testlist ./filenames/kitti15_test.txt --loadckpt ./checkpoints/finetuned.ckpt --dataset kitti --colored True --model MSNet2D

Credits

The implementation of this code is based on PSMNet and GwcNet. Also, thanks to Matteo Poggi for the KITTI python utils.

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Related tags

Overview

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Installation

Requirements

Setting up the environment

Training

Pretraining on SceneFlow

Finetuning on KITTI

Prediction

Credits

License

Owner

Cognitive Systems Research Group

Official code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]

The code for paper Efficiently Solve the Max-cut Problem via a Quantum Qubit Rotation Algorithm

This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object Tracking with TRansformer.

Our solution for SSN Invente 2021's Hackathon

A Real-ESRGAN equipped Colab notebook for CLIP Guided Diffusion

unet-family: Ultimate version

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

A solution to the 2D Ising model of ferromagnetism, implemented using the Metropolis algorithm

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

NDE: Climate Modeling with Neural Diffusion Equation, ICDM'21

Multi-Objective Loss Balancing for Physics-Informed Deep Learning

Opinionated code formatter, just like Python's black code formatter but for Beancount

"Learning and Analyzing Generation Order for Undirected Sequence Models" in Findings of EMNLP, 2021

This project uses ViT to perform image classification tasks on DATA set CIFAR10.

In this work, we will implement some basic but important algorithm of machine learning step by step.

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

Knowledge Management for Humans using Machine Learning & Tags

[AAAI 2022] Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Deployment of PyTorch chatbot with Flask

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice