Back to Event Basics: SSL of Image Reconstruction for Event Cameras

Last update: Dec 26, 2022

Related tags

Overview

Back to Event Basics: SSL of Image Reconstruction for Event Cameras

Minimal code for Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy, CVPR'21.

Usage

This project uses Python >= 3.7.3. After setting up your virtual environment, please install the required python libraries through:

pip install -r requirements.txt

Code is formatted with Black (PEP8) using a pre-commit hook. To configure it, run:

pre-commit install

Data format

Similarly to researchers from Monash University, this project processes events through the HDF5 data format. Details about the structure of these files can be found in datasets/tools/.

Inference

Download our pre-trained models from here.

Our HDF5 version of sequences from the Event Camera Dataset can also be downloaded from here for evaluation purposes.

To estimate optical flow from the input events:

python eval_flow.py

To perform image reconstruction from the input events:

python eval_reconstruction.py

In configs/, you can find the configuration files associated to these scripts and vary the inference settings (e.g., number of input events, dataset).

Training

Our framework can be trained using any event camera dataset. However, if you are interested in using our training data, you can download it from here. The datasets are expected at datasets/data/, but this location can be modified in the configuration files.

To train an image reconstruction and optical flow model, you need to adapt the training settings in configs/train_reconstruction.yml. Here, you can choose the training dataset, the number of input events, the neural networks to be used (EV-FlowNet or FireFlowNet for optical flow; E2VID or FireNet for image reconstruction), the number of epochs, the optimizer and learning rate, etc. To start the training from scratch, run:

python train_reconstruction.py

Alternatively, if you have a model that you would like to keep training from, you can use

python train_reconstruction.py --prev_model

This is handy if, for instance, you just want to train the image reconstruction model and use a pre-trained optical flow network. For this, you can set train_flow: False in configs/train_reconstruction.yml, and run:

python train_reconstruction.py --prev_model

If you just want to train an optical flow network, adapt configs/train_flow.yml, and run:

python train_flow.py

Note that we use MLflow to keep track of all the experiments.

Citations

If you use this library in an academic context, please cite the following:

@article{paredes2020back,
  title={Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy},
  author={Paredes-Vall{\'e}s, Federico and de Croon, Guido C. H. E.},
  journal={arXiv preprint arXiv:2009.08283},
  year={2020}
}

Acknowledgements

This code borrows from the following open source projects, whom we would like to thank:

Back to Event Basics: SSL of Image Reconstruction for Event Cameras

Related tags

Overview

Back to Event Basics: SSL of Image Reconstruction for Event Cameras

Usage

Data format

Inference

Training

Citations

Acknowledgements

Owner

TU Delft

Solutions of Reinforcement Learning 2nd Edition

Tensorflow Implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Video Matting Refinement For Python

You Only Look Once for Panopitic Driving Perception

Video-based open-world segmentation

Code repository for our paper "Learning to Generate Scene Graph from Natural Language Supervision" in ICCV 2021

Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）

Benchmark datasets, data loaders, and evaluators for graph machine learning

An executor that performs image segmentation on fashion items

This repo contains the implementation of YOLOv2 in Keras with Tensorflow backend.

Discord-Protect is a simple discord bot allowing you to have some security on your discord server by ordering a captcha to the user who joins your server.

ncnn is a high-performance neural network inference framework optimized for the mobile platform

This repository compare a selfie with images from identity documents and response if the selfie match.

TensorFlow-based implementation of "Pyramid Scene Parsing Network".

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

This repository contains the implementation of the HealthGen model, a generative model to synthesize realistic EHR time series data with missingness

To provide 100 JAX exercises over different sections structured as a course or tutorials to teach and learn for beginners, intermediates as well as experts

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.