This tool uses Deep Learning to help you draw and write with your hand and webcam.

Last update: Dec 10, 2022

Related tags

Overview

air-drawing 👆

This tool uses Deep Learning to help you draw and write with your hand and webcam. A Deep Learning model is used to try to predict whether you want to have 'pencil up' or 'pencil down'.

Try it online : loicmagne.github.io/air-drawing

Technical Details

This pipeline is made up of two steps: detecting the hand, and predicting the drawing. Both steps are done using Deep Learning.
The handpose detection is performed using MediaPipe toolbox
The drawing prediction part uses only the finger position, not the image. The input is a sequence of 2D points (actually i'm using the speed and acceleration of the finger instead of the position to make the prediction translation-invariant), and the output is a binary classification 'pencil up' or 'pencil down'. I used a simple bidirectionnal LSTM architecture. I made a small dataset myself (~50 samples) which I annotated thanks to tools provided in the python-stuff/data-wrangling/. At first I wanted to make the 'pencil up'/'pencil down' prediction in real-time, i.e. make the predictions at the same time the user draws. However this task was too difficult and I had poor results, which is why I'm now using bidirectionnal LSTM. You can find details of the deep learning pipeline in the jupyter-notebook in python-stuff/deep-learning/
The application is entirely client-side. I deployed the deep learning model by converting the PyTorch model to .onnx, and then using the ONNX Runtime which is very convenient and compatible with a lot of layers.

Going Forward

Overall the pipeline still struggles and needs some improvement. Ideas of amelioration include :

Having a bigger dataset, with more diverse user data.
Process and smooth the finger signal, to be less dependent on camera quality, and to improve model generalization.

This tool uses Deep Learning to help you draw and write with your hand and webcam.

Related tags

Overview

air-drawing 👆

Technical Details

Going Forward

Owner

lmagne

An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.

Code for "Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification", ECCV 2020 Spotlight

Wide Residual Networks (WideResNets) in PyTorch

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

This repository contains the map content ontology used in narrative cartography

This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX

Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch

Code for the paper: Sketch Your Own GAN

RLDS stands for Reinforcement Learning Datasets

deep_image_prior_extension

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Implementation of "Meta-rPPG: Remote Heart Rate Estimation Using a Transductive Meta-Learner"

EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge Distillation (CVPR'21)

Algorithmic Trading using RNN

Code release for "Making a Bird AI Expert Work for You and Me".

SimplEx - Explaining Latent Representations with a Corpus of Examples

Learning Open-World Object Proposals without Learning to Classify

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Computational Pathology Toolbox developed by TIA Centre, University of Warwick.

Code to replicate the key results from Exploring the Limits of Out-of-Distribution Detection