a dnn ai project to classify which food people are eating on audio recordings

Last update: Oct 24, 2021

Related tags

Overview

Deep Learning - EAT Challenge

About

This project is part of an AI challenge of the DeepLearning course 2021 at the University of Augsburg. The objective to be learned is a classification task telling which food people are eating on audio recordings.

Students

This project was created by:

Benjamin Möckl
Julian Göser
Marco Tröster

EAT Dataset Setup

For your convenience, the download of all external project assets (dataset and evaluation metrics) has been automated by a shell script. After executing the script you should be ready to run / develop the project code.

# download and unpack the dataset and metric files
./init_dataset_and_metrics.sh <dataset zip password>

How to Run

First, cache the input dataset as TFRecord files for a training session (e.g. naive training). This should massively improve your training performance (especially with low CPU / GPU resources).

# cache the preprocessed audio dataset as TFRecord file
python src/main.py preprocess_dataset naive

Now, you can launch a training session (e.g. naive training).

# process a training session
python src/main.py run_training naive

After that you can sample all inputs of the unknown test dataset using a trained model and export the prediction results for EAT challenge submission.

# evaluate the results for submission
python src/main.py eval_results naive

Valid training configurations are:

naive
noisy
autoenc
amplitude

Remark: Use a GPU empowered machine for amplitude training (although it won't be too rewarding anyways). Tested on Ubuntu 20.04. For running on Windows, the keras ModelCheckpoint Callback has to be switched to our SaveBestAccuracyCallback.

Training Results

Training	Approach Description	Test Acc.	Real Acc.
Naive	Train on audio melspectrograms using Conv2D	0.41	0.36
Noisy	Train on audio melspectrograms using custom noisy Conv2D	0.44	0.39
Amplitude	Train on audio amplitude using Conv1D	0.23	?.??
AutoEnc	Train on audio melspectrograms using an Auto Encoder	0.25	?.??

a dnn ai project to classify which food people are eating on audio recordings

Related tags

Overview

Deep Learning - EAT Challenge

About

Students

EAT Dataset Setup

How to Run

Training Results

Owner

Marco Tröster

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021

RealTime Emotion Recognizer for Machine Learning Study Jam's demo

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

It's final year project of Diploma Engineering. This project is based on Computer Vision.

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

PyTorch implementation of the Transformer in Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).

A tensorflow implementation of GCN-LPA

Next-gen Rowhammer fuzzer that uses non-uniform, frequency-based patterns.

Joint learning of images and text via maximization of mutual information

Deep Learning for Human Part Discovery in Images - Chainer implementation

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Introducing neural networks to predict stock prices

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

Session-based Recommendation, CoHHN, price preferences, interest preferences, Heterogeneous Hypergraph, Co-guided Learning, SIGIR2022

ExCon: Explanation-driven Supervised Contrastive Learning

This repo is to present various code demos on how to use our Graph4NLP library.

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'

Code for the paper BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly