This is a Deep Leaning API for classifying emotions from human face and human audios.

Last update: Oct 02, 2022

Overview

Emotion AI

This is a Deep Leaning API for classifying emotions from human face and human audios.

Starting the server

To start the server first you need to install all the packages used by running the following command:

pip install -r requirements.txt
# make sure your current directory is "server"

After that you can start the server by running the following commands:

change the directory from server to api:

cd api

run the app.py

python app.py

The server will start at a default PORT of 3001 which you can configure in the api/app.py on the Config class:

class AppConfig:
    PORT = 3001
    DEBUG = False

If everything went well you will be able to make api request to the server.

EmotionAI

Consist of two parallel models that are trained with different model architectures to save different task. The one is for audio classification and the other is for facial emotion classfication. Each model is served on a different endpoint but on the same server.

Audio Classification

Sending an audio file to the server at http://127.0.0.1:3001/api/classify/audio using the POST method we will be able to get the data that looks as follows as the json response from the server:

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Classifying audios

Using cURL

To classify the audio using cURL make sure that you open the command prompt where the audio files are located for example in my case the audios are located in the audios folder so i open the command prompt in the audios folder or else i will provide the absolute path when making a cURL request for example

curl -X POST -F [email protected] http://127.0.0.1:3001/api/classify/audio

If everything went well we will get the following response from the server:

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Using Postman client

To make this request with postman we do it as follows:

Change the request method to POST at http://127.0.0.1:3001/api/classify/audio
Click on form-data
Select type to be file on the KEY attribute
For the KEY type audio and select the audio you want to predict under value Click send
If everything went well you will get the following response depending on the audio you have selected:

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Using JavaScript fetch api.
First you need to get the input from html
Create a formData object
make a POST requests

res.json()) .then((data) => console.log(data));">

const input = document.getElementById("input").files[0];
let formData = new FormData();
formData.append("audio", input);
fetch("http://127.0.0.1:3001/api/classify/audio", {
  method: "POST",
  body: formData,
})
  .then((res) => res.json())
  .then((data) => console.log(data));

If everything went well you will be able to get expected response.

{
  "predictions": {
    "emotion": { "class": "sad", "label": 3, "probability": 0.22 },
    "emotion_intensity": { "class": "normal", "label": 0, "probability": 0.85 },
    "gender": { "class": "male", "label": 0, "probability": 1.0 }
  },
  "success": true
}

Notebooks

If you want to see how the models were trained you can open the respective notebooks:

Audio Classification

This is a Deep Leaning API for classifying emotions from human face and human audios.

Related tags

Overview

Emotion AI

Starting the server

EmotionAI

Audio Classification

Classifying audios

Notebooks

Owner

crispengari

TensorFlow CNN for fast style transfer

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

yolov5 deepsort 行人车辆跟踪检测计数

Unsupervised Representation Learning by Invariance Propagation

This is the solution for 2nd rank in Kaggle competition: Feedback Prize - Evaluating Student Writing.

An LSTM based GAN for Human motion synthesis

An implementation of "Learning human behaviors from motion capture by adversarial imitation"

Development Kit for the SoccerNet Challenge

Code for Understanding Pooling in Graph Neural Networks

ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm

Learning Features with Parameter-Free Layers (ICLR 2022)

Code for ICE-BeeM paper - NeurIPS 2020

Rocket-recycling with Reinforcement Learning

Code for the paper "Multi-task problems are not multi-objective"

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

Github Traffic Insights as Prometheus metrics.

Pointer-generator - Code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

Tools for computational pathology

This is a Deep Leaning API for classifying emotions from human face and human audios.

Related tags

Overview

Emotion AI

Starting the server

EmotionAI

Audio Classification

Classifying audios

Notebooks

Owner

crispengari

TensorFlow CNN for fast style transfer

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

yolov5 deepsort 行人 车辆 跟踪 检测 计数

Unsupervised Representation Learning by Invariance Propagation

This is the solution for 2nd rank in Kaggle competition: Feedback Prize - Evaluating Student Writing.

An LSTM based GAN for Human motion synthesis

An implementation of "Learning human behaviors from motion capture by adversarial imitation"

Development Kit for the SoccerNet Challenge

Code for Understanding Pooling in Graph Neural Networks

ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm

Learning Features with Parameter-Free Layers (ICLR 2022)

Code for ICE-BeeM paper - NeurIPS 2020

Rocket-recycling with Reinforcement Learning

Code for the paper "Multi-task problems are not multi-objective"

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

Github Traffic Insights as Prometheus metrics.

Pointer-generator - Code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks

Official code for the paper "Self-Supervised Prototypical Transfer Learning for Few-Shot Classification"

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

Tools for computational pathology

yolov5 deepsort 行人车辆跟踪检测计数