Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Last update: Aug 28, 2022

Related tags

Deep Learning AequeVox

Overview

AequeVox

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

README under development.

Python Packages Required

numpy
scipy
math
librosa
random
time
json
threading
re
nltk

ASR Specific Packages

Google Cloud

speech
Storage

Microsoft Azure

Azure.cognitiveservices.speech

IBM Cloud

ibm_watson
ibm_watson.websocket
Ibm_cloud_sdk_core.authenticators

The code is separated into 2 sections, Generation and Analysis.

Generation:

transGen.py

Lists all transformation types and magnitudes to be used. Can be modified as necessary.
Requires the specification of file names of all the original speech files.

Generates transformed speech files with form {Original File Name}{Transformation Type Abbreviation}{Magnitude of Transformation Parameter, theta}.wav

List of Abbreviations.

A - Amplitude
C - Clipping
D - Drop
F - Frame
HP - Highpass
LP - LP
N - Noise
S - Scale

GCP_Recog.py

Requires Google cloud client libraries and associated keys.

Takes a group name and the list of all original files in the group to generate transcripts.

MS_Recog.py

Requires Microsoft Azure client libraries and associated key and region.

Takes a group name and the list of all original files in the group to generate transcripts.

IBM_Recog.py

Requires IBM client libraries and associated key and service URL..

Takes a group name and the list of all original files in the group to generate transcripts.

compASR.py

Takes the names of two ASR systems and group names to generate a distance metric. Result yields text files with distance metrics for specified groups.

Users are requested to use the distance metrics to calculate the D values for each transformation.

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Related tags

Overview

AequeVox

Owner

Sai Sathiesh

Cereal box identification in store shelves using computer vision and a single train image per model.

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

CasualHealthcare's Pneumonia detection with Artificial Intelligence (Convolutional Neural Network)

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects (CVPR 2021)

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

Gesture Volume Control v.2

This is a Python Module For Encryption, Hashing And Other stuff

audioLIME: Listenable Explanations Using Source Separation

I-SECRET: Importance-guided fundus image enhancement via semi-supervised contrastive constraining

PyTorch implementaton of our CVPR 2021 paper "Bridging the Visual Gap: Wide-Range Image Blending"

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Codebase for BMVC 2021 paper "Text Based Person Search with Limited Data"

This repository contains the code and models for the following paper.

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

This framework implements the data poisoning method found in the paper Adversarial Examples Make Strong Poisons

A PyTorch implementation of deep-learning-based registration