kaldi-asr/kaldi is the official location of the Kaldi project.

Last update: Jan 05, 2023

Overview

Kaldi Speech Recognition Toolkit

To build the toolkit: see ./INSTALL. These instructions are valid for UNIX systems including various flavors of Linux; Darwin; and Cygwin (has not been tested on more "exotic" varieties of UNIX). For Windows installation instructions (excluding Cygwin), see windows/INSTALL.

To run the example system builds, see egs/README.txt

If you encounter problems (and you probably will), please do not hesitate to contact the developers (see below). In addition to specific questions, please let us know if there are specific aspects of the project that you feel could be improved, that you find confusing, etc., and which missing features you most wish it had.

Kaldi information channels

For HOT news about Kaldi see the project site.

Documentation of Kaldi:

Info about the project, description of techniques, tutorial for C++ coding.
Doxygen reference of the C++ code.

Kaldi forums and mailing lists:

We have two different lists

User list kaldi-help
Developer list kaldi-developers:

To sign up to any of those mailing lists, go to http://kaldi-asr.org/forums.html:

Development pattern for contributors

Create a personal fork of the main Kaldi repository in GitHub.
Make your changes in a named branch different from master, e.g. you create a branch my-awesome-feature.
Generate a pull request through the Web interface of GitHub.
As a general rule, please follow Google C++ Style Guide. There are a few exceptions in Kaldi. You can use the Google's cpplint.py to verify that your code is free of basic mistakes.

Platform specific notes

PowerPC 64bits little-endian (ppc64le)

Kaldi is expected to work out of the box in RHEL >= 7 and Ubuntu >= 16.04 with OpenBLAS, ATLAS, or CUDA.
CUDA drivers for ppc64le can be found at https://developer.nvidia.com/cuda-downloads.
An IBM Redbook is available as a guide to install and configure CUDA.

Android

Kaldi supports cross compiling for Android using Android NDK, clang++ and OpenBLAS.
See this blog post for details.

kaldi-asr/kaldi is the official location of the Kaldi project.

Related tags

Overview

Kaldi Speech Recognition Toolkit

Kaldi information channels

Development pattern for contributors

Platform specific notes

PowerPC 64bits little-endian (ppc64le)

Android

Owner

Kaldi

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Handwritten_Text_Recognition

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

kaldi-asr/kaldi is the official location of the Kaldi project.

Open Source Computer Vision Library

fishington.io bot with OpenCV and NumPy

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Python Computer Vision application that allows users to draw/erase on the screen using their webcam.

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

A tool combining EasyOCR and LaMa to automatically detect text and replace it with an inpainted background.

OCR of Chicago 1909 Renumbering Plan

Page to PAGE Layout Analysis Tool

Some Boring Research About Products Recognition 、Duplicate Img Detection、Img Stitch、OCR

基于openpose和图像分类的手语识别项目

Automatically download multiple papers by keywords in CVPR

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

Official PyTorch implementation for "Mixed supervision for surface-defect detection: from weakly to fully supervised learning"

Deskewing images with slanted content