無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

Last update: Jul 05, 2022

Related tags

Text Data & NLP voicevox_engine

Overview

VOICEVOX ENGINE

VOICEVOXの音声合成エンジン。実態は HTTP サーバーなので、リクエストを送信すればテキスト音声合成できます。

API ドキュメント

VOICEVOX ソフトウェアを起動した状態で、ブラウザから http://localhost:50021/docs にアクセスするとドキュメントが表示されます。
VOICEVOX 音声合成エンジンとの連携も参考になるかもしれません。

HTTP リクエストで音声合成するサンプルコード

query.json curl -s \ -H "Content-Type: application/json" \ -X POST \ -d @query.json \ localhost:50021/synthesis?speaker=1 \ > audio.wav ">

text="ABCDEFG"

curl -s \
    -X POST \
    "localhost:50021/audio_query?text=$text&speaker=1"\
    > query.json

curl -s \
    -H "Content-Type: application/json" \
    -X POST \
    -d @query.json \
    localhost:50021/synthesis?speaker=1 \
    > audio.wav

貢献者の方へ

Issue を解決するプルリクエストを作成される際は、別の方と同じ Issue に取り組むことを避けるため、 Issue 側で取り組み始めたことを伝えるか、最初に Draft プルリクエストを作成してください。

環境構築

# 開発に必要なライブラリのインストール
pip install -r requirements-test.txt

# とりあえず実行したいだけなら代わりにこちら
pip install -r requirements.txt

実行

# 製品版 VOICEVOX でサーバーを起動
VOICEVOX_DIR="C:/path/to/voicevox" # 製品版 VOICEVOX ディレクトリのパス
python run.py --voicevox_dir=$VOICEVOX_DIR

# モックでサーバー起動
python run.py

コードフォーマット

コードのフォーマットを整えます。プルリクエストを送る前に実行してください。

pysen run format lint

ビルド

Build Tools for Visual Studio 2019 が必要です。

pip install -r requirements-dev.txt

python -m nuitka \
    --standalone \
    --plugin-enable=numpy \
    --follow-import-to=numpy \
    --follow-import-to=aiofiles \
    --include-package=uvicorn \
    --include-package-data=pyopenjtalk \
    --include-data-file=VERSION.txt=./ \
    --include-data-file=speakers.json=./ \
    --include-data-file=C:/音声ライブラリへのパス/Release/*.dll=./ \
    --include-data-file=C:/音声ライブラリへのパス/*.bin=./ \
    --include-data-dir=.venv/Lib/site-packages/_soundfile_data=./_soundfile_data \
    --msvc=14.2 \
    --follow-imports \
    --no-prefer-source-code \
    run.py

ライセンス

LGPL v3 と、ソースコードの公開が不要な別ライセンスのデュアルライセンスです。別ライセンスを取得したい場合は、ヒホ（twitter: @hiho_karuta）に求めてください。

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

Related tags

Overview

VOICEVOX ENGINE

API ドキュメント

HTTP リクエストで音声合成するサンプルコード

貢献者の方へ

環境構築

実行

コードフォーマット

ビルド

ライセンス

You might also like...

Releases(check-code-sign-8)

check-code-sign-8(Jul 10, 2022)

Owner

Hiroshiba

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

Neural-Machine-Translation - Implementation of revolutionary machine translation models

Poetry PEP 517 Build Backend & Core Utilities

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

NLTK Source

An Explainable Leaderboard for NLP

Autoregressive Entity Retrieval

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Facilitating the design, comparison and sharing of deep text matching models.

LSTM based Sentiment Classification using Tensorflow - Amazon Reviews Rating

Dust model dichotomous performance analysis

Unsupervised text tokenizer focused on computational efficiency

Blue Brain text mining toolbox for semantic search and structured information extraction

MiCECo - Misskey Custom Emoji Counter

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

NLPretext packages in a unique library all the text preprocessing functions you need to ease your NLP project.

Nmt - TensorFlow Neural Machine Translation Tutorial

Basic Utilities for PyTorch Natural Language Processing (NLP)

Implemented shortest-circuit disambiguation, maximum probability disambiguation, HMM-based lexical annotation and BiLSTM+CRF-based named entity recognition