Translate

Command-line interface to translation pipelines, powered by Huggingface transformers. This tool can download translation models, and then using them to translate sentences offline. By default, tries using models from Helsinki-NLP (each model is about 300MB large).

Install

$ git clone https://github.com/Teuze/translate
$ cd translate
$ pip3 install --user -r requirements.py

If you want to be able to use this script from anywhere in your system, you can symlink or copy the translate script file into one of your path folders, like for example $HOME/.local/bin.

Usage

Listing available and installed translation models :

$ # Also available on https://huggingface.co/models
$ ./translate model list online | less
$ ./translate model list local | less

Downloading models :

$ ./translate download model "Helsinki-NLP/opus-mt-en-es"
$ ./translate download model "Helsinki-NLP/opus-mt-fr-en"

Using models to translate from CLI arguments or from standard input :

$ ./translate text -e "Helsinki-NLP/opus-mt-en-es" "Hello World!"
¡Hola Mundo!
$ echo "Ceci est une phrase d'exemple simple" | ./translate text -s fr -t en
This is a simple example sentence

Partially offline multi-language translator built upon Huggingface transformers.

Related tags

Overview

Translate

Install

Usage

Owner

Richard Jarry

BiNE: Bipartite Network Embedding

DVC-NLP-Simple-usecase

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.

Random Directed Acyclic Graph Generator

Natural Language Processing for Adverse Drug Reaction (ADR) Detection

Pre-training BERT masked language models with custom vocabulary

Translate U is capable of translating the text present in an image from one language to the other.

Labelling platform for text using distant supervision

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

Code for the paper "Are Sixteen Heads Really Better than One?"

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Behavioral Testing of Clinical NLP Models

Utilities for preprocessing text for deep learning with Keras

Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences"

Python wrapper for Stanford CoreNLP tools v3.4.1

A library for finding knowledge neurons in pretrained transformer models.

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.

OceanScript is an Esoteric language used to encode and decode text into a formulation of characters

Tools, wrappers, etc... for data science with a concentration on text processing