The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

Last update: Dec 16, 2022

Related tags

Text Processing python-Levenshtein

Overview

Contents

Maintainer wanted
Introduction
Installation
Documentation
License
History
Source code
Authors

Maintainer wanted

I am looking for a new maintainer to the project as it is apparent that I haven't had the need for this particular library for well over 7 years now, due to it being a C-only library and its somewhat restrictive original license.

Introduction

The Levenshtein Python C extension module contains functions for fast computation of

Levenshtein (edit) distance, and edit operations
string similarity
approximate median strings, and generally string averaging
string sequence and set similarity

It supports both normal and Unicode strings.

Python 2.2 or newer is required; Python 3 is supported.

StringMatcher.py is an example SequenceMatcher-like class built on the top of Levenshtein. It misses some SequenceMatcher's functionality, and has some extra OTOH.

Levenshtein.c can be used as a pure C library, too. You only have to define NO_PYTHON preprocessor symbol (-DNO_PYTHON) when compiling it. The functionality is similar to that of the Python extension. No separate docs are provided yet, RTFS. But they are not interchangeable:

C functions exported when compiling with -DNO_PYTHON (see Levenshtein.h) are not exported when compiling as a Python extension (and vice versa)
Unicode character type used with -DNO_PYTHON is wchar_t, Python extension uses Py_UNICODE, they may be the same but don't count on it

Installation

pip install python-Levenshtein

Documentation

Documentation for the current version

gendoc.sh generates HTML API documentation, you probably want a selfcontained instead of includable version, so run in ./gendoc.sh --selfcontained. It needs Levenshtein already installed and genextdoc.py.

http://github.com/ztane/python-Levenshtein/

Authors

Maintainer: Antti Haapala <[email protected]>
Python 3 compatibility: Esa Määttä
Jonatas CD: Fixed documentation generation
Previous maintainer: Mikko Ohtamaa
Original code: David Necas (Yeti) <yeti at physics.muni.cz>

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

Related tags

Overview

Maintainer wanted

Introduction

Installation

Documentation

License

History

Source code

Authors

Owner

Antti Haapala

A program that looks through entered text and replaces certain commands with mathematical symbols

🚩 A simple and clean python banner generator - Banners

Build a translation program similar to Google Translate with Python programming language and QT library

Bidirectionally transformed strings

Extract price amount and currency symbol from a raw text string

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

ChirpText is a collection of text processing tools for Python 3.

An experimental Fang Song style Chinese font generated with skeleton-tracing and pix2pix

A python Tk GUI that creates, writes text and attaches images into a custom spreadsheet file

Maiden & Spell community player ranking based on tournament data.

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

pydantic-i18n is an extension to support an i18n for the pydantic error messages.

Text to ASCII and ASCII to text

Convert English text to IPA using the toPhonetic

text-to-speach bot - You really do NOT have time for read a newsletter? Now you can listen to it

Vector space based Information Retrieval System for Text Processing - Information retrieval

A python tool to convert Bangla Bijoy text to Unicode text.

Hspell, the free Hebrew spellchecker and morphology engine.

Wikipedia Reader for the GNOME Desktop