pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Last update: Dec 29, 2021

Related tags

Overview

Python binding for Morfologik

Morfologik is Polish morphological analyzer. For more information see http://github.com/morfologik/morfologik-stemming/ and http://http://www.morfologik.blogspot.com/

Requirements

This binding works with Python 2 and Python 3.

Installation

Install it from pip

pip install pyMorfologik

or directly from github

git clone https://github.com/dmirecki/pyMorfologik.git

Usage

Now, only simple stems are supported:

>>> from pymorfologik import Morfologik
>>> from pymorfologik.parsing import ListParser
>>>
>>> parser = ListParser()
>>> stemmer = Morfologik()
>>> stemmer.stem(['Ala ma kota'], parser)
[(u'Ala',
  {u'Al': [u'subst:sg:acc:m1+subst:sg:gen:m1'],
   u'Ala': [u'subst:sg:nom:f'],
   u'Alo': [u'subst:sg:acc:m1+subst:sg:gen:m1']}),
 (u'ma',
  {u'mieć': [u'verb:fin:sg:ter:imperf:refl.nonrefl'],
   u'mój': [u'adj:sg:nom.voc:f:pos']}),
 (u'kota', {u'kot': [u'subst:sg:acc:m1'], u'kota': [u'subst:sg:nom:f']})]

Acknowledgements

This repo is based on Morfologik, a great contribution of Marcin Miłowski (http://marcinmilkowski.pl) and Dawid Weiss (http://www.dawidweiss.com).

Contributions

Damian Mirecki

Adrian Bohdanowicz

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Related tags

Overview

Python binding for Morfologik

Requirements

Installation

Usage

Acknowledgements

Contributions

Owner

Damian Mirecki

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Levenshtein and Hamming distance computation

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Exploration of BERT-based models on twitter sentiment classifications

A python gui program to generate reddit text to speech videos from the id of any post.

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Generating Korean Slogans with phonetic and structural repetition

Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

Modified GPT using average pooling to reduce the softmax attention memory constraints.

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

An Open-Source Package for Neural Relation Extraction (NRE)

Code for the project carried out fulfilling the course requirements for Fall 2021 NLP at NYU

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Related tags

Overview

Python binding for Morfologik

Requirements

Installation

Usage

Acknowledgements

Contributions

Owner

Damian Mirecki

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Levenshtein and Hamming distance computation

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Exploration of BERT-based models on twitter sentiment classifications

A python gui program to generate reddit text to speech videos from the id of any post.

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Generating Korean Slogans with phonetic and structural repetition

Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Modified GPT using average pooling to reduce the softmax attention memory constraints.

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

An Open-Source Package for Neural Relation Extraction (NRE)

Code for the project carried out fulfilling the course requirements for Fall 2021 NLP at NYU

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。