dcavar/fomaMWT

Foma-based multi-word tagger and morphological analyzer

/ 100

Experimental

This tool helps computational linguists and natural language processing researchers quickly analyze multi-word expressions in text. You input a text file with tokenized sentences and a dictionary of multi-word expressions, and it outputs an analysis of their internal structure and morphosyntactic properties. It's designed for those who need to efficiently tag and understand complex phrases in large textual datasets.

No commits in the last 6 months.

Use this if you need a very fast way to identify and analyze multi-word expressions with their morphological and syntactic structure within text, based on a predefined dictionary.

Not ideal if you're not comfortable with C++ development environments or if you need a system that learns multi-word expressions without an explicit dictionary.

computational-linguistics natural-language-processing morphological-analysis syntactic-analysis text-analysis

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

C++

License

Apache-2.0

Higher-rated alternatives

nert-nlp/streusle

STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)

bretttolbert/verbecc

Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and...

natasha/yargy

Rule-based facts extraction for Russian language

bjascob/LemmInflect

A python module for English lemmatization and inflection.

google-research/turkish-morphology

A two-level morphological analyzer for Turkish.

Explore NLP Tools

All categories Trending NLP directory Insights