dcavar/fomaMWT
Foma-based multi-word tagger and morphological analyzer
This tool helps computational linguists and natural language processing researchers quickly analyze multi-word expressions in text. You input a text file with tokenized sentences and a dictionary of multi-word expressions, and it outputs an analysis of their internal structure and morphosyntactic properties. It's designed for those who need to efficiently tag and understand complex phrases in large textual datasets.
No commits in the last 6 months.
Use this if you need a very fast way to identify and analyze multi-word expressions with their morphological and syntactic structure within text, based on a predefined dictionary.
Not ideal if you're not comfortable with C++ development environments or if you need a system that learns multi-word expressions without an explicit dictionary.
Stars
7
Forks
1
Language
C++
License
Apache-2.0
Category
Last pushed
Aug 06, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dcavar/fomaMWT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nert-nlp/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
bretttolbert/verbecc
Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and...
natasha/yargy
Rule-based facts extraction for Russian language
bjascob/LemmInflect
A python module for English lemmatization and inflection.
google-research/turkish-morphology
A two-level morphological analyzer for Turkish.