LanguageMachines/mbt

MBT: Memory-based tagger generation and tagging MBT is a memory-based tagger-generator and tagger in one.

/ 100

Emerging

This tool helps computational linguists and natural language processing researchers automatically assign grammatical tags (like noun, verb, adjective) to words in a text. You provide raw text, and it outputs the same text with each word labeled with its grammatical category. It's designed for researchers working on language analysis and text processing.

Use this if you need to automatically tag words in text with their part-of-speech or other linguistic categories for research or analysis.

Not ideal if you're looking for a pre-trained, ready-to-use solution for general text analysis without custom model generation, or if your primary need is not linguistic tagging.

computational-linguistics natural-language-processing text-annotation linguistic-analysis part-of-speech-tagging

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

C++

License

GPL-3.0

Higher-rated alternatives

EmilStenstrom/conllu

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.

OpenPecha/Botok

🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python

taishi-i/nagisa

A Japanese tokenizer based on recurrent neural networks

zaemyung/sentsplit

A flexible sentence segmentation library using CRF model and regex rules

natasha/razdel

Rule-based token, sentence segmentation for Russian language

Explore NLP Tools

All categories Trending NLP directory Insights