asahala/BabyLemmatizer

State-of-the-art neural tagger and lemmatizer for ancient languages

/ 100

Experimental

This tool helps ancient language scholars and researchers analyze transliterated texts from languages like Akkadian, Sumerian, or Ancient Greek. It takes a transliterated text as input and identifies the root form (lemma) and part-of-speech (POS) tag for each word, making the text searchable and useful for further study. The primary user is anyone working with historical linguistic data who needs to systematically categorize words.

No commits in the last 6 months.

Use this if you need to automatically identify lemmas and part-of-speech tags for words in transliterated ancient texts, particularly Cuneiform languages, to make them searchable and analyzable.

Not ideal if you are working with modern languages or if you require a simple, out-of-the-box solution without any command-line setup.

ancient-languages philology cuneiform-studies linguistic-analysis historical-texts

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

chakki-works/seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

jbesomi/texthero

Text preprocessing, representation and visualization from zero to hero.

hamelsmu/ktext

Utilities for preprocessing text for deep learning with Keras

asahi417/tner

Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...

Explore NLP Tools

All categories Trending NLP directory Insights