asahala/BabyLemmatizer

State-of-the-art neural tagger and lemmatizer for ancient languages

23
/ 100
Experimental

This tool helps ancient language scholars and researchers analyze transliterated texts from languages like Akkadian, Sumerian, or Ancient Greek. It takes a transliterated text as input and identifies the root form (lemma) and part-of-speech (POS) tag for each word, making the text searchable and useful for further study. The primary user is anyone working with historical linguistic data who needs to systematically categorize words.

No commits in the last 6 months.

Use this if you need to automatically identify lemmas and part-of-speech tags for words in transliterated ancient texts, particularly Cuneiform languages, to make them searchable and analyzable.

Not ideal if you are working with modern languages or if you require a simple, out-of-the-box solution without any command-line setup.

ancient-languages philology cuneiform-studies linguistic-analysis historical-texts
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 10 / 25

How are scores calculated?

Stars

14

Forks

2

Language

Python

License

Last pushed

Mar 09, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/asahala/BabyLemmatizer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.