ilinguistics/geoLid
Geographically-informed language identification
This tool identifies the language of written text, especially for less common languages, by using geographic information. You input a piece of text and specify a geographic region, and it outputs the probable language. This is useful for linguists, social scientists, or anyone working with text data from diverse global communities.
No commits in the last 6 months.
Use this if you need to accurately determine the language of short text samples, particularly those from specific geographic areas with many local languages.
Not ideal if you only work with widely-spoken, high-resource languages where standard language identification tools are sufficient.
Stars
7
Forks
1
Language
Python
License
GPL-3.0
Category
Last pushed
Mar 14, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ilinguistics/geoLid"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
indix/whatthelang
Lightning Fast Language Prediction 🚀
nitotm/efficient-language-detector-js
Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.
pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and...
nitotm/efficient-language-detector
Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.
mbanon/fastspell
Targetted language identifier, based on FastText and Hunspell.