pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
This tool helps you automatically figure out what language a piece of text is written in. You provide any text, from a single word to a full document, and it tells you which of 75 supported languages it is. This is ideal for anyone working with multilingual text data, such as customer service managers routing emails, or researchers processing linguistic data.
1,659 stars.
Use this if you need to accurately identify the language of short text snippets, single words, or mixed-language text, even when other tools struggle.
Not ideal if you only work with very long, homogeneous text and don't need highly precise detection for short or mixed-language inputs.
Stars
1,659
Forks
57
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/pemistahl/lingua-py"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
indix/whatthelang
Lightning Fast Language Prediction 🚀
nitotm/efficient-language-detector-js
Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.
nitotm/efficient-language-detector
Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.
mbanon/fastspell
Targetted language identifier, based on FastText and Hunspell.
nickdavidhaynes/spacy-cld
Language detection extension for spaCy 2.0+