pemistahl/lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

50
/ 100
Established

This tool helps you automatically figure out what language a piece of text is written in. You provide any text, from a single word to a full document, and it tells you which of 75 supported languages it is. This is ideal for anyone working with multilingual text data, such as customer service managers routing emails, or researchers processing linguistic data.

1,659 stars.

Use this if you need to accurately identify the language of short text snippets, single words, or mixed-language text, even when other tools struggle.

Not ideal if you only work with very long, homogeneous text and don't need highly precise detection for short or mixed-language inputs.

natural-language-processing customer-service-routing text-analysis multilingual-data linguistic-data-management
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

1,659

Forks

57

Language

Python

License

Apache-2.0

Last pushed

Mar 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/pemistahl/lingua-py"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.