pemistahl/lingua

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

44
/ 100
Emerging

This tool identifies the natural language of any text you provide, from single words to long documents, supporting 75 languages. It takes any textual input and outputs the detected language. This is ideal for anyone working with multilingual text, such as customer service managers routing emails or data analysts preparing text for further processing.

802 stars. No commits in the last 6 months.

Use this if you need to accurately determine the language of short or long text snippets without complex setup or an internet connection.

Not ideal if you need to detect languages that are not among the 75 currently supported or require a full-fledged natural language processing framework.

multilingual-content customer-service data-preprocessing text-analysis global-communications
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

802

Forks

75

Language

Kotlin

License

Apache-2.0

Last pushed

Mar 21, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/pemistahl/lingua"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.