pemistahl/lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
This tool identifies the natural language of any text you provide, from single words to long documents, supporting 75 languages. It takes any textual input and outputs the detected language. This is ideal for anyone working with multilingual text, such as customer service managers routing emails or data analysts preparing text for further processing.
802 stars. No commits in the last 6 months.
Use this if you need to accurately determine the language of short or long text snippets without complex setup or an internet connection.
Not ideal if you need to detect languages that are not among the 75 currently supported or require a full-fledged natural language processing framework.
Stars
802
Forks
75
Language
Kotlin
License
Apache-2.0
Category
Last pushed
Mar 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/pemistahl/lingua"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
wooorm/franc
Natural language detection
microsoft/Recognizers-Text
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time,...
winkjs/wink-pos-tagger
English Part-of-speech (POS) tagger
sillsdev/machine
Machine is a natural language processing library for .NET that is focused on providing tools for...
ayoungprogrammer/Lango
Language Lego