caiselvas/language-identification
An NLP project leveraging character trigrams and smoothing techniques (Lidstone, Linear Discounting, Absolute Discounting) for language identification. Trained on for Spanish, Italian, English, French, Dutch, and German, achieving 99.8932% accuracy. Includes datasets, model parameters, and comprehensive documentation.
No commits in the last 6 months.
Stars
2
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 04, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/caiselvas/language-identification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
indix/whatthelang
Lightning Fast Language Prediction 🚀
nitotm/efficient-language-detector-js
Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.
pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and...
mbanon/fastspell
Targetted language identifier, based on FastText and Hunspell.
nitotm/efficient-language-detector
Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.