SudhirGadhvi/open-vernacular-ai-kit
Clean Indian code-mixed text before it reaches your LLM.
46
/ 100
Emerging
Available on PyPI.
Maintenance
10 / 25
Adoption
4 / 25
Maturity
20 / 25
Community
12 / 25
Stars
5
Forks
1
Language
Python
License
MIT
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SudhirGadhvi/open-vernacular-ai-kit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
acl-org/acl-anthology
Data and software for building the ACL Anthology.
76
anoopkunchukuttan/indic_nlp_library
Resources and tools for Indian language Natural Language Processing
64
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
53
KennethEnevoldsen/scandinavian-embedding-benchmark
A Scandinavian Benchmark for sentence embeddings
47
Separius/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
47