derintelligence/en-az-parallel-corpus
English-Azerbaijani parallel language corpus
This is a collection of English and Azerbaijani sentences, aligned for translation purposes. It's used to train and improve machine translation systems between these two languages, helping create more accurate language models. Machine learning engineers and researchers working on natural language processing for Azerbaijani would use this.
No commits in the last 6 months.
Use this if you are a machine learning engineer or researcher developing or fine-tuning AI models for English-Azerbaijani or Azerbaijani-English translation.
Not ideal if you need a pre-built, ready-to-use translation application, or if you are not working on building language models.
Stars
20
Forks
—
Language
—
License
MIT
Category
Last pushed
Aug 16, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/derintelligence/en-az-parallel-corpus"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...