iflytek/cino
CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)
This project offers pre-trained language models specifically designed for processing text in Chinese minority languages and dialects. It takes raw text in languages like Tibetan, Mongolian, Uyghur, Kazakh, Korean, Zhuang, and Cantonese, and outputs a deeper understanding of the language, which can then be used for tasks like text classification. This tool is for researchers, linguists, or content managers working with these specific languages who need to analyze or process large volumes of text.
262 stars. No commits in the last 6 months.
Use this if you need to build applications or conduct research that accurately understands and processes text in Chinese minority languages like Tibetan, Mongolian, Uyghur, or Cantonese.
Not ideal if your primary focus is on processing standard Mandarin Chinese or other widely spoken global languages, as other models may be more efficient or comprehensive for those languages.
Stars
262
Forks
32
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/iflytek/cino"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ThilinaRajapakse/simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling,...
jsksxs360/How-to-use-Transformers
Transformers 库快速入门教程
google/deepconsensus
DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences...
Denis2054/Transformers-for-NLP-2nd-Edition
Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning,...
abhimishra91/transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks