iflytek/cino

CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)

/ 100

Emerging

This project offers pre-trained language models specifically designed for processing text in Chinese minority languages and dialects. It takes raw text in languages like Tibetan, Mongolian, Uyghur, Kazakh, Korean, Zhuang, and Cantonese, and outputs a deeper understanding of the language, which can then be used for tasks like text classification. This tool is for researchers, linguists, or content managers working with these specific languages who need to analyze or process large volumes of text.

262 stars. No commits in the last 6 months.

Use this if you need to build applications or conduct research that accurately understands and processes text in Chinese minority languages like Tibetan, Mongolian, Uyghur, or Cantonese.

Not ideal if your primary focus is on processing standard Mandarin Chinese or other widely spoken global languages, as other models may be more efficient or comprehensive for those languages.

minority-language-processing text-analysis language-understanding linguistics-research content-categorization

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

262

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

ThilinaRajapakse/simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling,...

jsksxs360/How-to-use-Transformers

Transformers 库快速入门教程

google/deepconsensus

DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences...

Denis2054/Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning,...

abhimishra91/transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

Explore Transformer Models

All categories Trending Transformer directory Insights