zhusleep/pytorch_chinese_lm_pretrain

pytorch中文语言模型预训练

41
/ 100
Emerging

This project helps machine learning engineers and NLP researchers improve the performance of Chinese language models for specific applications. You can input domain-specific Chinese text data and use it to fine-tune existing popular language models like BERT, RoBERTa, or ERNIE. The output is a more specialized language model that performs better on tasks relevant to your domain.

385 stars. No commits in the last 6 months.

Use this if you need to adapt a general Chinese language model to perform exceptionally well on text data from a niche domain or specific task, like legal documents or medical reports.

Not ideal if you are looking for a pre-trained Chinese language model without any custom training or if your task doesn't require domain-specific adaptation.

natural-language-processing machine-learning-engineering chinese-text-analysis language-model-adaptation text-mining
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 23 / 25

How are scores calculated?

Stars

385

Forks

78

Language

Python

License

Last pushed

Jul 17, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/zhusleep/pytorch_chinese_lm_pretrain"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.