zhanlaoban/NLP_PEMDC

NLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.

34
/ 100
Emerging

This is a continuously updated collection of pre-trained word embeddings, models, and datasets specifically designed for natural language processing (NLP) tasks, primarily focusing on Chinese. It provides ready-to-use components to help researchers and students explore and build various text-based applications, taking in raw Chinese (and some English) text data and supporting the creation of systems for tasks like classification, sentiment analysis, or question answering. This is for NLP researchers, data scientists, and students working on text analysis, especially with Chinese language data.

No commits in the last 6 months.

Use this if you need a convenient, centralized resource for Chinese NLP components, including word vectors, pre-trained language models like BERT and RoBERTa, and diverse datasets for tasks like sentiment analysis, text classification, and reading comprehension.

Not ideal if you are looking for a plug-and-play NLP application or a library with high-level APIs for immediate integration into production systems, as this is a collection of resources for learning and research.

natural-language-processing chinese-text-analysis text-classification sentiment-analysis reading-comprehension
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 18 / 25

How are scores calculated?

Stars

65

Forks

15

Language

License

Last pushed

Jan 14, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/zhanlaoban/NLP_PEMDC"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.