catqaq/OpenTextClassification
OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全面的开源文本分类项目,支持中英双语、多种模型、多种任务。
This project helps anyone who needs to automatically sort large volumes of text into predefined categories. You provide your raw text data and a list of categories, and it outputs each text piece with its assigned category (or categories). This is ideal for data scientists, analysts, or researchers dealing with text data in fields like content moderation, customer feedback analysis, or document organization.
209 stars. No commits in the last 6 months.
Use this if you need to classify text in English or Chinese, want to choose from various machine learning models (from simpler options to advanced transformers), or have text that belongs to multiple categories.
Not ideal if your classification task requires real-time, ultra-low latency processing on massive data streams, or if you need to classify text in languages other than English or Chinese without custom development.
Stars
209
Forks
22
Language
Python
License
—
Category
Last pushed
May 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/catqaq/OpenTextClassification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
yongzhuo/Pytorch-NLU
中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text...
aniass/Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning...
VinAIResearch/BERTweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
hppRC/bert-classification-tutorial
【2023年版】BERTによるテキスト分類
zhanlaoban/Transformers_for_Text_Classification
基于Transformers的文本分类