Roshanson/TextInfoExp
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
This project offers a foundational toolkit for experimenting with various natural language processing tasks using the Sogou dataset. You can input Chinese text data and explore methods for text classification, clustering, sentiment analysis, and relationship extraction. It's designed for data scientists, NLP students, or researchers who are learning or testing different NLP algorithms.
1,729 stars. No commits in the last 6 months.
Use this if you need a pre-configured environment and basic implementations to learn about or benchmark common Chinese NLP techniques like TF-IDF, word embeddings, and text categorization.
Not ideal if you're looking for an out-of-the-box solution for production NLP tasks or if you require advanced, state-of-the-art models for specific applications.
Stars
1,729
Forks
764
Language
Python
License
—
Last pushed
Jul 18, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Roshanson/TextInfoExp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gaussic/text-classification-cnn-rnn
CNN-RNN中文文本分类,基于TensorFlow
ShawnyXiao/TextClassification-Keras
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN,...
prakashpandey9/Text-Classification-Pytorch
Text classification using deep learning models in Pytorch
TobiasLee/Text-Classification
Implementation of papers for text classification task on DBpedia
FreedomIntelligence/TextClassificationBenchmark
A Benchmark of Text Classification in PyTorch