cloudera/CML_AMP_Few-Shot_Text_Classification
Perform topic classification on news articles in several limited-labeled data regimes.
This project helps categorize news articles into predefined topics, even when you have very few or no labeled examples. You input a collection of news articles, and it outputs topic classifications for each article. This is ideal for content managers, market researchers, or anyone needing to quickly sort large volumes of text data with minimal manual effort.
No commits in the last 6 months.
Use this if you need to classify text data into categories but lack a large dataset of already-labeled examples.
Not ideal if you already have a well-labeled, extensive dataset for training traditional text classification models.
Stars
18
Forks
6
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Dec 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/cloudera/CML_AMP_Few-Shot_Text_Classification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
huggingface/setfit
Efficient few-shot learning with Sentence Transformers
YujiaBao/Distributional-Signatures
"Few-shot Text Classification with Distributional Signatures" ICLR 2020
zhongyuchen/few-shot-text-classification
Few-shot binary text classification with Induction Networks and Word2Vec weights initialization
ELM-Research/ECG-Language-Models
A Training and Evaluation Framework for ECG-Language Models (ELMs)
StefanHeng/ECG-Representation-Learning
Self-supervised pre-training for ECG representation with inspiration from transformers & computer vision