microsoft/ASTRA
Self-training with Weak Supervision (NAACL 2021)
This framework helps data scientists and machine learning engineers create robust classification models faster when manually labeling large datasets is too expensive. By combining domain-specific rules, a small amount of labeled data, and a large pool of unlabeled data, it automatically generates high-quality weak labels. The output is a trained deep neural network capable of accurately classifying new instances.
163 stars. No commits in the last 6 months.
Use this if you need to train a classification model but lack sufficient manually labeled data and can define some heuristic rules for your domain.
Not ideal if you already have large-scale, high-quality labeled datasets or if your task doesn't easily lend itself to rule-based weak supervision.
Stars
163
Forks
21
Language
Python
License
MIT
Category
Last pushed
Jul 24, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/microsoft/ASTRA"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
AdaptiveMotorControlLab/CEBRA
Learnable latent embeddings for joint behavioral and neural analysis - Official implementation of CEBRA
theolepage/sslsv
Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker...
PaddlePaddle/PASSL
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision...
YGZWQZD/LAMDA-SSL
30 Semi-Supervised Learning Algorithms
ModSSC/ModSSC
ModSSC: A Modular Framework for Semi Supervised Classification