JieyuZ2/Awesome-Weak-Supervision

A curated list of programmatic weak supervision papers and resources

35
/ 100
Emerging

Building machine learning models often requires vast amounts of labeled data, which can be expensive and time-consuming to create. This resource compiles research papers and tools focused on "weak supervision," a technique that uses programmatic rules or heuristics to automatically generate training data. If you are a machine learning practitioner or researcher, you can explore various approaches to efficiently label data using existing knowledge or simple scripts, helping you build models faster with less manual effort. This collection provides insights into generating training sets from rule-based inputs.

191 stars. No commits in the last 6 months.

Use this if you need to build machine learning models but struggle with the cost and time required to manually label large datasets.

Not ideal if you already have perfectly labeled, high-quality datasets for your machine learning tasks or are not involved in model development.

machine-learning-engineering data-labeling natural-language-processing computer-vision data-centric-ai
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 17 / 25

How are scores calculated?

Stars

191

Forks

28

Language

TeX

License

Last pushed

Mar 01, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/JieyuZ2/Awesome-Weak-Supervision"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.