JayThibs/Weak-Supervised-Learning-Case-Study

Exploring NLP weak supervision approaches to train text classification models. The project is also a prototype for a semi-automated text data labelling platform. Approaches: Snorkel and Zero-Shot Learning.

/ 100

Experimental

This project offers a semi-automated way to label text data for classification tasks. You input raw, unlabelled text documents, and it helps you produce a dataset where each text is categorized. This is useful for data scientists or machine learning engineers who need to prepare text datasets for training classification models.

No commits in the last 6 months.

Use this if you have a large volume of unlabelled text and need to quickly generate a labelled dataset for machine learning, reducing the manual effort of human annotators.

Not ideal if you require extremely high precision for critical applications where even small labelling errors could have significant consequences, as weak supervision may introduce some noise.

text-classification data-labelling natural-language-processing machine-learning-engineering

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

AdaptiveMotorControlLab/CEBRA

Learnable latent embeddings for joint behavioral and neural analysis - Official implementation of CEBRA

theolepage/sslsv

Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker...

PaddlePaddle/PASSL

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision...

YGZWQZD/LAMDA-SSL

30 Semi-Supervised Learning Algorithms

ModSSC/ModSSC

ModSSC: A Modular Framework for Semi Supervised Classification

Explore ML Frameworks

All categories Trending ML Framework directory Insights