joisino/seafaring

Code for "Active Learning from the Web" (WWW 2023)

46
/ 100
Emerging

This project helps machine learning engineers or researchers efficiently gather high-quality, labeled image data from vast online sources like Flickr or Open Images. By processing a small set of initial labeled images and a large pool of unlabeled web data, it identifies the most informative images to label next. The output is a more accurate machine learning model trained with less manual labeling effort.

116 stars. No commits in the last 6 months.

Use this if you are building an image classification model and want to reduce the cost and time spent manually labeling training data by intelligently selecting which images to annotate from the web.

Not ideal if your data is not publicly available on the web, if you are not working with image data, or if you need to label data from a private, curated dataset rather than broad web sources.

machine-learning-engineering image-classification data-acquisition web-scraping model-training-optimization
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

116

Forks

23

Language

Python

License

MIT

Last pushed

Feb 14, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/joisino/seafaring"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.