treeverse/example-get-started

Get started DVC project (NLP, random forest)

43
/ 100
Emerging

This helps data scientists or machine learning engineers manage and reproduce machine learning projects. You provide raw data, like StackOverflow questions in XML format, and it helps you process that data, train a model (specifically a random forest classifier for text tagging), and evaluate its performance. The outcome is a trained model that can predict tags for new text, along with metrics and plots showing how well it performs.

194 stars. No commits in the last 6 months.

Use this if you are a data scientist or ML engineer looking for a structured way to manage different versions of your data, models, and experiments, especially in a natural language processing context.

Not ideal if you are looking for a pre-built, production-ready solution to automatically tag text without needing to manage the underlying machine learning pipeline yourself.

Machine Learning Project Management Natural Language Processing Experiment Tracking Data Versioning Model Training
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 25 / 25

How are scores calculated?

Stars

194

Forks

186

Language

Python

License

Last pushed

May 27, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/treeverse/example-get-started"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.