treeverse/example-get-started
Get started DVC project (NLP, random forest)
This helps data scientists or machine learning engineers manage and reproduce machine learning projects. You provide raw data, like StackOverflow questions in XML format, and it helps you process that data, train a model (specifically a random forest classifier for text tagging), and evaluate its performance. The outcome is a trained model that can predict tags for new text, along with metrics and plots showing how well it performs.
194 stars. No commits in the last 6 months.
Use this if you are a data scientist or ML engineer looking for a structured way to manage different versions of your data, models, and experiments, especially in a natural language processing context.
Not ideal if you are looking for a pre-built, production-ready solution to automatically tag text without needing to manage the underlying machine learning pipeline yourself.
Stars
194
Forks
186
Language
Python
License
—
Category
Last pushed
May 27, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/treeverse/example-get-started"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
MMBazel/Springboard-DataScienceTrack-Student
Springboard Program: Data Science Career Track - NLP
TomLin/Playground
store my personal project
PhDLeToanThang/BA-DA-DS_vs_CI-CD_Copilot
Business Analytics - Data Analytics vs Data Scientist and Develop CI/CD Pipeline Data Framework...
gayathri1462/Suven-Consultants-and-Technology
During this Online Coding Internship, I have worked on projects related to Data Analytics and...
ucalyptus/dirac-dev
Internship tasks at Dirac business Solutions