PalaashAgrawal/allmond
LLM Training made Quick and Easy
This project helps machine learning engineers efficiently train, fine-tune, and evaluate Large Language Models (LLMs) for specific tasks or datasets. It takes raw text data (like a Hugging Face dataset) and a base LLM, then outputs a specialized, trained LLM ready for deployment. The ideal user is a data scientist or machine learning engineer working with LLMs who needs to quickly adapt models without deep dives into complex code.
No commits in the last 6 months.
Use this if you are a machine learning engineer or researcher who needs to rapidly prototype and train custom Large Language Models on various hardware configurations, from a single GPU to distributed setups.
Not ideal if you are looking for a no-code solution or don't have experience with Python and machine learning concepts.
Stars
7
Forks
—
Language
Python
License
—
Category
Last pushed
Sep 18, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/PalaashAgrawal/allmond"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
liangyuwang/Tiny-DeepSpeed
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
catherinesyeh/attention-viz
Visualizing query-key interactions in language + vision transformers (VIS 2023)
microsoft/Text2Grad
🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model...
FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's...