teticio/llama-squad
Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.
This project helps machine learning engineers and researchers adapt large, generalized AI models (like Llama 2 or 3) to perform very specific question-answering tasks. It takes a foundation model and a dataset of questions, contexts, and exact answers, then outputs a specialized model that can accurately extract answers or state when an answer isn't present. It's for those looking to fine-tune existing large language models for precise information extraction.
No commits in the last 6 months.
Use this if you need to train a large language model to precisely extract answers from provided text and 'know' when the answer is not available.
Not ideal if you're looking for a general-purpose chatbot or a model that generates creative, open-ended responses.
Stars
53
Forks
10
Language
Python
License
GPL-3.0
Category
Last pushed
Jun 07, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/teticio/llama-squad"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
NotYuSheng/DialogSmith
Fine-tune an LLM on your Telegram chats to replicate your writing style using LLaMA-Factory and LoRA
yihong1120/Llama2-Telegram-Bot
Integration of the advanced llama2 AI model with Telegram to provide real-time chatbot...
michael-borck/simplebot
LLMs made simple for students and educators - A case study from Ship Python, Orchestrate AI
MohdSaleh/ThinkShift_LLM
🧠🔀 Seamlessly manage multiple LLM clients to overcome API limitations. tShift_LLM automatically...
Garrachonr/LlamaDos
Finetuning of a Llama2-7B to give it the ability of having a fluent conversation in Spanish.