Azure-Samples/azureai-foundry-finetuning-raft

A recipe that will walk you through using either Meta Llama 3.1 405B or OpenAI GPT-4o deployed on Azure AI to generate a synthetic dataset using UC Berkeley's Gorilla project RAFT method.

/ 100

Emerging

This project helps improve the accuracy of Retrieval Augmented Generation (RAG) systems by teaching language models to better use retrieved information. It takes an existing RAG setup and uses powerful "teacher" models like GPT-4o or Llama 3.1 405B to create specialized training data. This data then fine-tunes a smaller "student" model, leading to more precise and relevant answers. This is for AI practitioners and RAG system developers looking to enhance their model's performance on specific knowledge domains.

No commits in the last 6 months.

Use this if you have a RAG system and want to make your language model more precise and reliable when answering questions based on retrieved documents or data.

Not ideal if you are looking for a general model distillation technique that doesn't focus on improving RAG system precision.

RAG-system-optimization LLM-fine-tuning AI-model-precision synthetic-data-generation domain-specific-AI

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

llmware-ai/llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Sinapsis-AI/sinapsis-chatbots

Monorepo for sinapsis templates supporting LLM based Agents

aimclub/ProtoLLM

Framework for prototyping of LLM-based applications

xi029/Qwen3-VL-MoeLORA

在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果，通过langchain+RAG+多智能体(Multi-Agent)进行部署

pkargupta/taxoadapt

Dynamically constructs and adapts an LLM-generated taxonomy to a given corpus across multiple dimensions.

Explore RAG Tools

All categories Trending RAG directory Insights