Hemanthkumar2112/Reward-Modeling-RLHF-Finetune-and-RAG

Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform

/ 100

Emerging

This project helps AI practitioners and researchers improve the quality and relevance of large language model outputs. By collecting human preferences on different model responses, you can train a 'reward model' that guides the language model to generate text that better aligns with human expectations. This allows for fine-tuning models like Llama3 8B or Gemma2 9B to produce more desirable and contextually accurate results for various applications.

No commits in the last 6 months.

Use this if you need to fine-tune existing large language models to produce highly relevant, human-aligned, and contextually rich text outputs for specific tasks or domains.

Not ideal if you are looking for a plug-and-play solution without any technical knowledge of machine learning or data collection for model training.

AI-research NLP-development language-model-fine-tuning generative-AI content-generation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

Apache-2.0

Higher-rated alternatives

LearningCircuit/local-deep-research

Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports...

NVIDIA-AI-Blueprints/rag

This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented...

Denis2054/RAG-Driven-Generative-AI

This repository provides programs to build Retrieval Augmented Generation (RAG) code for...

hienhayho/rag-colls

Collection of recent advanced RAG techniques.

jeremiahbohr/literature-mapper

Transform academic PDFs into a Knowledge Graph with typed claims, temporal analysis,...

Explore RAG Tools

All categories Trending RAG directory Insights