nabeelshan78/safe-llm-adaptation-peft-rlhf
An end-to-end pipeline for adapting FLAN-T5 for dialogue summarization, exploring the full spectrum of modern LLM tuning. Implements and compares Full Fine-Tuning, PEFT (LoRA), and Reinforcement Learning (RLHF) for performance and alignment. Features a PPO-tuned model to reduce toxicity, in-depth analysis notebooks, and interactive Streamlit demo.
No commits in the last 6 months.
Stars
1
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Aug 04, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/nabeelshan78/safe-llm-adaptation-peft-rlhf"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning...
sergio11/llm_finetuning_and_evaluation
The LLM FineTuning and Evaluation project 🚀 enhances FLAN-T5 models for tasks like summarizing...
sahilichake/Document-Summarization-App-using-LLM
Document Summarization App using large language model (LLM) and Langchain framework. Used a...
Nihal108-bi/TextSummrizer
End-to-end text summarization project that fine-tunes PEGASUS (`google/pegasus-cnn_dailymail`)...
arq105/llm-speech-summarization
📘 Summarize speeches and documents swiftly with advanced techniques using LangChain and Groq's...