nabeelshan78/safe-llm-adaptation-peft-rlhf

An end-to-end pipeline for adapting FLAN-T5 for dialogue summarization, exploring the full spectrum of modern LLM tuning. Implements and compares Full Fine-Tuning, PEFT (LoRA), and Reinforcement Learning (RLHF) for performance and alignment. Features a PPO-tuned model to reduce toxicity, in-depth analysis notebooks, and interactive Streamlit demo.

/ 100

Experimental

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 1 / 25

Maturity 7 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Higher-rated alternatives

HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning...

sergio11/llm_finetuning_and_evaluation

The LLM FineTuning and Evaluation project 🚀 enhances FLAN-T5 models for tasks like summarizing...

sahilichake/Document-Summarization-App-using-LLM

Document Summarization App using large language model (LLM) and Langchain framework. Used a...

Nihal108-bi/TextSummrizer

End-to-end text summarization project that fine-tunes PEGASUS (`google/pegasus-cnn_dailymail`)...

arq105/llm-speech-summarization

📘 Summarize speeches and documents swiftly with advanced techniques using LangChain and Groq's...

Explore Transformer Models

All categories Trending Transformer directory Insights