arunprsh/ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
This project guides you through creating a reliable, domain-specific FAQ chatbot. You'll learn how to take your organization's frequently asked questions and answers to build an intelligent bot that can respond to user queries. This is ideal for machine learning engineers or data scientists looking to implement advanced chatbot solutions.
No commits in the last 6 months.
Use this if you are an ML engineer or data scientist tasked with building a robust FAQ chatbot for your organization using GPT-2 and reinforcement learning.
Not ideal if you are looking for a pre-built chatbot solution or lack experience with machine learning, AWS, and Python.
Stars
14
Forks
4
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Feb 11, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/arunprsh/ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hud-evals/hud-python
OSS RL environment + evals toolkit
hiyouga/EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
sail-sg/oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning,...
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)