rlhf
Here are 116 public repositories matching this topic...
RAG Law systems base on google search and Gemini Pro
-
Updated
Mar 14, 2024 - Python
This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.
-
Updated
Mar 28, 2024 - Jupyter Notebook
This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.
-
Updated
May 6, 2024 - Jupyter Notebook
Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.
-
Updated
May 7, 2024 - Jupyter Notebook
Researching the reinforcement learning algorithm of ChatGPT
-
Updated
Apr 7, 2023 - Jupyter Notebook
Reinforcement Learning Tutorial (强化学习教程)
-
Updated
Sep 10, 2023
[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
-
Updated
May 20, 2024 - Python
Library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO
-
Updated
Feb 28, 2024 - Python
A program that enhances and customizes ChatGPT's underlying pre-trained LLM w/ transformer architecture. Based on OpenAI's beta InstructGPT fine-tune model.
-
Updated
Jul 30, 2023
Large Language Model for Competitive Programming
-
Updated
Apr 28, 2023 - Python
JavaScript client library for managing your LLM data in one place
-
Updated
May 3, 2023 - JavaScript
Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
-
Updated
Apr 16, 2023 - Python
Applying quantum computing principles to large language models for more reliable, interpretable, and steerable systems.
-
Updated
Jan 5, 2024 - Python
Survey of preference alignment algorithms
-
Updated
Feb 25, 2024
Intelligent AI Chatbot that has the capability to learn from the user
-
Updated
Mar 22, 2024 - Python
Some experiments with activation steering in LLMs
-
Updated
Jan 21, 2024 - Python
Improve this page
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."