Skip to main content
Back to top
Ctrl
+
K
Welcome to “RL for Language Model Training”!
Language Models: Introduction
Language Model Training & Introduction to RL
Homework 1
Reinforcement Learning (part 2)
Reinforcement Learning (part 3): RLHF and PPO
SOTA systems
Evaluating LLMs: Part 1
Homework 2
Evaluating LLMs: Part 2
Evaluating LLMs: Part 3
RL fine-tuning: Outlook
Limitations & Implications of LLMs: Part 1
Limitations & Implications of LLMs: Part 2
Outlook: LLMs
Homework 3
Outlook & Recap
Repository
Open issue
Search
Error
Please activate JavaScript to enable the search functionality.
Ctrl
+
K