Skip to main content

Ctrl+K

Welcome to “RL for Language Model Training”!

Language Models: Introduction
Language Model Training & Introduction to RL
- Homework 1
Reinforcement Learning (part 2)
Reinforcement Learning (part 3): RLHF and PPO
SOTA systems
Evaluating LLMs: Part 1
- Homework 2
Evaluating LLMs: Part 2
Evaluating LLMs: Part 3
RL fine-tuning: Outlook
Limitations & Implications of LLMs: Part 1
Limitations & Implications of LLMs: Part 2
Outlook: LLMs
- Homework 3
Outlook & Recap

Repository
Open issue

Search

Ctrl+K

By Polina Tsvilodub

© Copyright 2023.