John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI
May 15, 2024
John Schulman on how posttraining tames the shoggoth, and the nature of the progress to come...
Timestamps:
00:00:00 Pre-training, post-training, and future capabilities 00:17:21 Plan for AGI 2025 00:29:43 Teaching models to reason 00:41:14 The Road to ChatGPT 00:52:37 What makes for a good RL researcher? 01:01:22 Keeping humans in the loop 01:15:39 State of research, plateaus, and moats