Reasoning Models in Generative AI: How the Next Generation of LLMs Can Think
Data Hub Tech Talk
Friday April 18, 2025 12:30pm-1:30pm
Library 107 – Data Hub
The latest frontier “thinking models” can apparently begin to match the reasoning performance of humans. We will do a technical deep dive on the likely underpinnings of the latest generation of frontier reasoning models, including OpenAI’s o1 and o3. We will discuss and implement in Python a simple LLM that uses self-taught reasoning (STR) and Q-learning at both training and inference.