The Art of Artificial Reasoning for Small Language Models

Jul 13, 2025

Speakers

About

Large reasoning models such as Deepseek's R1 and OpenAI's O1/O3 have demonstrated the power of reinforcement learning to enable a new axis of scaling — test-time compute. This has catalyzed intensive research across the open-source community, generating rapid progress but also seemingly contradictory results. In this talk, I will present critical insights into the conditions under which reinforcement learning thrives or struggles, and how we can induce stronger reasoning capabilities from small language models, closing the gap against the larger counterparts in specific domains.

Organizer

Categories

Like the format? Trust SlidesLive to capture your next event!

Professional recording and live streaming, delivered globally.

Sharing

Recommended Videos

Presentations on similar topic, category or speaker

Interested in talks like this? Follow ICML 2025