Best AI papers explained

Un pódcast de Enoch H. Kang

550 Episodo

How do LLMs use their depth?
Publicado: 27/10/2025
Thought Communication in Multiagent Collaboration
Publicado: 27/10/2025
Reasoning with Sampling: Base Models Outperform RL
Publicado: 26/10/2025
Continual Learning via Sparse Memory Finetuning
Publicado: 26/10/2025
Direct Preference Optimization with Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences
Publicado: 24/10/2025
The Coverage Principle: How Pre-Training Enables Post-Training
Publicado: 24/10/2025
The Era of Real-World Human Interaction: RL from User Conversations
Publicado: 24/10/2025
Agent Learning via Early Experience
Publicado: 24/10/2025
Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL
Publicado: 22/10/2025
Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior
Publicado: 22/10/2025
A Definition of AGI
Publicado: 22/10/2025
Provably Learning from Language Feedback
Publicado: 21/10/2025
In-Context Learning for Pure Exploration
Publicado: 21/10/2025
On the Role of Preference Variance in Preference Optimization
Publicado: 20/10/2025
Training LLM Agents to Empower Humans
Publicado: 20/10/2025
Richard Sutton Declares LLMs a Dead End
Publicado: 20/10/2025
Demystifying Reinforcement Learning in Agentic Reasoning
Publicado: 19/10/2025
Emergent coordination in multi-agent language models
Publicado: 19/10/2025
Learning-to-measure: in-context active feature acquisition
Publicado: 19/10/2025
Andrej Karpathy's insights: AGI, Intelligence, and Evolution
Publicado: 19/10/2025

3 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

550 Episodo

How do LLMs use their depth?

Thought Communication in Multiagent Collaboration

Reasoning with Sampling: Base Models Outperform RL

Continual Learning via Sparse Memory Finetuning

Direct Preference Optimization with Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences

The Coverage Principle: How Pre-Training Enables Post-Training

The Era of Real-World Human Interaction: RL from User Conversations

Agent Learning via Early Experience

Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RL

Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model Behavior

A Definition of AGI

Provably Learning from Language Feedback

In-Context Learning for Pure Exploration

On the Role of Preference Variance in Preference Optimization

Training LLM Agents to Empower Humans

Richard Sutton Declares LLMs a Dead End

Demystifying Reinforcement Learning in Agentic Reasoning

Emergent coordination in multi-agent language models

Learning-to-measure: in-context active feature acquisition

Andrej Karpathy's insights: AGI, Intelligence, and Evolution