Best AI papers explained
A podcast by Enoch H. Kang
506 Episodio
-  Direct Preference Optimization with Unobserved Preference Heterogeneity: The Necessity of Ternary PreferencesPubblicato: 24/10/2025
-  The Coverage Principle: How Pre-Training Enables Post-TrainingPubblicato: 24/10/2025
-  The Era of Real-World Human Interaction: RL from User ConversationsPubblicato: 24/10/2025
-  Agent Learning via Early ExperiencePubblicato: 24/10/2025
-  Demystifying the Mechanisms Behind Emergent Exploration in Goal-conditioned RLPubblicato: 22/10/2025
-  Rewriting History: A Recipe for Interventional Analyses to Study Data Effects on Model BehaviorPubblicato: 22/10/2025
-  A Definition of AGIPubblicato: 22/10/2025
-  Provably Learning from Language FeedbackPubblicato: 21/10/2025
-  In-Context Learning for Pure ExplorationPubblicato: 21/10/2025
-  On the Role of Preference Variance in Preference OptimizationPubblicato: 20/10/2025
-  Training LLM Agents to Empower HumansPubblicato: 20/10/2025
-  Richard Sutton Declares LLMs a Dead EndPubblicato: 20/10/2025
-  Demystifying Reinforcement Learning in Agentic ReasoningPubblicato: 19/10/2025
-  Emergent coordination in multi-agent language modelsPubblicato: 19/10/2025
-  Learning-to-measure: in-context active feature acquisitionPubblicato: 19/10/2025
-  Andrej Karpathy's insights: AGI, Intelligence, and EvolutionPubblicato: 19/10/2025
-  Front-Loading Reasoning: The Synergy between Pretraining and Post-Training DataPubblicato: 18/10/2025
-  Representation-Based Exploration for Language Models: From Test-Time to Post-TrainingPubblicato: 18/10/2025
-  The attacker moves second: stronger adaptive attacks bypass defenses against LLM jail- Breaks and prompt injectionsPubblicato: 18/10/2025
-  When can in-context learning generalize out of task distribution?Pubblicato: 16/10/2025
Cut through the noise. We curate and break down the most important AI papers so you don’t have to.
