Best AI papers explained
A podcast by Enoch H. Kang
506 Episodio
-  The Art of Scaling Reinforcement Learning Compute for LLMsPubblicato: 16/10/2025
-  A small number of samples can poison LLMs of any sizePubblicato: 16/10/2025
-  Dual Goal RepresentationsPubblicato: 14/10/2025
-  Welcome to the Era of ExperiencePubblicato: 14/10/2025
-  Value Flows: Flow-Based Distributional Reinforcement LearningPubblicato: 14/10/2025
-  Self-Adapting Language ModelsPubblicato: 12/10/2025
-  The Markovian ThinkerPubblicato: 12/10/2025
-  Moloch’s Bargain: emergent misalignment when LLMs compete for audiencesPubblicato: 12/10/2025
-  Transformer Predictor Dynamics and Task DiversityPubblicato: 11/10/2025
-  Base models know how to reason, thinking models learn whenPubblicato: 11/10/2025
-  Spectrum tuning: Post-training for distributional coverage and in-context steerabilityPubblicato: 11/10/2025
-  Understanding Prompt Tuning and In-Context Learning via Meta-LearningPubblicato: 11/10/2025
-  MLPs Learn In-Context on Regression and Classification tasksPubblicato: 11/10/2025
-  Is Pre-Training Truly Better than Meta-Learning?Pubblicato: 11/10/2025
-  Agentic Context Engineering: Evolving Contexts for Self-Improving Language ModelsPubblicato: 11/10/2025
-  Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMsPubblicato: 09/10/2025
-  Learning dynamics of LLM finetuningPubblicato: 09/10/2025
-  Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHFPubblicato: 09/10/2025
-  OpenAI Agent Builder and n8n: Orchestrating Reasoning Versus Automating ProcessPubblicato: 08/10/2025
-  Training Agents Inside of Scalable World ModelsPubblicato: 08/10/2025
Cut through the noise. We curate and break down the most important AI papers so you don’t have to.
