Kubernetes GPU Management Just Got a Major Upgrade

The New Stack Podcast - A podcast by The New Stack

Podcast artwork

Categorie:

Nvidia Distinguished Engineer Kevin Klues noted that low-level systems work is invisible when done well and highly visible when it fails — a dynamic that frames current Kubernetes innovations for AI. At KubeCon + CloudNativeCon North America 2025, Klues and AWS product manager Jesse Butler discussed two emerging capabilities: dynamic resource allocation (DRA) and a new workload abstraction designed for sophisticated AI scheduling.

Visit the podcast's native language site