"Single-source cross-platform GPU LLM inference with Slang and Rust" ( 2026 )

Saturday at 13:05, 20 minutes, UD2.120 (Chavanne), UD2.120 (Chavanne), AI Plumbers Crozet Sébastien , video

Leveraging Rust and Khronos' emerging Slang initiative, we introduce our efforts toward a cross-platform GPU LLM inference ecosystem. With a single-source approach we aim to minimize backend-specific code and foster community participation by writing inference kernels once and run them everywhere.