FOSDEM 2026

"Single-source cross-platform GPU LLM inference with Slang and Rust" ( 2026 )

Saturday at 13:05, 20 minutes, UD2.120 (Chavanne), UD2.120 (Chavanne), AI Plumbers Crozet Sébastien , video

Leveraging Rust and Khronos' emerging Slang initiative, we introduce our efforts toward a cross-platform GPU LLM inference ecosystem. With a single-source approach we aim to minimize backend-specific code and foster community participation by writing inference kernels once and run them everywhere.

2026

0.49 "Vulkan API for Machine Learning? Competing with CUDA and ROCm in llama.cpp"
0.49 "From Infrastructure to Production: A Year of Self-Hosted LLMs"
0.48 "Rust meets cheap bare-metal RISC-V"
0.48 "Supercharging LLM serving with Dynamo"

2025

0.52 "Expanding GGML Hardware Support using the Vulkan API"
0.50 "GPUStack: Building a Simple and Scalable Management Experience for Diverse AI Models"
0.50 "The bare metal perspective on AMD's GPU ASICs"
0.49 "Rust for Linux: an overview"
0.48 "Rust for Linux"

2024

0.48 "ε-serde / mem_dbg / sux / dsi-bitstream / webgraph: a Rust ecosystem for large graph processing"

Related: