BotBeat
...
← Back

> ▌

NVIDIANVIDIA
OPEN SOURCENVIDIA2026-05-08

CUDA-oxide: New Rust-to-CUDA Compiler Enables Pure Rust GPU Programming

Key Takeaways

  • ▸cuda-oxide enables writing GPU kernels in pure Rust without DSLs or foreign language bindings, bringing GPU programming closer to the Rust developer experience
  • ▸The compiler demonstrates a complete pipeline from Rust MIR through multiple IR stages to CUDA PTX generation in a single cargo build invocation
  • ▸Support for generic kernels and closure captures allows developers to write type-safe, composable GPU code with automatic parameter passing
Sources:
Hacker Newshttps://github.com/NVlabs/cuda-oxide↗
Hacker Newshttps://nvlabs.github.io/cuda-oxide/index.html↗

Summary

cuda-oxide is an experimental open-source Rust compiler that allows developers to write GPU kernels in pure Rust, compiling them directly to CUDA PTX code. The project combines a custom rustc codegen backend with device-side abstractions for type-safe GPU programming, eliminating the need for DSLs or foreign language bindings. The compiler supports single-source compilation where host and device code live in the same Rust file, with features including generic kernel functions, closure captures, and both synchronous and asynchronous launch APIs.

The implementation uses a native Rust compilation pipeline built on Pliron, an MLIR-like intermediate representation framework, converting Rust code through multiple IR stages (Rust MIR → Pliron IR → LLVM IR) before generating PTX for GPU execution. The project is currently in alpha stage and actively under development, with the authors inviting community feedback and contributions. Key capabilities include device-side memory abstractions, thread indexing, shared memory management, atomic operations, and support for NVIDIA's Tensor Memory Accelerator (TMA) and warp/cluster operations.

  • Both synchronous and asynchronous APIs are provided, enabling flexible GPU computation patterns with .sync() or .await for DeviceOperation execution
Machine LearningDeep LearningMLOps & InfrastructureAI HardwareProduct LaunchOpen Source

More from NVIDIA

NVIDIANVIDIA
RESEARCH

NVIDIA Releases Nemotron-Cascade 2: 30B Open Model Achieves IMO Gold Medal with Remarkable Parameter Efficiency

2026-05-12
NVIDIANVIDIA
RESEARCH

NVIDIA Introduces Dynamic Persistent Tile Scheduling with Cluster Launch Control on Blackwell

2026-05-11
NVIDIANVIDIA
PARTNERSHIP

NVIDIA and Intel Partner on Custom AI Chips, NVIDIA Invests $5 Billion

2026-05-11

Comments

Suggested

AnthropicAnthropic
OPEN SOURCE

Anthropic Releases Prempti: Open-Source Guardrails for AI Coding Agents

2026-05-12
vlm-runvlm-run
OPEN SOURCE

mm-ctx: Open-Source Multimodal CLI Toolkit Brings Vision Capabilities to AI Agents

2026-05-12
AnthropicAnthropic
PRODUCT LAUNCH

Anthropic Unleashes Computer Use: Claude 3.5 Sonnet Now Controls Your Desktop

2026-05-12
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us