BotBeat
...
← Back

> ▌

NVIDIANVIDIA
OPEN SOURCENVIDIA2026-05-08

CUDA-oxide: New Rust-to-CUDA Compiler Enables Pure Rust GPU Programming

Key Takeaways

  • ▸cuda-oxide enables writing GPU kernels in pure Rust without DSLs or foreign language bindings, bringing GPU programming closer to the Rust developer experience
  • ▸The compiler demonstrates a complete pipeline from Rust MIR through multiple IR stages to CUDA PTX generation in a single cargo build invocation
  • ▸Support for generic kernels and closure captures allows developers to write type-safe, composable GPU code with automatic parameter passing
Sources:
Hacker Newshttps://github.com/NVlabs/cuda-oxide↗
Hacker Newshttps://nvlabs.github.io/cuda-oxide/index.html↗

Summary

cuda-oxide is an experimental open-source Rust compiler that allows developers to write GPU kernels in pure Rust, compiling them directly to CUDA PTX code. The project combines a custom rustc codegen backend with device-side abstractions for type-safe GPU programming, eliminating the need for DSLs or foreign language bindings. The compiler supports single-source compilation where host and device code live in the same Rust file, with features including generic kernel functions, closure captures, and both synchronous and asynchronous launch APIs.

The implementation uses a native Rust compilation pipeline built on Pliron, an MLIR-like intermediate representation framework, converting Rust code through multiple IR stages (Rust MIR → Pliron IR → LLVM IR) before generating PTX for GPU execution. The project is currently in alpha stage and actively under development, with the authors inviting community feedback and contributions. Key capabilities include device-side memory abstractions, thread indexing, shared memory management, atomic operations, and support for NVIDIA's Tensor Memory Accelerator (TMA) and warp/cluster operations.

  • Both synchronous and asynchronous APIs are provided, enabling flexible GPU computation patterns with .sync() or .await for DeviceOperation execution
Machine LearningDeep LearningMLOps & InfrastructureAI HardwareProduct LaunchOpen Source

More from NVIDIA

NVIDIANVIDIA
INDUSTRY REPORT

Analysis: AI GPUs Likely Last Longer Than Three-Year Industry Claim Suggests

2026-06-19
NVIDIANVIDIA
RESEARCH

cuTile Rust: Safe GPU Kernel Programming Brings Memory Safety to NVIDIA Acceleration

2026-06-17
NVIDIANVIDIA
UPDATE

NVIDIA GB300 NVL72 Achieves 1.6x Performance Boost on DeepSeek V3 Pretraining

2026-06-16

Comments

Suggested

Moebius Research ProjectMoebius Research Project
RESEARCH

Moebius: Lightweight Image Inpainting Framework Achieves 10B-Level Quality with Just 0.2B Parameters

2026-06-20
InceptionInception
PRODUCT LAUNCH

Inception Unveils Mercury 2: Parallel-Token Diffusion Models Reshape LLM Performance Economics

2026-06-20
UC Davis HealthUC Davis Health
RESEARCH

Brain-Computer Interface Enables Independent At-Home Communication for Man with ALS

2026-06-20
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us