NVIDIA Releases CUDA 13.3 With Stable Python Support and Enhanced C++ Programming

Key Takeaways

▸CUDA Python 1.0 reaches stability milestone, enabling production-ready Python applications for AI and data science
▸CUDA Tile programming model extended to C++ developers for optimized tile-based computation
▸CompileIQ compiler auto-tuning framework delivers up to 15% performance improvements on GEMM and attention kernels

Source:

Hacker Newshttps://www.phoronix.com/news/NVIDIA-CUDA-13.3-Released↗

Summary

NVIDIA released CUDA 13.3 on Tuesday, marking a significant milestone with CUDA Python 1.0 achieving stable, production-ready status for Python developers. This enables developers to leverage GPU acceleration in Python for AI, data science, and scientific computing applications. The release also introduces CUDA Tile for C++, extending the tile programming model to C++ developers alongside new performance optimization features.

Key new features in CUDA 13.3 include the CompileIQ compiler auto-tuning framework, which delivers up to 15% performance improvements on critical kernels like GEMM and attention operations. The release also adds a Numba CUDA MLIR backend, C++23 support in NVCC and NVRTC compilers, and mmap() support. These updates reflect NVIDIA's continued investment in simplifying GPU programming across multiple languages and improving performance across the CUDA ecosystem.

Comprehensive platform updates including C++23 support, new math libraries, Numba CUDA MLIR backend, and mmap() support

NVIDIA Releases CUDA 13.3 With Stable Python Support and Enhanced C++ Programming

Key Takeaways

Summary

More from NVIDIA

95% of NVIDIA's Announced Grace Blackwell GPUs Remain Undeployed

EnclaveX: End-to-End Confidential AI with CPU and GPU TEEs

Researchers Enable Multiple Double Arithmetic on NVIDIA Tensor Cores with Ozaki Scheme Solution

Comments

Suggested

MenteDB Launches Open-Source AI Memory Engine for Persistent Agent Context

Anthropic Unveils Hidden 'J-Space' Inside Claude Using New Mechanistic Interpretability Technique

Anthropic Faces Billing System Crisis: $16.6M Phantom Invoice Charges Korean User

NVIDIA Releases CUDA 13.3 With Stable Python Support and Enhanced C++ Programming

Key Takeaways

Summary

More from NVIDIA

95% of NVIDIA's Announced Grace Blackwell GPUs Remain Undeployed

EnclaveX: End-to-End Confidential AI with CPU and GPU TEEs

Researchers Enable Multiple Double Arithmetic on NVIDIA Tensor Cores with Ozaki Scheme Solution

Comments

Suggested

MenteDB Launches Open-Source AI Memory Engine for Persistent Agent Context

Anthropic Unveils Hidden 'J-Space' Inside Claude Using New Mechanistic Interpretability Technique

Anthropic Faces Billing System Crisis: $16.6M Phantom Invoice Charges Korean User