Redpanda Benchmark Shows NVIDIA Vera Delivers 5.5x Lower Latencies for Real-Time Streaming Workloads
Key Takeaways
- ▸NVIDIA Vera achieved up to 5.5x lower P99 latencies compared to AMD EPYC "Turin" in real-time streaming tests, demonstrating superior performance for mission-critical applications
- ▸Vera showed up to 73% higher throughput than competing systems and improved latency scaling as configurations grew from 8 cores to 32 cores and across distributed clusters
- ▸The CPU's optimized architecture with increased memory and reduced per-core overhead enables enterprises to scale agentic AI and real-time data streaming applications more effectively
Summary
Redpanda has published benchmark results demonstrating significant performance advantages for NVIDIA Vera, the company's new high-performance CPU based on the Olympus core architecture. Testing Vera against leading competitors including AMD EPYC "Turin," AMD EPYC "Genoa," and Intel Xeon 6 "Granite Rapids" showed that Vera delivered up to 5.5x lower latencies for streaming workloads and up to 73% higher throughput than competing systems. The benchmarks specifically highlight Vera's superior performance on Kafka-compatible workloads and its ability to decrease latency as scaling increases across multiple nodes and cores.
Vera is optimized for the demands of reinforcement learning, agentic AI applications, and large-scale data processing, making it particularly well-suited for enterprises deploying data-intensive applications near inference engines. Redpanda's shard-per-core, shared-nothing architecture proved especially effective on Vera's CPU design, which provides more memory and less overhead per core. The benchmark results underscore Vera's positioning as a key component of NVIDIA's Vera Rubin platform and as a standalone CPU option for hyperscale cloud, analytics, HPC, storage, and enterprise workloads.
- Vera is designed to support emerging infrastructure requirements for AI-driven applications in industries ranging from financial services to entertainment
Editorial Opinion
NVIDIA's Vera represents a meaningful advancement in CPU architecture for data-intensive workloads, with Redpanda's benchmark providing compelling evidence of its advantages over incumbent solutions. The dramatic improvements in latency and throughput, particularly the counter-intuitive reduction in latency at scale, suggest Vera's design philosophy fundamentally addresses bottlenecks in existing architectures. However, independent third-party validation and broader real-world deployment data will be essential to fully assess whether these benchmark results translate consistently across diverse enterprise environments and use cases.



