BotBeat
...
← Back

> ▌

NVIDIANVIDIA
RESEARCHNVIDIA2026-03-26

NVIDIA Releases Nemotron-Cascade 2: Compact 30B MoE Model Achieves Gold Medal Performance in IMO and IOI

Key Takeaways

  • ▸Nemotron-Cascade 2 achieves Gold Medal-level IMO and IOI performance with only 3B activated parameters, demonstrating unprecedented intelligence density for open-weight models
  • ▸Expanded Cascade RL and novel multi-domain on-policy distillation techniques enable competitive reasoning performance in a compact MoE architecture
  • ▸Open-weight release of model checkpoints and training data democratizes access to advanced post-training techniques and high-capability reasoning models
Source:
Hacker Newshttps://arxiv.org/abs/2603.19220↗

Summary

NVIDIA has introduced Nemotron-Cascade 2, an open-weight 30B mixture-of-experts (MoE) model with just 3B activated parameters that delivers competitive reasoning and agentic capabilities despite its compact size. The model achieves remarkable performance on prestigious benchmarks, becoming only the second open-weight LLM to earn Gold Medal-level scores in the 2025 International Mathematical Olympiad (IMO), International Olympiad in Informatics (IOI), and ICPC World Finals—accomplishing this with 20x fewer parameters than its nearest competitor. This breakthrough demonstrates exceptional "intelligence density" by combining mathematical and coding reasoning capabilities typically found only in much larger frontier models.

The model's success stems from two key technical advancements over its predecessor, Nemotron-Cascade 1. First, NVIDIA significantly expanded Cascade Reinforcement Learning to cover a broader spectrum of reasoning and agentic tasks. Second, the team introduced multi-domain on-policy distillation from the strongest intermediate teacher models for each domain throughout the training process, allowing efficient recovery from benchmark regressions while maintaining strong performance gains. NVIDIA has released both the model checkpoints and training data to the open-source community, advancing research in efficient post-training methodologies.

Editorial Opinion

Nemotron-Cascade 2 represents a significant milestone in making frontier-level reasoning capabilities accessible through more efficient models. The achievement of IMO/IOI Gold Medals with 20x fewer parameters than competitors challenges assumptions about the compute requirements for advanced reasoning and suggests the post-training methodology may be as important as raw model scale. However, the practical applicability of this 30B MoE model for real-world agentic tasks beyond specialized math and coding domains remains to be seen, and the release's true impact will depend on community adoption and reproducibility.

Large Language Models (LLMs)Generative AIReinforcement LearningAI AgentsOpen Source

More from NVIDIA

NVIDIANVIDIA
POLICY & REGULATION

China Bans Nvidia RTX 5090D V2 During CEO Huang's Visit, Escalating AI Hardware Trade War

2026-05-20
NVIDIANVIDIA
PRODUCT LAUNCH

GTAP Enables Transparent Remote GPU Access: Ollama Runs on MacBook with Remote Blackwell GPU

2026-05-20
NVIDIANVIDIA
RESEARCH

Researchers Discover Critical Confused Deputy Vulnerabilities in AI Accelerators Affecting 100+ Million Devices

2026-05-19

Comments

Suggested

Google / AlphabetGoogle / Alphabet
PRODUCT LAUNCH

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

2026-05-20
Executive Office of the President of the United States (Policy/Regulation)Executive Office of the President of the United States (Policy/Regulation)
RESEARCH

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning

2026-05-20
AnthropicAnthropic
POLICY & REGULATION

Advanced AI Models Bring Government to 'Reflection Point,' CIA Official Says

2026-05-20
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us