BotBeat
...
← Back

> ▌

NVIDIANVIDIA
PRODUCT LAUNCHNVIDIA2026-06-01

NVIDIA Launches Cosmos 3: An Open Foundation Model That Unifies Physical AI Reasoning, World Generation, and Action

Key Takeaways

  • ▸Cosmos 3 unifies physical reasoning, world generation, and action generation in a single model, eliminating the need for complex multi-model orchestration
  • ▸Two model sizes (Nano 16B and Super 64B) address different deployment scenarios—edge/workstation computing vs. large-scale datacenter inference
  • ▸Full open-source release includes model weights, training scripts, deployment microservices, and six domain-specific synthetic datasets for post-training
Source:
Hacker Newshttps://developer.nvidia.com/blog/develop-physical-ai-reasoning-world-and-action-models-with-nvidia-cosmos-3/↗

Summary

NVIDIA has released Cosmos 3, a frontier foundation model designed for physical AI that combines reasoning, world generation, and action generation in a single unified architecture. The model uses a Mixture-of-Transformers design with two towers—a "Reasoner" tower (vision-language model) that interprets multimodal observations and understands physical context, and a "Generator" tower that uses diffusion-based processes to produce physics-aware video and action outputs. NVIDIA is offering two versions: Cosmos 3 Nano (16B parameters) for efficient inference on workstation-grade GPUs, and Cosmos 3 Super (64B parameters) for datacenter deployment with maximum quality and capability.

In a significant move toward democratizing physical AI development, NVIDIA is open-sourcing the model checkpoints, training scripts, deployment tools via Cosmos NIM microservices, and six synthetic datasets covering robotics, physics simulation, autonomous driving, warehouse operations, and human motion. The release eliminates the need to orchestrate multiple separate models, streamlining the development workflow for applications like robotic manipulation systems, autonomous vehicles, and warehouse monitoring solutions. This unified approach represents a major step forward in making physical AI more accessible and reproducible for research and industry.

  • Mixture-of-Transformers architecture with autoregressive Reasoner and diffusion-based Generator enables both understanding and generation of physics-aware behaviors

Editorial Opinion

NVIDIA's decision to open-source Cosmos 3 represents a significant inflection point for physical AI development. By providing not just model weights but also training scripts, datasets, and deployment infrastructure, NVIDIA is lowering barriers to entry and accelerating the adoption of foundation models for robotics and autonomous systems. This contrasts sharply with closed proprietary approaches and positions NVIDIA as a platform provider rather than just a model vendor, likely to strengthen its ecosystem dominance in enterprise AI infrastructure.

Computer VisionGenerative AIRoboticsOpen Source

More from NVIDIA

NVIDIANVIDIA
PRODUCT LAUNCH

Nvidia Challenges Apple Silicon with New RTX Spark PC Chip

2026-06-01
NVIDIANVIDIA
PRODUCT LAUNCH

NVIDIA Releases Nemotron 3 Super: Open-Source 120B Hybrid Model with 2.2x Faster Inference

2026-06-01
NVIDIANVIDIA
PRODUCT LAUNCH

Nvidia Announces RTX Spark: Entry into Consumer PC Chip Market with Local AI Agent Support

2026-06-01

Comments

Suggested

AnthropicAnthropic
INDUSTRY REPORT

AI Agents Era Arrives: Anthropic's Claude Code Opus 4.5 Triggers Developer Frenzy and Reshapes Software Development

2026-06-01
JetBrainsJetBrains
OPEN SOURCE

JetBrains Open-Sources Mellum2: Fast, Efficient LLM for Production AI Workflows

2026-06-01
IntelIntel
PRODUCT LAUNCH

Intel Unveils Crescent Island: Data Center GPU with Up to 480GB LPDDR5X Memory for AI Inference

2026-06-01
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us