NVIDIA Introduces Kimodo: Large-Scale Kinematic Motion Diffusion Model for Human and Robot Motion Generation

Key Takeaways

▸Kimodo is trained on 700 hours of optical mocap data, substantially larger than previous public datasets, enabling superior motion quality and generalization
▸The model supports multiple control modalities including text prompts, full-body keyframe constraints, end-effector positioning, and path-based navigation
▸A two-stage denoiser architecture minimizes common artifacts like foot skating and floating while maintaining natural motion characteristics

Source:

Hacker Newshttps://research.nvidia.com/labs/sil/projects/kimodo/↗

Summary

NVIDIA has unveiled Kimodo, a kinematic motion diffusion model trained on 700 hours of optical motion capture data that generates high-quality 3D human and robot motions. The model can be controlled through intuitive text prompts and a comprehensive suite of kinematic constraints, including full-body keyframes, sparse joint positions/rotations, 2D waypoints, and dense 2D paths. This represents a significant advancement over previous motion generation systems, which were limited by small-scale public mocap datasets that constrained motion quality, control accuracy, and generalization capabilities.

Kimodo features a carefully designed motion representation and two-stage denoiser architecture that decomposes root and body prediction to minimize motion artifacts while enabling flexible constraint conditioning. Key capabilities include text-to-motion generation, compositional locomotion, full-body constraints, environment interaction through end-effector constraints, and path-based global character translation. The model demonstrates practical applications in robotics, where it can generate humanoid demonstration data more efficiently than traditional teleoperation, with motions exportable to formats compatible with ProtoMotions and MuJoCo for physics-based policy training.

The model has direct applications in robotics for generating training data and in entertainment/simulation for character animation
NVIDIA provides open-source code, Python API, and a motion authoring demo for practical implementation

Editorial Opinion

Kimodo represents a meaningful step forward in controllable motion generation by addressing a critical bottleneck—the scarcity of large-scale training data. The combination of substantial dataset scale (700 hours of mocap) with a thoughtfully designed architecture that explicitly handles common motion artifacts could make this a valuable tool for roboticists and animation professionals. However, the practical impact will depend on how accessible the model becomes and whether the quality gains justify adoption over existing workflows.

NVIDIA

PRODUCT LAUNCH NVIDIA2026-03-18

NVIDIA Introduces Kimodo: Large-Scale Kinematic Motion Diffusion Model for Human and Robot Motion Generation

Key Takeaways

▸Kimodo is trained on 700 hours of optical mocap data, substantially larger than previous public datasets, enabling superior motion quality and generalization
▸The model supports multiple control modalities including text prompts, full-body keyframe constraints, end-effector positioning, and path-based navigation
▸A two-stage denoiser architecture minimizes common artifacts like foot skating and floating while maintaining natural motion characteristics

Source:

Hacker Newshttps://research.nvidia.com/labs/sil/projects/kimodo/↗

Summary

The model has direct applications in robotics for generating training data and in entertainment/simulation for character animation
NVIDIA provides open-source code, Python API, and a motion authoring demo for practical implementation

Editorial Opinion

Kimodo represents a meaningful step forward in controllable motion generation by addressing a critical bottleneck—the scarcity of large-scale training data. The combination of substantial dataset scale (700 hours of mocap) with a thoughtfully designed architecture that explicitly handles common motion artifacts could make this a valuable tool for roboticists and animation professionals. However, the practical impact will depend on how accessible the model becomes and whether the quality gains justify adoption over existing workflows.

NVIDIA Introduces Kimodo: Large-Scale Kinematic Motion Diffusion Model for Human and Robot Motion Generation

Key Takeaways

Summary

Editorial Opinion

More from NVIDIA

NVIDIA Reports Record $81.6B Revenue in Q1 FY2027, Data Center Segment Surges 92% YoY

China Bans Nvidia RTX 5090D V2 During CEO Huang's Visit, Escalating AI Hardware Trade War

GTAP Enables Transparent Remote GPU Access: Ollama Runs on MacBook with Remote Blackwell GPU

Comments

Suggested

Barnes & Noble CEO Backs Selling AI-Written Books, Sparking Industry Debate on Transparency Standards

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning

NVIDIA Introduces Kimodo: Large-Scale Kinematic Motion Diffusion Model for Human and Robot Motion Generation

Key Takeaways

Summary

Editorial Opinion

More from NVIDIA

NVIDIA Reports Record $81.6B Revenue in Q1 FY2027, Data Center Segment Surges 92% YoY

China Bans Nvidia RTX 5090D V2 During CEO Huang's Visit, Escalating AI Hardware Trade War

GTAP Enables Transparent Remote GPU Access: Ollama Runs on MacBook with Remote Blackwell GPU

Comments

Suggested

Barnes & Noble CEO Backs Selling AI-Written Books, Sparking Industry Debate on Transparency Standards

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning