BotBeat
...
← Back

> ▌

Moonshot AI (Kimi)Moonshot AI (Kimi)
RESEARCHMoonshot AI (Kimi)2026-02-28

Moonshot AI Releases Kimi K2: 1 Trillion Parameter Open-Source Model with State-of-the-Art Agentic Capabilities

Key Takeaways

  • ▸Kimi K2 is a 1 trillion parameter MoE model with 32 billion activated parameters, pre-trained on 15.5 trillion tokens using the novel MuonClip optimizer
  • ▸The model achieves state-of-the-art performance among open-source non-thinking models, with particular strength in agentic tasks and software engineering benchmarks
  • ▸K2 scores 65.8 on SWE-Bench Verified and 47.3 on SWE-Bench Multilingual, surpassing most open and closed-source baselines
Source:
Hacker Newshttps://arxiv.org/abs/2507.20534↗

Summary

Moonshot AI has released Kimi K2, a massive open-source Mixture-of-Experts (MoE) language model featuring 1 trillion total parameters with 32 billion activated parameters. The model was pre-trained on 15.5 trillion tokens using a novel optimizer called MuonClip, which the team developed to address training instability while maintaining token efficiency. The MuonClip optimizer introduces a QK-clip technique that enabled zero loss spikes during the extensive pre-training process.

Kimi K2 underwent a sophisticated multi-stage post-training process that emphasized agentic capabilities through large-scale agentic data synthesis and joint reinforcement learning stages. The model was trained to improve through interactions with both real and synthetic environments, positioning it as a particularly strong performer in autonomous agent tasks and software engineering applications.

The model achieves impressive benchmark results that surpass most open-source and many closed-source competitors in non-thinking settings. Notable scores include 66.1 on Tau2-Bench, 76.5 on ACEBench (English), 65.8 on SWE-Bench Verified, and 47.3 on SWE-Bench Multilingual. In coding and mathematics tasks, K2 scored 53.7 on LiveCodeBench v6, 49.5 on AIME 2025, 75.1 on GPQA-Diamond, and 27.1 on OJBench—all without extended thinking mechanisms.

Moonshot AI has released both the base and post-trained versions of the model, making it one of the most capable open-source large language models currently available. The release represents a significant contribution to the open-source AI ecosystem, particularly for developers working on agentic systems and software engineering applications.

  • Both base and post-trained versions have been released open-source, making advanced agentic AI capabilities accessible to the developer community
  • The model was trained with a multi-stage process including large-scale agentic data synthesis and joint RL stages for enhanced autonomous capabilities

Editorial Opinion

Kimi K2's release marks a significant milestone in democratizing advanced AI capabilities, particularly for agentic applications. The model's exceptional performance on software engineering benchmarks like SWE-Bench suggests we're entering an era where open-source models can genuinely compete with proprietary systems for real-world developer tasks. The introduction of the MuonClip optimizer and the focus on stable training at this scale also represents important technical contributions that could benefit the broader AI research community. Perhaps most notably, the emphasis on agentic capabilities through specialized post-training reflects the industry's strategic pivot toward AI systems that can autonomously complete complex, multi-step tasks.

Large Language Models (LLMs)Reinforcement LearningAI AgentsMachine LearningOpen Source

More from Moonshot AI (Kimi)

Moonshot AI (Kimi)Moonshot AI (Kimi)
RESEARCH

Kimi K2.6 Beats Claude, GPT-5.5, and Gemini in Programming Challenge

2026-05-03
Moonshot AI (Kimi)Moonshot AI (Kimi)
PRODUCT LAUNCH

Kimi Launches Vendor Verifier Tool to Ensure Accuracy of AI Inference Providers

2026-04-20
Moonshot AI (Kimi)Moonshot AI (Kimi)
RESEARCH

Moonshot AI's Kimi K2.5, Cursor's Composer 2, and Chroma's Context-1 Advance Agentic AI Through Reinforcement Learning

2026-03-28

Comments

Suggested

Research CommunityResearch Community
RESEARCH

New Methodology Proposed for Selecting Runtime Architecture Patterns in Production LLM Agents

2026-05-20
Google / AlphabetGoogle / Alphabet
PRODUCT LAUNCH

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

2026-05-20
Executive Office of the President of the United States (Policy/Regulation)Executive Office of the President of the United States (Policy/Regulation)
RESEARCH

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning

2026-05-20
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us