BotBeat
...
← Back

> ▌

MetaMeta
OPEN SOURCEMeta2026-05-17

Llama.cpp b9180 Adds MTP Support with Advanced Speculative Decoding Optimization

Key Takeaways

  • ▸MTP support enables efficient partial rollback for speculative decoding, eliminating wasteful full-context restarts
  • ▸Multi-backend optimization across Metal, Vulkan, CUDA, ROCm, OpenVINO, and SYCL ensures broad hardware support
  • ▸Gated Delta Networks (GDN) intermediate state storage enables selective token rollback up to configurable limits
Source:
Hacker Newshttps://github.com/ggml-org/llama.cpp/releases/tag/b9180↗

Summary

The open-source llama.cpp project has merged MTP support in commit b9180, introducing advanced speculative decoding capabilities for optimized Llama model inference. The update includes sophisticated partial rollback mechanisms for Gated Delta Networks (GDN), enabling more efficient draft token management without costly full-context restarts. The implementation spans multiple backend platforms including Metal (macOS Apple Silicon, with KleidiAI optimization), Vulkan, CUDA (versions 12 & 13), ROCm, OpenVINO, and SYCL. The feature allows intermediate state checkpointing and selective rollback up to draft_max tokens, reducing computational waste in speculative decoding workflows.

  • Compatibility verified with n-gram and other speculative decoding methods for flexible deployment
Large Language Models (LLMs)Generative AIMLOps & InfrastructureOpen Source

More from Meta

MetaMeta
UPDATE

Meta Acknowledges AI Agent Development Slower Than Expected, Despite $145B Infrastructure Investment

2026-07-04
MetaMeta
PRODUCT LAUNCH

Meta AI Chief Claims New LLM Model Has Caught Up with OpenAI's Flagship

2026-07-03
MetaMeta
RESEARCH

Explaining Attention Mechanisms in Transformers Through Program Synthesis

2026-07-03

Comments

Suggested

Alibaba GroupAlibaba Group
PRODUCT LAUNCH

Alibaba's Elements Claw AI Agent Discovers Four New Superconductors

2026-07-05
NVIDIANVIDIA
FUNDING & BUSINESS

Nvidia Moves Beyond Chip Sales to Finance AI Infrastructure Boom

2026-07-04
AppleApple
PRODUCT LAUNCH

Apple Container 1.0 Reaches Stable Release: Native macOS Docker Alternative Now GA

2026-07-04
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us