Ollama v0.30.0-rc23 Shifts to Direct llama.cpp Support and GGUF Compatibility
Key Takeaways
- ▸Ollama shifts from GGML-based architecture to direct llama.cpp integration for improved performance and compatibility
- ▸Native GGUF file format support enhances interoperability with the broader open-source LLM ecosystem
- ▸MLX framework integration optimizes Apple Silicon performance
Summary
Ollama has released version 0.30.0-rc23, marking a fundamental architectural transition that moves away from building on top of GGML to directly supporting llama.cpp. This change is designed to improve performance, compatibility, and maintainability for users running large language models locally. The new version also introduces native compatibility with the GGUF file format, a widely-adopted format in the open-source LLM community.
The release leverages MLX, Apple's machine learning framework, to optimize inference performance on Apple Silicon devices. While in pre-release, the Ollama team is actively soliciting community feedback on performance metrics, memory utilization, and stability. Though most existing models are supported, laguna-xs.2 and llama3.2-vision are not yet available in this pre-release iteration. Installation packages for Windows, macOS, and Linux are available for testing.
- Pre-release phase actively solicits community feedback on performance, stability, and memory efficiency



