Ollama v0.30.0-rc23 Shifts to Direct llama.cpp Support and GGUF Compatibility

Key Takeaways

▸Ollama shifts from GGML-based architecture to direct llama.cpp integration for improved performance and compatibility
▸Native GGUF file format support enhances interoperability with the broader open-source LLM ecosystem
▸MLX framework integration optimizes Apple Silicon performance

Source:

Hacker Newshttps://github.com/ollama/ollama/releases/tag/v0.30.0-rc23↗

Summary

Ollama has released version 0.30.0-rc23, marking a fundamental architectural transition that moves away from building on top of GGML to directly supporting llama.cpp. This change is designed to improve performance, compatibility, and maintainability for users running large language models locally. The new version also introduces native compatibility with the GGUF file format, a widely-adopted format in the open-source LLM community.

The release leverages MLX, Apple's machine learning framework, to optimize inference performance on Apple Silicon devices. While in pre-release, the Ollama team is actively soliciting community feedback on performance metrics, memory utilization, and stability. Though most existing models are supported, laguna-xs.2 and llama3.2-vision are not yet available in this pre-release iteration. Installation packages for Windows, macOS, and Linux are available for testing.

Pre-release phase actively solicits community feedback on performance, stability, and memory efficiency

Ollama v0.30.0-rc23 Shifts to Direct llama.cpp Support and GGUF Compatibility

Key Takeaways

Summary

More from Ollama

Ollama Raises $65M Series B to Expand AI Model Accessibility, Reaches 8.9M Monthly Users

Critical Unpatched Vulnerabilities in Ollama Desktop App Enable Phishing and Data Exfiltration

Critical NPM Supply Chain Attack Spreads as Self-Propagating Worm Through Binding.gyp Exploits

Comments

Suggested

AudarAI Launches Audar-ASR-V1, Open-Weight Arabic Speech Recognition Models

OpenAI Extends Reasoning Models with Multi-Turn State Retention

Fusion Embedding 1: Open-Weight Multimodal Model Beats Gemini Embedding 2 with Only 16M Trained Parameters

Ollama v0.30.0-rc23 Shifts to Direct llama.cpp Support and GGUF Compatibility

Key Takeaways

Summary

More from Ollama

Ollama Raises $65M Series B to Expand AI Model Accessibility, Reaches 8.9M Monthly Users

Critical Unpatched Vulnerabilities in Ollama Desktop App Enable Phishing and Data Exfiltration

Critical NPM Supply Chain Attack Spreads as Self-Propagating Worm Through Binding.gyp Exploits

Comments

Suggested

AudarAI Launches Audar-ASR-V1, Open-Weight Arabic Speech Recognition Models

OpenAI Extends Reasoning Models with Multi-Turn State Retention

Fusion Embedding 1: Open-Weight Multimodal Model Beats Gemini Embedding 2 with Only 16M Trained Parameters