New Distribution Fine Tuning Algorithm Reduces Formulaic 'Slop' in LLM Writing
Key Takeaways
- ▸DFT improves model output distribution matching by 49% (MMD) and 63% (JMQ) compared to standard supervised fine-tuning
- ▸Writing quality metrics increased dramatically: +164% creativity, +28% coherence, +16% clarity, and +146% meaningful detail
- ▸A 14B parameter model trained with DFT achieved 100% human-written detection rate on 100 sample outputs using the Pangram AI detector
Summary
A new research paper from jbotz introduces Distribution Fine Tuning (DFT), a post-training algorithm designed to fix one of the most frustrating aspects of modern large language models: their tendency to overuse certain words, phrases, and writing patterns. The research quantifies this problem using three metrics—Maximum Mean Discrepancy (MMD), Judge Model Quality (JMQ), and L2 Token Distribution—and demonstrates that standard supervised fine-tuning (SFT) fails to match the distribution of human writing.
The DFT algorithm improves output distribution matching significantly, increasing MMD scores by 49% and JMQ by 63% compared to SFT baselines. The results are striking across multiple dimensions: writing samples showed +164% improvement in creativity, +28% in coherence, +16% in clarity, and +146% in meaningful detail. The algorithm also eliminates common "slop signs" like excessive em-dashes and overused formulaic phrases such as "it's not X, it's Y."
A demonstration is available at rosmine.ai featuring a 14-billion-parameter model trained with DFT. In a striking validation, 100 sample outputs from the demo model were scored as 100% human-written by the Pangram AI detector, suggesting the algorithm successfully produces text that closely mirrors natural human writing patterns. The research includes extensive appendices with additional metrics, dataset analysis, and comparisons to other approaches.
- The algorithm eliminates common LLM writing patterns (em-dashes, clichéd phrases) without sacrificing other dimensions of quality
- Future roadmap includes collaboration opportunities, open-weight model releases, and scaling to larger models
Editorial Opinion
This research addresses a genuinely felt problem in AI-generated text—the formulaic, repetitive 'slop' that makes LLM outputs immediately recognizable as machine-generated. The metrics are well-designed (using distribution matching rather than vague notions of 'quality'), and the 49-63% improvements are substantial. If these results generalize and the approach scales to frontier models, DFT could be a meaningful step toward AI writing that doesn't feel robotically formulaic.



