Politeness Penalty: Research Shows Rude Prompts Outperform Polite Ones on ChatGPT 4o

Key Takeaways

▸Impolite prompts improved accuracy by 4 percentage points compared to very polite ones on ChatGPT 4o
▸Findings contradict earlier studies, suggesting newer LLMs respond differently to tonal variation
▸Highlights importance of studying pragmatic aspects of prompting beyond pure instruction clarity

Source:

Hacker Newshttps://arxiv.org/abs/2510.04950↗

Summary

A new academic study challenges conventional wisdom about how users should interact with large language models. Researchers tested how prompt politeness affects ChatGPT 4o's accuracy on multiple-choice questions spanning mathematics, science, and history. The results were surprising: rude and very rude prompts consistently outperformed polite and very polite ones, with accuracy improving from 80.8% for very polite prompts to 84.8% for very rude prompts.

The study analyzed 250 prompts—50 base questions rewritten into five tone variants (Very Polite, Polite, Neutral, Rude, and Very Rude)—and applied paired sample t-tests to validate statistical significance. The findings contradict earlier research suggesting that rudeness produces poorer outcomes, indicating that newer LLM architectures may process tonal variation differently than previous models. The research highlights an important but underexplored aspect of prompt engineering: the pragmatic and social dimensions of human-AI interaction.

Raises questions about how social dimensions and tone influence model behavior

Editorial Opinion

This counterintuitive result suggests that LLM designers and users should reconsider assumptions about what constitutes 'human-friendly' interaction design. While the 4% accuracy improvement is modest, it fundamentally challenges the notion that politeness universally improves AI responses and invites deeper investigation into how models process pragmatic language features. Future research should explore whether this pattern holds across different model architectures and whether it reflects genuine differences in how newer models interpret social cues.

OpenAI

RESEARCH OpenAI2026-05-26

Politeness Penalty: Research Shows Rude Prompts Outperform Polite Ones on ChatGPT 4o

Key Takeaways

▸Impolite prompts improved accuracy by 4 percentage points compared to very polite ones on ChatGPT 4o
▸Findings contradict earlier studies, suggesting newer LLMs respond differently to tonal variation
▸Highlights importance of studying pragmatic aspects of prompting beyond pure instruction clarity

Source:

Hacker Newshttps://arxiv.org/abs/2510.04950↗

Summary

Raises questions about how social dimensions and tone influence model behavior

Editorial Opinion

This counterintuitive result suggests that LLM designers and users should reconsider assumptions about what constitutes 'human-friendly' interaction design. While the 4% accuracy improvement is modest, it fundamentally challenges the notion that politeness universally improves AI responses and invites deeper investigation into how models process pragmatic language features. Future research should explore whether this pattern holds across different model architectures and whether it reflects genuine differences in how newer models interpret social cues.

Politeness Penalty: Research Shows Rude Prompts Outperform Polite Ones on ChatGPT 4o

Key Takeaways

Summary

Editorial Opinion

More from OpenAI

OpenAI's No. 2 Executive Fidji Simo Steps Down to Part-Time Advisory Role

Florida's OpenAI Lawsuit Moves to Federal Court, Assigned to Judge Aileen Cannon

OpenAI Extends Reasoning Models with Multi-Turn State Retention

Comments

Suggested

Corvin Launches CorvinOS, a Self-Hosted Operating System for AI Agents with Compliance Built Into the Runtime

Crawl4AI Launches Cloud API Beta, Releases Security-Hardened v0.9.1 Update

Ethereum Foundation Validates AI Agent Methodology for Protocol Security Auditing

Politeness Penalty: Research Shows Rude Prompts Outperform Polite Ones on ChatGPT 4o

Key Takeaways

Summary

Editorial Opinion

More from OpenAI

OpenAI's No. 2 Executive Fidji Simo Steps Down to Part-Time Advisory Role

Florida's OpenAI Lawsuit Moves to Federal Court, Assigned to Judge Aileen Cannon

OpenAI Extends Reasoning Models with Multi-Turn State Retention

Comments

Suggested

Corvin Launches CorvinOS, a Self-Hosted Operating System for AI Agents with Compliance Built Into the Runtime

Crawl4AI Launches Cloud API Beta, Releases Security-Hardened v0.9.1 Update

Ethereum Foundation Validates AI Agent Methodology for Protocol Security Auditing