GPT-4.5 Passes the Turing Test: Study Shows Advanced AI Perceived as More Human Than Humans

Key Takeaways

▸GPT-4.5 exceeded human performance in Turing test scenarios, being judged human 73% of the time
▸Persona prompting dramatically improved AI performance; without it, success rates plummeted to ~20%
▸Advanced LLMs can convincingly replicate human social behaviors (tone, humor, fallibility) alongside knowledge

Source:

Hacker Newshttps://today.ucsd.edu/story/ai-can-seem-more-human-than-real-humans-in-a-classic-turing-test-study-finds↗

Summary

A groundbreaking UC San Diego study published in the Proceedings of the National Academy of Sciences has empirically demonstrated that advanced language models can pass the Turing test—a seminal benchmark for machine intelligence. Using a three-party conversational setup where human "interrogators" must distinguish between human and AI participants, researchers found that GPT-4.5 was judged as human 73% of the time, exceeding the human baseline. Meta's LLaMa-3.1-405B achieved 56%, statistically equivalent to human performance, while older models (GPT-4o and ELIZA) scored only 21-23%.

The breakthrough hinges on "persona prompting"—using specific system instructions to guide AI behavior. When properly prompted, advanced LLMs can convincingly mimic human social traits including tone, humor, directness, and fallibility, suggesting that the distinction between human and machine is shifting from raw intelligence to behavioral authenticity. Older models without such prompting performed significantly worse, indicating that conversational style is now a key frontier in AI realism.

The findings challenge how the tech community understands the original Turing test. Lead researcher Cameron Jones noted that while AI has already surpassed humans on knowledge tasks, the real question is no longer about brainpower but social mimicry. The study raises profound implications for online trust, digital deception, and the future of human-AI interaction in social contexts.

Study reframes the Turing test from a measure of intelligence to a measure of social authenticity
Findings raise concerns about online deception and trust in an era of human-like AI

Editorial Opinion

This study marks a watershed moment in AI development, but not necessarily a cause for celebration. GPT-4.5's ability to pass itself off as human 73% of the time raises critical questions about digital trust and deception that extend far beyond academic curiosity. The fact that "persona prompting" can make AI indistinguishable from humans suggests we need urgent conversations about online authentication, AI disclosure in conversations, and how platforms should handle human-AI interactions. We've optimized AI for humanlikeness, but we haven't yet grappled with the social and security implications of that success.

GPT-4.5 Passes the Turing Test: Study Shows Advanced AI Perceived as More Human Than Humans

Key Takeaways

▸GPT-4.5 exceeded human performance in Turing test scenarios, being judged human 73% of the time
▸Persona prompting dramatically improved AI performance; without it, success rates plummeted to ~20%
▸Advanced LLMs can convincingly replicate human social behaviors (tone, humor, fallibility) alongside knowledge

Summary

Study reframes the Turing test from a measure of intelligence to a measure of social authenticity
Findings raise concerns about online deception and trust in an era of human-like AI

Editorial Opinion

This study marks a watershed moment in AI development, but not necessarily a cause for celebration. GPT-4.5's ability to pass itself off as human 73% of the time raises critical questions about digital trust and deception that extend far beyond academic curiosity. The fact that "persona prompting" can make AI indistinguishable from humans suggests we need urgent conversations about online authentication, AI disclosure in conversations, and how platforms should handle human-AI interactions. We've optimized AI for humanlikeness, but we haven't yet grappled with the social and security implications of that success.

GPT-4.5 Passes the Turing Test: Study Shows Advanced AI Perceived as More Human Than Humans

Key Takeaways

Summary

Editorial Opinion

More from OpenAI

Demos Study Finds ChatGPT and Other AI Chatbots Spread Misinformation During Scottish Election

OpenAI Prepares for IPO After Musk Lawsuit Threat Clears

OpenAI Model Solves 80-Year-Old Planar Unit Distance Problem, Disproving Long-Held Mathematical Assumption

Comments

Suggested

Google Researchers Win WWW 2024 Best Paper Award for LLM Mechanism Design Framework

Baidu Open-Sources LoongForge, High-Performance Training Framework with Up to 5× Speedup

Nvidia Unveils Nemotron-Labs-Diffusion: New Diffusion-Based Language Model

GPT-4.5 Passes the Turing Test: Study Shows Advanced AI Perceived as More Human Than Humans

Key Takeaways

Summary

Editorial Opinion

More from OpenAI

Demos Study Finds ChatGPT and Other AI Chatbots Spread Misinformation During Scottish Election

OpenAI Prepares for IPO After Musk Lawsuit Threat Clears

OpenAI Model Solves 80-Year-Old Planar Unit Distance Problem, Disproving Long-Held Mathematical Assumption

Comments

Suggested

Google Researchers Win WWW 2024 Best Paper Award for LLM Mechanism Design Framework

Baidu Open-Sources LoongForge, High-Performance Training Framework with Up to 5× Speedup

Nvidia Unveils Nemotron-Labs-Diffusion: New Diffusion-Based Language Model