GPT-4.5 Passes the Turing Test: Study Shows Advanced AI Perceived as More Human Than Humans
Key Takeaways
- ▸GPT-4.5 exceeded human performance in Turing test scenarios, being judged human 73% of the time
- ▸Persona prompting dramatically improved AI performance; without it, success rates plummeted to ~20%
- ▸Advanced LLMs can convincingly replicate human social behaviors (tone, humor, fallibility) alongside knowledge
Summary
A groundbreaking UC San Diego study published in the Proceedings of the National Academy of Sciences has empirically demonstrated that advanced language models can pass the Turing test—a seminal benchmark for machine intelligence. Using a three-party conversational setup where human "interrogators" must distinguish between human and AI participants, researchers found that GPT-4.5 was judged as human 73% of the time, exceeding the human baseline. Meta's LLaMa-3.1-405B achieved 56%, statistically equivalent to human performance, while older models (GPT-4o and ELIZA) scored only 21-23%.
The breakthrough hinges on "persona prompting"—using specific system instructions to guide AI behavior. When properly prompted, advanced LLMs can convincingly mimic human social traits including tone, humor, directness, and fallibility, suggesting that the distinction between human and machine is shifting from raw intelligence to behavioral authenticity. Older models without such prompting performed significantly worse, indicating that conversational style is now a key frontier in AI realism.
The findings challenge how the tech community understands the original Turing test. Lead researcher Cameron Jones noted that while AI has already surpassed humans on knowledge tasks, the real question is no longer about brainpower but social mimicry. The study raises profound implications for online trust, digital deception, and the future of human-AI interaction in social contexts.
- Study reframes the Turing test from a measure of intelligence to a measure of social authenticity
- Findings raise concerns about online deception and trust in an era of human-like AI
Editorial Opinion
This study marks a watershed moment in AI development, but not necessarily a cause for celebration. GPT-4.5's ability to pass itself off as human 73% of the time raises critical questions about digital trust and deception that extend far beyond academic curiosity. The fact that "persona prompting" can make AI indistinguishable from humans suggests we need urgent conversations about online authentication, AI disclosure in conversations, and how platforms should handle human-AI interactions. We've optimized AI for humanlikeness, but we haven't yet grappled with the social and security implications of that success.



