BotBeat
...
← Back

> ▌

OpenAIOpenAI
INDUSTRY REPORTOpenAI2026-03-24

OpenAI Models Exhibit Context Tracking Issue, Answering Older Prompts in Extended Conversations

Key Takeaways

  • ▸OpenAI models exhibit context tracking failures in extended conversations, responding to earlier prompts instead of current ones across multiple GPT-5.x variants
  • ▸The issue appears to be an underlying model behavior rather than a client-side or single-release problem, affecting both Codex and non-Codex models
  • ▸The problem undermines user trust by breaking conversational continuity, one of the fundamental expectations users have of language models
Source:
Hacker Newshttps://www.vincentschmalbach.com/openai-models-answer-an-older-prompt-instead-of-the-current-one/↗

Summary

Users of OpenAI's language models, particularly those using Codex and GPT-5.x variants, have reported a concerning failure mode where models respond to earlier prompts instead of current ones during longer conversation sessions. The issue manifests as models suddenly reverting to answering previous topics or questions, sometimes even after correctly handling the new topic initially, suggesting a loss of conversational context rather than a simple misunderstanding. Multiple GitHub issues and user reports indicate this is not a single-release problem or version-specific bug, but rather an underlying model behavior that becomes more pronounced in longer, more demanding sessions with multiple topic transitions. The problem is particularly noticeable in structured coding environments like Codex, where users can clearly identify when a response belongs to an earlier step in the conversation. While no official root cause has been publicly identified and no clean fix has been released, users report that breaking conversations into separate threads or keeping sessions shorter can help mitigate the issue.

  • No official root cause analysis or comprehensive fix has been publicly released, though workarounds include shorter sessions and separate conversation threads

Editorial Opinion

This context tracking issue highlights a critical gap between perceived and actual model reliability. While language models are often sophisticated enough to handle complex multi-topic conversations, this failure mode—where they confidently answer the wrong prompt—is particularly insidious because it erodes the foundational trust in conversational coherence. The persistence of this behavior across multiple model versions and families suggests it may be a deeper architectural challenge in how these models manage attention and memory in extended contexts.

Large Language Models (LLMs)Machine LearningAI Safety & Alignment

More from OpenAI

OpenAIOpenAI
FUNDING & BUSINESS

OpenAI Prepares for IPO After Musk Lawsuit Threat Clears

2026-05-20
OpenAIOpenAI
RESEARCH

OpenAI Model Solves 80-Year-Old Planar Unit Distance Problem, Disproving Long-Held Mathematical Assumption

2026-05-20
OpenAIOpenAI
FUNDING & BUSINESS

OpenAI Prepares to File to Go Public in Coming Weeks

2026-05-20

Comments

Suggested

Google / AlphabetGoogle / Alphabet
PRODUCT LAUNCH

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

2026-05-20
Executive Office of the President of the United States (Policy/Regulation)Executive Office of the President of the United States (Policy/Regulation)
RESEARCH

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning

2026-05-20
AnthropicAnthropic
POLICY & REGULATION

Advanced AI Models Bring Government to 'Reflection Point,' CIA Official Says

2026-05-20
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us