When the Product Brief Becomes the Indictment
The most damaging AI criticism is the kind the company writes itself. OpenAI's launch materials for GPT-5.5 positioned the model as a trustworthy autonomous agent — capable of handling "messy, multi-part" tasks without hand-holding . The screenshot that circulated on June 7 did not need to argue against that claim; it simply placed the claim next to the output and let the distance speak. What makes this episode structurally different from typical model-criticism threads is that the failure evidence was a live product behaving contrary to its specification rather than a synthetic benchmark edge case. Communities already running Cursor and local models as alternatives have treated the post as confirmation of a pattern, not an isolated incident .