In chats that run long enough on ChatGPT, you'll see it begin to confuse prompts...

insin · 2026-04-09T10:18:21 1775729901

Gemini seems to be an expert in mistaking its own terrible suggestions as written by you, if you keep going instead of pruning the context

benhurmarcel · 2026-04-09T14:06:46 1775743606

In Gemini chat I find that you should avoid continuing a conversation if its answer was wrong or had a big shortcoming. It's better to edit the previous prompt so that it comes up with a better answer in the first place, instead of sending a new message.

WarmWash · 2026-04-09T14:21:55 1775744515

The key with gemini is to migrate to a new chat once it makes a single dumb mistake. It's a very strong model, but once it steps in the mud, you'll lose your mind trying to recover it.

Delete the bad response, ask it for a summary or to update [context].md, then start a new instance.

wildrhythms · 2026-04-09T12:49:25 1775738965

After just a handful of prompts everything breaks down

jwrallie · 2026-04-09T10:37:17 1775731037

I think it’s good to play with smaller models to have a grasp of these kind of problems, since they happen more often and are much less subtle.

ehnto · 2026-04-09T12:39:25 1775738365

Totally agree, these kinds of problems are really common in smaller models, and you build an intuition for when they're likely to happen.

The same issues are still happening in frontier models. Especially in long contexts or in the edges of the models training data.

throw310822 · 2026-04-09T11:14:13 1775733253

Makes me wonder if during training LLMs are asked to tell whether they've written something themselves or not. Should be quite easy: ask the LLM to produce many continuations of a prompt, then mix them with many other produced by humans, and then ask the LLM to tell them apart. This should be possible by introspecting on the hidden layers and comparing with the provided continuation. I believe Anthropic has already demonstrated that the models have already partially developed this capability, but should be trivial and useful to train it.

8organicbits · 2026-04-09T13:25:45 1775741145

Isn't that something different? If I prompt an LLM to identify the speaker, that's different from keeping track of speaker while processing a different prompt.

j-bos · 2026-04-09T11:10:36 1775733036

At work where LLM based tooling is being pushed haaard, I'm amazed every day that developers don't know, let alone second nature intuit, this and other emergent behavior of LLMs. But seeing that lack here on hn with an article on the frontpage boggles my mind. The future really is unevenly distributed.

sixhobbits · 2026-04-09T10:28:44 1775730524

author here, interesting to hear, I generally start a new chat for each interaction so I've never noticed this in the chat interfaces, and only with Claude using claude code, but I guess my sessions there do get much longer, so maybe I'm wrong that it's a harness bug

kayodelycaon · 2026-04-09T14:35:44 1775745344

I’ve done long conversations with ChatGPT and it really does start losing context fast. You have to keep correcting it and refeeding instructions.

It seems to degenerate into the same patterns. It’s like context blurs and it begins to value training data more than context.

lelandfe · 2026-04-09T22:13:11 1775772791

Yes, and with very long chats, you'll see it even forget how to do things like make tool calls - or even respond at all! I've had ChatGPT reply with raw JSON, regurgitate an earlier prompt, reply with a single newline, regurgitate information from a completely different chat, reply in a foreign language, and more.

Things get really wacky as it approaches decoherence.

kayodelycaon · 2026-04-10T00:40:43 1775781643

I’ve seen the raw json before. I didn’t realize that was an actual failure mode.

I’ve also had it fail to respond in long chats but I thought it was a network error despite having no error messages.

lelandfe · 2026-04-10T01:24:56 1775784296

Yeah, the raw JSON (in my case) is the result of a failed tool call, it was trying to generate an image. With thinking models, you can observe the degeneration of its understanding of image tool calls over the lifetime of a chat. It eventually puzzles over where images are supposed to be emitted, how it's supposed to write text, if it's allowed to provide commentary - and eventually, it gets all of it wrong. This also happens with file cites (in projects) and web search calls.

scotty79 · 2026-04-09T12:00:00 1775736000

It makes sense. It's all probabilistic and it all gets fuzzy when garbage in context accumulates. User messages or system prompt got through the same network of math as model thinking and responses.