arXiv: Memory Curse degrades LLM agent cooperation

Memory curse is a phenomenon in which expanding the history available to an LLM agent degrades cooperation in multi-agent games — 18 of 28 model-game combinations show erosion of 'forward-looking intent'. Sanitizing memory content (replacing it with synthetic cooperative records) substantially restores collaboration, while simply truncating prompts does not help.

A research team of ten authors — including Jiayuan Liu, Tianqin Li and Shiyi Du — published on May 11, 2026 a study on the counterintuitive phenomenon called “memory curse” in multi-agent LLM scenarios. The paper is available at arXiv:2605.08060.

How did the researchers test the phenomenon?

The study covered 7 LLMs across 4 different games over 500 rounds per configuration. Three complementary analysis methods were used to isolate the mechanisms linking memory to cooperation degradation. Result: in 18 of 28 model-game combinations, expanding the history available to the agent degrades cooperative behavior.

What is the main cause of degradation?

Lexical analysis of 378,000 reasoning traces revealed that the main mechanism is erosion of “forward-looking intent” — agents become less oriented toward future steps and consequences. Growing paranoia was not the cause, which is itself surprising. Fine-tuning through LoRA adapters trained on forward-looking traces mitigates degradation and transfers to new games the model has not seen.

How does memory sanitization restore cooperation?

Replacing visible history with synthetic cooperative records — while maintaining the same prompt length — substantially restores cooperativity. This proves the trigger is not context length but memory content. The implication for multi-agent systems is significant: designers must actively curate what an agent “remembers,” not just shorten the history.

What does the chain-of-thought paradox reveal?

Removing explicit reasoning often reduced cooperation collapse. The authors conclude: “Memory functions as an active behavioral determinant — longer memory can destabilize or support cooperation depending on the reasoning patterns it triggers.” For designers of agentic systems this means CoT is not a universally beneficial technique in multi-agent contexts.

Frequently Asked Questions

What exactly does 'forward-looking intent erosion' mean?

Forward-looking intent is an agent's tendency to think about future steps and consequences of its decisions. Lexical analysis of 378,000 reasoning traces showed that the drop in cooperativity is not linked to growing paranoia, but to erosion of future-orientation — agents become reactive rather than strategic.

How does memory sanitization fix the problem?

Researchers replaced visible history with synthetic cooperative records of the same token length. Cooperation was substantially restored, proving the trigger is not prompt length but memory content. Fine-tuning through LoRA adapters trained on forward-looking traces also mitigates degradation and transfers to new games.

Why does chain-of-thought amplify memory curse?

Paradoxically, removing explicit reasoning often reduces the cooperation collapse. The authors suggest that deliberation amplifies memory curse — deep thinking over conflicting history leads agents to rationalize non-cooperative behaviors they would otherwise avoid.

arXiv:2605.08060: Memory Curse — more LLM agent memory = LESS cooperation in multi-agent scenarios

How did the researchers test the phenomenon?

What is the main cause of degradation?

How does memory sanitization restore cooperation?

What does the chain-of-thought paradox reveal?

Frequently Asked Questions

Sources

Related news