ArXiv: Bans Work, Instructions Backfire — Empirical Study of Rules for AI Coding Agents

A large empirical study with over 5,000 AI agent runs on the SWE-bench Verified benchmark delivers a surprising finding: rule files such as CLAUDE.md or .cursorrules don’t function the way developers think they do.

What did the researchers discover?

The research team analyzed 679 rule files collected from GitHub, containing a total of 25,532 individual rules. They tested how these rules affect the performance of AI coding agents.

Key findings:

Rules overall deliver a 7-14 percentage point improvement in task completion
Randomly generated rules perform just as well as expertly written ones — suggesting that context “priming” is at play, not specific instructions
Prohibitions (negative constraints like “never do X”) improve performance when applied individually
Positive instructions (prescriptions like “always use approach Y”) actively hurt performance — the agent makes more mistakes than with no rules at all

Why does this matter?

Millions of developers today use CLAUDE.md, .cursorrules, and similar rule files to guide AI assistants. This study suggests that the approach of “tell the agent what NOT to do” is far more effective than “tell it how to work.”

The researchers recommend: constrain what the agent must not do, rather than prescribing what it should. In other words, a short list of prohibitions outperforms lengthy best-practice guides.

Implications for the industry

This calls into question the popular practice of writing detailed rules for AI agents. It appears that agents perform better with clear boundaries than with detailed instructions — similar to human teams that also function better with clear constraints than with excessive micromanagement prescriptiveness.

ArXiv: Bans Work, Instructions Backfire — Empirical Study of Rules for AI Coding Agents

What did the researchers discover?

Why does this matter?

Implications for the industry

Sources

Related news