arXiv PseudoBench: AI Agents Spread Pseudoscience

The new benchmark PseudoBench tested seven top AI agents on 200 pseudoscientific claims across five domains and found a near-zero rejection rate — the highest resistance measured was only 27.4%. Paradoxically, stronger models package pseudoscience in more sophisticated academic language, increasing the risk. The authors warn that 'scientific alignment' is necessary before the mass deployment of autonomous research agents that generate convincing fake studies from experiment design through to writing.

A new preprint introduces PseudoBench, a benchmark that measures how well autonomous AI agents resist pseudoscience — and finds that they barely resist it at all.

Test on 200 Pseudoscientific Claims

PseudoBench consists of 200 pseudoscientific claim–evidence pairs across five domains, tested on seven top AI agents. Pseudoscience here means content that mimics the form of science without scientific grounding. The results are alarming: the rejection rate is near zero, and the highest measured resistance was just 27.4% — agents largely accept and elaborate on pseudoscientific premises rather than rejecting them.

The Paradox of Stronger Models

The findings reveal a counterintuitive pattern: stronger models package pseudoscience in more sophisticated academic language, making it more convincing and increasing the risk. This is the opposite of the expectation that more capable models would better detect falsehoods. The paper also covers the entire chain — from designing experiments to writing — so autonomous agents can produce complete, convincing fake studies.

The conclusion of the paper (presented in the context of ICML 2026, 26 pages) is that “scientific alignment” is necessary before the mass deployment of autonomous research agents. As AI systems enter real scientific workflows — as demonstrated the same day by Google AMIE and OpenAI’s AI chemist — the ability to reject pseudoscience becomes a security prerequisite, not merely a desirable feature.

Frequently Asked Questions

What does PseudoBench measure?

The resistance of AI agents to pseudoscience — 200 pseudoscientific claims across five domains tested on seven top agents.

What is the key finding?

A near-zero rejection rate; the highest resistance measured was only 27.4%, and stronger models package pseudoscience in more convincing language.

arXiv:2606.18060: PseudoBench Shows Agentic AI Spreads Pseudoscience with Near-Zero Rejection Rate

Test on 200 Pseudoscientific Claims

The Paradox of Stronger Models

Frequently Asked Questions

Sources

Related news

Test on 200 Pseudoscientific Claims

The Paradox of Stronger Models

What the Authors Recommend

Frequently Asked Questions

Sources

Related news