arXiv:2606.16723: AgentFairBench Measures Demographic Discrimination in LLM Agent Actions
AgentFairBench is the first benchmark that measures demographic inequality in the actual actions of LLM agents — not just in their responses — across employment, lending, and medical triage domains. It uses counterfactual flip rate and action-rate disparity metrics and tests four agent scaffolds. In a pilot of 864 decisions, Claude Haiku showed no demographic effect above the noise floor, and the paper warns that naive comparison of six groups can overestimate inequality by roughly 2.4 times.
This article was generated using artificial intelligence from primary sources.
A new preprint introduces AgentFairBench, the first benchmark that measures demographic discrimination in the actions of LLM agents rather than merely in their text responses.
What does AgentFairBench measure differently?
Existing fairness tests have mainly checked model responses, while AgentFairBench looks at the actual actions an agent takes — decisions it makes in employment, lending, and medical triage tasks. It uses two metrics: counterfactual flip rate (how often a decision changes when a demographic attribute changes) and action-rate disparity (the difference in action rates across groups). It tests four agent scaffolds ranging from simple to tool-equipped.
What are the key findings?
In a pilot of 864 decisions, Claude Haiku showed no demographic effect above the level of statistical noise. The paper additionally warns of a methodological trap: naive comparison of six demographic groups can overestimate inequality by roughly 2.4 times due to a statistical artifact. The design is low-budget and reproducible, making it easy for independent teams to replicate the tests.
Why is this relevant for regulation?
The benchmark directly addresses EU AI Act requirements for auditing fairness in high-risk systems. As agents take on decisions with material consequences, measuring bias at the action level becomes a prerequisite for compliance and trust.
Frequently Asked Questions
- What does AgentFairBench measure?
- Demographic inequality in LLM agent actions across employment, lending, and medical triage — not just in text responses.
- Which metrics does it use?
- Counterfactual flip rate and action-rate disparity, testing four agent scaffolds.
Related news
arXiv:2606.17005: Bayesian Framework for Auditing Reveals That AI Leaderboards Hide Incompatible Histories
UK AI Safety Institute: Overseeing advanced AI systems is becoming harder — 20+ degradation pathways identified
EU AI Office: draft guidelines for classifying high-risk AI systems