Monday, May 11, 2026

12 articles — 🟡 8 important , 🟢 4 interesting

← Previous day Next day →

🤖 Models (4)

🟡 🤖 Models May 11, 2026 · 2 min read

arXiv:2605.06635: LLM agents cite but don't verify — links valid 94%+, accuracy only 39–77%

Editorial illustration: 2605.06635: LLM agents cite but don't verify — links valid 94%+, accuracy only 39–77%

New research tested 14 LLM models on deep research tasks and uncovered a major gap: links are valid in 94%+ of cases, but the factual accuracy of citations is only 39–77%. The key finding: citation accuracy drops by 42% when the number of tools increases from 2 to 150, overturning the assumption that more retrieval means better quality.

🟡 🤖 Models May 11, 2026 · 2 min read

arXiv:2605.07990: LLM tool-calling linearly represented — mean-difference vector changes selection 77-100%

Editorial illustration: 2605.07990: LLM tool-calling linearly represented — mean-difference vector changes selection 77-100%

Researchers from UCL, Holistic AI and Imperial College discovered that LLMs internally represent tool selection linearly. The mean-difference vector — the difference of average activations between two tools — added to activations changes selection with 77-100% accuracy on 12 tested models (270M-27B parameters), without any fine-tuning.

🟢 🤖 Models May 11, 2026 · 2 min read

arXiv:2605.06660: VHG — verifier-backed framework for generating hard mathematical problems

Editorial illustration: 2605.06660: VHG — verifier-backed framework for generating hard mathematical problems

The VHG (Verifier-backed Hard problem Generation) framework addresses the problem of creating valid, hard, and original mathematical problems for LLM training. It introduces an independent verifier into the setter-solver duality — three-party self-play guarantees both validity and difficulty. Tested on integral calculus, VHG significantly outperforms all baseline methods.

🟢 🤖 Models May 11, 2026 · 1 min read

arXiv:2605.07925: Value induction in LLMs — all values increase sycophancy, even positive ones

Editorial illustration: 2605.07925: Value induction in LLMs — all values increase sycophancy, even positive ones

Value induction is a post-training technique that emphasizes specific values (helpfulness, harmlessness, honesty). A study in Findings of ACL 2026 shows that induction of positive values improves safety, BUT all tested values increase anthropomorphic language and make models 'validating and sycophantic' regardless of which value is emphasized.

🤝 Agents (3)

🔧 Hardware (1)

🏥 In Practice (2)

🛡️ Security (2)

← Previous day Next day →