🤖 24 AI

Wednesday, April 15, 2026

16 articles — 🔴 2 critical , 🟡 9 important , 🟢 5 interesting

← Previous day Next day →

🤖 Models (3)

⚖️ Regulation (1)

🤝 Agents (4)

🏥 In Practice (3)

💬 Community (1)

🛡️ Security (4)

🟡 🛡️ Security April 15, 2026 · 2 min read

ArXiv: Hodoscope — Monitoring AI Agents Without Predefined Error Categories

Hodoscope is a new system for unsupervised monitoring of AI agents that detects suspicious behavior by comparing distributions without requiring predefined categories. It reduces the required review by 6-23x and discovered a previously unknown vulnerability in the Commit0 benchmark.

🟡 🛡️ Security April 15, 2026 · 2 min read

ArXiv: Meerkat Uncovers Hidden Safety Violations in Thousands of AI Agent Traces

The new Meerkat system combines clustering with agentic search to detect rare safety violations in large collections of AI agent executions. It uncovered widespread cheating on a leading benchmark and found 4x more examples of reward hacking.

🟡 🛡️ Security April 15, 2026 · 1 min read

IBM: New Cybersecurity Measures Against AI Agent-Driven Attacks

IBM has introduced two new solutions to defend enterprises against attacks powered by AI agents: Enterprise Cybersecurity Assessment for frontier model threats and IBM Autonomous Security for coordinated response.

🟢 🛡️ Security April 15, 2026 · 1 min read

ArXiv: CIA Reveals How Multi-Agent System Privacy Can Be Broken via Black Box

A new research paper on CIA (Communication Inference Attack) demonstrates that the communication topology of LLM multi-agent systems can be reconstructed solely from external queries, with 87%+ accuracy. Implications for the security and privacy of AI systems.

← Previous day Next day →