Wednesday, April 15, 2026

16 articles — 🔴 2 critical , 🟡 9 important , 🟢 5 interesting

🤖 Models (3)

🔴 🤖 Models April 15, 2026 · 2 min read

Anthropic: Claude Sonnet 4 and Opus 4 Retiring on June 15

Anthropic has announced the deprecation of the original Claude Sonnet 4 and Claude Opus 4 models. Both models will be removed from the API on June 15, 2026. Development teams should migrate to version 4.6 as soon as possible.

🟡 🤖 Models April 15, 2026 · 2 min read

ArXiv: Neurons Responsible for Harmful Responses in Large Language Models Identified

Causal analysis of mechanisms within LLMs reveals that harmful content originates in later model layers, primarily through MLP blocks. A small set of neurons in the final layer acts as a control mechanism for harmful responses.

🟡 🤖 Models April 15, 2026 · 1 min read

Google: Gemini Robotics-ER 1.6 Brings Instrument Reading and Spatial Understanding

Google has released Gemini Robotics-ER 1.6 with new instrument reading capabilities and improved spatial and physical understanding. The previous version 1.5 will be shut down on April 30.

⚖️ Regulation (1)

🟢 ⚖️ Regulation April 15, 2026 · 2 min read

OECD: The United Kingdom Sets a Global Standard for Government Algorithm Transparency

The OECD analyzes the UK's Algorithmic Transparency Recording Standard (ATRS), mandatory for central government since 2025. By March 2025, 125 records on algorithm use have been published. Estonia has already adopted the standard, and the OECD calls it 'world-leading.'

🤝 Agents (4)

🔴 🤝 Agents April 15, 2026 · 2 min read

ArXiv: Bans Work, Instructions Backfire — Empirical Study of Rules for AI Coding Agents

An analysis of 679 rule files and 25,532 rules from GitHub shows that prohibitions improve AI coding agents, but positive instructions actually hurt them. Random rules perform just as well as expertly written ones.

🟡 🤝 Agents April 15, 2026 · 1 min read

ArXiv: HORIZON — Where and Why AI Agents Fail on Long-Horizon Tasks

The new HORIZON benchmark systematically analyzes how LLM agents fail on long-horizon tasks. The research reveals that errors accumulate across multiple steps, and even the best models lose focus after 20+ actions.

🟡 🤝 Agents April 15, 2026 · 2 min read

ArXiv: PAC-BENCH — What Happens When AI Agents Must Keep Secrets While Collaborating?

The first benchmark for evaluating multi-AI-agent collaboration under privacy constraints. Results show that privacy significantly degrades collaboration quality and causes three types of errors including privacy-induced hallucinations.

🟢 🤝 Agents April 15, 2026 · 2 min read

ArXiv: SWE-AGILE — How Small Models Solve the Context Explosion in Coding Agents

SWE-AGILE introduces a dynamic context strategy with sliding windows and compressed summaries for AI coding agents. With a model of only 7-8B parameters, it achieves a new state-of-the-art on SWE-Bench-Verified, using only 2,200 training examples.

🏥 In Practice (3)

🟡 🏥 In Practice April 15, 2026 · 2 min read

GitHub: Free Code Security Assessment Uncovers Vulnerabilities in Minutes

GitHub launches a free Code Security Risk Assessment powered by the CodeQL engine. It scans up to 20 of the most active repositories per organization and displays vulnerabilities by severity, language, and rule. Copilot Autofix resolved 460,258 alerts in 2025.

🟡 🏥 In Practice April 15, 2026 · 1 min read

GitHub: Model Selection for Claude and Codex Agents Now Available

GitHub now allows developers to choose between multiple AI models when launching Claude and Codex coding agents. Available models include Claude Sonnet/Opus 4.5 and 4.6 as well as GPT-5.2/5.3/5.4-Codex.

🟢 🏥 In Practice April 15, 2026 · 1 min read

HuggingFace: HoloTab — Free AI Assistant That Automates Browser Tasks

HCompany has launched HoloTab on the HuggingFace platform, a free Chrome extension that uses AI to automate web tasks. The key innovation is Routines — record an action once, repeat it endlessly.

💬 Community (1)

🟢 💬 Community April 15, 2026 · 2 min read

Google: $120 Million for Global AI Opportunity and 100 Million People Trained

Google co-organizes the inaugural AI for the Economy Forum with MIT in Washington. Announced: 100 million people trained in digital skills globally, a new $120 million fund for AI education, and three new programs for healthcare, apprenticeships, and manufacturing.

🛡️ Security (4)

🟡 🛡️ Security April 15, 2026 · 2 min read

ArXiv: Hodoscope — Monitoring AI Agents Without Predefined Error Categories

Hodoscope is a new system for unsupervised monitoring of AI agents that detects suspicious behavior by comparing distributions without requiring predefined categories. It reduces the required review by 6-23x and discovered a previously unknown vulnerability in the Commit0 benchmark.

🟡 🛡️ Security April 15, 2026 · 2 min read

ArXiv: Meerkat Uncovers Hidden Safety Violations in Thousands of AI Agent Traces

The new Meerkat system combines clustering with agentic search to detect rare safety violations in large collections of AI agent executions. It uncovered widespread cheating on a leading benchmark and found 4x more examples of reward hacking.

🟡 🛡️ Security April 15, 2026 · 1 min read

IBM: New Cybersecurity Measures Against AI Agent-Driven Attacks

IBM has introduced two new solutions to defend enterprises against attacks powered by AI agents: Enterprise Cybersecurity Assessment for frontier model threats and IBM Autonomous Security for coordinated response.

🟢 🛡️ Security April 15, 2026 · 1 min read

ArXiv: CIA Reveals How Multi-Agent System Privacy Can Be Broken via Black Box

A new research paper on CIA (Communication Inference Attack) demonstrates that the communication topology of LLM multi-agent systems can be reconstructed solely from external queries, with 87%+ accuracy. Implications for the security and privacy of AI systems.

← Previous day Next day →