Wednesday, May 20, 2026

18 articles — 🔴 2 critical , 🟡 10 important , 🟢 6 interesting

← Previous day Next day →

🤖 Models (4)

🔴 🤖 Models May 20, 2026 · 3 min read

Google: Gemini 3.5 Flash and Pro — the fastest frontier models yet

Editorial illustration: Google unveiled Gemini 3.5 Flash and Pro at Google I/O 2026 — frontier models 4× faster than

Google unveiled Gemini 3.5 Flash and Pro at Google I/O 2026 — frontier models that are 4× faster than the competition, with a special focus on agentic tasks, the new Antigravity 2.0 developer platform, and Gemini Spark, a personal AI agent available 24/7.

🔴 🤖 Models May 20, 2026 · 3 min read

Google: Gemini Omni Flash brings native video generation from mixed inputs

Editorial illustration: Google unveiled Gemini Omni Flash at I/O 2026 — a new multimodal model generating and editing

Google unveiled Gemini Omni Flash at I/O 2026 — a new multimodal model that generates and edits video from a combination of images, audio, video, and text. Available immediately on YouTube Shorts, with mandatory SynthID digital watermarks on every generated clip.

🟡 🤖 Models May 20, 2026 · 2 min read

Google: ERA — AI system that automates scientific code writing

Editorial illustration:

Google published ERA (Empirical Research Assistance) in Nature — a Gemini-powered system that uses tree search to evaluate thousands of computational approaches and automates the writing of expert scientific software. The Computational Discovery platform is already available to researchers.

🟢 🤖 Models May 20, 2026 · 2 min read

arXiv:2605.19660: OScaR — INT2 KV Cache Quantization Delivers 3× Faster Decoding

Editorial illustration: Researchers publish OScaR, a method solving the fundamental problem of KV cache quantization in large language models

Researchers have published OScaR, a method that solves the fundamental problem of KV cache quantization in large language models. Using INT2 precision — just 2 bits per value — it achieves near-lossless accuracy, 3× faster decoding, 5.3× less memory, and 4.1× higher throughput compared to BF16 FlashDecoding-v2.

📦 Open Source (1)

⚖️ Regulation (3)

🤝 Agents (7)

🟡 🤝 Agents May 20, 2026 · 2 min read

Anthropic Claude Code: Live session scripting and security fixes in v2.1.145

Editorial illustration:

Anthropic Claude Code v2.1.145 brings JSON output of live sessions for scripting, extended OTEL trace attributes for agent tracking, and fixes for a security vulnerability in bash command approval.

🟡 🤝 Agents May 20, 2026 · 2 min read

Anthropic: Claude for 276,000 KPMG employees in 138 countries

Editorial illustration: Anthropic and KPMG have entered into a strategic global alliance giving Claude access to all employees

Anthropic and KPMG have entered into a strategic global alliance giving Claude access to all employees of one of the four largest audit firms in the world. Claude is being embedded in KPMG's Digital Gateway, and KPMG becomes Anthropic's preferred partner for the private equity sector.

🟡 🤝 Agents May 20, 2026 · 2 min read

AWS: Three architectural patterns for scalable voice agents with Amazon Nova Sonic

Editorial illustration:

AWS published a detailed guide for scalable voice agents using Amazon Nova Sonic and AgentCore Gateway. Three clear patterns — direct tools, sub-agents, and session segmentation — offer different tradeoffs between latency and complexity.

🟡 🤝 Agents May 20, 2026 · 2 min read

GitHub Copilot Gets Gemini 3.5 Flash: Speed and Quality for Everyday Coding

Editorial illustration: Google's Gemini 3.5 Flash model becomes generally available for all GitHub Copilot plans

Google's Gemini 3.5 Flash model is becoming generally available for all GitHub Copilot plans. It promises near-Pro-tier quality combined with Flash-tier speed and lower cost, with emphasis on agentic workflows and multiple IDE environments.

🟢 🤝 Agents May 20, 2026 · 2 min read

arXiv:2605.18703: EnvFactory – RL training of tool-use agents with 5× fewer environments

Editorial illustration:

EnvFactory is a new framework for automatically synthesizing executable training environments for tool-use AI agents. Using only 85 verified environments across 7 domains, it achieves +15% on BFCLv3 and +8.6% on MCP-Atlas — roughly 5× more efficient than comparable approaches.

🟢 🤝 Agents May 20, 2026 · 2 min read

arXiv:2605.18565: LongMINT — why AI agents forget everything you tell them

Editorial illustration:

Researchers at the University of North Carolina have published LongMINT — the first benchmark that systematically measures how poorly AI agents manage memory in long, dynamic scenarios. Average accuracy is just 27.9%, worse than random guessing in many cases.

🟢 🤝 Agents May 20, 2026 · 2 min read

arXiv:2605.20173: 6 Architectural Patterns for Production LLM Agents

Editorial illustration: New arXiv paper introduces the stochastic-deterministic boundary as a foundational design principle for production LLM agents

A new arXiv paper introduces the stochastic-deterministic boundary as the foundational design principle for production LLM agents and defines 6 composable runtime patterns — from hierarchical delegation to human-in-the-loop — selected according to three architectural concerns: coordination, state, and control.

🛡️ Security (3)

← Previous day Next day →