Anthropic: CAD 10 million for Canadian AI and data on Claude usage in Canada

Editorial illustration: map of Canada with university center markers and an AI adoption growth chart

Anthropic is investing CAD 10 million in 8 Canadian institutions with a focus on AI safety, healthcare, and low-resource languages. A simultaneously published Economic Index reveals that Canada accounts for 2.6% of global Claude.ai traffic, with per-capita adoption 4.4 times higher than expected.

🟡 🤖 Models

Anthropic: Claude for Teachers — free Claude for US K-12 teachers

Editorial illustration: a teacher using an AI assistant on a tablet in a classroom in front of the board

Anthropic has launched a free version of Claude for verified K-12 teachers in the US, available until June 2027. The program includes a library of teaching skills, a curriculum aligned with educational standards from all 50 states, and integrations with 9 educational tools.

🟡 🤖 Models

arXiv:2607.11598: interaction as the 'third axis' of test-time compute removes up to 74% of errors

Diagram of the three axes of test-time compute: longer thinking, best-of-N sampling, and interaction with tools

Test-time compute is the additional computation a model spends during inference to produce a better answer. Bojie Li and Noah Shi define interaction with external tools as the third axis of test-time compute — alongside longer thinking and best-of-N sampling. A proposer-reviewer system achieves a 100% pass rate, while self-thinking and best-of-N plateau.

Latest in AI July 14, 2026

🟡 🤝 Agents July 14, 2026 · 2 min read

arXiv:2607.11185: SCALECUA scales computer-use agents with RL — 68.7% on OSWorld

Editorial illustration: AI agent navigating a computer graphical interface with a reinforcement learning reward-and-penalty loop

SCALECUA is a new framework from Tsinghua/THUDM researchers that scales computer-use agents using online reinforcement learning, achieving a new SOTA result of 68.7% on the OSWorld benchmark and 54.0% on ScienceBoard.

🟡 🏥 In Practice July 14, 2026 · 2 min read

Anthropic: Claude Code v2.1.208 and v2.1.209 — screen reader, vim remaps, and wrapper support

Editorial illustration: terminal with Claude Code interface, accessibility icon, and vim shortcuts

Claude Code v2.1.208 introduces three new capabilities: an opt-in screen reader mode for accessibility, vim insert-mode key remapping, and a corporate process wrapper covering all self-spawned processes. Hotfix v2.1.209 immediately fixed a bug that was blocking dialogs in background agent sessions.

🟡 🛡️ Security July 14, 2026 · 2 min read

GitHub: AI security detections on pull requests and /security-review in the Copilot app

Editorial illustration: GitHub pull request interface with highlighted AI security warnings and a Copilot chat window with the /security-review command

GitHub has introduced AI-generated security detections directly on pull requests in Code Scanning, along with a new /security-review command in the Copilot app that analyzes code for injection, XSS, path traversal, and weak cryptography.

🟡 🔧 Hardware July 14, 2026 · 2 min read

NVIDIA: performance per watt as the key metric — Blackwell GB300 up to 25× more efficient than Hopper

Editorial illustration: NVIDIA Blackwell GB300 NVL72 chip with a graphical efficiency comparison against the previous Hopper generation

NVIDIA Blackwell GB300 NVL72 is up to 25× more efficient than Hopper measured by performance per watt on the DeepSeek V4 Pro model, and software-only optimizations within a single month delivered up to a 5× improvement.

🟢 📦 Open Source July 14, 2026 · 2 min read

AMD: LogsLop — open-source tool compresses large log files to 11% and saves 78% of tokens

Terminal view of the LogsLop tool compressing a log file and displaying the reduced line count

AMD has released LogsLop, an open-source Python CLI tool that compresses large log files through normalization and clustering. On 73 real log files it reduces the line count to 11% and bytes to 7.5%, and with Llama 3.3 70B it uses 78% fewer tokens.

🟢 🤝 Agents July 14, 2026 · 2 min read

arXiv:2607.10891: SETA — 4,500+ RL environments boost terminal agent results

Diagram of the SETA system for generating RL environments with SETA-Synth and SETA-Evol components for training terminal agents

A terminal agent is an AI agent that solves tasks through commands in a terminal or shell. SETA (Shen et al., Oxford, KAUST) generates 4,500+ verifiable RL environments for training terminal agents. DeepSeek-V4-Flash improves pass@1 from 40% to 43% and pass@5 from 54% to 58% on Terminal-Bench 2.0.

🟢 🏥 In Practice July 14, 2026 · 2 min read

arXiv:2607.11307: ToMap — multi-agent proof autoformalization +19% above SOTA

Diagram of the ToMap pipeline with three agents: Decomposer, Formalizer, and Prover for mathematical proof autoformalization

Autoformalization is the automatic translation of natural-language mathematics into a machine-verifiable formal form. The ToMap pipeline by Tian-Shuo Liu and collaborators achieves a 19.0% improvement over the previous SOTA method on the ProofFlowBench dataset. The key innovation is concentrating test-time compute on the Decomposer as the bottleneck instead of distributing it evenly.

🟢 🤝 Agents July 14, 2026 · 2 min read

AWS: Strands Agents and Bedrock for multi-agent B2B buyer prospecting

Diagram of Swarm and Graph orchestration patterns for B2B buyer prospecting with Amazon Bedrock

Multi-agent orchestration is the coordination of multiple specialized AI agents on a shared task. Thrad.ai implemented a system for finding B2B buyers across 6 sources using Amazon Bedrock and the Strands Agents SDK. The Swarm pattern achieves email quality of 8.2/10, while the Graph pattern costs 25% less and runs 28% faster.

🟢 📦 Open Source July 14, 2026 · 2 min read

CNCF: Is the Kubernetes Pod the right unit for AI agents?

Diagram of a Kubernetes WorkerPool architecture with multiple suspended AI agents on a single Worker

Lin Sun from Solo.io questions whether the Kubernetes Pod is the right abstraction for deploying AI agents. Agents are ephemeral — they activate briefly, and a dedicated idle Pod wastes resources. He proposes a new 'agent-substrate' model with Actors and WorkerPools.

🟢 💬 Community July 14, 2026 · 2 min read

Google DeepMind: ATL Saathi — Gemini tool for Indian teachers targets 11 million students

A teacher in an Indian school using the ATL Saathi application on a tablet in a STEM lab

Google DeepMind and the Atal Innovation Mission have launched ATL Saathi, a Gemini-powered web application that helps teachers in Indian school STEM labs. It is piloting in 100 schools and targets more than 11 million students, with support for 8 languages and 12 teaching modules.

Previous edition July 13, 2026

All news from July 13, 2026 →

🟡 🤝 Agents July 13, 2026 · 3 min read

Allen Institute: What Building the Shippy Agent Taught Us About Reliable AI Agents

Editorial illustration: architecture diagram of the Soul-Skills-Config AI agent with per-user Kubernetes isolation

Allen Institute for AI published a detailed analysis of the maritime agent Shippy's architecture, which serves 70+ countries on Claude Opus 4.6. The key takeaway: agent reliability depends less on model strength and more on deterministic tools, isolated infrastructure, and evaluations grounded in real operational workflows.

🟡 🔧 Hardware July 13, 2026 · 2 min read

AMD: GEAK Agent Automatically Optimized DeepSeek-V4 MLA Kernel on MI355 with Up to 9× Speedup

Editorial illustration: diagram of AMD MI355 GPU accelerator with the GEAK agent generating an optimized Triton kernel

AMD's GEAK agent for automated GPU kernel optimization migrated the DeepSeek-V4 MLA kernel from PyTorch to Triton for MI355 accelerators. Results show up to 9.13× prefill speedup, 4.94× geomean decode speedup, and 2.10× higher end-to-end throughput in the SGLang framework.

🟡 🤖 Models July 13, 2026 · 2 min read

Anthropic: Claude's Values Vary by Model and Language

Editorial illustration: graphic showing the four value axes of Claude's models distributed across languages

Anthropic analyzed 309,815 anonymized Claude.ai conversations and found statistically significant differences in expressed values between models Sonnet 4.6, Opus 4.6, and Opus 4.7, as well as across 20 analyzed languages — Hindi shows the highest warmth, while Russian and English show the highest rigor.

🟡 🤖 Models July 13, 2026 · 2 min read

arXiv:2607.09375: Li Auto Presents Mach-Mind-4-Flash — 35B MoE with 3B Active Parameters

Li Auto, a Chinese automaker, has released its own foundation model Mach-Mind-4-Flash. It is a 35-billion parameter MoE model that activates only 3 billion parameters per input, and through a three-stage training pipeline achieves performance in the 100B+ model class with dramatically lower inference costs.

🟡 🤝 Agents July 13, 2026 · 2 min read

arXiv:2607.09493: Shared Selective Persistent Memory for Agents — 96% Success at 97× Lower Token Cost

Editorial illustration: graph showing the rise in agent success with selective memory versus a declining line for full history

A new architectural framework for agentic LLM systems introduces selective persistent memory that retains four categories of reusable context while discarding transient reasoning traces. The result is 96% task completion success and 97× lower token cost per call.

🟡 📦 Open Source July 13, 2026 · 2 min read

arXiv:2607.09424: Soofi S — Sovereign Open-Source Model for German and English Trained on Telekom's Cloud

Editorial illustration: open language model Soofi S with Deutsche Telekom cloud infrastructure and EU flag markings

Soofi S is a 30B MoE hybrid Mamba-Transformer model with 3B active parameters pre-trained on approximately 27 trillion tokens, including enhanced German language data. Trained exclusively on Deutsche Telekom's cloud infrastructure, it outperforms all European sovereign alternatives and leads the code benchmark among 17 open models.

Earlier news

Saturday, July 11, 2026

6 articles →

🟡 💬 Community July 11, 2026 · 2 min read

arXiv:2607.07779: Terence Tao among authors of paper on limits of large language models in frontier mathematics

Editorial illustration: a blackboard covered in formal proofs with a neural network passing through it

'From Solvers to Research' is an arXiv paper, co-authored by Fields Medal recipient Terence Tao, examining whether LLM agents can perform research-level mathematics through auto-formalization and proof synthesis. The paper identifies systematic weaknesses in existing proof-synthesis systems and maps the boundary between solving problems and genuine research.

🟡 🛡️ Security July 11, 2026 · 2 min read

arXiv:2607.08395: Token-Flow Firewall — runtime monitoring that protects long-running AI agents and reduces attack success rate to 12.5%

Editorial illustration: a firewall filtering a stream of words towards an AI agent at work

Token-Flow Firewall is a defensive mechanism presented on arXiv that monitors token flows in natural language in real time for persistent AI agents. In tests against prompt injection and other adversarial attacks it reduces the attack success rate to 12.5%, offering a runtime protection layer instead of relying solely on model training.

🟡 🏥 In Practice July 11, 2026 · 2 min read

Anthropic: Claude Code v2.1.207 enables auto mode on all enterprise clouds, switches default model to Opus 4.8, and closes shell-injection vulnerability

Editorial illustration: terminal with an automatic model selector above three cloud provider logos

Claude Code v2.1.207 is a new release of Anthropic's CLI tool that makes 'auto mode' automatically available on Amazon Bedrock, Google Vertex AI, and Microsoft Foundry without an opt-in variable. The release closes a shell-injection security vulnerability in hook configuration and switches the default model to Claude Opus 4.8 on Bedrock, Vertex, and the Claude platform on AWS.

🟢 🤝 Agents July 11, 2026 · 2 min read

arXiv:2607.08093: CausalDS — a new benchmark measuring causal reasoning of AI agents in data-science tasks

Editorial illustration: an agent separating two intertwined curves labeled cause and correlation

CausalDS is a benchmark presented on arXiv that tests whether AI agents can distinguish causality from correlation within data-science workflows. It fills a gap in the evaluation of autonomous analytical agents, where standard tests measure execution accuracy but not the ability to reason correctly about causation.

Friday, July 10, 2026

12 articles →

🟡 🛡️ Security July 10, 2026 · 2 min read

arXiv:2607.08173: 'Overthinking' — Amplified Reasoning Forces Reasoning Models to Reveal Learned Secrets, New Extraction Attack at ICML 2026

Editorial illustration: a brain made of thought loops with locked documents leaking through a crack

Overthinking is a paper accepted at ICML 2026 showing that amplifying the reasoning weight in large language models can extract hidden learned information the model would not otherwise expose. The finding opens a new class of extraction attacks targeting reasoning models such as o1, DeepSeek R1, and Claude with extended thinking.

🟡 🤖 Models July 10, 2026 · 2 min read

arXiv:2607.08733: 'Super Weights' Explain Why Selective Fine-Tuning Fails — Paper Accepted at COLM 2026

Editorial illustration: a network of parameters in which a few bright nodes hold the entire structure together

Super Weights in LLMs is a paper accepted at COLM 2026 that identifies 'super weights' — a small number of parameters whose modification disproportionately changes a language model's behavior. The paper shows that these weights explain why selective fine-tuning of only certain layers often fails, with direct implications for PEFT and LoRA approaches.

🟡 🏥 In Practice July 10, 2026 · 2 min read

AWS: SageMaker Brings Serverless Fine-Tuning for NVIDIA Nemotron 3 Models with SFT, RLVR, and RLAIF Techniques

Editorial illustration: a modular model being refined through three automated pipelines with no servers in sight

Amazon SageMaker AI has introduced serverless customization for NVIDIA Nemotron 3 models, requiring no infrastructure management. Three techniques are available: SFT (supervised fine-tuning), RLVR (reinforcement learning with verifiable rewards), and RLAIF (reinforcement learning from AI feedback), making advanced RL methods accessible to enterprise teams without ML infrastructure expertise.

🟡 🛡️ Security July 10, 2026 · 2 min read

GitHub: CodeQL 2.26 Introduces AI Prompt Injection Detection — First Mainstream SAST Tool to Treat AI Attacks on Par with Classic Ones

Editorial illustration: a code scanner catching a malicious message embedded in an AI prompt

CodeQL 2.26.0 is the new version of GitHub's static security analysis tool, introducing detection of AI prompt injection attacks as a new analysis type, along with support for Kotlin 2.4.0. It is the first integration of an AI-specific attack vector into a mainstream SAST tool, bringing prompt injection into the same security workflows as XSS and SQL injection.

Thursday, July 9, 2026

16 articles →

🔴 🤖 Models July 9, 2026 · 2 min read

Google: SensorFM — foundation model trained on one trillion minutes of wearable data wins on 34 of 35 health tasks

Editorial illustration: smartwatch with waves of biometric signals flowing into a neural network

SensorFM is Google's foundation model for health data from wearable devices, trained on more than one trillion minutes of signals from Fitbit and Pixel Watch devices worn by 5 million users in over 100 countries. The model outperforms specialized approaches on 34 of 35 tasks, with +9% AUC on classification and +21% correlation on regression.

🔴 🤖 Models July 9, 2026 · 2 min read

Microsoft: Aurora 1.5 open-source model outperforms ECMWF ensemble on 88.9% of variables — new standard in AI weather forecasting

Editorial illustration: globe with swirling weather fronts and data network

Aurora 1.5 is Microsoft's open-source foundation model for the Earth system that outperforms the ECMWF ensemble forecast on 88.9% of evaluated variables and horizons. The new release adds 22 meteorological variables, hourly resolution, and probabilistic ensemble forecasting — achieving around 33% lower track error for Hurricane Helene compared to the original Aurora.

🔴 🤖 Models July 9, 2026 · 2 min read

OpenAI: GPT-5.6 arrives in three variants — Sol, Terra and Luna, with multi-agent orchestration and same-day availability in GitHub Copilot

Editorial illustration: three planetary spheres (Sol, Terra, Luna) connected by data flows

GPT-5.6 is OpenAI's new model family with three variants: Sol (flagship for complex reasoning), Terra (balanced), and Luna (high-volume, cost-efficient). It introduces Programmatic Tool Calling, explicit prompt cache control, persisted reasoning and multi-agent orchestration in beta — available from day one in GitHub Copilot.

🟡 💬 Community July 9, 2026 · 2 min read

Anthropic: Nobel laureate Ben Bernanke, former Fed chair, joins the Long-Term Benefit Trust — the body overseeing the company's mission

Editorial illustration: classical bank building whose columns transition into neural networks

Ben Bernanke is the former chair of the US Federal Reserve (2006–2014) and 2022 Nobel laureate in economics, appointed by Anthropic to its Long-Term Benefit Trust. The LTBT is an independent body with no equity or profit interest that oversees the company's public benefit mission — Bernanke brings expertise on the economic impacts of AI.

Wednesday, July 8, 2026

15 articles →

🔴 🤖 Models July 8, 2026 · 5 min read

Mistral Robostral Navigate: Robotic AI That Navigates Using Only an RGB Camera

Editorial illustration: Mistral Robostral embodied robot navigation model based purely on RGB vision

Mistral introduced Robostral Navigate, its first model for embodied robotic navigation with 8 billion parameters. Using only a single RGB camera — no LiDAR or depth sensors — it achieves 76.6% success on the R2R-CE benchmark for unseen environments and surpasses multi-sensor competitors by 4.5 percentage points.

🔴 🤖 Models July 8, 2026 · 4 min read

OpenAI Launches GPT-Live: Voice Model for Lifelike AI Conversations

Editorial illustration: OpenAI GPT-Live voice model integrated into the ChatGPT Voice interface

OpenAI introduced GPT-Live, a new voice model designed for natural, lifelike conversational AI interactions. Integrated into ChatGPT Voice from day one, it arrives as part of an accelerated voice model development cycle — just two days after GPT-Realtime-2.1. Technical specifications and pricing have not yet been published.

🔴 📦 Open Source July 8, 2026 · 5 min read

PyTorch 2.13: Up to 4× Less GPU Memory for LLM Training and 12.3× Faster FlexAttention

Editorial illustration: PyTorch 2.13 new release with key improvements for AI training and inference

PyTorch 2.13 ships with 3,328 commits from 526 contributors. Key highlights: nn.LinearCrossEntropyLoss cuts peak GPU memory footprint by up to 4× for LLM training, FlexAttention on Apple Silicon achieves up to 12.3× speedup, and the new torchcomms backend modernizes distributed training.

🟡 🤖 Models July 8, 2026 · 4 min read

Anthropic Develops an 'Off Switch' for Dangerous Knowledge: GRAM Isolates Dual-Use Capabilities into Removable Modules

Editorial illustration: Anthropic GRAM interchangeable knowledge modules for controlling and removing dual-use AI capabilities

Anthropic and AE Studio announce GRAM (Gradient-Routed Auxiliary Modules) — a method that isolates dual-use knowledge such as virology, cybersecurity, and nuclear physics into removable neural modules during training, enabling a single training run to produce multiple model variants with different capability sets.

Tuesday, July 7, 2026

15 articles →

🔴 ⚖️ Regulation July 7, 2026 · 5 min read

EU Action Plan on Cybersecurity and AI: Commission Coordinates Defense Without New Laws

Editorial illustration: European Commission action plan on cybersecurity and AI

The European Commission today presented a joint Action Plan on Cybersecurity and Artificial Intelligence that builds on the existing EU legal framework, coordinates member states and industry, and addresses AI simultaneously as an attack vector and a defensive tool.

🔴 🤖 Models July 7, 2026 · 5 min read

Meta Launches Muse Image and Muse Video: Agentic AI That Self-Corrects Its Own Mistakes

Editorial illustration: Meta Muse generative AI for creating images and video content

Meta Superintelligence Labs has unveiled Muse Image and Muse Video — models that operate as agents, internally invoking code and web-search tools, ranking #2 and #3 on the Arena leaderboard, with mandatory Content Seal watermarking.

🟡 🛡️ Security July 7, 2026 · 4 min read

AISI: Frontier AI Models Found Critical Cloud Vulnerabilities for Under £150

Editorial illustration: AISI uses frontier AI for automatic cloud misconfiguration discovery

The UK AI Security Institute used frontier models to audit its own research platform and found a five-step attack chain that had previously escaped standard tools — all for under £150 in token costs.

🟡 🛡️ Security July 7, 2026 · 4 min read

Agent Data Injection: A New Attack Class That Bypasses AI Agent Defenses

Editorial illustration: data injection attacks on AI agents and Claude Code vulnerabilities

Researchers from Seoul National University and the Universities of Indiana and Wisconsin introduce Agent Data Injection (ADI) — a new attack type that embeds malicious data into trusted data structures of agents, achieving arbitrary click attacks and remote code execution on Claude Code, Codex, and Gemini CLI while bypassing existing defenses.

View full archive →