Google DeepMind Decoupled DiLoCo: 20× lower network bandwidth for AI training across geographically distributed datacenters
Google DeepMind has introduced Decoupled DiLoCo, a distributed architecture for training AI models. It reduces the required network bandwidth from 198 Gbps to 0.84 Gbps across 8 datacenters and achieves 88% goodput compared to 27% with conventional methods.