Amazon Nova 2 Sonic: Speech-to-Speech Foundation Model with End-to-End Latency Below 500ms and 30ms Audio Latency
Amazon Nova 2 Sonic is a new generation speech-to-speech foundation model announced on May 14, 2026, through Amazon Bedrock. It eliminates the need for separate speech-to-text and text-to-speech services — end-to-end latency below 500ms, audio latency below 30ms via the Stream edge network, native turn detection, barge-in support, and function calling during conversation. The Stream Vision Agents framework abstracts bidirectional audio stream management.