AI learns to sync sight and sound

June 6, 2025

MIT researchers have developed CAV-MAE Sync, an AI model that learns to precisely link sounds with matching visuals in video without any labels. This technology can bring us closer to smarter AI that can see, hear, and understand the world just like humans.