OVH Groupe, Europe's leading cloud provider and self-described "sovereign" AI player, announced June 11th that it has entered exclusive negotiations to acquire Gladia, a Paris-based AI startup specializing in speech-to-text (STT) technology. The deal aims to strengthen OVH's capabilities in multimodal and agentic generative AI—key battlegrounds as hyperscalers race to dominate the next wave of enterprise automation. Founded in 2022 by voice AI specialists operating out of France's capital, Gladia has quietly built a substantial developer following through its single API offering. The platform handles real-time transcription and batch processing across more than 100 languages, converting raw audio into structured, actionable data for downstream applications. Today the company serves over 300,000 developers and 2,000 enterprise customers—including HeyGen, Livestorm, Attention, Circleback, Method Financial, Recall.ai, and Leexi—a roster that spans video generation, webinar platforms, sales intelligence, and meeting automation tools. This acquisition marks OVH Groupe's second major purchase targeting its AI ambitions. The company is positioning this deal to reinforce its internal AI Lab, which has stated goals of developing "next generations of sovereign generative, agentic and multimodal AI technologies." By bringing Gladia's STT building blocks in-house, OVHcloud and its OVHai division can offer voice AI services directly to customers without relying on third-party transcription providers—a classic vertical integration play. Gladia's customer base reveals where the real value lies. HeyGen alone has become a major player in synthetic video generation, requiring robust audio processing pipelines. Livestorm powers enterprise webinars at scale. Recall.ai builds meeting intelligence infrastructure. These aren't hobbyist projects—they're production systems handling sensitive business communications, which makes the "sovereign" positioning particularly attractive for European enterprises navigating GDPR and data residency requirements. OVHcloud operates over 500,000 servers across 46 datacenters spanning four continents, serving roughly 1.6 million customers in more than 140 countries. The company has built its identity on controlling every layer of the stack—from custom server designs to datacenter construction to fiber network orchestration—and this acquisition extends that philosophy into AI inference and audio intelligence.

Key Takeaways

  • Gladia brings 300K+ developers and 2,000 enterprise customers using its speech-to-text API across 100+ languages
  • This is OVH Groupe's second acquisition targeting its AI Lab strategy for sovereign generative AI
  • Internalizing STT tech lets OVHcloud offer voice AI services natively within its cloud ecosystem

The Bottom Line

OVH isn't just buying a transcription API—it's securing talent, technology, and customer relationships in a critical multimodal AI layer. For enterprises tired of routing sensitive audio through American hyperscalers, this sovereign play might finally give them a credible European alternative that doesn't compromise on capability.