6.7 C
Nova Iorque
quinta-feira, abril 2, 2026

Buy now

Microsoft takes on AI rivals with three new foundational models

Microsoft AI, the tech large’s analysis lab, introduced the discharge of three foundational AI models on Thursday that may generate textual content, voice, and pictures.

The discharge alerts Microsoft’s continued push to construct out its personal stack of multimodal AI fashions — and compete with rival AI labs — although it stays tied to OpenAI.

MAI-Transcribe-1 transcribes speech throughout 25 totally different languages into textual content and is 2.5 occasions sooner than Microsoft’s Azure Quick providing, in response to an organization press launch. MAI-Voice-1 is an audio-generating mannequin. This voice mannequin permits customers to generate 60 seconds of audio in a single second and permits customers to create a customized voice. MAI-Picture-2 is a video-generating mannequin.

MAI-Picture-2 was originally released on MAI Playground, a brand new giant language mannequin testing software program, on March 19. Now, all three fashions are being launched on Microsoft Foundry and the transcription and voice fashions can be found in MAI Playground as properly.

The fashions have been developed by Microsoft’s MAI Superintelligence team, an AI analysis staff led by Mustafa Suleyman, the CEO of Microsoft AI, that was fashioned and introduced in November 2025.

“At Microsoft AI, we’re constructing Humanist AI. Now we have a definite view when creating our AI fashions — placing people on the heart, optimizing for the way individuals truly talk, coaching for sensible use,” Suleyman wrote within the blog post. “You’ll see extra fashions from us quickly in Foundry and immediately in Microsoft merchandise and experiences.”

In an more and more crowded LLM market, MAI hopes a promoting level for these fashions is that they’re cheaper than these from Google and OpenAI, the corporate wrote within the weblog publish.

Techcrunch occasion

San Francisco, CA
|
October 13-15, 2026

MAI-Transcribe-1 begins at $0.36 per hour. MAI-Voice-1 begins at $22 per 1 million characters, and MAI-Picture-2 begins at $5 for 1 million tokens for textual content enter and $33 for 1 million tokens for picture output.

Regardless of releasing its personal fashions, Suleyman reaffirmed Microsoft’s dedication to its partnership with OpenAI in an interview with VentureBeat — though a current renegotiation of that partnership allowed Microsoft to actually pursue this superintelligence analysis, Suleyman told The Verge.

Microsoft has invested greater than $13 billion into the AI analysis lab and hosts its fashions in its numerous merchandise by way of a multi-year partnership. Microsoft takes the identical stance with chips; it each produces its personal and buys from exterior gamers as properly.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Stay Connected

0FansLike
0FollowersFollow
0SubscribersSubscribe
- Advertisement -spot_img

Latest Articles