Microsoft has introduced three new AI models that can generate text, voice and images. The new models- MAI-Transcribe-1, MAI-Voice-1 and MAI-Image-2- are now available through Microsoft Foundry and MAI Playground. The tech giant says these models focus on speed, accuracy and affordability.
MAI-Transcribe-1 is designed to convert speech into text. It supports transcription in the 25 most-used languages and is designed to perform well even in noisy, real-world environments. Microsoft says MAI-Transcribe-1 can process batch transcription about 2.5 times faster than the company’s existing Azure Fast offering. MAI-Transcribe-1 is ‘not just the most accurate, but also lightning fast,’ the company said.
Also read: Google launches Gemma 4 AI models: Features, capabilities and how to use
The second model, MAI-Voice-1, is designed to generate realistic AI voices. Microsoft says it can produce ‘natural, realistic speech, rich with nuance, emotional range and expression that preserves speaker identity even across long-form content.’
A key feature of MAI-Voice-1 is the ability for developers to create a custom AI voice using only a few seconds of recorded audio. Also, Microsoft claims this model can generate 60 seconds of audio in just one second.
Also read: OpenAI buys Sam Altman favourite tech show TBPN, internet calls it PR move
The third model, MAI-Image-2, focuses on image generation. The model is said to offer at least twice the image generation speed compared to previous systems. The MAI-Image-2 AI model has already started rolling out in Bing and PowerPoint.
‘MAI-Image-2 was created with photographers, designers, and visual storytellers that demand natural lighting, accurate skin tones and texture, and clear in-image text for diagrams, layouts, and graphics,’ the tech giant said.
MAI-Transcribe-1 starts at $0.36 per hour, while MAI-Voice-1 starts at $22 per 1M characters. MAI-Image-2 starts at $5 per 1M tokens for text input and $33 per 1M tokens for image output.
Also read: Google AI Pro plan now offers 5TB storage at no extra cost: How to get it