How do I access Microsoft MAI models?

Sign up for Microsoft Foundry at foundry.microsoft.com. Create a resource, get an API key, and call the MAI model endpoints. You can also try them free in MAI Playground at microsoft.ai.

MAI Playground offers free trials. Azure Foundry uses pay-per-use pricing — MAI models are priced aggressively against OpenAI and competitors. Image and voice generation have per-request pricing.

Can I use MAI models with my existing Azure OpenAI code?

Yes, mostly. Microsoft Foundry shares authentication with Azure OpenAI services. You switch the model parameter (e.g., 'mai-image-2' instead of 'dall-e-3') and most code continues to work with minor endpoint changes.

Quick Answer

How to Use Microsoft MAI Models on Foundry (2026)

Published: April 5, 2026

How to Use Microsoft MAI Models on Foundry

A practical guide to using Microsoft’s MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 models.

Last verified: April 2026

What You Get

Model	Purpose	Status
MAI-Transcribe-1	Speech-to-text	Public preview
MAI-Voice-1	Text-to-speech	Public preview
MAI-Image-2	Image generation	Public preview

All three are in public preview on Microsoft Foundry (Azure) and MAI Playground.

Option 1: MAI Playground (Quick Start)

Go to microsoft.ai/playground
Sign in with your Microsoft account
Pick a model (Transcribe, Voice, or Image)
Try it with a prompt or upload audio
See real-time output

Perfect for exploring capabilities before committing to API integration.

Option 2: Azure Foundry (Production)

Step 1: Set Up Azure Foundry

# Install Azure CLI if you don't have it
curl -sL https://aka.ms/InstallAzureCLIDeb | sudo bash

# Login
az login

# Create a Foundry resource
az cognitiveservices account create \
  --name my-foundry \
  --resource-group my-rg \
  --kind AIFoundry \
  --sku S0 \
  --location eastus

Step 2: Get Your API Key

az cognitiveservices account keys list \
  --name my-foundry \
  --resource-group my-rg

Step 3: Call the APIs

MAI-Image-2:

curl -X POST https://my-foundry.cognitiveservices.azure.com/v1/images/generations \
  -H "api-key: YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mai-image-2",
    "prompt": "a futuristic city at sunset, cinematic lighting",
    "size": "1024x1024",
    "n": 1
  }'

MAI-Transcribe-1:

curl -X POST https://my-foundry.cognitiveservices.azure.com/v1/audio/transcriptions \
  -H "api-key: YOUR_KEY" \
  -F "model=mai-transcribe-1" \
  -F "[email protected]"

MAI-Voice-1:

curl -X POST https://my-foundry.cognitiveservices.azure.com/v1/audio/speech \
  -H "api-key: YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mai-voice-1",
    "input": "Hello, this is MAI-Voice-1 speaking.",
    "voice": "alloy"
  }' \
  --output speech.mp3

Pricing (Public Preview)

Microsoft hasn’t published final pricing. Preview pricing is aggressive — typically undercutting OpenAI equivalents by 20-40%. Check foundry.microsoft.com for current pricing.

Python SDK Example

from openai import AzureOpenAI

client = AzureOpenAI(
    api_key="YOUR_KEY",
    api_version="2024-12-01-preview",
    azure_endpoint="https://my-foundry.cognitiveservices.azure.com"
)

# Generate image with MAI-Image-2
response = client.images.generate(
    model="mai-image-2",
    prompt="a futuristic city at sunset",
    size="1024x1024",
    n=1
)
print(response.data[0].url)

When to Use MAI vs Alternatives

Scenario	Pick
Azure-heavy org	MAI models
Microsoft 365 integration	MAI models
Need OpenAI’s ecosystem	Azure OpenAI
Maximum quality (art)	Midjourney
Open source	Stable Diffusion / Flux

Tips

Start in Playground — Test prompts before writing code
Use streaming — For voice and transcription, stream for better UX
Monitor quota — Preview has lower rate limits than production
Pin API version — Preview APIs change; pin specific versions

Last verified: April 2026