Models supported by Azure AI Agent Service

Agents are powered by a diverse set of models with different capabilities and price points. Model availability varies by region and cloud. Certain tools and capabilities require the latest models. The following models are available in the REST API and SDKs.

Azure OpenAI models

Azure OpenAI provides customers with choices on the hosting structure that fits their business and usage patterns. The service offers two main types of deployment:

  • Standard is offered with a global deployment option, routing traffic globally to provide higher throughput.
  • Provisioned is also offered with a global deployment option, allowing customers to purchase and deploy provisioned throughput units across Azure global infrastructure.

All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see our deployment types guide.

Azure AI Agent Service supports the same models as the chat completions API in Azure OpenAI, in the following regions.

Note

The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see provisioned throughput in the Azure OpenAI documentation. GlobalStandard customers also have access to global standard models.

Region gpt-4o, 2024-05-13 gpt-4o, 2024-08-06 gpt-4o-mini, 2024-07-18 gpt-4, 0613 gpt-4, 1106-Preview gpt-4, 0125-Preview gpt-4, turbo-2024-04-09 gpt-4-32k, 0613 gpt-35-turbo, 0613 gpt-35-turbo, 1106 gpt-35-turbo, 0125 gpt-35-turbo-16k, 0613
australiaeast - - - - -
eastus - - - -
eastus2 - - - -
francecentral - - - - - -
japaneast - - - - - - - - -
norwayeast - - - - - - - - - - -
southindia - - - - - - - - -
swedencentral - -
uaenorth - - - - - - - - - - - -
uksouth - - - - - -
westus - - - - -
westus3 - - - - - -

Non-Microsoft models

The Azure AI Agent Service also supports the following models from the Azure AI Foundry model catalog.

  • Meta-Llama-405B-Instruct
  • Cohere-command-r-plus
  • Cohere-command-r

To use these models, you can use Azure AI Foundry portal to make a deployment, and then reference the deployment name in your agent. For example:

agent = project_client.agents.create_agent( model="llama-3", name="my-agent", instructions="You are a helpful agent" ) 

Next steps

Create a new Agent project