Domain 1 β€” Module 1 of 8 13%
1 of 27 overall
Domain 1: Develop containerized solutions on Azure Free ⏱ ~10 min read

Containers for AI Workloads β€” Why and When

Why AI apps live in containers in 2026 β€” and what containers, registries, and orchestrators have to do with passing AI-200. Meet the four characters whose stories run through this whole course.

Why containers are the default for AI on Azure

Simple explanation

A container is a sealed lunchbox for your code. It packs your app, every library it needs, the right Python version, the right CUDA version β€” everything β€” into one self-contained box. You can hand the same lunchbox to a laptop, a test server, or a giant AI cluster and it tastes the same.

AI apps especially love containers because AI dependencies are fussy. Wrong NumPy version, wrong tokenizer, wrong CUDA driver β€” and your model breaks. Containers freeze β€œwhat works on my machine” so it works everywhere.

On Azure, you’ll meet four ways to run containers: App Service (a managed web host), Container Apps (serverless containers that scale to zero), AKS (full Kubernetes for control freaks), and Azure Container Registry (the warehouse where your container images live). The exam tests when to pick which.

The four hosting tiers at a glance

The four container tiers covered by AI-200 Domain 1
FeatureContainer RegistryApp ServiceContainer AppsAKS
What it isImage storage + build serviceManaged web app host (with container support)Serverless containers, scale to zeroFull managed Kubernetes
You manageImages, tags, replicationApp settings, slots, scaling rulesContainer revisions, ingress, scale rulesNodes, pods, networking, RBAC
Best forStoring every other tier's imagesWeb APIs, simple AI inference endpointsEvent-driven AI pipelines, microservices, scale-to-zeroComplex multi-service AI platforms, GPU pools
Scale modelN/A β€” it's storageManual + autoscale on metricsKEDA event-driven (queues, HTTP, custom)HPA, KEDA, cluster autoscaler β€” your choice
Operational costTiny (storage + bandwidth)Low β€” fully managedMedium β€” pay per active containerHigh β€” you run a cluster

Meet the four characters

This course follows four people building real AI workloads on Azure. You’ll see them across all 27 modules and again in the practice questions.

CharacterWho they areAI use case
🦘 Mira at Roo RoboticsBackend engineer, 18-person robotics startupVision models for warehouse inventory robots β€” needs GPU containers + event scaling
🌊 Theo at Tidewater HealthSenior platform engineer, 4500-staff hospital networkClinical AI assistant over patient records β€” RAG, secrets, audit logs are non-negotiable
β˜• Priya at BeanCraft CoffeeTech lead, 240-store coffee chainLoyalty app personalisation, real-time order recs, menu Q&A bot
πŸ‘¨β€πŸ’» LinFreelance Azure consultantBuilds AI POCs for SMB clients β€” values simplicity and time-to-deploy
Real-world example: how the four tiers fit together

Mira ships a warehouse robot. Here’s how all four containers tiers cooperate:

  1. Azure Container Registry holds the inference image (roo-vision:v3.4.1) β€” built nightly, signed, geo-replicated to two regions.
  2. Container Apps runs the image as a serverless inference endpoint. KEDA scales it from zero up to 50 replicas based on Service Bus queue depth.
  3. App Service hosts the operator dashboard β€” a Node.js admin UI in a container, on a Linux App Service plan.
  4. AKS runs the model training cluster β€” GPU node pools, scheduled jobs, MLflow tracking.

All four tiers pull from the same ACR. That single registry is the spine of the whole system.

What β€œdeveloping AI” means on this exam

The AI-200 exam title is Developing AI Cloud Solutions on Azure. The audience profile makes the focus clear: you are responsible for the back-end services and components of an AI solution. That distinction matters.

AI-102 / AI-103AI-200
Build the AI itself β€” pick the model, train, fine-tune, build the agentWire the AI into a production app β€” host it, scale it, secure it, observe it
Heavy on Foundry portal, model selection, RAG designHeavy on containers, Cosmos DB, pgvector, Service Bus, Key Vault, OpenTelemetry
Python + Foundry SDKPython + Azure SDKs + container tooling + KQL

If AI-103 is β€œbe the AI engineer”, AI-200 is β€œbe the cloud developer who ships the AI engineer’s work to production.”

Exam tip: read the question for 'where does the data live?'

Many AI-200 questions hinge on whether the answer needs container hosting, a vector database, a message bus, or a secret store. Build a habit: as you read each scenario, jot down the data path β€” where is the input coming from, where is state stored, where do events flow?

The right Azure service almost always falls out of that data-path sketch. Wrong answers are usually services that work, but don’t fit the data path.

How this course is organised

The exam’s four domains map directly to four parts of this course:

  1. Domain 1 β€” Containers on Azure (you are here): ACR, App Service, Container Apps, KEDA, AKS, troubleshooting.
  2. Domain 2 β€” AI data services: Cosmos DB NoSQL (incl. vector search + change feed), PostgreSQL with pgvector + RAG, Azure Managed Redis.
  3. Domain 3 β€” Connect to Azure services: Service Bus, Event Grid, Azure Functions.
  4. Domain 4 β€” Secure, monitor, troubleshoot: Key Vault, App Configuration, OpenTelemetry, KQL.

You’ll see the same four characters in each domain. Their problems get more specific as you go β€” Mira’s container hits Domain 1, her Cosmos vector index hits Domain 2, her Service Bus queue hits Domain 3, her OpenTelemetry traces hit Domain 4. The more you sit with the cast, the more β€œwhich Azure service?” answers itself.

Key terms

Question

What is a container image?

Click or press Enter to reveal answer

Answer

A read-only, layered file system bundle that includes an application, its runtime, libraries, and configuration. Built once, run anywhere that supports the OCI image format. The unit you store in Azure Container Registry.

Click to flip back

Question

What is Azure Container Registry (ACR)?

Click or press Enter to reveal answer

Answer

A managed Azure service for storing and managing OCI container images and Helm charts. Supports geo-replication, image signing (Notary v2), vulnerability scanning, and ACR Tasks for cloud-side image builds.

Click to flip back

Question

What is Azure Container Apps?

Click or press Enter to reveal answer

Answer

A serverless container hosting service built on Kubernetes + KEDA + Dapr + Envoy. You bring an image; Azure handles the cluster. Scales from zero to thousands of replicas based on HTTP traffic, queue depth, or custom metrics. Best fit for event-driven AI inference workers.

Click to flip back

Question

What is Azure Kubernetes Service (AKS)?

Click or press Enter to reveal answer

Answer

A managed Kubernetes service. You get a real Kubernetes cluster β€” kubectl, manifests, operators, all of it β€” without having to run the control plane yourself. Best fit when you need full K8s control: GPU node pools, complex networking, custom CRDs, multi-tenant platforms.

Click to flip back

Question

What is the difference between App Service container support and Azure Container Apps?

Click or press Enter to reveal answer

Answer

App Service is a managed web app platform that happens to support container images (single container per app, traditional autoscale, deployment slots). Container Apps is a serverless containers-first platform built on KEDA β€” it scales to zero, supports event-driven workloads, and runs multi-container apps with revisions. App Service β‰ˆ web app with a container; Container Apps β‰ˆ container with a web.

Click to flip back

Knowledge check

Knowledge Check

Mira at Roo Robotics is choosing between App Service and Container Apps to host an inference endpoint that processes images from a Service Bus queue. Most of the day the queue is empty, but during warehouse shift changes it spikes to 800 messages per minute. Which is the better fit?

Knowledge Check

Theo at Tidewater Health is mapping the back-end architecture for the new clinical AI assistant. Which Azure service should hold the container images that ALL the other tiers (Container Apps, App Service, AKS) pull from?

Knowledge Check

Lin is deciding how to host a small RAG demo for a client β€” one Python container that exposes an HTTP endpoint, used 5–10 times per hour by the client's three pilot users. The client cares about cost above all. Which is the best fit?