Domain 1 β€” Module 7 of 8 88%
7 of 27 overall
Domain 1: Plan and Manage an Azure AI Solution Free ⏱ ~14 min read

Monitoring & Security

Your AI solution is only as good as its data pipeline and security posture. Learn how to monitor search index health, data ingestion quality, and lock down your AI infrastructure with managed identity, private endpoints, and RBAC.

Monitoring data and search quality

Simple explanation

Your AI is only as good as the data it searches. If the data pipeline breaks or the search index goes stale, your AI starts giving wrong answers β€” and nobody tells you.

Monitoring means watching two things: (1) Is new data flowing in correctly? (2) When users search, are they finding what they need? If either breaks, your RAG application starts hallucinating or giving irrelevant responses.

Data ingestion monitoring

What to MonitorWhyRed Flag
Indexer statusConfirms documents are being processedIndexer in β€œfailed” or β€œdegraded” state
Document countTracks how many documents are indexedCount plateaus when new docs should be flowing in
Parsing errorsCatches corrupt or unsupported filesError rate above 1-2%
Field completenessEnsures extracted metadata is populatedRequired fields (title, date) returning null
Embedding freshnessConfirms vectors match current modelEmbeddings generated with outdated model version

Search relevance monitoring

MetricWhat It MeasuresHow to Improve
Precision at KOf the top K results, how many are relevant?Tune ranking profiles, adjust chunking
RecallOf all relevant documents, how many were found?Add more search types (hybrid), broaden index
Mean Reciprocal RankHow high does the correct answer rank?Improve semantic ranker configuration
User satisfactionDo users rephrase and retry?Track query reformulations as a proxy for poor results
Exam tip: Stale index vs model drift

The exam may present declining quality and ask for the cause. Key distinction:

  • Stale index = data pipeline stopped, new documents aren’t indexed, answers are outdated
  • Model drift = model behaviour changed, but data is fine

Check the data pipeline first, then the model. In practice, stale indexes cause more quality issues than model drift.

Security fundamentals for AI

Four security pillars for AI infrastructure
FeatureSecurity FeatureWhat It Does
Managed identitySystem-assigned identityAzure resources authenticate to each other without storing credentials in code. No API keys needed.
Private endpointsPrivate networkingAI services communicate over Azure's private network backbone β€” never touching the public internet.
Keyless credentialsToken-based authApplications use Microsoft Entra ID tokens instead of API keys. Tokens expire automatically, keys don't.
RBAC (role policies)Role-Based Access ControlFine-grained permissions: who can deploy models, who can read data, who can manage agents.

Managed identity in practice

Managed identity is the number one security best practice for Azure AI:

Without Managed IdentityWith Managed Identity
Store API key in app config or Key VaultNo keys to store β€” Azure handles authentication
Rotate keys manuallyNo rotation needed β€” tokens are short-lived
Risk of key exposure in logs or codeNo secret to expose
Configure key per serviceOne identity, grant roles to each resource
Real-world example: Atlas Financial's security posture

Atlas Financial handles sensitive financial data. Their AI security setup:

  • Managed identity on all Foundry resources β€” zero API keys in code
  • Private endpoints for Foundry Project, AI Search, and Storage β€” no public internet exposure
  • RBAC roles:
    • Data scientists: β€œCognitive Services User” (can call models, can’t deploy)
    • AI engineers: β€œCognitive Services Contributor” (can deploy models)
    • Security team: β€œReader” + custom role for audit log access
  • VNet integration β€” all AI traffic stays within Atlas’s private network
  • Key Vault β€” only for third-party API keys (external services that don’t support managed identity)

RBAC roles for AI services

RoleWhat It AllowsWho Gets It
Cognitive Services UserCall deployed models and agentsApplication service principals, developers
Cognitive Services ContributorDeploy and manage modelsAI engineers, DevOps
Search Index Data ReaderQuery search indexesApplications, agents
Search Index Data ContributorRead and write search index dataIndexing pipelines
Search Service ContributorManage search service configurationInfrastructure admins

Key terms

Question

What is managed identity in Azure?

Click or press Enter to reveal answer

Answer

A system-assigned identity for Azure resources that enables passwordless authentication between services. Eliminates the need to store API keys or credentials in code. Resources authenticate using short-lived Microsoft Entra ID tokens.

Click to flip back

Question

What are private endpoints?

Click or press Enter to reveal answer

Answer

Network interfaces that connect Azure services over Microsoft's private backbone network instead of the public internet. Ensures AI traffic (model calls, search queries, data transfers) never leaves the private network.

Click to flip back

Question

What does 'keyless credentials' mean?

Click or press Enter to reveal answer

Answer

Using Microsoft Entra ID token-based authentication instead of API keys. Tokens expire automatically (no rotation needed) and can't be accidentally committed to code. Managed identity is the primary mechanism for keyless auth.

Click to flip back

Question

What is search index health?

Click or press Enter to reveal answer

Answer

The operational status of your Azure AI Search index β€” whether documents are being indexed correctly, embeddings are current, and the index is serving relevant results. Degraded index health directly impacts RAG and agent quality.

Click to flip back

Knowledge check

Knowledge Check

NeuralMed's RAG chatbot has been giving outdated drug interaction information, even though new research papers are being uploaded to storage daily. What should the team investigate first?

Knowledge Check

Kai's team stores an API key for the Foundry model deployment in their application's environment variables. The security team flags this as a risk. What's the recommended fix?

Knowledge Check

Which RBAC role should Atlas Financial assign to their AI application's service principal so it can call deployed models but NOT deploy or delete them?