Domain 2 β€” Module 9 of 10 90%
20 of 27 overall
Domain 2: Identify Benefits, Capabilities, and Opportunities for Microsoft AI Apps and Services Free ⏱ ~12 min read

Azure AI Services: Vision, Search & Beyond

Explore Azure AI Services β€” Vision, Speech, Language, and AI Search β€” and understand when to use these specialised capabilities alongside Copilot and Foundry.

Specialised AI for specific problems

Simple explanation

If Copilot is a general-purpose assistant and Foundry is a custom kitchen, Azure AI Services are specialist tools β€” a microscope, a translator, a scanner.

Azure AI Services are ready-made AI capabilities you can plug into your applications:

  • Vision β€” AI that understands images. It can read text from photos, detect objects, and analyse scenes.
  • Speech β€” AI that converts speech to text and text to speech. Real-time transcription, voice interfaces.
  • Language β€” AI that understands text. Sentiment analysis, key phrase extraction, entity recognition.
  • Translator β€” AI that translates text between 100+ languages. A separate service from Language.
  • AI Search β€” Enterprise search that finds the right information across millions of documents. Powers the RAG pattern for grounding AI in your data.

Service-by-service breakdown

Azure AI Vision

Vision gives AI the ability to understand images and video.

CapabilityWhat It DoesBusiness Example
Image analysisDescribes image content, detects objects, reads textProduct catalogue automation β€” AI tags product photos
OCRExtracts printed and handwritten text from imagesDigitise paper forms, scan receipts, read whiteboards
Object detectionIdentifies and locates specific objects in imagesRetail shelf monitoring β€” detect out-of-stock products
Face detectionDetects faces (not identification β€” privacy-safe)Count foot traffic in retail without identifying individuals
Custom VisionTrain custom image classifiersManufacturing quality control β€” detect defective products. Note: Microsoft is migrating away from Custom Vision β€” new projects should use Florence/Azure AI Vision model customisation in Foundry.

Azure AI Speech

Speech bridges the gap between human voice and digital text.

CapabilityWhat It DoesBusiness Example
Speech-to-text (STT)Converts spoken audio to textMeeting transcription, call centre analysis
Text-to-speech (TTS)Converts text to natural-sounding voiceVoice interfaces, accessibility, automated announcements
Speech translationReal-time translation of spoken languageMultilingual meetings without interpreters
Speaker recognitionIdentifies who is speakingCall centre authentication by voice

Azure AI Language

Language processes and understands text at scale.

CapabilityWhat It DoesBusiness Example
Sentiment analysisDetermines positive, negative, or neutral toneMonitor customer feedback across reviews and social media
Named entity recognitionIdentifies people, places, organisations, datesExtract key information from contracts and legal documents
Key phrase extractionPulls out the most important terms from textSummarise support tickets to identify trending issues
Text summarisationCondenses long text into key pointsAuto-summarise lengthy reports or articles

Azure AI Translator

Translator is a separate service from Azure AI Language, purpose-built for text translation.

CapabilityWhat It DoesBusiness Example
Text translationTranslates text between 100+ languagesMultilingual documentation, global communication
Document translationTranslates entire documents while preserving formattingLocalise contracts, manuals, and marketing materials
Custom TranslatorTrain custom translation models with your terminologyIndustry-specific translations (legal, medical, technical)

AI Search is the enterprise search engine that powers intelligent information retrieval.

CapabilityWhat It DoesBusiness Example
Full-text searchTraditional keyword search across indexed contentSearch across millions of documents
Vector searchSemantic search that understands meaning, not just keywords”Find policies about flexible working” finds results even if they don’t contain those exact words
Semantic rankingRe-ranks results by relevance using AIMost relevant document appears first, even in large result sets
RAG patternRetrieves relevant documents to ground AI responsesCopilot or Foundry app answers questions using YOUR documents as evidence
Multi-source indexingIndexes content from SharePoint, Blob Storage, SQL, and custom sourcesOne search experience across all your data
What is RAG and why does it matter?

RAG (Retrieval-Augmented Generation) is the pattern that connects AI models to your data:

  1. User asks a question
  2. AI Search retrieves the most relevant documents from your data
  3. Those documents are sent to the AI model as context
  4. The model generates an answer grounded in YOUR data, not just its training knowledge

RAG is how enterprise AI avoids hallucination β€” by grounding responses in real documents. Azure AI Search is the most common retrieval engine for RAG in the Microsoft ecosystem.

Question

What is Azure AI Search and what role does it play in the RAG pattern?

Click or press Enter to reveal answer

Answer

Azure AI Search is an enterprise search service that indexes content from multiple sources. In the RAG (Retrieval-Augmented Generation) pattern, AI Search retrieves the most relevant documents for a user's question, which are then sent to the AI model as context β€” grounding the response in real data instead of general knowledge.

Click to flip back

Question

Name the four main Azure AI Services categories.

Click or press Enter to reveal answer

Answer

1) Vision β€” image analysis, OCR, object detection. 2) Speech β€” speech-to-text, text-to-speech, translation. 3) Language β€” sentiment analysis, entity recognition, summarisation. 4) Translator β€” text translation across 100+ languages. 5) AI Search β€” enterprise search, vector search, RAG retrieval. Plus Document Intelligence for extracting data from forms and invoices.

Click to flip back

When to use Azure AI Services vs Copilot vs Foundry

NeedAzure AI ServicesCopilot for M365Microsoft Foundry
Image analysis and OCRYes β€” Vision APINoCan integrate Vision APIs
Speech transcriptionYes β€” Speech APITeams has built-in transcriptionCan integrate Speech APIs
Sentiment analysis at scaleYes β€” Language APINoCan integrate Language APIs
Enterprise document searchYes β€” AI SearchCopilot Chat uses Graph (not Search)Uses AI Search for RAG
Draft emails and documentsNoYes β€” built-inNo
Custom AI model trainingNo (pre-built APIs)NoYes β€” fine-tuning and custom models
Meeting summariesNoYes β€” Teams CopilotNo

The key insight: Azure AI Services, Copilot, and Foundry are complementary. A single solution might use all three:

  • Copilot for everyday productivity
  • AI Search for document retrieval (RAG)
  • Vision for image processing
  • Foundry for custom model orchestration
Question

How do Azure AI Services, Copilot, and Foundry work together?

Click or press Enter to reveal answer

Answer

They are complementary layers. Copilot handles everyday M365 productivity. Azure AI Services provide specialised capabilities (vision, speech, language, search). Foundry orchestrates custom AI applications that may use both AI Services and custom models. A single enterprise solution often uses all three.

Click to flip back

πŸ”„ PacificSteel uses Vision for quality inspection

TomΓ‘s, DT Lead at PacificSteel Manufacturing, is piloting AI-powered quality inspection on the production line.

The problem: Steel sheets are inspected manually. Inspectors check 500 sheets per shift and miss about 3% of defects. A missed defect costs $12,000 in rework and customer returns.

The solution: Azure AI Vision with custom image classification.

  1. Training: TomΓ‘s’s team photographs 10,000 steel sheets β€” 8,000 good, 2,000 with various defects (scratches, pitting, discolouration). They train a custom vision model in Azure AI Services.
  2. Deployment: Cameras on the production line capture images of every sheet. The Vision API classifies each sheet in real time.
  3. Integration: Defective sheets are automatically flagged and diverted. The system logs defect types and locations for trend analysis.

Results after 90 days (in this scenario):

  • Defect detection rate: improved from 97% to over 99%
  • Time per inspection: under 1 second (vs 45 seconds manual)
  • Estimated annual savings: in the range of $500,000–$1,000,000 in reduced rework (depending on defect rates and production volume)
Why this couldn't be done with Copilot

This is a classic example of where Azure AI Services are the right choice:

  • Real-time image classification is not something Copilot does
  • The solution needs to run on the production line, not inside Microsoft 365
  • It requires a custom-trained vision model specific to steel defects
  • The API-based architecture integrates with existing manufacturing systems

If the exam describes a specialised AI need (vision, speech, document processing) on a production or operational system, the answer is almost always Azure AI Services β€” not Copilot.

Question

A company needs real-time image classification on a production line. Which Microsoft AI service should they use?

Click or press Enter to reveal answer

Answer

Azure AI Vision (part of Azure AI Services). It provides image analysis, object detection, and custom image classification via APIs. This is a specialised AI capability that Copilot and Copilot Studio don't offer β€” they focus on productivity and chatbots, not real-time vision processing.

Click to flip back

Knowledge Check

Elena's consulting firm has a client β€” a legal firm that wants to automatically extract key information (parties, dates, amounts, clauses) from thousands of contract PDFs. Which Azure AI Service should Elena recommend?

Knowledge Check

Ravi wants to build an AI assistant for TechVantage that answers employee questions using information from SharePoint, Confluence, and an internal SQL database. Which Azure AI capability enables searching across all these sources?


Next up: Matching the Right AI Model to Your Business Need β€” learn how to choose between large models, small models, and open-source options.