Domain 4 of 5 — 10–15% of Exam
Azure AI Language • Azure AI Speech • Translator • Content Safety
Get Full Access on FlashGenius →Domain 4 covers how to implement text analysis and speech processing solutions using Azure AI services. This domain represents 10–15% of the AI-103 exam — roughly 7–11 questions. You'll need to know when and how to apply Azure AI Language, Azure AI Speech, Azure Translator, and Content Safety.
| Domain | Topic | Weight |
|---|---|---|
| Domain 1 | Design and plan AI solutions on Azure | 10–15% |
| Domain 2 | Implement computer vision solutions | 10–15% |
| Domain 3 | Implement natural language processing solutions | 25–30% |
| Domain 4 | Implement text analysis solutions ← This Page | 10–15% |
| Domain 5 | Implement generative AI solutions | 30–35% |
Master the Azure AI services you need for Domain 4. Each section covers the key features, decision points, and exam-relevant details.
Prebuilt NER categories: Person, Location, Organization, DateTime, Quantity, URL, IP Address, Email
Custom NER: Train the model with your own labeled entities. Evaluation uses precision (of what the model predicted, how much was correct), recall (of all actual entities, how many were found), and F1 score (harmonic mean of precision and recall).
Identifies the main topics or talking points from unstructured text. Returns a list of phrases that represent the key ideas. No training required — fully prebuilt.
Document-level sentiment: Classifies the entire text as positive, neutral, or negative (with confidence scores for each).
Opinion Mining (aspect-based sentiment): Goes deeper — identifies specific aspects (e.g., "coffee", "service") and the sentiment expressed toward each. Example: "The coffee was great but the service was slow" → coffee=positive, service=negative.
Disambiguates recognized entities to known entries in a knowledge base (Wikipedia). Example: "Mercury" in an astronomy context links to the planet, not the element or the god. Returns a data source (Wikipedia URL) and confidence score.
Identifies the language of input text and returns a confidence score (0.0–1.0). Handles mixed-language content by returning the dominant language. Use when source language is unknown before translation.
Categories detected: Name, Phone number, Social Security Number (SSN), Credit card number, Email address, IP address, date of birth, passport number, and more.
Redaction: The API can return text with PII replaced by category labels (e.g., "[PERSON]") for safe downstream processing. Separate endpoint for PHI (Protected Health Information).
Single-label classification: Each document gets exactly one category. Used for simple categorization (e.g., "sports", "politics", "tech").
Multi-label classification: A document can belong to multiple categories simultaneously. Used when content spans multiple topics.
Both require custom training with labeled examples in Language Studio.
Extractive summarization: Selects and returns the most important existing sentences from the source document. Output sentences are verbatim from input.
Abstractive summarization: Generates a new summary in the model's own words. May not use exact source sentences.
Conversation summarization: Summarizes multi-turn dialogue — returns issue, resolution, and chapter structure.
Build FAQ-style knowledge bases from documents, URLs, or manually entered Q&A pairs. Returns answers with a confidence score (0–1). Supports follow-up prompts for multi-turn conversations. Hosted in Azure AI Language (replaces QnA Maker).
The modern replacement for LUIS. Understands natural language input by predicting intents (what the user wants) and extracting entities (key data). Training utterances teach the model variations of each intent.
| Factor | Azure AI Language | GPT-4o (Azure OpenAI) |
|---|---|---|
| Output structure | Consistent, typed JSON | Structured outputs via JSON Schema |
| Latency | Lower | Higher |
| Cost | Lower | Higher (token-based) |
| Training data needed | More labeled examples | Few-shot or zero-shot |
| Complex reasoning | Limited | Chain-of-thought capable |
| Compliance/audit | Easier (deterministic) | More variable outputs |
Use Language service for: PII detection at scale, structured NER, compliance scenarios, cost-sensitive applications.
Use GPT-4o for: nuanced sentiment with reasoning, complex entity extraction with context, when you have few labeled examples.
<speak> and use tags like <prosody rate="slow">, <break time="500ms"/>, <emphasis level="strong">.| Task | Recommended Service / Feature |
|---|---|
| Extract named entities from text | Azure AI Language — NER |
| Detect sentiment + specific aspect opinions | Azure AI Language — Sentiment + Opinion Mining |
| Transcribe meeting audio with speaker labels | Speech Service — Batch Transcription + Diarization |
| Translate 10,000 documents to French | Azure Translator — Document Translation (async batch) |
| Build FAQ chatbot from existing docs | Azure AI Language — Question Answering |
| Detect PII in medical records | Azure AI Language — PII Detection (PHI endpoint) |
| Build voice assistant with intent understanding | Speech SDK (STT) + CLU + TTS |
| Complex nuanced text reasoning | Azure OpenAI — GPT-4o |
| Detect harmful / unsafe text | Azure Content Safety |
| Convert Arabic script to Latin characters | Azure Translator — Transliteration |
| Detect jailbreak in user chat message | Content Safety — Prompt Shield (user) |
| Summarize using exact source sentences | Azure AI Language — Extractive Summarization |
| Generate fluent paraphrased summary | Azure AI Language — Abstractive Summarization |
| Understand spoken commands ("book me a flight") | Speech SDK Intent Recognition (STT + CLU) |
| Translate technical documents with jargon | Azure Translator — Custom Translator |
High-impact mnemonics and mental models to anchor exam concepts. These are the patterns that stick when exam pressure is high.
10 scenario-based questions covering the key decision points in Domain 4. Select the best answer for each question.
20 cards covering essential Domain 4 concepts. Click any card to flip and reveal the answer.
Personalized focus recommendations based on what you're building. Match your use case to the services that matter most for your scenario.
Direct links to Microsoft Learn documentation and the official AI-103 certification page. All resources are free to access.
Microsoft's official exam objectives and topic breakdown for AI-103.
Exam registration, prerequisites, and skills measured overview.
NER, Sentiment, PII, CLU, QnA, Summarization — complete service reference.
STT, TTS, SSML, Custom Speech, Diarization, Speech Translation quickstarts.
Text translation, Document Translation, Custom Translator, transliteration API reference.
Harm categories, Prompt Shield, Groundedness, Protected Material detection.
Get all 5 AI-103 domains with adaptive flashcards, full-length practice exams, and progress tracking on FlashGenius.
Start Free on FlashGenius →