AAISM · AI Security Architecture & Secure AI Lifecycle

Domain 3A & 3B — What the Exam Tests

Domain 3 carries the heaviest exam weight at 38% of the 90-question AAISM exam. Sub-area A covers AI Security Architecture and Design — how to apply security-by-design principles, Zero Trust, and defense-in-depth to every layer of an AI system. Sub-area B covers the Secure AI Lifecycle — selecting safe models, securing the training pipeline, validating model outputs, and hardening MLSecOps CI/CD. Mastery of these two sub-areas is essential to passing AAISM.

Security by Design Zero Trust AI STRIDE for ML Data Poisoning Adversarial Training Differential Privacy Federated Learning Model Red Teaming MLSecOps Model Registry Feature Store Security Prompt Injection

Core Concepts at a Glance

Six foundational pillars of AI security architecture and the secure AI lifecycle

🏗️

Security by Design for AI

Embedding security controls into every phase of AI system design — from data pipeline architecture through inference endpoints. Threat modeling (STRIDE applied to ML) occurs before the first line of code is written, not as an afterthought.

🔒

Zero Trust for AI Systems

Never implicitly trust model inputs, outputs, or inter-component communications. Microsegmentation isolates ML workloads; every request is authenticated and authorized. Model outputs are validated before downstream consumption.

🛡️

Defense in Depth for AI

Multiple overlapping control layers protect AI systems: input validation → model guardrails → output filtering → behavioral monitoring. No single control failure leads to full system compromise.

🧪

Secure Training Pipeline

Training data requires access controls, encryption at rest/in transit, and integrity verification. Data poisoning attacks are mitigated through provenance tracking, anomaly detection, and differential privacy techniques.

✅

Model Validation & Red Teaming

Before production: adversarial robustness evaluation, bias/fairness testing, model red teaming, and explainability checks (SHAP/LIME). Automated security gates in MLOps pipelines prevent insecure models from being promoted.

⚙️

MLSecOps Pipeline Security

Securing CI/CD for ML — model registry signing, secrets management for pipeline credentials, container image scanning, feature store access controls, and immutable audit logs for all training runs and model promotions.

AAISM Domain 3 Sub-Area Comparison

Dimension	3A: AI Security Architecture	3B: Secure AI Lifecycle
Focus	How AI systems are structured and protected architecturally	How AI models are selected, trained, validated, and deployed safely
Key threat	Adversarial inputs, prompt injection, inference endpoint attacks	Data poisoning, model theft, training-time backdoors
Primary controls	Zero Trust, microsegmentation, API gateways, input sanitization	Data provenance, differential privacy, red teaming, MLSecOps gates
Frameworks used	STRIDE, MITRE ATLAS, Zero Trust Architecture (NIST SP 800-207)	OWASP Top 10 for ML, NIST AI RMF, model cards, MLSecOps
Infrastructure scope	GPU clusters, MLOps platforms, network segmentation, API layer	Training environments, feature stores, model registries, CI/CD
Exam tip	Know WHERE controls are placed in the architecture diagram	Know WHEN controls apply in the ML lifecycle phases

AI System Components to Secure

Component	Primary Threats	Security Controls
Data Pipelines	Data poisoning, interception, unauthorized modification	Encryption in transit, integrity hashing, access controls, data provenance
Feature Stores	Feature poisoning, unauthorized read/write, stale features	RBAC, feature versioning, anomaly detection on feature distributions
Model Training Infra	Insider threats, supply chain attacks, compute hijacking	Isolated environments, least privilege, audit logging, GPU cluster hardening
Model Registry	Tampered model artifacts, unauthorized promotion	Artifact signing, version integrity checks, approval gates
Inference Endpoints	Prompt injection, model extraction, DoS, unauthorized access	API gateway, rate limiting, authentication, output filtering
Monitoring Systems	Log tampering, evasion, alert fatigue	Immutable logs, behavioral baselines, anomaly alerting

AI Security Architecture & Design

STRIDE Applied to Machine Learning Systems

The STRIDE threat model maps directly onto ML components. Applying it during design phase — before infrastructure is built — is the most cost-effective time to address vulnerabilities.

STRIDE Category	ML System Manifestation	Mitigation
Spoofing	Adversarial inputs that deceive classifiers	Adversarial training, input validation, confidence thresholds
Tampering	Data poisoning of training datasets	Data integrity checks, provenance tracking, anomaly detection
Repudiation	Inability to trace which data produced a model decision	Immutable audit logs, model cards, explainability tools
Information Disclosure	Model inversion attacks, membership inference	Differential privacy, output perturbation, access controls
Denial of Service	Flooding inference endpoints, expensive adversarial inputs	Rate limiting, input complexity bounds, auto-scaling limits
Elevation of Privilege	Prompt injection bypassing system prompt restrictions	System prompt hardening, output validation, sandboxing

Zero Trust Architecture for AI

Zero Trust principles applied to AI systems mean no implicit trust for any actor, component, or data flow — regardless of network location.

Verify explicitly: Every model API call is authenticated; service-to-service calls use mTLS or signed tokens
Use least privilege: ML engineers have minimal access to production models; training pipelines cannot directly promote to production
Assume breach: Model outputs are always validated before use in downstream systems; monitoring detects anomalous behavior
Microsegmentation: Training environments are network-isolated from inference environments and corporate networks
Never trust model inputs/outputs without validation: Input sanitization and output filtering are mandatory at boundaries

Defense in Depth for AI Systems

Multiple independent layers of control, so that failure of any single control does not result in a full compromise.

Layer 1 — Data Validation: Schema validation, range checks, anomaly detection on inputs before they reach the model
Layer 2 — Model Guardrails: Built-in refusal behaviors, topic classifiers, toxicity filters within model inference
Layer 3 — Output Filtering: Post-generation content checks, PII detection, hallucination detection before returning to caller
Layer 4 — Behavioral Monitoring: Continuous drift detection, anomalous query pattern alerting, audit logging
Layer 5 — Incident Response: Automated model rollback triggers, circuit breakers on inference endpoints

Prompt Injection Defense (LLMs)

Input sanitization before reaching LLM context
System prompt separation from user input (privileged context)
Output validation to detect instruction leakage
Sandboxing LLM tool calls and code execution
Privilege levels in prompt context (system > user > data)
Monitoring for anomalous instruction patterns in user inputs

MLOps Platform Security (Kubeflow, MLflow, SageMaker)

RBAC on experiment tracking and model registry
Network policies isolating ML namespaces in Kubernetes
GPU cluster hardening: disable unused ports, patch drivers
Service account least privilege for pipeline workers
Secrets management (Vault, AWS Secrets Manager) — no hardcoded credentials
Air-gapped training for highly sensitive model workloads

Federated Learning Security Architecture

Federated learning trains models across distributed data sources without centralizing raw data — but introduces unique security challenges.

Aggregation server security: The central aggregation point is a high-value target; must be hardened and access-controlled
Gradient poisoning: Malicious participants can send manipulated gradient updates; mitigated by anomaly detection on updates and Byzantine-robust aggregation
Differential privacy integration: Noise added to local gradients before sharing to prevent membership inference from the aggregated model
Secure aggregation protocols: Cryptographic aggregation ensures the server learns only the aggregate, not individual updates

Secure Model Selection

Evaluating Third-Party and Open-Source Models

Evaluation Criterion	What to Check	Risk if Ignored
Model Card Review	Documented limitations, intended use cases, known biases, performance boundaries	Model deployed outside safe operating envelope
Weight Integrity	Hash verification of downloaded weights against published checksums	Malicious or tampered weights from repository
Provenance	Training data sources, data rights, GDPR/CCPA compliance of training data	Legal liability, privacy violations in outputs
Vendor Security Posture	SOC 2 Type II, pen-test reports, data handling policies, breach history	Data exfiltration through commercial API calls
Fine-Tuning Data Risk	What data leaves your environment when fine-tuning via third-party APIs	Proprietary data retained/used by vendor
Capability Assessment	Understanding attack surface of model capabilities (code generation, tool use)	Unexpected capabilities enabling exploitation

Secure Model Training

Data Poisoning Prevention

Data provenance tracking: Every training record traced to its origin; immutable lineage records
Anomaly detection: Statistical outlier detection on training batches flags suspicious samples
Data validation pipelines: Schema enforcement, range checks, deduplication before training ingestion
Clean-label attack detection: Clustering analysis to identify mislabeled poison samples
Multi-source verification: Cross-referencing data from independent sources to detect manipulation

Differential Privacy in Training

Adds calibrated noise (Gaussian or Laplace) to gradients during training
Provides mathematical privacy guarantee: ε-differential privacy
Prevents membership inference attacks on training data
Trade-off: privacy budget ε vs. model accuracy — lower ε = stronger privacy but more accuracy loss
Implemented in TensorFlow Privacy, Opacus (PyTorch)
Especially important for healthcare and financial training data

Adversarial Training for Robustness

Deliberately training models on adversarial examples — inputs crafted to cause misclassification — to improve resistance to real-world adversarial attacks.

FGSM (Fast Gradient Sign Method): Simple, fast adversarial example generation used to augment training data
PGD (Projected Gradient Descent): Stronger adversarial training — iterative attack used to generate harder examples
Trade-off: Adversarial training improves robustness but can reduce clean-data accuracy (accuracy-robustness trade-off)
Scope: Does not defend against all attack types — black-box adaptive attacks may still succeed
Certified defenses: Randomized smoothing provides provable robustness guarantees within a perturbation radius

Secure Training Environment Controls

Isolated compute: Training environments network-isolated from internet and production; egress filtering for data exfiltration prevention
Least privilege for ML engineers: Scientists access data via controlled notebooks; no direct database or model registry write access
Immutable audit logs: Every training run logs: dataset version, hyperparameters, environment hash, user ID, timestamps — write-once storage
Hyperparameter security: Tuning APIs (Optuna, Ray Tune) can leak information about training data through optimization trajectories — access-controlled
Container security: Base images scanned for vulnerabilities; runtime security (Falco) monitors for unexpected process execution in training containers

Model Validation & Security Testing

Model Red Teaming

Structured adversarial probing of AI models before production deployment — performed by a dedicated team attempting to find failures, harmful outputs, and security gaps.

Scope definition: Define what the model should never do (harmful content, PII exposure, instruction bypass)
Automated fuzzing: Systematic variation of inputs to find edge cases and failure modes
Prompt injection testing: Structured attempts to override system prompts, extract instructions, or enable disallowed behaviors
Jailbreak taxonomy: Testing known jailbreak categories (roleplay, hypothetical, encoding tricks, language switching)
Model extraction probing: Testing whether sufficient queries can reconstruct model behavior (IP theft risk)
Documented findings: Red team results feed into go/no-go deployment decision and residual risk acceptance

Bias & Fairness Testing Metrics

Disparate Impact: Ratio of favorable outcome rates across demographic groups (≥0.8 = 80% rule threshold)
Equalized Odds: Equal true positive and false positive rates across groups
Demographic Parity: Equal positive prediction rates regardless of protected attribute
Individual Fairness: Similar individuals receive similar predictions
Tools: IBM AI Fairness 360, Google What-If Tool, Fairlearn

Explainability for Security Review

SHAP: Game theory–based feature attribution; explains any model's predictions
LIME: Local surrogate models explain individual predictions
Attention maps: Visualize which tokens influence transformer model outputs
Security use: verify model is using legitimate features, not exploitable shortcuts (Clever Hans effect)
Required for high-stakes decisions (credit, medical, legal)

MLOps Security Gates & Staging Deployments

Automated security gates: CI/CD checks that a model must pass before promotion — adversarial robustness score, fairness metrics, red team clearance, model card completeness
Canary deployments: Route a small traffic percentage (e.g., 5%) to new model version; monitor for anomalies before full rollout
Blue/green deployments: Maintain previous model version for rapid rollback if security issue detected in production
Regression testing for security: Test suite verifies that model updates don't re-introduce previously patched vulnerabilities
Human-in-loop approval: High-risk models require explicit security officer sign-off before production promotion

MLSecOps Pipeline Security

Securing ML CI/CD Pipelines

Pipeline Stage	Security Control	Tools / Examples
Data Ingestion	Data provenance, integrity hashing, access logging	Great Expectations, dbt, AWS Glue Data Catalog
Feature Engineering	Feature store RBAC, feature versioning, drift detection	Feast, Tecton, Vertex AI Feature Store
Model Training	Isolated compute, secrets management, audit logging	HashiCorp Vault, AWS Secrets Manager, Weights & Biases
Model Registry	Artifact signing, version integrity, approval workflow	MLflow Model Registry, SageMaker Model Registry
Deployment	Container scanning, IaC security, canary rollout	Trivy, Checkov, Kubernetes admission controllers
Monitoring	Immutable logs, drift alerting, anomaly detection	Evidently AI, Arize, Fiddler AI, CloudWatch

Model Artifact Signing & Registry Security

Cryptographic signing of model artifacts (weights, configs) at training completion
Signature verification before any deployment — prevents tampered model promotion
Version pinning: deployments reference specific signed artifact versions, not "latest"
Access control: separate write (training pipelines) from read (inference) permissions
Approval workflows: model promotion requires human reviewer plus automated gate passage

IaC Security for ML Platforms

Terraform and Kubernetes manifests for ML infrastructure treated as code — version-controlled and reviewed
Static analysis: Checkov, tfsec scan for misconfigurations (open ports, public buckets, overly permissive IAM)
Admission controllers: OPA/Gatekeeper enforces security policies on Kubernetes workloads
GPU workload isolation: node taints and tolerations prevent non-ML workloads from co-locating with training jobs

Memory Hooks

Six mnemonics to lock in the hardest AAISM Domain 3 concepts for exam day

🏗️

STRIDE for ML — Threat Modeling Hook

"STRIDE Hits Every ML System"

Spoofing → adversarial inputs fool classifiers.
Tampering → data poisoning corrupts training sets.
Repudiation → no audit trail of model decisions.
Information Disclosure → model inversion reveals training data.
Denial of Service → expensive inputs overwhelm endpoints.
Elevation of Privilege → prompt injection bypasses restrictions.
Apply STRIDE before building — not after.

🔒

Zero Trust for AI — "VALVE" Framework

"VALVE shuts off implicit trust"

Verify every request (auth on all model API calls).
Assume breach (monitor model outputs always).
Least privilege (ML engineers can't touch prod).
Validate inputs and outputs before use.
Enforce microsegmentation (training ≠ inference network).
Remember: in Zero Trust AI, even the model's own outputs are untrusted until validated.

☠️

Data Poisoning Defenses — "PADV"

"Poisoned data? PADV to the rescue"

Provenance tracking — know every record's origin.
Anomaly detection on training batches.
Data validation pipelines (schema + range checks).
Verification across multiple independent data sources.
Data poisoning is a training-time attack; defenses must be embedded in the pipeline — not bolted on after training.

🔬

Differential Privacy — The ε Trade-off

"Epsilon down = privacy up = accuracy down"

Differential privacy adds calibrated noise to model gradients. The privacy budget ε (epsilon) controls the trade-off: a smaller ε means stronger privacy guarantee but greater accuracy loss. Remember the direction: lower ε → stronger privacy → lower accuracy. Exam questions often test whether you know this trade-off exists and which direction it runs.

🎯

Model Red Teaming — "SAFJD" Gate

"SAFJD: Safe After Five Jailbreak Domains"

Before deploying: test all five red team domains —
Scope violations (does it do what it shouldn't?).
Adversarial inputs (does it misclassify under attack?).
Fairness gaps (disparate impact across groups?).
Jailbreak resistance (prompt injection attempts?).
Data leakage (does it expose training data?).
Red teaming is a structured pre-deployment activity, not ad-hoc testing.

⚙️

MLSecOps Pipeline — "DRIFT" Controls

"Secure pipelines DRIFT to production safely"

Data provenance and integrity at ingestion.
Registry signing — cryptographically sign model artifacts.
Isolated compute for training (least privilege, secrets management).
Fuzzing and security gates before promotion.
Traceability — immutable audit logs for every pipeline stage.
MLSecOps applies DevSecOps principles to the ML lifecycle — security is a pipeline concern, not just a deployment concern.

Practice Quiz

10 exam-style questions · AAISM Domain 3A & 3B

Score: 0 / 0

Question 1 of 10

A security architect is designing a new AI-powered fraud detection system. According to AAISM best practices, at which phase should STRIDE-based threat modeling be applied to the ML system?

Correct — D. Security by design mandates threat modeling (including STRIDE) during the design phase, before infrastructure is provisioned. This is when changes are cheapest and most effective. Post-deployment threat modeling misses the most impactful intervention window.

Question 2 of 10

An organization is applying Zero Trust principles to its AI inference infrastructure. Which of the following BEST represents a Zero Trust control specifically applied to model outputs?

Correct — B. Zero Trust for AI includes never implicitly trusting model outputs. Output validation and filtering — checking for harmful content, PII, or policy violations before returning results — is the Zero Trust control applied specifically at the model output boundary. The other options are valid security controls but do not specifically address output trust.

Question 3 of 10

An ML engineer notices that a production image classifier began misclassifying stop signs as speed limit signs after a recent training data update. Which training-time attack does this MOST likely represent?

Correct — C. The described scenario — targeted misclassification of a specific class after a training data update — is the classic signature of a data poisoning attack, potentially with a backdoor trigger embedded in the new training data. Model inversion (A) and membership inference (B) are inference-time privacy attacks. Adversarial examples (D) are crafted at inference time, not through training data modification.

Question 4 of 10

A healthcare organization wants to train a diagnostic AI model on patient records while ensuring that the model cannot be used to determine whether any specific individual's data was in the training set. Which privacy technique BEST addresses this requirement?

Correct — D. Differential privacy provides a mathematical guarantee (defined by the privacy budget ε) that the model cannot be used to determine membership in the training set — directly addressing membership inference risk. k-anonymity (A) protects the dataset, not the trained model. RBAC (B) and encryption at rest (C) are access controls that don't protect against membership inference from the model itself.

Question 5 of 10

An organization reviews a pre-trained open-source language model for production deployment. The security team finds no model card documentation. What is the PRIMARY security risk of deploying this model without that documentation?

Correct — B. Model cards document known limitations, biases, failure modes, and intended use cases. Without this documentation, a security team cannot assess whether the model is being deployed within its safe operating envelope. Deploying a model without understanding its limitations risks harmful or biased outputs in production. The other options are not accurate consequences of missing model card documentation.

Question 6 of 10

A red team is testing an LLM-based customer service assistant. They successfully craft an input that causes the assistant to ignore its system prompt and reveal internal instructions. Which attack category does this represent, and what is the PRIMARY architectural defense?

Correct — C. Crafting inputs to override system prompt instructions is a prompt injection attack — classified under Elevation of Privilege in STRIDE applied to LLMs. The architectural defenses are input sanitization before the LLM context, privileged separation of system prompts from user input, and output validation to detect when system prompt content leaks. Rate limiting (A) addresses DoS, not prompt injection specifically.

Question 7 of 10

In an MLSecOps pipeline, a newly trained model must be cryptographically signed before being stored in the model registry. What security objective does model artifact signing PRIMARILY address?

Correct — D. Model artifact signing provides integrity assurance: any tampering with the model weights after signing will invalidate the cryptographic signature, preventing a tampered model from being deployed. This is the primary objective of artifact signing in MLSecOps. Confidentiality (A) requires encryption, not signing. Signing does not address availability (B) or engineer authentication (C) directly.

Question 8 of 10

An AAISM security manager reviews a bias and fairness report for a loan-approval AI model. The report shows that the model's approval rate for Group A is 60% and for Group B is 45%. Applying the 80% Rule (disparate impact threshold), what is the assessment?

Correct — B. The disparate impact ratio = Group B rate / Group A rate = 45% / 60% = 0.75. The 80% Rule (four-fifths rule) flags potential discrimination when this ratio falls below 0.80. Since 0.75 < 0.80, disparate impact is indicated. This is a quantitative metric the AAISM exam expects candidates to be able to calculate and interpret.

Question 9 of 10

An organization uses federated learning to train a fraud detection model across 50 regional bank branches without centralizing customer transaction data. A security review identifies that a malicious branch participant could manipulate gradient updates to degrade model performance. Which federated learning security control BEST mitigates this threat?

Correct — C. Gradient poisoning from malicious federated participants (Byzantine participants) is mitigated by Byzantine-robust aggregation algorithms (e.g., coordinate-wise median, Krum) that reduce the influence of outlier gradient updates, combined with anomaly detection to flag suspicious participants. Encryption at rest (A) and framework standardization (B) don't address gradient manipulation. Differential privacy (D) addresses membership privacy, not Byzantine robustness.

Question 10 of 10

An organization's MLOps pipeline automatically promotes any model that achieves an accuracy improvement over the current production model. A security manager recommends adding security gates before promotion. Which of the following represents a comprehensive set of security gates appropriate for a high-risk AI deployment?

Correct — D. A comprehensive MLSecOps security gate for a high-risk AI deployment must include: adversarial robustness testing, fairness/bias metrics within acceptable thresholds, red team clearance, complete model card documentation for auditability, and artifact signature verification to ensure integrity. Options A–C represent performance or operational checks, not security gates appropriate for AAISM-level risk management of high-risk AI systems.

Flashcards

Click any card to flip it and reveal the answer. 8 cards covering core AAISM Domain 3 concepts.

AI Security Architecture

What does the "E" in the STRIDE threat model stand for, and how does it manifest in an LLM-based AI system?

Elevation of Privilege. In LLMs, this manifests as prompt injection — crafting user inputs that override system prompt restrictions and grant the attacker capabilities beyond their intended privilege level (e.g., bypassing content filters, revealing internal instructions, or enabling tool calls the user shouldn't have access to).

Click to flip back

Zero Trust AI

How does Zero Trust architecture differ from traditional perimeter-based security when applied to AI inference endpoints?

Zero Trust assumes no implicit trust based on network location. Every inference request is authenticated and authorized regardless of origin (internal or external). Model outputs are always validated before downstream use. Microsegmentation isolates inference from training environments. Compare to perimeter-based: once inside the network, requests were trusted — creating risk from insider threats and lateral movement.

Click to flip back

Secure Training

What is differential privacy in ML training, and what does the privacy budget ε control?

Differential privacy adds calibrated mathematical noise (Gaussian or Laplace) to model gradients during training. The privacy budget ε (epsilon) controls the trade-off: smaller ε = stronger privacy guarantee = greater accuracy loss. It prevents membership inference attacks — an attacker cannot determine whether any specific individual's record was in the training set.

Click to flip back

Data Poisoning

What are the three primary defenses against data poisoning attacks in ML training pipelines?

1. Data provenance tracking — immutable lineage records linking every training sample to its origin source.
2. Anomaly detection on training batches — statistical analysis to flag outlier or mislabeled samples before ingestion.
3. Multi-source verification — cross-referencing data from independent sources to detect manipulation in any single source.

Click to flip back

Model Validation

A model's approval rate for Group A is 70% and Group B is 49%. Does this model pass the 80% Rule (disparate impact test)?

No — it fails. Disparate impact ratio = 49% ÷ 70% = 0.70, which is below the 0.80 threshold (four-fifths rule). This indicates potential discriminatory impact on Group B. The model would require fairness remediation (resampling, reweighting, or post-processing threshold adjustment) before deployment in high-stakes contexts.

Click to flip back

Model Red Teaming

How does model red teaming differ from standard security penetration testing, and when in the AI lifecycle must it occur?

Model red teaming is structured adversarial probing specifically targeting AI failure modes: harmful outputs, prompt injection, jailbreaks, fairness violations, and data leakage — not just traditional vulnerability exploits. It occurs before production deployment as a gate in the MLSecOps pipeline. Unlike pentesting, red team findings feed into go/no-go deployment decisions and model card documentation.

Click to flip back

MLSecOps

What is model artifact signing in MLSecOps, and which security objective does it primarily serve?

Model artifact signing is the cryptographic signing of model weights and configuration files upon completion of training. Before any deployment, the signature is verified — any post-training tampering invalidates the signature and blocks deployment. It primarily serves integrity — ensuring the model that was trained and validated is identical to the model being deployed.

Click to flip back

Federated Learning Security

Why is Byzantine-robust aggregation necessary in federated learning, and what problem does it solve?

Byzantine-robust aggregation protects the central aggregation server from malicious participants (Byzantine nodes) that send manipulated gradient updates designed to degrade model performance or embed backdoors. Algorithms like coordinate-wise median or Krum reduce the influence of outlier updates — unlike simple averaging, which is vulnerable to even a single malicious participant skewing the aggregated gradient significantly.

Click to flip back

Study Advisor

Select a topic area for targeted exam preparation tips

HIGH PRIORITY · ~12–14 exam questions

🏗️ AI Security Architecture Study Tips

Know STRIDE's six categories cold and be able to map each to a concrete ML attack vector — expect scenario questions that describe an attack and ask which STRIDE category it falls under
Understand Zero Trust's three core principles (verify explicitly, least privilege, assume breach) and how each applies differently to AI systems versus traditional IT
Memorize the five layers of defense in depth for AI: input validation → model guardrails → output filtering → monitoring → incident response — questions test layer ordering and function
Know the difference between prompt injection defense mechanisms: input sanitization (before the model), system prompt privilege separation (within the model context), and output validation (after the model)
For LLM architectures, understand why the "Elevation of Privilege" STRIDE category is the most commonly tested — it maps directly to prompt injection
Know what microsegmentation means for ML workloads: training environments must be network-isolated from inference environments and corporate networks
Understand federated learning's unique security challenge: the aggregation server is the high-value target, and gradient poisoning is the primary training-time attack
Be prepared for architecture diagram questions that ask WHERE to place a specific security control (input validation layer, API gateway, output filter)

MEDIUM PRIORITY · ~4–6 exam questions

🎯 Secure Model Selection Study Tips

Model cards are a key AAISM concept: know that they document limitations, biases, intended use cases, and performance boundaries — deploying without reviewing them is a significant risk
Open-source model risk centers on weight integrity: downloaded weights should be hash-verified against published checksums before use in any pipeline
For commercial model APIs, understand the fine-tuning data risk: when fine-tuning on proprietary data via a third-party API, data may be retained and used by the vendor — this requires contractual and technical controls
Vendor security posture evaluation should include SOC 2 Type II attestation, data handling policies, and breach history — not just technical capability evaluation
Capability assessment is a security activity: understanding what a model CAN do (code generation, tool calling, file access) defines the attack surface that must be controlled
Remember: model selection is the first lifecycle phase where security decisions are made — wrong choices here compound throughout the lifecycle

HIGH PRIORITY · ~10–12 exam questions

🔬 Secure Training Study Tips

Data poisoning is the most commonly tested training-time attack — know the three defenses: provenance tracking, anomaly detection, multi-source verification
Differential privacy: know that ε (epsilon) is the privacy budget, smaller ε = stronger privacy but lower accuracy — expect calculation or interpretation questions
Adversarial training improves robustness by training on adversarial examples (FGSM, PGD) but introduces an accuracy-robustness trade-off — expect questions testing whether you know this trade-off exists
Immutable training audit logs must record: dataset version, hyperparameters, environment hash, user ID, timestamps — "immutable" is the key adjective for exam purposes
Federated learning enables training without centralizing data but introduces gradient poisoning as the primary threat — Byzantine-robust aggregation is the mitigation
Hyperparameter tuning APIs can leak information about training data — this is a subtle point that appears in advanced AAISM questions
Secure training environments require isolated compute + least privilege for ML engineers + audit logging — all three are needed; questions may ask which is missing
Know TensorFlow Privacy and Opacus as the primary differential privacy training frameworks

HIGH PRIORITY · ~8–10 exam questions

✅ Model Validation Study Tips

Model red teaming is a structured, pre-deployment adversarial activity — not ad hoc testing. It produces documented findings that feed into go/no-go decisions
Know the five red team test domains: scope violations, adversarial inputs, fairness gaps, jailbreak resistance, data leakage
Disparate impact (80% Rule / four-fifths rule): ratio = minority group rate ÷ majority group rate; if <0.80, disparate impact is flagged — be ready to calculate this on the exam
Know the three fairness metrics: Disparate Impact (overall rates), Equalized Odds (equal TPR and FPR), Demographic Parity (equal positive prediction rates)
Explainability tools for security review: SHAP (global and local, any model), LIME (local surrogates), Attention Maps (transformer-specific) — know which tool applies to which use case
Canary deployments route a small traffic percentage to a new model version; blue/green deployments maintain the previous version for rollback — know both patterns
Security gates in MLOps: adversarial robustness score + fairness metrics + red team clearance + model card completeness + artifact signature verification — questions may ask which gate addresses which risk
Regression testing for security ensures model updates don't re-introduce previously patched vulnerabilities — often overlooked but exam-tested

MEDIUM-HIGH PRIORITY · ~6–8 exam questions

⚙️ MLSecOps Study Tips

Model artifact signing primarily addresses integrity (not confidentiality) — know the distinction; signing ≠ encryption
Feature store security: RBAC prevents unauthorized write access; feature versioning enables rollback; anomaly detection on feature distributions catches feature poisoning
Secrets management in ML pipelines: API keys and credentials must be stored in dedicated secrets managers (HashiCorp Vault, AWS Secrets Manager) — never hardcoded in pipeline configs or container images
IaC security tools: Checkov and tfsec for Terraform; OPA/Gatekeeper for Kubernetes admission control — know what they scan for (misconfigurations, not vulnerabilities)
Container security for ML: image scanning (Trivy) at build time; runtime security (Falco) monitors for unexpected process execution during training runs
Model registry access control: training pipelines have write access; inference environments have read-only access — separation of concerns prevents unauthorized promotion
Immutable audit logs are write-once records covering the full pipeline: dataset version → training parameters → model version → deployment target — all stages must be logged
Know that MLSecOps applies DevSecOps principles to ML — "shift left" security into data ingestion and training, not just deployment

Key Resources

Authoritative references for AAISM Domain 3 exam preparation

Official Certification Body

ISACA AAISM Exam Resources

Official exam content outline, candidate guide, and AAISM study resources from ISACA — the authoritative source for domain weightings and objectives.

isaca.org/credentialing/aaism →

NIST Framework

NIST AI Risk Management Framework (AI RMF 1.0)

The NIST AI RMF provides structured guidance on identifying, assessing, and managing AI risks — directly referenced in AAISM domain content for governance and controls.

airc.nist.gov/RMF →

Threat Framework

MITRE ATLAS — Adversarial Threat Landscape for AI Systems

MITRE ATLAS catalogs adversarial ML attack techniques (data poisoning, model evasion, model extraction) — essential for AAISM security architecture questions.

atlas.mitre.org →

Security Standard

NIST SP 800-207: Zero Trust Architecture

The foundational Zero Trust Architecture standard — defines principles and deployment models that the AAISM exam applies to AI system design contexts.

csrc.nist.gov →

ML Security Standard

OWASP Machine Learning Security Top 10

OWASP's top ML security risks including data poisoning, model theft, and adversarial examples — a practical complement to AAISM's theoretical framework with real-world attack context.

owasp.org →

Practice Exams

FlashGenius AAISM Practice Tests

Full-length AAISM practice exams with 90 questions across all five domains, detailed answer explanations, and domain-level performance analytics to guide focused study.

flashgenius.net/register →

Domain 3A & 3B — What the Exam Tests

Core Concepts at a Glance

Security by Design for AI

Zero Trust for AI Systems

Defense in Depth for AI

Secure Training Pipeline

Model Validation & Red Teaming

MLSecOps Pipeline Security

AAISM Domain 3 Sub-Area Comparison

AI System Components to Secure

AI Security Architecture & Design

STRIDE Applied to Machine Learning Systems

Zero Trust Architecture for AI

Defense in Depth for AI Systems

Prompt Injection Defense (LLMs)

MLOps Platform Security (Kubeflow, MLflow, SageMaker)

Federated Learning Security Architecture

Secure Model Selection

Evaluating Third-Party and Open-Source Models

Secure Model Training

Data Poisoning Prevention

Differential Privacy in Training

Adversarial Training for Robustness

Secure Training Environment Controls

Model Validation & Security Testing

Model Red Teaming

Bias & Fairness Testing Metrics

Explainability for Security Review

MLOps Security Gates & Staging Deployments

MLSecOps Pipeline Security

Securing ML CI/CD Pipelines

Model Artifact Signing & Registry Security

IaC Security for ML Platforms

Memory Hooks

STRIDE for ML — Threat Modeling Hook

Zero Trust for AI — "VALVE" Framework

Data Poisoning Defenses — "PADV"

Differential Privacy — The ε Trade-off

Model Red Teaming — "SAFJD" Gate

MLSecOps Pipeline — "DRIFT" Controls

Practice Quiz

Flashcards

Study Advisor

🏗️ AI Security Architecture Study Tips

🎯 Secure Model Selection Study Tips

🔬 Secure Training Study Tips

✅ Model Validation Study Tips

⚙️ MLSecOps Study Tips

Key Resources

ISACA AAISM Exam Resources

NIST AI Risk Management Framework (AI RMF 1.0)

MITRE ATLAS — Adversarial Threat Landscape for AI Systems

NIST SP 800-207: Zero Trust Architecture

OWASP Machine Learning Security Top 10

FlashGenius AAISM Practice Tests

Ready to Pass the AAISM Exam?