SecAI+ Domain 2: Securing AI Systems

Securing AI Systems
The biggest domain — 40%

Threat-modeling frameworks, model & gateway controls, access controls, data security, monitoring & auditing, and the full catalog of AI-specific attacks with their compensating controls.

40%

Exam Weight

Key Concept Areas

Flashcards

Quiz Questions

What Domain 2 Covers

Domain 2 is the heart of SecAI+ at 40% of the exam. It covers how to design and enforce controls that protect AI models, data, agents, and integrations — and the full range of AI-specific attacks you're expected to recognize and counter.

Objective A

Implement Security Controls

Protect AI systems, data, and models using technical safeguards — model controls, gateway controls, and access controls.

Objective B

Secure Deployment Environments

Apply best practices across on-premises, cloud, and hybrid AI infrastructure, with monitoring and auditing built in.

Objective C

Mitigate Adversarial Risks

Defend against attacks targeting AI models, data pipelines, and inference layers using compensating controls.

MITRE ATLAS OWASP LLM Top 10 Guardrails & Prompt Firewalls Least Privilege Encryption in Use Monitoring & Auditing Prompt Injection Model Inversion Excessive Agency

💡

How to use this page: Because this domain is worth 40%, give it the most study time. Work through all six Key Concepts cards — especially the attacks & compensating controls table — drill the flashcards, then take the 10-question Knowledge Check.

🔗

Connect the dots: Every attack in this domain has at least one compensating control. The exam loves "given this attack, which control stops it?" questions — study them as pairs, not separate lists.

Key Concept Areas

Click each card to expand the explanation and study tip.

1. Threat-Modeling Resources & Frameworks ▾

OWASP Top 10 for LLM Applications — catalogs the most critical risks specific to LLM-based apps (prompt injection, training data poisoning, supply chain, etc.).
OWASP Machine Learning Security Top 10 — a broader ML risk catalog covering the entire pipeline (data, model, infrastructure).
MIT AI Risk Repository — a comprehensive, categorized database of AI risks drawn from academic and industry sources.
MITRE ATLAS — an adversarial threat-modeling framework (modeled on MITRE ATT&CK) cataloging real-world tactics and techniques used against AI systems.
CVE AI Working Group — extends standard vulnerability tracking and disclosure practices to AI-specific vulnerabilities.

💡

Study tip: Know which framework to reach for: ATLAS = adversary tactics/techniques, OWASP LLM/ML Top 10 = app/pipeline risk catalogs, MIT AI Risk Repository = broad risk taxonomy, CVE AI WG = vulnerability disclosure.

2. Model & Gateway Security Controls ▾

Model Controls

Evaluation — testing model behavior/outputs before and after deployment.
Guardrails — rules that constrain model inputs/outputs.
Prompt templates — constrain and standardize input structure.

Gateway Controls

Prompt firewalls — inspect and filter prompts before they reach the model.
Rate & token limits — cap usage to prevent abuse and cost overruns.
Input quotas by size/quantity and modality limits (restrict input/output types — text/image/audio).
Endpoint access controls — restrict which systems/users can reach the model API.

Validation

Guardrail testing & validation — continuously test guardrails against known jailbreak/bypass techniques.

💡

Study tip: Think "defense in depth" — model-level controls + gateway-level controls + ongoing validation, layered together.

3. Access Controls for AI Systems ▾

Apply least privilege across four surfaces:

Models — who can query, fine-tune, or modify a model.
Data — who can access training and inference data.
Agents — what actions an AI agent is authorized to take.
APIs & networks — network segmentation and endpoint authentication.

Agents need particularly tight, scoped permissions — an over-permissioned agent is the root cause of "excessive agency" (see card 6).

💡

Study tip: For any access-control question, ask: "is this the minimum access needed for this specific function?"

4. Data Security Controls ▾

Encryption in transit — protects data moving between systems.
Encryption at rest — protects stored data.
Encryption in use — protects data while it's being actively processed; the hardest to achieve (relates to confidential computing).
Anonymization — removing identifying information so individuals can't be re-identified.
Labels / classification — tagging data by sensitivity (public, internal, confidential) to drive handling rules.
Redaction / masking — hiding or replacing sensitive values (e.g., showing only the last 4 digits of an account number).
Minimization — collecting and retaining only the data actually needed.

💡

Study tip: Map each control to a data lifecycle stage — collection (minimization), storage (encryption at rest, classification), processing (encryption in use), output (redaction/masking).

5. Monitoring & Auditing ▾

Prompt/query/response monitoring — logging what users ask and what the model returns.
Log monitoring, sanitization & protection — ensure logs don't leak sensitive data and are protected from tampering.
Response confidence monitoring — tracking how "certain" a model is, to flag low-confidence (possibly hallucinated) outputs.
Rate & cost monitoring — tracking prompt, storage, response, and processing costs to detect abuse or runaway spend.
Auditing for hallucinations, accuracy, bias/fairness, and access — periodic structured reviews.

💡

Study tip: Monitoring = real-time/ongoing observation. Auditing = periodic structured review. The exam tests whether you can tell these apart.

6. AI-Specific Attacks & Compensating Controls ▾

This is the longest list on the exam — study attacks and their compensating controls as pairs.

Attack	What It Does	Compensating Control(s)
Prompt injection	Malicious instructions embedded in input override system intent	Prompt firewalls, templates, guardrails
Model / data poisoning	Corrupts training data or model weights	Data integrity checks, provenance tracking, access controls
Jailbreaking	Bypasses a model's safety guardrails	Guardrail testing, layered controls
Input manipulation	Crafted inputs cause unintended behavior	Input validation, rate limiting
Bias introduction	Deliberately skews training data or outputs	Data auditing, fairness testing
Guardrail circumvention	Finds gaps in safety rules	Continuous guardrail validation
Integration abuse	Exploits how AI connects to other systems/plug-ins	Least privilege, scoped agent permissions
Model inversion / theft	Extracts training data or parameters via queries	Rate limiting, output filtering, access controls
Supply chain / transfer learning attack	Compromised pre-trained models or dependencies	Provenance verification, vetted sources
Model skewing	Gradually shifts model behavior via crafted inputs over time	Monitoring/auditing for drift
Output integrity attack	Tampers with model outputs in transit	Encryption, integrity checks
Membership inference	Determines whether specific data was in the training set	Differential privacy, access controls
Insecure output handling	Blindly trusting/executing model output (e.g., as code)	Output validation, sandboxing
Model denial of service (DoS)	Overwhelms a model with costly queries	Rate/token limits, quotas
Sensitive data disclosure	Model reveals confidential training data or PII	Data minimization, redaction, guardrails
Insecure plug-ins	Vulnerable third-party extensions	Least privilege, vetting, sandboxing
Excessive agency / overreliance	AI agent given too much autonomy or trust	Least privilege, human-in-the-loop oversight

💡

Study tip: Notice how often least privilege, guardrails, and rate limiting reappear — these three controls cover a huge share of this table.

Securing AI Systems
The biggest domain — 40%

What Domain 2 Covers

Implement Security Controls

Secure Deployment Environments

Mitigate Adversarial Risks

Key Concept Areas

Model Controls

Gateway Controls

Validation

Flashcards

Knowledge Check

Exam Ready

Next up — Domain 3: AI-Assisted Security (24%)

Securing AI SystemsThe biggest domain — 40%

What Domain 2 Covers

Implement Security Controls

Secure Deployment Environments

Mitigate Adversarial Risks

Key Concept Areas

Model Controls

Gateway Controls

Validation

Flashcards

Knowledge Check

Exam Ready

Next up — Domain 3: AI-Assisted Security (24%)

Securing AI Systems
The biggest domain — 40%