AI Auditing Tools & Techniques

Domain 3 focuses on how auditors plan, execute, report, and follow up on AI-specific audits — using both traditional IT audit disciplines and AI-native testing approaches.

    Auditor's Mindset: AI auditing applies the same rigorous methodology as IT audit — but with AI-specific twists. Models are non-deterministic, training data is complex, and black-box architectures challenge explainability. Your tools must evolve: CAATs, continuous monitoring, and XAI techniques are now core audit capabilities.
  

AI Audit Lifecycle — 4 Phases

Plan

Define scope, objectives, risk assessment, and audit program

Execute

Gather evidence, test controls, run model and data quality tests

Report

Draft findings using CCCE, assign risk ratings, get management response

Follow-Up

Track remediation, validate that findings are resolved, close audit

ITAF — IT Assurance Framework (ISACA's Audit Standards)

Category 1 · 1000–1099

General Standards

Prerequisites for the auditor

Professional standards governing who the auditor IS: independence and objectivity, competence and due professional care, ethics and professional conduct. Must be met before fieldwork begins.

Category 2 · 1200–1299

Performance Standards

How the audit work is DONE

Governs how audits are conducted: engagement planning, risk and materiality assessment, resource management, supervision and quality, evidence collection and testing techniques.

Category 3 · 1400–1499

Reporting Standards

What the audit OUTPUT looks like

Standards for communicating results: audit report content, findings presentation using CCCE, risk ratings, management responses, opinion statements, and follow-up reporting.

CCCE Audit Finding Framework

Condition

"What IS happening?"

The AI model's drift metrics have not been reviewed in 6 months. No documented threshold breach review exists.

Criteria

"What SHOULD be happening?"

AI governance policy requires monthly PSI review with documented approval for any model continuing operation above 0.2 threshold.

Cause

"WHY did it happen?"

No designated owner was assigned for drift monitoring. The monitoring dashboard was configured but automated alerts were disabled.

Effect

"What is the IMPACT?"

Undetected model degradation may result in biased decisions affecting 50,000+ customers, regulatory penalty risk, and reputational harm.

AI-Specific Audit Challenges

Challenge	Why It's Hard	Auditor's Response
Black Box Models	Deep learning decisions cannot be attributed to specific inputs	Use LIME/SHAP for explainability; require XAI documentation in governance policy
Non-Determinism	Same input may produce different outputs in probabilistic models	Test on holdout datasets; compare output distributions rather than exact values
Rapid Model Change	Models retrain frequently; change management may not keep pace	Audit model versioning, change authorization, and rollback capabilities
Data Complexity	Training data may be massive, siloed, or poorly documented	Data lineage audit; test data quality dimensions; sample using CAATs
Algorithmic Bias	Bias in training data propagates to model outputs at scale	Stratified testing across demographic groups; compare fairness metrics
Third-Party AI	Vendor models offer limited auditability	Review SLAs, model cards, third-party assessments; right-to-audit clauses

AI Audit Planning

A well-planned AI audit begins with a risk-based scope, clear objectives, and an understanding of the AI systems being audited.

    Risk-Based Approach: Not all AI systems carry equal risk. Auditors prioritize engagements based on the impact of AI decisions (high-stakes vs. advisory), model complexity (deep learning vs. rules-based), regulatory exposure, and prior audit findings.
  

Risk Concepts in AI Audit Planning

Risk Type

Inherent Risk

The level of risk that exists in the absence of any controls. For AI systems, inherent risk is driven by model complexity, data sensitivity, decision impact, and regulatory scope.

AI example: A high-stakes credit scoring model using customer PII has high inherent risk before any controls are applied.

Risk Type

Residual Risk

The risk that remains AFTER controls are applied. Residual risk = Inherent risk minus the risk reduced by controls. Audit scope focuses on verifying residual risk is within acceptable tolerance.

AI example: After adding model monitoring, human review thresholds, and bias testing, the residual risk is acceptably low.

Risk Type

Control Risk

The risk that a material error or control failure will not be detected by the entity's control framework. High control risk = weaker evidence from management's own controls.

AI example: If drift monitoring is the only detective control, control risk is high — a single point of failure.

Risk Type

Detection Risk

The risk that the auditor's procedures will fail to detect a material problem. Inversely related to audit evidence: more evidence → lower detection risk. Auditors adjust sample sizes to manage this.

AI example: Small bias test samples may miss demographic subgroups — increasing sample size lowers detection risk.

AI Audit Universe — Types of AI Engagements

Audit Type	Focus Area	Key Procedures
AI Governance Audit	Policies, oversight structure, accountability	Review AI policy, governance committee, roles/responsibilities, escalation paths
AI Model Audit	Model development, validation, and performance	Holdout testing, bias testing, model documentation review, version control
AI Data Audit	Training data quality and lineage	Data quality dimensions testing, lineage documentation, consent/GDPR review
AI Controls Audit	Control design and operating effectiveness	Control walkthroughs, reperformance of key controls, HITL override rate analysis
AI Ethics/Bias Audit	Fairness and non-discrimination	Disparate impact analysis, LIME/SHAP output review, demographic parity testing
Third-Party AI Audit	Vendor AI systems and SLAs	Model card review, right-to-audit clauses, SOC 2 reports, vendor questionnaires

AI Audit Program Components

An audit program documents the specific procedures the auditor will perform. For AI audits, the program should address each AI risk area with tailored testing steps.

Program Component	AI-Specific Content
Audit Objectives	Define what the audit will opine on: governance adequacy, model reliability, data integrity, or control effectiveness
Scope & Boundaries	Identify which AI systems, models, data pipelines, and business processes are in scope; explicit exclusions
Risk Assessment	Score AI systems by inherent risk; prioritize highest-risk systems for in-depth testing
Testing Procedures	Specific steps: model testing, bias analysis, change management walkthroughs, monitoring review, data lineage tracing
Evidence Requirements	Specify what documentation, output logs, test data, and approvals must be collected to support conclusions
Materiality Threshold	Define what level of error, bias deviation, or control failure is material (e.g., demographic disparity >5%)

COBIT 2019 Applied to AI Audit

ISACA — Control Objectives for Information and Related Technologies

Framework

What COBIT Provides

A governance and management framework that defines 40 governance/management objectives organized into 5 domains: EDM (Evaluate/Direct/Monitor), APO (Align/Plan/Organize), BAI (Build/Acquire/Implement), DSS (Deliver/Service/Support), MEA (Monitor/Evaluate/Assess).

Key COBIT Objectives for AI

EDM03 (Manage Risk) — AI risk tolerance. APO12 (Manage Risk) — AI risk register. BAI03 (Manage Solutions Identification and Build) — AI model development. DSS05 (Manage Security Services) — AI system security. MEA01 (Manage Performance) — AI KPIs and monitoring.

🎯 Exam Hook: ITAF = audit STANDARDS (how to conduct the audit). COBIT = governance FRAMEWORK (what the organization should be doing). Auditors use COBIT as a reference to assess whether the entity's AI governance is designed appropriately, then ITAF to ensure the audit itself is conducted professionally.

Audit Evidence & AI Testing

Auditors collect evidence to support their conclusions. For AI, evidence extends beyond documents to model outputs, test datasets, and algorithmic fairness metrics.

    Evidence Reliability Hierarchy: The most reliable evidence is obtained directly by the auditor (physical inspection, reperformance, direct observation). Evidence from third parties is more reliable than evidence from management. Oral/inquiry evidence is the least reliable.
  

6 Types of Audit Evidence

👁️

Observation

Auditor directly witnesses a process or control in operation. Moderately reliable — only captures a moment in time.

AI use: Watch a model deployment process; observe HITL reviewer workflow in real time

💬

Inquiry

Oral or written responses from personnel. Least reliable — must be corroborated with other evidence types.

AI use: Interview data scientists about model validation steps; ask risk team about override protocols

📄

Inspection

Examination of records, documents, or physical assets. Reliable when documents are externally generated; less so for internal docs.

AI use: Inspect model cards, training data documentation, governance meeting minutes

🔁

Reperformance

Auditor independently executes a control or procedure to verify it produces expected results. Highly reliable — auditor generates the evidence.

AI use: Re-run model on holdout test set; recalculate bias metrics independently using provided data

📊

Analytical

Evaluation of data through comparisons, ratios, trends, or patterns. Very effective for identifying anomalies at scale.

AI use: Trend PSI scores over time; compare model accuracy across demographic segments; analyze override rate trends

✉️

Confirmation

Third-party verification of information provided by management. High reliability due to independence of source.

AI use: Obtain third-party model audit report; confirm vendor's stated test accuracy with their test lab

AI Model Testing Types

🎯

Accuracy Testing

Evaluate model performance on labeled holdout data using metrics: Accuracy, Precision, Recall, F1, AUC-ROC.

Auditor action: Run model on independent test set; compare to baseline and vendor-stated metrics

⚖️

Bias Testing

Evaluate whether model produces equitable outcomes across demographic groups. Test for disparate impact and demographic parity.

Auditor action: Stratify test data by protected attributes; compare False Positive/Negative rates across groups

💪

Robustness Testing

Assess how model performs under adversarial inputs, edge cases, or distribution shifts. Tests resilience to real-world conditions.

Auditor action: Apply adversarial input variations; test with out-of-distribution data; review stress test documentation

🔍

Explainability Testing

Verify that model decisions can be explained to an appropriate degree. Critical for high-stakes and regulated AI systems.

Auditor action: Run SHAP/LIME on sample decisions; verify explanations align with documented model intent

🛡️

Security Testing

Assess vulnerability to adversarial attacks: data poisoning, evasion, model inversion, prompt injection, model stealing.

Auditor action: Review penetration test results; test API rate limiting; verify differential privacy controls

Data Quality Dimensions — AI Training Data Audit

Accuracy

Data correctly represents the real-world value it models.

Test: Compare sample records to source system

Completeness

All required records and fields are present with no missing values.

Test: Null checks; count records vs. expected population

Consistency

Data values are uniform across systems and time; no contradictory records.

Test: Cross-system reconciliation; detect format anomalies

Timeliness

Data is available when needed and is sufficiently current for its intended use.

Test: Check data freshness dates; compare update frequency to policy

Validity

Data conforms to defined business rules, formats, and acceptable value ranges.

Test: Range checks; type validation; business rule compliance

Uniqueness

No record is duplicated in a way that would skew model training or statistics.

Test: Duplicate detection on key fields; deduplication log review

Data Lineage Auditing

Data lineage traces the complete journey of data — from original source through every transformation, to its final use in AI model training or inference. A gap in lineage = an unverified assumption in the model.

Lineage Stage	Audit Questions	Evidence Sought
Data Origin	Where does the training data come from? Is consent obtained?	Source system documentation, data use agreements, consent records
Data Ingestion	How is data collected and ingested? Are there transformation errors?	ETL pipeline logs, ingestion audit trails, error handling documentation
Data Transformation	How is data preprocessed, normalized, or feature-engineered?	Transformation scripts, version history, data transformation documentation
Feature Engineering	Which features are derived? Could any be proxies for protected attributes?	Feature catalog, feature importance analysis, proxy variable testing
Training Split	Is holdout data truly unseen? Is there data leakage?	Train/test split documentation; evidence of temporal separation
Model Input	Is production data the same distribution as training data?	PSI monitoring reports, distribution comparison charts

AI Audit Tools & Techniques

Modern AI auditing combines traditional CAATs with AI-native testing tools, continuous auditing capabilities, and statistical sampling methods.

    Key Distinction: Continuous Auditing is an audit function activity — internal audit deploys automated routines to evaluate controls and transactions on an ongoing basis. Continuous Monitoring is management's activity — operations teams monitor KPIs and thresholds as part of normal business operations. Both generate evidence, but only continuous auditing produces audit assurance.
  

Computer-Assisted Audit Techniques (CAATs)

Data Extraction & Analytics

Core CAAT

Use audit software (ACL, IDEA, Python/SQL) to extract 100% of a population and perform statistical analysis: completeness checks, gap analysis, duplicate detection, trend analysis.

AI use: Extract model decision logs; analyze override rate trends; detect anomalous output patterns

Exception Reporting

Analytics

Programmatically identify transactions or model outputs that fall outside defined parameters. More efficient than manual review — covers the entire population.

AI use: Flag all model decisions where confidence score <40%; identify all HITL overrides for review

Automated Control Testing

Core CAAT

Scripts that automatically test whether controls are operating: verifying approval records exist, checking timestamps, validating authorization hierarchies.

AI use: Verify that every model deployment record has a signed change approval; check all bias tests are run before release

Continuous Auditing Routines

Method

Ongoing automated audit procedures that run against live systems in real time or near-real time. Reduces the lag between control failure and audit detection.

AI use: Daily automated PSI calculation with alert to audit when threshold breached; weekly bias metric sampling

GRC Platform Integration

Platform

Governance, Risk, and Compliance tools (ServiceNow GRC, RSA Archer) link audit findings to risk registers, control frameworks, and remediation tracking.

AI use: Map AI control failures to COBIT objectives; auto-assign remediation owners; track finding aging

XAI Audit Tools

Analytics

SHAP, LIME, and counterfactual tools applied to audit populations. Provide post-hoc explanations for specific model decisions during testing.

AI use: Run SHAP on sample of denied credit applications to verify no proxy discrimination; present findings to audit committee

Audit Sampling Methods

Statistical

Simple Random Sampling

Every item in the population has an equal chance of selection. Selection is random with no systematic pattern. Fully objective.

Best when: population is homogeneous and a representative cross-section is needed

Statistical

Systematic Sampling

Select every Nth item from a list after a random starting point. E.g., every 50th AI decision log from a sorted list of 50,000 records.

Best when: population is ordered and a quick, evenly spaced sample is acceptable

Statistical

Stratified Sampling

Divide population into subgroups (strata) and randomly sample from each. Ensures representation of key subgroups in the sample.

Best when: auditing AI bias — stratify by demographic to ensure each group is tested

Statistical

Cluster Sampling

Divide population into clusters, randomly select WHOLE clusters, then test all items in selected clusters. Efficient for geographically or logically grouped data.

Best when: population naturally groups by region, business unit, or data partition

Statistical vs. Non-Statistical (Judgmental) Sampling

Dimension	Statistical Sampling	Judgmental Sampling
Selection basis	Random (probability-based)	Auditor's professional judgment
Sampling risk	Can be quantified and projected	Cannot be statistically quantified
Projection	Results can be projected to whole population	Projection to population is not statistically valid
Bias risk	Minimal — selection is objective	Higher — auditor may unconsciously select familiar items
When preferred	Large homogeneous populations; regulatory requirements	Targeted testing of high-risk items; small populations

AI Audit Report Structure

Executive Summary

Overall audit opinion (favorable, qualified, adverse, disclaimer), scope covered, key themes, and highest-priority findings. Written for senior leadership and audit committee — non-technical language.

Audit Scope & Objectives

Defines what was audited (AI systems in scope), the period covered, the audit standards applied (ITAF), and any scope limitations that constrained the audit work.

Findings (CCCE Format)

Each finding presented with: Condition (what was found), Criteria (what should be), Cause (root cause), Effect (business impact). Each finding assigned a risk rating: Critical, High, Medium, or Low.

Recommendations

Specific, actionable corrective actions for each finding. Good recommendations are: feasible, measurable, targeted to root cause, and assigned to an accountable owner with a due date.

Management Response

Auditee's formal response to each finding: accept (agree and will remediate), partially accept, or reject (with documented rationale). Management commits to remediation plan and target dates.

Follow-Up Plan

Schedule and process for verifying that management has completed agreed remediation actions. Unresolved high/critical findings escalate to audit committee or board-level reporting.

Practice Quiz — Domain 3

10 questions covering AI audit methodology, ITAF, evidence, CAATs, and sampling. Select an answer for each question, then click Submit.

Question 1 of 10

In the CCCE audit finding framework, which element describes the deviation from expected practice that the auditor FOUND during fieldwork?

A Criteria — the standard or expectation that should be met

B Condition — what the auditor actually observed or discovered

C Cause — the root reason the deviation exists

D Effect — the impact of the deviation on the organization

The CCCE framework has four components: Condition (what IS — what the auditor found), Criteria (what SHOULD BE — the standard), Cause (WHY the gap exists), and Effect (the impact). A complete finding requires all four elements. Condition is the observed reality; criteria is the benchmark being violated.

Question 2 of 10

Which ISACA framework specifically provides the professional standards, guidelines, and tools for IT audit and assurance engagements?

A COBIT 2019 — provides governance and management objectives for enterprise IT

B ITAF — IT Assurance Framework with audit standards across three categories

C ISO/IEC 27001 — information security management system standard

D NIST CSF — cybersecurity framework for critical infrastructure

ITAF (IT Assurance Framework) is ISACA's professional standards framework for IT audit. It has three standard categories: General Standards (auditor qualifications — 1000s), Performance Standards (how to conduct the audit — 1200s), and Reporting Standards (how to communicate results — 1400s). COBIT is a governance framework that auditors use as a reference, not an audit standard itself.

Question 3 of 10

Continuous auditing differs from continuous monitoring in that continuous auditing is PRIMARILY:

A Performed exclusively by automated tools with no human involvement

B Management's operational responsibility for tracking KPIs

C Performed by internal or external audit functions to provide assurance

D Only applicable to financial controls and transactions

Continuous auditing = audit function's ongoing, automated testing of controls and transactions to provide independent assurance. Continuous monitoring = management's operational activity watching KPIs, thresholds, and alerts as part of normal operations (Line 1 or Line 2). The key distinction is ownership: audit owns continuous auditing; management owns continuous monitoring. Both use similar tools but serve different purposes.

Question 4 of 10

An AI auditor wants to evaluate whether a credit scoring model produces equitable outcomes across demographic groups. The MOST appropriate approach is:

A Review the model training documentation and developer sign-offs only

B Interview the data science team about their bias mitigation design choices

C Run the model on test datasets stratified by demographic and compare output distributions and error rates

D Assess only overall model accuracy on the full aggregated test set

The most effective bias audit requires actual testing, not just documentation review or inquiry. Stratifying test data by demographic group (protected attributes) allows the auditor to independently measure disparate impact — comparing False Positive Rates, False Negative Rates, and selection rates across groups. Aggregate accuracy can hide significant subgroup disparities (the "Simpson's Paradox" problem). Documentation and interviews are supporting evidence only.

Question 5 of 10

Statistical sampling is distinguished from non-statistical (judgmental) sampling primarily because statistical sampling:

A Is faster and simpler to implement in practice

B Allows the auditor to quantify sampling risk and project results to the full population

C Always requires a substantially larger sample to reach valid conclusions

D Eliminates the need for auditor professional judgment

The defining feature of statistical sampling is that it uses probability-based selection, which allows the auditor to quantify sampling risk (the risk that the sample is not representative) and statistically project results to the entire population with a calculable confidence level. Non-statistical sampling cannot do this — results apply only to the sampled items. Statistical sampling is not necessarily faster or larger; it IS more objective and defensible for projection.

Question 6 of 10

The "black box" nature of deep learning models creates which SPECIFIC audit challenge?

A The model executes too slowly for efficient audit testing procedures

B Labeled test data is unavailable for accuracy assessment

C Individual model decisions cannot be explained or attributed to specific input variables

D The model's source code and architecture are unavailable for review

The black box problem is specifically about explainability: deep learning models (especially neural networks) transform inputs through hundreds of hidden layers in ways that cannot be directly interpreted. The auditor cannot trace why a specific decision was made. This challenges regulatory compliance (GDPR Article 22, fair lending laws), bias auditing, and due process requirements. Tools like SHAP and LIME provide post-hoc explanations but are approximations. Speed, labels, and source code access are separate issues.

Question 7 of 10

Which type of audit evidence is generally considered MOST reliable in supporting audit conclusions?

A Physical evidence or documentation obtained directly by the auditor through reperformance or inspection

B Verbal inquiry responses from experienced and knowledgeable management

C Written management representations confirming control effectiveness

D Second-hand testimonials from operational staff about past procedures

Evidence obtained directly by the auditor — through physical inspection, reperformance of controls, or direct observation — is the most reliable because the auditor controls the quality of collection and reduces dependence on management representations. The reliability hierarchy: Evidence obtained directly by auditor > External third-party evidence > Written internal evidence > Oral/inquiry evidence. Inquiry from management is the least reliable and must be corroborated.

Question 8 of 10

During an AI change management audit, the auditor is PRIMARILY verifying that:

A AI models achieve defined benchmark accuracy scores before deployment

B Changes to AI models are authorized, tested, and documented before deployment to production

C All AI systems in production use explainability tools such as SHAP or LIME

D All training data is encrypted at rest using approved encryption standards

Change management auditing focuses on process controls governing how changes reach production. The three pillars are: Authorization (was the change approved by the right people?), Testing (was the model validated in a non-production environment before go-live?), and Documentation (is there an audit trail?). Accuracy thresholds, explainability tools, and encryption are valid AI controls but are separate audit domains.

Question 9 of 10

Data lineage auditing is MOST concerned with:

A Tracking model performance metrics over time after deployment

B Verifying encryption standards applied to training datasets at rest

C Tracing the origin, transformation, and movement of data through the AI pipeline

D Auditing third-party AI vendor contractual compliance

Data lineage maps the complete journey of data: source system → ingestion → preprocessing → feature engineering → training split → model input. Auditing lineage answers: Where did this data come from? Who transformed it and how? Is the production input the same distribution as training data? Could any transformation introduce proxy variables or bias? Gaps in lineage create unverifiable assumptions in the model's trustworthiness — a critical AI audit risk.

Question 10 of 10

An auditor is testing an AI loan decisioning system and wants to ensure equitable representation of all demographic groups in the sample, including smaller minority groups. Which sampling method is MOST appropriate?

A Systematic sampling — select every Nth record from the decision log

B Simple random sampling — randomly select from all decisions without regard to group

C Cluster sampling — randomly select entire branches or regional batches

D Stratified sampling — divide decisions by demographic group and randomly sample from each stratum

Stratified sampling is ideal for bias auditing because it guarantees representation of each demographic subgroup, including small minorities that might be underrepresented in simple random samples. By segmenting the population into strata (e.g., by race, gender, age) and independently sampling from each, the auditor ensures sufficient sample sizes to draw statistically valid conclusions about fairness metrics across all groups. Without stratification, rare subgroups may have too few observations for valid analysis.

Practice Score — Keep studying with FlashGenius!

Memory Hooks

High-yield mnemonics and patterns to lock in AI Auditing Tools & Techniques for the AAIA.

🔎

CCCE — Audit Finding Framework

Condition (what IS) → Criteria (what SHOULD BE) → Cause (WHY) → Effect (IMPACT). Every audit finding needs all four. Missing the Cause = incomplete analysis. Missing the Effect = no business case for remediation.

Mnemonic: "Can Conditions Create Effects?" — or "Cops Check Crime Everywhere." The E is Effect, not Evidence.

📋

ITAF 3 Standard Categories

General (who you ARE as an auditor — competence, independence). Performance (how you DO the audit — planning, evidence, testing). Reporting (what you SAY — findings, opinions, management response). Think: Are → Do → Say.

Mnemonic: "Good Performance Reports" — General, Performance, Reporting. Or remember: Be qualified first, then do the work, then write it up.

🏆

Evidence Reliability Ranking

Most reliable → Least reliable: 1. Auditor-obtained (reperformance, physical inspection) 2. External/third-party 3. Written internal documentation 4. Verbal inquiry. The auditor running the test beats asking someone else what happened.

Mnemonic: "Auditors Are Extremely Wise Investigators" — A=Auditor-obtained, E=External, W=Written internal, I=Inquiry (verbal)

📐

4 Sampling Methods

Random: equal chance, fully objective. Systematic: every Nth item. Stratified: divide into subgroups, sample each (BEST for bias auditing). Cluster: sample entire natural groupings. Statistical methods → quantify sampling risk. Judgmental → cannot project.

Mnemonic: "Really Smart Students Cluster" — Random, Systematic, Stratified, Cluster. Stratified = bias auditing's best friend.

💻

Continuous Auditing vs. Monitoring

Continuous Auditing = AUDIT function owns it → provides independent assurance. Continuous Monitoring = MANAGEMENT owns it → operational oversight. Both are ongoing, both can use the same tools. The owner is the distinction. Audit Committee cares about continuous AUDITING results; management cares about monitoring dashboards.

Mnemonic: "Audit Assures, Management Monitors" — CA is an audit activity; CM is management's Line 2 control.

🧬

Data Quality — 6 Dimensions

Accuracy (correct values), Completeness (no missing records), Consistency (uniform across systems), Timeliness (current), Validity (conforms to rules), Uniqueness (no duplicates). Training data failing ANY dimension = model reliability at risk.

Mnemonic: "All Complete Computers Truly Validate Uniqueness" — Accuracy, Completeness, Consistency, Timeliness, Validity, Uniqueness

High-Yield AAIA Facts — Auditing Tools & Techniques

Fact	Answer
ITAF category governing how the audit is conducted	Performance Standards (1200 series)
CCCE element that describes what the auditor found	Condition (what IS happening)
Best sampling method for AI bias auditing	Stratified sampling — ensures all demographic groups are represented
What CAATs stands for	Computer-Assisted Audit Techniques
Owner of continuous auditing	Internal audit (Line 3) — provides independent assurance
Owner of continuous monitoring	Management (Line 1/2 operational activity)
Most reliable type of audit evidence	Evidence obtained directly by the auditor (reperformance, physical inspection)
Least reliable type of audit evidence	Inquiry (verbal) — must be corroborated with other evidence
COBIT vs. ITAF distinction	COBIT = governance framework (what org should do); ITAF = audit standards (how auditor works)
Key challenge of black-box AI models for auditors	Cannot explain decisions; SHAP/LIME used for post-hoc explainability

Flashcards & Study Advisor

Click any card to flip it. Use the Study Advisor for targeted guidance by topic area.

Concept

What is ITAF and what are its three standard categories?

Tap to reveal →

Answer

ITAF = IT Assurance Framework (ISACA's professional audit standards). Three categories: General (auditor qualifications — independence, competence), Performance (how to conduct audits — planning, evidence, testing), Reporting (communicating results — findings, opinions, follow-up).

Framework

What are the 4 elements of the CCCE audit finding framework?

Tap to reveal →

Answer

Condition — what the auditor FOUND (the deviation). Criteria — what SHOULD BE (the standard). Cause — WHY the gap exists (root cause). Effect — the IMPACT on the organization. All four required for a complete, actionable finding.

Distinction

How does continuous auditing differ from continuous monitoring?

Tap to reveal →

Answer

Continuous Auditing = performed by internal audit; provides independent assurance on controls and transactions on an ongoing basis. Continuous Monitoring = performed by management (Line 1/2); operational oversight of KPIs and thresholds. Same tools; different owners.

Challenge

What is the "black box" problem in AI auditing?

Tap to reveal →

Answer

Deep learning models cannot be directly interpreted — decisions cannot be attributed to specific input variables. Auditors cannot trace why a particular output was produced. This challenges GDPR Article 22 compliance, bias auditing, and due process. Mitigation: use SHAP/LIME for post-hoc explanations.

Technique

What is data lineage auditing, and why does it matter for AI?

Tap to reveal →

Answer

Data lineage traces the complete journey of data: source → ingestion → preprocessing → feature engineering → training → production inference. It matters for AI because untracked transformations can introduce bias, violate consent, or create distribution mismatches between training and production data.

Concept

What is stratified sampling and when is it preferred for AI bias auditing?

Tap to reveal →

Answer

Stratified sampling divides the population into subgroups (strata) and independently randomly samples from each. Preferred for bias auditing because it guarantees representation of small demographic minorities that simple random sampling might miss — enabling statistically valid conclusions about fairness across all groups.

Acronym

What are CAATs and what AI-specific tasks can they support?

Tap to reveal →

Answer

CAATs = Computer-Assisted Audit Techniques. For AI auditing: extract and analyze model decision logs (exception reporting), automate control testing (verify approvals exist), run continuous PSI calculations, perform demographic stratification, and integrate SHAP outputs for bias evidence across large populations.

Risk Concept

What is the difference between inherent risk and residual risk in AI audit planning?

Tap to reveal →

Answer

Inherent risk = risk that exists BEFORE controls are applied (driven by model complexity, data sensitivity, decision impact). Residual risk = risk that REMAINS after controls are applied. Auditors assess whether residual risk is within acceptable tolerance, not whether inherent risk is zero (it never is).

Ready to pass the AAIA?

Reinforce Domain 3 with full-length practice tests on FlashGenius.

Unlock Full Practice Tests on FlashGenius →

📌 Exam Strategy

⚠️ Common Mistakes

⚡ Quick Review

🔬 Deep Dive

🎯 Practice Tips

Exam Strategy — Domain 3

Distinguish ITAF vs COBIT: ITAF = how YOU audit; COBIT = what the ORGANIZATION should be doing. If a question asks about audit standards, the answer is ITAF.
CCCE order matters: Condition comes before Criteria — you observe first, then compare to the standard. Cause always requires root-cause analysis, not just symptom description.
Sampling for bias: Any question about testing demographic fairness → Stratified sampling. It's the only method that guarantees representation of small groups.
Continuous auditing owner: If a question says "management monitors..." that's continuous monitoring, not auditing. Audit owns continuous AUDITING.
Evidence reliability: Auditor-obtained always beats management-provided. Reperformance is the gold standard — the auditor does the test themselves.

Common Mistakes to Avoid

Mixing up CCCE: "Criteria" is the STANDARD (what should be), not what the auditor found. "Condition" is what they found. This is the #1 mix-up on the exam.
COBIT ≠ Audit Standard: COBIT is a governance reference framework, not an audit standard. Auditors use it as a benchmark, not to govern how they conduct the audit (ITAF does that).
Aggregate accuracy hides bias: Don't conclude a model is fair based on overall accuracy. Bias only surfaces when you test across demographic subgroups.
Inquiry is the weakest evidence: Management telling you a control works is not sufficient alone. Must corroborate with inspection, reperformance, or observation.
Data lineage ≠ model monitoring: Lineage is about data flow before model training; monitoring is about model performance after deployment. They're different audit procedures.

Quick Review — Key Facts

ITAF 3 categories: General (1000s) → Performance (1200s) → Reporting (1400s)
CCCE: Condition → Criteria → Cause → Effect
4 sampling types: Random, Systematic, Stratified (bias), Cluster
CAATs: Computer-Assisted Audit Techniques — data analytics, exception reporting, automated testing
CA vs CM: Continuous Auditing = audit function; Continuous Monitoring = management
Data quality 6 dims: Accuracy, Completeness, Consistency, Timeliness, Validity, Uniqueness
Evidence hierarchy: Auditor-obtained > External > Written internal > Inquiry
Bias testing method: Stratify by demographic, compare error rates across groups

Deep Dive — Advanced Concepts

Audit risk model: Audit Risk = Inherent Risk × Control Risk × Detection Risk. To reduce overall audit risk, increase substantive testing (lowers detection risk) when inherent or control risk is high.
Data lineage and proxy variables: Feature engineering can inadvertently create proxy variables for protected attributes (e.g., zip code → race). Data lineage audit maps every transformation to identify proxies before they contaminate model training.
Holdout vs. cross-validation: Holdout testing uses a fixed unseen test set (common for audit verification). Cross-validation reuses data across folds (used in model development). Auditors should prefer holdout sets they themselves control.
Right-to-audit clauses: For third-party AI vendors, right-to-audit contract clauses allow the organization (and auditors) to inspect vendor AI systems, model cards, and test results — critical for governance where internal access is unavailable.
Materiality in AI context: For bias audits, a common materiality threshold is 80% rule (4/5ths rule from US EEOC) — if selection rate for a protected group is <80% of the highest group, disparate impact may be material.

Practice Tips for Domain 3

Scenario-based questions: Domain 3 questions often describe a situation and ask "what should the auditor do FIRST?" — the answer is usually planning or risk assessment before any testing begins.
Watch for "most appropriate" phrasing: When asking for evidence, the most appropriate is usually reperformance or analytical — not inquiry alone. When asking about sampling for bias, stratified beats all others.
ITAF vs COBIT traps: Exam may ask about "standards the auditor follows" (ITAF) vs "standards the organization should meet" (COBIT). Know both frameworks — they're frequently tested together.
Audit lifecycle sequence: Plan → Execute → Report → Follow-Up. Questions about "what happens after findings are documented?" → Report. "What happens after the report?" → Follow-Up/remediation.
Flashcard drill: Use the CCCE framework on every real-world AI scenario you read — identify all four elements. This builds exam intuition quickly.