CY0-001 Practice Questions: Securing AI systems Domain

Published: June 14, 2026 | 20 min read

Test your CY0-001 knowledge with 10 practice questions from the Securing AI systems domain. Includes detailed explanations and answers.

CY0-001 Practice Questions

Master the Securing AI systems Domain

Test your knowledge in the Securing AI systems domain with these 10 practice questions. Each question is designed to help you prepare for the CY0-001 certification exam with detailed explanations to reinforce your learning.

Question 1

A security team is fine-tuning an internal code assistant using pull request comments, issue tickets, and developer wiki pages. Some sources can be edited by contractors, and the team is concerned that malicious examples could teach the assistant to recommend insecure code patterns. Which control best reduces this risk?

A) Track dataset provenance, verify integrity, and require review before fine-tuning.

B) Use a larger model so individual malicious examples have less influence.

C) Block the assistant from answering questions about authentication code.

D) Run the model only on internal network segments after deployment.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): Data poisoning risk is reduced by knowing where training data came from, validating integrity, reviewing questionable sources, and controlling approval into fine-tuning datasets. These controls address the training inputs before they influence model behavior.

Why the other options are wrong:
- Option B: This may appear to dilute bad data, but model size does not provide reliable protection against poisoning.
- Option C: This may reduce one category of output risk, but it does not prevent poisoned examples from entering the dataset.
- Option D: This may limit exposure, but network placement does not address malicious fine-tuning data.

Question 2

A fraud detection model was deployed last night. Since deployment, alerts have dropped sharply. The model registry shows: Version: fraud-v8.2 Validation accuracy: 96.9% Approval: Pending Artifact signature: Missing Source pipeline: experimental-branch Previous version: fraud-v8.1, signed, approved, deployed for 42 days What should the AI security engineer do first?

A) Keep v8.2 deployed because its validation accuracy is higher.

B) Roll back to v8.1 and investigate the v8.2 deployment path.

C) Retrain v8.2 immediately using the newest production alerts.

D) Disable the registry approval workflow for emergency updates.

Show Answer & Explanation

Correct Answer: B

Explanation:

Correct answer (B): The deployed model lacks approval and artifact signing and came from an experimental branch. Rolling back to the last trusted signed version reduces production risk while the team investigates model provenance, deployment controls, and possible compromise.

Why the other options are wrong:
- Option A: Accuracy is not proof of model integrity or authorized deployment. A high score cannot compensate for missing approval and signature evidence.
- Option C: Retraining before containment and investigation may destroy evidence or introduce poisoned data into a new model.
- Option D: Disabling approval gates would weaken secure MLOps controls and make unauthorized deployments easier.

Question 3

A development team stores an external model API key in a prompt template file so their AI application can call the model during testing. The prompt template is committed to the same repository used by the CI/CD pipeline. What is the best remediation?

A) Move the key to an approved secrets manager and rotate the exposed key.

B) Rename the prompt template so it is less obvious to repository users.

C) Keep the key in the file but restrict the model's maximum token count.

D) Add a comment warning developers not to share the prompt template.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): Secrets should not be embedded in prompts, templates, repositories, logs, training data, or model configuration files. The exposed key should be rotated, and future access should use an approved secrets management mechanism integrated with the application or pipeline.

Why the other options are wrong:
- Option B: This relies on obscurity and does not prevent access by anyone who can read the repository.
- Option C: Token limits affect model output length, not protection of credentials stored in files.
- Option D: A warning is not an effective control for a secret already committed to a shared repository.

Question 4

A team fine-tunes a malware triage model weekly. A recent dataset update shows: Dataset: malware-triage-finetune-weekly New records: 84,000 Source: community-submitted samples Write access: shared service account Hash manifest: not provided Reviewer: none Observed behavior: model increasingly labels suspicious macros as benign Which control would BEST reduce the risk shown in this record?

A) Require source validation, hash manifests, restricted writes, and review before ingestion.

B) Increase the training frequency so the model adapts faster to new samples.

C) Use a larger model so it can learn from mislabeled records more effectively.

D) Encrypt the fine-tuning dataset after it is loaded into the training environment.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): The artifact indicates possible data poisoning or unauthorized manipulation: unvalidated community data, shared write access, no manifest, no reviewer, and a suspicious behavior shift. Provenance tracking, integrity checks, least-privilege write access, and review workflows are appropriate controls for training and fine-tuning data integrity.

Why the other options are wrong:
- Option B: More frequent training can spread poisoned or low-quality data faster if ingestion controls remain weak.
- Option C: A larger model does not make untrusted or poisoned labels safe and may still learn malicious patterns.
- Option D: Encryption helps confidentiality, but it does not validate the source or integrity of the ingested data.

Question 5

A team fine-tunes a malware triage model weekly. After a performance drop, the dataset lineage record shows: Dataset: triage-finetune-week34 New records: 18,000 Sources: analyst-reviewed tickets, community uploads Write access: all SOC contractors Validation: schema check only Label review: skipped due to deadline Which control would most directly reduce the risk shown?

A) Add trusted-source validation, label review, and restricted write access.

B) Increase model size so it can learn around incorrect labels.

C) Remove lineage records after training to reduce data exposure.

D) Use a longer training window to smooth weekly fluctuations.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): The record shows weak provenance, broad write access, and missing review, which create data poisoning risk. Trusted sources, dataset lineage, validation, review workflows, anomaly checks, and restricted write permissions are key controls for training and fine-tuning data integrity.

Why the other options are wrong:
- Option B: A larger model may not correct malicious or mislabeled data and can still learn poisoned patterns.
- Option C: Deleting lineage reduces accountability and makes investigation harder; lineage should be protected, not removed.
- Option D: A longer training window may hide symptoms but does not validate source integrity or prevent malicious records.

Question 6

A SOC copilot analyzes endpoint alerts and generates remediation scripts for analysts. The organization plans to let analysts run generated scripts from the console. A sample output includes: Recommendation: Remove suspected persistence. Generated command: delete all scheduled tasks where publisher is unknown; restart host immediately. Confidence: 87% Which control is BEST before allowing use of this feature in production?

A) Require sandbox testing, policy validation, and analyst approval before execution.

B) Allow direct execution when the copilot confidence score is above 85%.

C) Restrict scripts to endpoints that have critical alerts from the SIEM.

D) Display the generated command in a larger font for analyst review.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): AI-generated commands can be incorrect, unsafe, or manipulated. Before production use, outputs that cause operational changes should be validated against policy, tested or sandboxed where possible, and approved by a human analyst before execution, especially when the action could disrupt a host.

Why the other options are wrong:
- Option B: A confidence score is not an authorization control and does not prove the command is safe or compliant.
- Option C: Limiting scope to critical-alert endpoints reduces exposure but still allows unvalidated commands to run on production systems.
- Option D: Improving readability may help review, but it does not provide validation, sandboxing, or approval enforcement.

Question 7

An HR chatbot uses retrieval-augmented generation to answer employee policy questions. A standard employee asks, "What severance terms apply to executives?" The retrieval log shows: Retrieved chunk 1: /hr/policies/public/severance_overview.pdf Retrieved chunk 2: /hr/legal/executive_agreements/CFO_contract.pdf Metadata for chunk 2: classification=restricted, allowed_group=HR-Legal User group: Employees-General The chatbot summarizes both documents. What is the BEST control to prevent this exposure?

A) Enforce user authorization before vector retrieval and source document access.

B) Encrypt the vector database volume using enterprise-managed keys.

C) Add a response disclaimer that answers may include sensitive information.

D) Tune the model to refuse questions containing the word "executive."

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): The failure is unauthorized retrieval of restricted content. A RAG system must enforce access control at retrieval time so users only retrieve embeddings, chunks, metadata, and source documents they are authorized to access. Content filtering, disclaimers, or encryption at rest do not solve authorized-but-inappropriate retrieval.

Why the other options are wrong:
- Option B: Encryption at rest protects stored data from some unauthorized storage access, but it does not stop the application from retrieving restricted chunks for an authenticated user.
- Option C: A disclaimer may warn users, but it does not prevent disclosure of sensitive source material.
- Option D: Keyword-based refusal is brittle and may block legitimate questions while failing to enforce document-level authorization.

Question 8

A SOC team wants an AI agent to help triage alerts. The proposed agent can read alerts, query asset inventory, create tickets, disable user accounts, and block IP addresses at the firewall. The team wants faster triage but must avoid unsafe autonomous containment. Which design is the best security approach?

A) Allow all actions but review a sample of agent decisions weekly.

B) Permit read-only triage and ticketing, requiring approval for containment.

C) Disable all tool access and use the agent only for free-text summaries.

D) Give the agent administrator access so it can resolve incidents quickly.

Show Answer & Explanation

Correct Answer: B

Explanation:

Correct answer (B): AI agents should operate with least privilege and explicit boundaries. Read-only investigation and low-impact ticket creation can improve workflow, while high-impact actions such as disabling accounts or changing firewall rules should require human approval or policy checks.

Why the other options are wrong:
- Option A: This may seem efficient, but after-the-fact sampling does not prevent unsafe account or firewall changes.
- Option C: This reduces risk, but it unnecessarily removes useful low-risk tool access needed for triage.
- Option D: This improves speed, but it creates excessive agency and allows privileged changes without safeguards.

Question 9

A company indexes confidential legal and HR documents into a vector database for an internal assistant. The application currently checks user identity only after the model has already retrieved the top matching chunks. Security testing shows users can receive summaries based on documents they are not permitted to open. What should the security team implement FIRST?

A) Encrypt the vector database storage volume and keep the current retrieval flow.

B) Apply document-level authorization before retrieval and separate sensitive indexes.

C) Increase the embedding dimension to reduce reconstruction of original documents.

D) Use a larger language model to improve the quality of access-denial responses.

Show Answer & Explanation

Correct Answer: B

Explanation:

Correct answer (B): The main weakness is that unauthorized content is retrieved before the application enforces access. RAG systems must apply authorization at retrieval time, and sensitive embeddings should be protected with access controls, segmentation or tenant separation, minimization, retention controls, and monitoring.

Why the other options are wrong:
- Option A: Encryption at rest protects stored data from storage-level compromise, but it does not prevent the application from retrieving unauthorized chunks for a logged-in user.
- Option C: Embedding dimension is a model design parameter, not an access-control boundary. It does not stop unauthorized retrieval.
- Option D: Better denial wording does not help if restricted content has already been retrieved and used to generate the response.

Question 10

A legal research assistant stores user prompts, generated answers, retrieved document chunks, and embeddings for troubleshooting. The logs include client identifiers and confidential case details. Several developers have broad access to the troubleshooting store. Which action BEST reduces the security and privacy risk while preserving operational support?

A) Restrict access, minimize stored fields, redact sensitive data, and set retention limits.

B) Keep all logs unchanged but encrypt the troubleshooting store at rest.

C) Disable all logging for the assistant to remove sensitive data exposure.

D) Move the logs to the model registry so they inherit model versioning.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): AI logs, retrieved chunks, and embeddings can contain sensitive information. The best approach preserves supportability while applying access control, data minimization, redaction, monitoring, and retention controls. Encryption alone does not prevent authorized users or workflows from accessing excessive sensitive content.

Why the other options are wrong:
- Option B: Encryption at rest is valuable, but broad developer access still exposes sensitive logs after decryption.
- Option C: Disabling all logging may reduce exposure but harms auditability, incident response, and troubleshooting.
- Option D: A model registry is not the appropriate control for sensitive prompt and retrieval logs.

Ready to Accelerate Your CY0-001 Preparation?

Join thousands of professionals who are advancing their careers through expert certification preparation with FlashGenius.

✅ Unlimited practice questions across all CY0-001 domains
✅ Full-length exam simulations with real-time scoring
✅ AI-powered performance tracking and weak area identification
✅ Personalized study plans with adaptive learning
✅ Mobile-friendly platform for studying anywhere, anytime
✅ Expert explanations and study resources

Start Free Practice Now

Already have an account? Sign in here

About CY0-001 Certification

The CY0-001 certification validates your expertise in securing ai systems and other critical domains. Our comprehensive practice questions are carefully crafted to mirror the actual exam experience and help you identify knowledge gaps before test day.