SNOWPRO-CORE-C03 Practice Questions: Snowflake AI Data Cloud Features & Architecture Domain
Test your SNOWPRO-CORE-C03 knowledge with 10 practice questions from the Snowflake AI Data Cloud Features & Architecture domain. Includes detailed explanations and answers.
SNOWPRO-CORE-C03 Practice Questions
Master the Snowflake AI Data Cloud Features & Architecture Domain
Test your knowledge in the Snowflake AI Data Cloud Features & Architecture domain with these 10 practice questions. Each question is designed to help you prepare for the SNOWPRO-CORE-C03 certification exam with detailed explanations to reinforce your learning.
Question 1
A company plans to host both HR and Sales analytics in the same Snowflake account. They want clear logical separation of objects, including independent management of tables and views, but still want to share some reference data between the two areas. Which Snowflake object is the best primary boundary to separate HR and Sales data within this single account?
Show Answer & Explanation
Correct Answer: B
Correct answer (B): Within a single Snowflake account, databases are the primary top-level logical containers for objects like schemas, tables, and views. Using separate databases for HR and Sales provides clear separation while still allowing controlled sharing of objects across databases if needed. Warehouses, stages, and cloud storage buckets do not provide the same logical object hierarchy separation for data modeling and governance inside Snowflake.
Why the other options are wrong:
- Option A: Virtual warehouses control compute for queries but do not logically separate the underlying data. HR and Sales could still accidentally access each other’s tables if they share the same database and schema structure.
- Option C: Cloud storage buckets exist outside Snowflake’s object hierarchy. While they matter for loading or external data, they do not provide the primary logical separation of databases and schemas inside a Snowflake account.
- Option D: Stages are used for data loading/unloading, not as the primary logical container for production tables and views. They are not the best object to establish HR vs. Sales boundaries.
Question 2
A data science team wants to build a reusable feature engineering pipeline in Python that operates directly on Snowflake tables without moving data out of the platform. They want their Python code to execute on Snowflake compute for scalability and governance, and they will later feed the engineered features into various ML tools. Which Snowflake capability is the best fit for this requirement?
Show Answer & Explanation
Correct Answer: C
Correct answer (C): Snowpark is the developer framework that lets engineers write Python (and other language) code that runs inside Snowflake compute, operating directly on Snowflake data. It is ideal for feature engineering and data processing pipelines that must stay close to the data.
Why the other options are wrong:
- Option A: Cortex is a managed AI and LLM service focused on applying models and LLMs to data, not a general-purpose framework for arbitrary Python feature engineering logic.
- Option B: Snowflake ML simplifies many ML lifecycle tasks, but the scenario specifically requires running custom Python feature engineering code on Snowflake compute, which is Snowpark’s role.
- Option D: Marketplace is for sharing data products and services, not for running internal Python feature engineering pipelines.
Question 3
A company wants to keep a curated reporting table continuously updated from a raw events table as new data lands. They prefer a managed approach that automatically keeps the derived table in sync over time without them having to build and orchestrate streams and tasks for incremental processing. Which Snowflake feature best meets this requirement?
Show Answer & Explanation
Correct Answer: A
Correct answer (A): Dynamic Tables are specifically designed to maintain a target table in sync with changes in source data based on a defining query. They provide managed, incremental transformations over time without requiring you to explicitly create and orchestrate streams and tasks, matching the desire for a managed continuous transformation.
Why the other options are wrong:
- Option B: Incorrect. A standard view does not materialize data or manage refresh; every query re-reads the raw table, which does not provide a curated, maintained physical table for performance and operational needs.
- Option C: Incorrect. Streams plus manual batch operations can implement incremental processing, but the requirement is to avoid managing streams and tasks explicitly. Dynamic Tables are the managed alternative.
- Option D: Incorrect. Iceberg tables are about interoperating on Apache Iceberg-format data, often in external storage. They do not by themselves provide a managed incremental transformation from one table into another.
Question 4
A multinational enterprise wants a single Snowflake strategy across two cloud providers and multiple regions. They must: (1) keep customer data in-region for residency, (2) enable some global analytics that combine data from multiple regions, and (3) centrally manage governance and billing while allowing regional account autonomy. Which high-level Snowflake architecture best satisfies these requirements?
Show Answer & Explanation
Correct Answer: A
Correct answer (A): Using a single Snowflake organization with multiple accounts in different regions and clouds allows centralized governance and billing while giving regions autonomy. Data residency is addressed by storing and processing data in-region in each account. Global analytics can be enabled selectively via data replication (for physical copies where needed) and secure data sharing across accounts, all within the same organization.
Why the other options are wrong:
- Option B: Incorrect. Snowflake does not automatically replicate data across regions; explicit replication or sharing is required. A single-region account cannot satisfy strict in-region residency across multiple regions.
- Option C: Incorrect. Reader accounts query provider-managed data and do not create local regional copies, which can violate residency and performance requirements for local processing.
- Option D: Incorrect. Multiple organizations complicate central governance and billing. External storage alone does not replace Snowflake-native replication and secure data sharing patterns.
Question 5
An analyst runs a SELECT query every morning to generate a report. For several days, the query returns almost instantly. Today, after a major data load into the underlying table, the same query suddenly takes much longer to run, even though the SQL text and warehouse are unchanged. What Snowflake behavior most likely explains this change?
Show Answer & Explanation
Correct Answer: A
Correct answer (A): Snowflake's result cache stores results for previously run queries and can return them instantly if the same query is issued and the underlying data has not changed. A major data load changes the table and invalidates the cached result. As a result, Snowflake must re-execute the query, causing the longer runtime observed.
Why the other options are wrong:
- Option B: Incorrect. Resizing a warehouse affects compute resources but does not 'turn off' caching globally. The key trigger here is the data change, not warehouse size.
- Option C: Incorrect. Micro-partitions are managed automatically; there is no manual rebuild step that would explain this behavior.
- Option D: Incorrect. Queries are executed in the region of the account; data is not transparently routed to another region, and there is no automatic slow replication that would cause this specific symptom.
Question 6
A data engineer is migrating a large fact table to Snowflake and asks how to manually create and maintain partitions to optimize query performance. As the Snowflake architect, how should you respond?
Show Answer & Explanation
Correct Answer: C
Correct answer (C): Snowflake automatically organizes table data into columnar, compressed micro-partitions and maintains the metadata needed for pruning at query time. Users do not manually create or manage partitions; optimization happens through Snowflake’s storage engine and metadata.
Why the other options are wrong:
- Option A: Snowflake does not require user-defined partitions for pruning. While clustering keys can help in some cases, explicit physical partitions are not created or maintained by users.
- Option B: Splitting tables across schemas does not create physical partitions and would add unnecessary complexity. It also does not align with how Snowflake’s storage engine works.
- Option D: Virtual warehouses are compute resources and cannot be used to physically partition data. They all access the same shared storage layer.
Question 7
A media company maintains petabytes of data in a cloud data lake using the Apache Iceberg table format. Their governance team requires that raw data files remain in the existing object storage, but analysts want to query these Iceberg tables from Snowflake as if they were native tables, without copying the data into Snowflake storage. What Snowflake feature best satisfies these requirements?
Show Answer & Explanation
Correct Answer: B
Correct answer (B): Snowflake Iceberg tables allow Snowflake to manage metadata and query data stored in external object storage using the Apache Iceberg format, without copying that data into native Snowflake storage. This meets the requirement to leave data in the data lake while querying it as Snowflake tables.
Why the other options are wrong:
- Option A: COPY INTO would ingest and store data in Snowflake, violating the requirement to keep raw data in the existing object storage without copying.
- Option C: Exporting results from Snowflake does not enable querying the existing Iceberg tables from within Snowflake. It shifts more work to external systems and breaks the desired unified query experience.
- Option D: Dynamic Tables automate transformations within Snowflake but assume data is in Snowflake tables. They are not a mechanism for directly querying externally stored Iceberg data without ingestion.
Question 8
A data engineering team currently runs complex transformations in a separate Python environment, pulling data out of Snowflake, processing it, and then writing results back. They want to reduce data movement and run the transformations directly inside Snowflake while still using Python as their main language. Which Snowflake capability best fits this requirement?
Show Answer & Explanation
Correct Answer: A
Correct answer (A): Snowpark is designed for data processing in languages like Python, executed inside Snowflake’s compute layer. It lets the team define transformations in Python while keeping execution close to the data, leveraging Snowflake’s scalability and governance and avoiding data movement in and out of external environments. Cortex focuses on AI/LLM use cases, Streamlit on interactive apps, and Snowflake ML on model training/inference, none of which directly address general-purpose Python transformations on data.
Why the other options are wrong:
- Option B: Cortex LLM functions provide AI capabilities such as text generation and summarization, not automatic code translation of arbitrary Python transformations into SQL for production pipelines.
- Option C: Streamlit in Snowflake is for building interactive apps and UIs, not for running backend batch transformation pipelines at scale inside the database.
- Option D: Snowflake ML is for building and serving machine learning models. It does not replace arbitrary transformation logic and is not intended as a general-purpose ETL framework.
Question 9
A business intelligence team wants to give non-technical business users the ability to ask natural language questions like "What were last quarter’s sales by region?" and receive answers and visualizations generated directly from governed data in Snowflake. The team does not want to build and manage their own LLM infrastructure. Which Snowflake capability is designed for this use case?
Show Answer & Explanation
Correct Answer: A
Correct answer (A): Cortex Analyst is specifically designed to allow business users to ask natural language questions about governed Snowflake data and receive answers and visualizations. It leverages Snowflake’s metadata, security, and query engine and does not require the team to manage LLM infrastructure, matching the described use case.
Why the other options are wrong:
- Option B: Cortex LLM functions are powerful for LLM-powered transformations inside SQL, but they require more technical setup and are not a pre-built conversational analytics experience for business users.
- Option C: Snowflake Notebooks are suited for analysts and engineers mixing SQL, Python, and narrative text. They are not a turnkey natural language Q&A interface for non-technical users.
- Option D: Streamlit in Snowflake can be used to build custom interactive apps, but the team would need to design and implement the full natural language interface and logic themselves. Cortex Analyst provides this functionality out of the box.
Question 10
A global retailer is standardizing its analytics platform on Snowflake. They have three main workloads: - Nightly ELT that performs large batch transformations with predictable timing. - High-concurrency BI dashboards accessed by hundreds of users during business hours. - Occasional ad-hoc data science exploration that can be compute-intensive but is used by a small team. The company wants good performance for each workload, cost control when workloads are idle, and minimal data duplication. Which Snowflake architecture best meets these requirements?
Show Answer & Explanation
Correct Answer: B
Correct answer (B): Using separate warehouses for each workload leverages Snowflake’s shared data architecture so all workloads access one copy of the data. A multi-cluster warehouse for BI handles high concurrency, while dedicated ELT and data science warehouses isolate heavy batch and exploratory workloads. Auto-suspend/resume provides cost control when workloads are idle, and no data duplication is required.
Why the other options are wrong:
- Option A: Incorrect. A single large always-on warehouse prevents workload isolation, risks contention between workloads, and wastes compute cost when some workloads are idle.
- Option C: Incorrect. Separate accounts and databases fragment governance and require data replication between accounts, which contradicts the goal of minimal data duplication.
- Option D: Incorrect. Copying tables into multiple databases duplicates data and increases maintenance overhead; assigning small warehouses does not solve concurrency needs for BI or isolate intense ELT from other workloads as well as dedicated warehouses.
Ready to Accelerate Your SNOWPRO-CORE-C03 Preparation?
Join thousands of professionals who are advancing their careers through expert certification preparation with FlashGenius.
- ✅ Unlimited practice questions across all SNOWPRO-CORE-C03 domains
- ✅ Full-length exam simulations with real-time scoring
- ✅ AI-powered performance tracking and weak area identification
- ✅ Personalized study plans with adaptive learning
- ✅ Mobile-friendly platform for studying anywhere, anytime
- ✅ Expert explanations and study resources
Already have an account? Sign in here
About SNOWPRO-CORE-C03 Certification
The SNOWPRO-CORE-C03 certification validates your expertise in snowflake ai data cloud features & architecture and other critical domains. Our comprehensive practice questions are carefully crafted to mirror the actual exam experience and help you identify knowledge gaps before test day.