FlashGenius Logo FlashGenius
Login Sign Up

SNOWPRO-CORE-C03 Practice Questions: Snowflake AI Data Cloud Features & Architecture Domain

Test your SNOWPRO-CORE-C03 knowledge with 10 practice questions from the Snowflake AI Data Cloud Features & Architecture domain. Includes detailed explanations and answers.

SNOWPRO-CORE-C03 Practice Questions

Master the Snowflake AI Data Cloud Features & Architecture Domain

Test your knowledge in the Snowflake AI Data Cloud Features & Architecture domain with these 10 practice questions. Each question is designed to help you prepare for the SNOWPRO-CORE-C03 certification exam with detailed explanations to reinforce your learning.

Question 1

A company plans to host both HR and Sales analytics in the same Snowflake account. They want clear logical separation of objects, including independent management of tables and views, but still want to share some reference data between the two areas. Which Snowflake object is the best primary boundary to separate HR and Sales data within this single account?

A) Separate virtual warehouses for HR and Sales

B) Separate databases for HR and Sales

C) Separate cloud provider storage buckets for HR and Sales

D) Separate internal stages for HR and Sales

Show Answer & Explanation

Correct Answer: B

Explanation:

Correct answer (B): Within a single Snowflake account, databases are the primary top-level logical containers for objects like schemas, tables, and views. Using separate databases for HR and Sales provides clear separation while still allowing controlled sharing of objects across databases if needed. Warehouses, stages, and cloud storage buckets do not provide the same logical object hierarchy separation for data modeling and governance inside Snowflake.

Why the other options are wrong:
- Option A: Virtual warehouses control compute for queries but do not logically separate the underlying data. HR and Sales could still accidentally access each other’s tables if they share the same database and schema structure.
- Option C: Cloud storage buckets exist outside Snowflake’s object hierarchy. While they matter for loading or external data, they do not provide the primary logical separation of databases and schemas inside a Snowflake account.
- Option D: Stages are used for data loading/unloading, not as the primary logical container for production tables and views. They are not the best object to establish HR vs. Sales boundaries.

Question 2

A data science team wants to build a reusable feature engineering pipeline in Python that operates directly on Snowflake tables without moving data out of the platform. They want their Python code to execute on Snowflake compute for scalability and governance, and they will later feed the engineered features into various ML tools. Which Snowflake capability is the best fit for this requirement?

A) Cortex, to run low-level Python feature engineering code inside LLM models.

B) Snowflake ML, to train models and avoid writing any custom feature engineering code.

C) Snowpark, to write and run Python code that processes data directly in Snowflake.

D) Snowflake Marketplace, to publish the feature engineering code as a data product.

Show Answer & Explanation

Correct Answer: C

Explanation:

Correct answer (C): Snowpark is the developer framework that lets engineers write Python (and other language) code that runs inside Snowflake compute, operating directly on Snowflake data. It is ideal for feature engineering and data processing pipelines that must stay close to the data.

Why the other options are wrong:
- Option A: Cortex is a managed AI and LLM service focused on applying models and LLMs to data, not a general-purpose framework for arbitrary Python feature engineering logic.
- Option B: Snowflake ML simplifies many ML lifecycle tasks, but the scenario specifically requires running custom Python feature engineering code on Snowflake compute, which is Snowpark’s role.
- Option D: Marketplace is for sharing data products and services, not for running internal Python feature engineering pipelines.

Question 3

A company wants to keep a curated reporting table continuously updated from a raw events table as new data lands. They prefer a managed approach that automatically keeps the derived table in sync over time without them having to build and orchestrate streams and tasks for incremental processing. Which Snowflake feature best meets this requirement?

A) A Dynamic Table defined on top of the raw events table.

B) A standard view that selects from the raw events table.

C) A Stream object on the raw table combined with manual batch COPY INTO operations.

D) An Iceberg table linked to the raw events data in external storage.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): Dynamic Tables are specifically designed to maintain a target table in sync with changes in source data based on a defining query. They provide managed, incremental transformations over time without requiring you to explicitly create and orchestrate streams and tasks, matching the desire for a managed continuous transformation.

Why the other options are wrong:
- Option B: Incorrect. A standard view does not materialize data or manage refresh; every query re-reads the raw table, which does not provide a curated, maintained physical table for performance and operational needs.
- Option C: Incorrect. Streams plus manual batch operations can implement incremental processing, but the requirement is to avoid managing streams and tasks explicitly. Dynamic Tables are the managed alternative.
- Option D: Incorrect. Iceberg tables are about interoperating on Apache Iceberg-format data, often in external storage. They do not by themselves provide a managed incremental transformation from one table into another.

Question 4

A multinational enterprise wants a single Snowflake strategy across two cloud providers and multiple regions. They must: (1) keep customer data in-region for residency, (2) enable some global analytics that combine data from multiple regions, and (3) centrally manage governance and billing while allowing regional account autonomy. Which high-level Snowflake architecture best satisfies these requirements?

A) Create a Snowflake organization with separate accounts in each required region and cloud, then use replication and secure data sharing between accounts as needed.

B) Create one large Snowflake account in a single region and cloud, and rely on automatic cross-region data replication for global analytics.

C) Use only reader accounts in other regions that query data from a single central provider account to avoid any cross-region replication costs.

D) Create a separate Snowflake organization per region and use external cloud storage buckets to share data between them.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): Using a single Snowflake organization with multiple accounts in different regions and clouds allows centralized governance and billing while giving regions autonomy. Data residency is addressed by storing and processing data in-region in each account. Global analytics can be enabled selectively via data replication (for physical copies where needed) and secure data sharing across accounts, all within the same organization.

Why the other options are wrong:
- Option B: Incorrect. Snowflake does not automatically replicate data across regions; explicit replication or sharing is required. A single-region account cannot satisfy strict in-region residency across multiple regions.
- Option C: Incorrect. Reader accounts query provider-managed data and do not create local regional copies, which can violate residency and performance requirements for local processing.
- Option D: Incorrect. Multiple organizations complicate central governance and billing. External storage alone does not replace Snowflake-native replication and secure data sharing patterns.

Question 5

An analyst runs a SELECT query every morning to generate a report. For several days, the query returns almost instantly. Today, after a major data load into the underlying table, the same query suddenly takes much longer to run, even though the SQL text and warehouse are unchanged. What Snowflake behavior most likely explains this change?

A) The result cache was invalidated because the underlying table data changed, so Snowflake had to recompute the result.

B) The warehouse was resized to a smaller size, which automatically disabled all caching features.

C) Micro-partitions had to be manually rebuilt after the load, causing a one-time performance penalty.

D) The query was routed to a different region where data is automatically replicated more slowly.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): Snowflake's result cache stores results for previously run queries and can return them instantly if the same query is issued and the underlying data has not changed. A major data load changes the table and invalidates the cached result. As a result, Snowflake must re-execute the query, causing the longer runtime observed.

Why the other options are wrong:
- Option B: Incorrect. Resizing a warehouse affects compute resources but does not 'turn off' caching globally. The key trigger here is the data change, not warehouse size.
- Option C: Incorrect. Micro-partitions are managed automatically; there is no manual rebuild step that would explain this behavior.
- Option D: Incorrect. Queries are executed in the region of the account; data is not transparently routed to another region, and there is no automatic slow replication that would cause this specific symptom.

Question 6

A data engineer is migrating a large fact table to Snowflake and asks how to manually create and maintain partitions to optimize query performance. As the Snowflake architect, how should you respond?

A) Define range partitions on the main date column, because Snowflake requires explicit partition keys for pruning to work

B) Create many small schemas and distribute table rows across them to simulate partitioning

C) Explain that Snowflake automatically stores data in micro-partitions and uses metadata for pruning, so no manual partition management is required

D) Use virtual warehouses as partitions by assigning different date ranges to different warehouses

Show Answer & Explanation

Correct Answer: C

Explanation:

Correct answer (C): Snowflake automatically organizes table data into columnar, compressed micro-partitions and maintains the metadata needed for pruning at query time. Users do not manually create or manage partitions; optimization happens through Snowflake’s storage engine and metadata.

Why the other options are wrong:
- Option A: Snowflake does not require user-defined partitions for pruning. While clustering keys can help in some cases, explicit physical partitions are not created or maintained by users.
- Option B: Splitting tables across schemas does not create physical partitions and would add unnecessary complexity. It also does not align with how Snowflake’s storage engine works.
- Option D: Virtual warehouses are compute resources and cannot be used to physically partition data. They all access the same shared storage layer.

Question 7

A media company maintains petabytes of data in a cloud data lake using the Apache Iceberg table format. Their governance team requires that raw data files remain in the existing object storage, but analysts want to query these Iceberg tables from Snowflake as if they were native tables, without copying the data into Snowflake storage. What Snowflake feature best satisfies these requirements?

A) Set up external stages and use COPY INTO to load Iceberg data into native Snowflake tables

B) Create Iceberg tables in Snowflake that reference the existing Iceberg data in object storage

C) Export Snowflake query results to the data lake so analysts can join them with Iceberg tables externally

D) Use Dynamic Tables to automatically synchronize data from the data lake into Snowflake on a schedule

Show Answer & Explanation

Correct Answer: B

Explanation:

Correct answer (B): Snowflake Iceberg tables allow Snowflake to manage metadata and query data stored in external object storage using the Apache Iceberg format, without copying that data into native Snowflake storage. This meets the requirement to leave data in the data lake while querying it as Snowflake tables.

Why the other options are wrong:
- Option A: COPY INTO would ingest and store data in Snowflake, violating the requirement to keep raw data in the existing object storage without copying.
- Option C: Exporting results from Snowflake does not enable querying the existing Iceberg tables from within Snowflake. It shifts more work to external systems and breaks the desired unified query experience.
- Option D: Dynamic Tables automate transformations within Snowflake but assume data is in Snowflake tables. They are not a mechanism for directly querying externally stored Iceberg data without ingestion.

Question 8

A data engineering team currently runs complex transformations in a separate Python environment, pulling data out of Snowflake, processing it, and then writing results back. They want to reduce data movement and run the transformations directly inside Snowflake while still using Python as their main language. Which Snowflake capability best fits this requirement?

A) Use Snowpark to write Python transformations that execute inside Snowflake’s compute layer.

B) Use Cortex LLM functions to automatically convert Python transformations into SQL.

C) Use Streamlit in Snowflake so Python code runs on the BI dashboards instead of in the database.

D) Use Snowflake ML to automatically create models that replace all Python transformation logic.

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): Snowpark is designed for data processing in languages like Python, executed inside Snowflake’s compute layer. It lets the team define transformations in Python while keeping execution close to the data, leveraging Snowflake’s scalability and governance and avoiding data movement in and out of external environments. Cortex focuses on AI/LLM use cases, Streamlit on interactive apps, and Snowflake ML on model training/inference, none of which directly address general-purpose Python transformations on data.

Why the other options are wrong:
- Option B: Cortex LLM functions provide AI capabilities such as text generation and summarization, not automatic code translation of arbitrary Python transformations into SQL for production pipelines.
- Option C: Streamlit in Snowflake is for building interactive apps and UIs, not for running backend batch transformation pipelines at scale inside the database.
- Option D: Snowflake ML is for building and serving machine learning models. It does not replace arbitrary transformation logic and is not intended as a general-purpose ETL framework.

Question 9

A business intelligence team wants to give non-technical business users the ability to ask natural language questions like "What were last quarter’s sales by region?" and receive answers and visualizations generated directly from governed data in Snowflake. The team does not want to build and manage their own LLM infrastructure. Which Snowflake capability is designed for this use case?

A) Cortex Analyst

B) Cortex LLM functions used inside ad-hoc SQL queries

C) Snowflake Notebooks with embedded SQL and Python cells

D) Streamlit in Snowflake apps connecting to Snowflake tables

Show Answer & Explanation

Correct Answer: A

Explanation:

Correct answer (A): Cortex Analyst is specifically designed to allow business users to ask natural language questions about governed Snowflake data and receive answers and visualizations. It leverages Snowflake’s metadata, security, and query engine and does not require the team to manage LLM infrastructure, matching the described use case.

Why the other options are wrong:
- Option B: Cortex LLM functions are powerful for LLM-powered transformations inside SQL, but they require more technical setup and are not a pre-built conversational analytics experience for business users.
- Option C: Snowflake Notebooks are suited for analysts and engineers mixing SQL, Python, and narrative text. They are not a turnkey natural language Q&A interface for non-technical users.
- Option D: Streamlit in Snowflake can be used to build custom interactive apps, but the team would need to design and implement the full natural language interface and logic themselves. Cortex Analyst provides this functionality out of the box.

Question 10

A global retailer is standardizing its analytics platform on Snowflake. They have three main workloads: - Nightly ELT that performs large batch transformations with predictable timing. - High-concurrency BI dashboards accessed by hundreds of users during business hours. - Occasional ad-hoc data science exploration that can be compute-intensive but is used by a small team. The company wants good performance for each workload, cost control when workloads are idle, and minimal data duplication. Which Snowflake architecture best meets these requirements?

A) Use a single large always-on warehouse for all workloads so they share compute and storage efficiently

B) Create three separate warehouses (ELT, BI as a multi-cluster warehouse, and data science) that all access the same shared database, with auto-suspend/resume configured

C) Create separate Snowflake accounts and databases for each workload so they never share compute or data

D) Copy all core tables into three different databases and assign one small warehouse to each database

Show Answer & Explanation

Correct Answer: B

Explanation:

Correct answer (B): Using separate warehouses for each workload leverages Snowflake’s shared data architecture so all workloads access one copy of the data. A multi-cluster warehouse for BI handles high concurrency, while dedicated ELT and data science warehouses isolate heavy batch and exploratory workloads. Auto-suspend/resume provides cost control when workloads are idle, and no data duplication is required.

Why the other options are wrong:
- Option A: Incorrect. A single large always-on warehouse prevents workload isolation, risks contention between workloads, and wastes compute cost when some workloads are idle.
- Option C: Incorrect. Separate accounts and databases fragment governance and require data replication between accounts, which contradicts the goal of minimal data duplication.
- Option D: Incorrect. Copying tables into multiple databases duplicates data and increases maintenance overhead; assigning small warehouses does not solve concurrency needs for BI or isolate intense ELT from other workloads as well as dedicated warehouses.

Ready to Accelerate Your SNOWPRO-CORE-C03 Preparation?

Join thousands of professionals who are advancing their careers through expert certification preparation with FlashGenius.

  • ✅ Unlimited practice questions across all SNOWPRO-CORE-C03 domains
  • ✅ Full-length exam simulations with real-time scoring
  • ✅ AI-powered performance tracking and weak area identification
  • ✅ Personalized study plans with adaptive learning
  • ✅ Mobile-friendly platform for studying anywhere, anytime
  • ✅ Expert explanations and study resources
Start Free Practice Now

Already have an account? Sign in here

About SNOWPRO-CORE-C03 Certification

The SNOWPRO-CORE-C03 certification validates your expertise in snowflake ai data cloud features & architecture and other critical domains. Our comprehensive practice questions are carefully crafted to mirror the actual exam experience and help you identify knowledge gaps before test day.