FlashGenius Logo FlashGenius
AWS SAA-C03 ยท Page 3 of 5

Design High-Performing Architectures

SAA-C03 ยท Domain 3 ยท 24% of Exam

Study with Practice Tests โ†’

Domain 3: Design High-Performing Architectures

The second-largest SAA-C03 domain at 24% of the exam. Covers storage selection, compute optimization, managed databases, content delivery, and data ingestion pipelines.

Domain 3 ยท 24% of Exam ยท 5 Task Statements

SAA-C03 Domain Breakdown

DomainTopicWeight
1Design Secure Architectures30%
2Design Resilient Architectures26%
3Design High-Performing Architectures (This Page)24%
4Design Cost-Optimized Architectures20%

Task Statements

TaskDescriptionKey Services
3.1High-performing and/or scalable storage solutionsS3, EBS, EFS, FSx, Storage Gateway
3.2High-performing and elastic compute solutionsEC2, Batch, EMR, placement groups
3.3High-performing database solutionsAurora, RDS, DynamoDB, DAX, ElastiCache, Redshift
3.4High-performing and/or scalable network architecturesCloudFront, Global Accelerator, Direct Connect, ELB
3.5High-performing data ingestion and transformationKinesis, Glue, Athena, Lake Formation, DataSync

What You'll Master

Storage (Task 3.1)

  • S3 storage classes and lifecycle policies
  • S3 Transfer Acceleration
  • EBS types: gp3, io2, st1, sc1
  • EFS performance and throughput modes
  • FSx variants: Lustre, Windows, ONTAP
  • AWS Storage Gateway

Compute (Task 3.2)

  • EC2 instance families (M, C, R, I, P)
  • Placement groups (Cluster/Spread/Partition)
  • Enhanced networking and ENA
  • Spot, On-Demand, Reserved purchasing
  • AWS Batch for HPC workloads
  • Amazon EMR (Hadoop/Spark)

Databases (Task 3.3)

  • Aurora (Global DB, Serverless v2)
  • RDS Multi-AZ vs Read Replicas
  • DynamoDB: GSI, LSI, Streams, TTL
  • DAX microsecond caching
  • ElastiCache Redis vs Memcached
  • Amazon Redshift for analytics

Networking & Data (Tasks 3.4โ€“3.5)

  • CloudFront origins and behaviors
  • Global Accelerator vs CloudFront
  • Direct Connect vs VPN
  • Kinesis Streams vs Firehose
  • AWS Glue ETL and Data Catalog
  • Athena, Lake Formation, DataSync

Exam Tips: Domain 3 is 24% of the exam. Focus on: (1) Storage class selection โ€” know WHEN each class is appropriate. (2) Database selection โ€” Aurora vs DynamoDB vs RDS vs Redshift use cases. (3) CloudFront vs Global Accelerator distinction โ€” this is a classic distractor pair.

Key Services at a Glance

Amazon S3 Amazon EBS Amazon EFS Amazon FSx AWS Storage Gateway Amazon EC2 AWS Batch Amazon EMR Amazon Aurora Amazon RDS Amazon DynamoDB Amazon DAX Amazon ElastiCache Amazon Redshift Amazon CloudFront AWS Global Accelerator Amazon Kinesis AWS Glue Amazon Athena AWS Lake Formation Amazon QuickSight AWS DataSync

Core Concepts

Eight deep-dive concept areas covering every major service family tested in Domain 3.

1. S3 Storage Classes & Performance

Storage ClassUse CaseRetrievalMin DurationAZs
S3 StandardFrequent access; defaultImmediateNone3+
S3 Standard-IABackups accessed <once/monthImmediate; retrieval fee30 days3+
S3 One Zone-IAReproducible data (thumbnails)Immediate; retrieval fee30 days1
Glacier InstantArchive; quarterly accessMilliseconds90 days3+
Glacier FlexibleArchival; 1โ€“12 hr OK1โ€“12 hours; bulk free90 days3+
Glacier Deep ArchiveRegulatory long-term12โ€“48 hours180 days3+
Intelligent-TieringUnpredictable access patternsImmediate; no retrieval feeNone3+
  • S3 Transfer Acceleration: uses CloudFront edge network for faster uploads from geographically distant clients; enabled per bucket; ideal for international uploads
  • S3 Intelligent-Tiering: small monthly monitoring fee per object; automatically moves between Frequent and Infrequent tiers based on access patterns; no retrieval fees
  • One Zone-IA risk: data is lost if the single AZ fails โ€” only use for reproducible or derived data

2. EBS Volume Types & File Systems

TypeClassMax IOPSMax ThroughputUse Case
gp3SSD16,0001,000 MB/sBoot volumes; default; IOPS independent of size
gp2SSD16,000250 MB/sLegacy; IOPS tied to size (3/GB)
io2 Block ExpressSSD256,0004,000 MB/sCritical DBs (SAP HANA, Oracle); 99.999% durable
io1SSD64,0001,000 MB/sI/O-intensive databases; older generation
st1HDD500 IOPS500 MB/sBig data, data warehouses, log processing
sc1HDD250 IOPS250 MB/sLowest cost; infrequent large data; no boot
  • Key gp3 advantage: IOPS and throughput provisioned independently from capacity โ€” gp2 tied IOPS to disk size (3 IOPS/GB); st1 and sc1 cannot be used as boot volumes
  • EFS: shared NFS file system; concurrent access from thousands of EC2s; auto-scales; multi-AZ; performance modes: General Purpose (default) vs Max I/O; throughput modes: Elastic (auto), Provisioned, Bursting
  • FSx for Lustre: HPC, ML, genomics โ€” hundreds of GB/s, millions of IOPS, native S3 integration
  • FSx for Windows File Server: SMB protocol; Active Directory integration; for Windows-based workloads

3. EC2 Instance Families & Placement Groups

FamilyPrefixesOptimized ForUse Cases
General PurposeM, TBalanced compute/memory/networkWeb servers, dev environments, small DBs
Compute OptimizedCHigh CPU performanceBatch processing, HPC, ML inference, gaming
Memory OptimizedR, XLarge memory-to-compute ratioIn-memory databases, SAP HANA, real-time analytics
Storage OptimizedI, DHigh sequential I/O, NVMeNoSQL databases, data warehousing, distributed file systems
Accelerated ComputingP, G, InfGPU/ML/graphics accelerationDeep learning training, graphics rendering, video encoding
Placement GroupStrategyBenefitLimit
ClusterSame rack, same AZ10 Gbps+ network; lowest latencySingle AZ; rack failure = all fail
SpreadDifferent hardware per instanceReduces correlated failuresMax 7 instances per AZ
PartitionGroups on separate racksFailure isolation per partitionFor Hadoop, Kafka, Cassandra
  • T-series (burstable): accumulate CPU credits; T3 unlimited mode avoids credit exhaustion (charges may apply); ideal for dev/test with variable load
  • Enhanced Networking (ENA): SR-IOV for high PPS, lower latency; up to 100 Gbps; available on most current-gen instances; no extra cost
  • AWS Batch: fully managed batch compute on EC2/Fargate; job queues and compute environments; ideal for genomics, financial modeling, image processing
  • Amazon EMR: managed Hadoop/Spark; use Spot for task nodes (up to 90% savings); EMRFS uses S3 as persistent storage

4. Databases: Aurora & RDS

  • Amazon Aurora: AWS-proprietary MySQL/PostgreSQL-compatible; 5x faster than MySQL, 3x than PostgreSQL; 6 copies of data across 3 AZs (storage-level replication); up to 15 read replicas with auto-failover
  • Aurora Serverless v2: auto-scales in fine-grained ACU increments; ideal for intermittent/unpredictable workloads; pay per ACU-second; supports read and write scaling
  • Aurora Global Database: one primary + up to 5 secondary regions; replication lag <1 second; promote secondary in <1 minute for DR; ideal for global reads
  • RDS Multi-AZ: synchronous standby replica; automatic failover 60โ€“120s; standby NOT readable โ€” HA only, not for read scale
  • RDS Read Replicas: asynchronous replication; up to 15 replicas; same or cross-region; must be manually promoted for failover; use for READ scaling and cross-region DR
  • RDS Proxy: connection pooler; reduces Lambda-to-RDS connection overhead; faster failover; enforces IAM authentication; solves "too many connections" errors from Lambda at scale

Aurora vs RDS decision: Choose Aurora when you need MySQL/PostgreSQL compatibility with higher performance, more replicas, or cross-region active-active. Choose RDS when you need specific engine versions, Oracle, or SQL Server.

5. DynamoDB & DAX

  • DynamoDB: serverless NoSQL key-value and document; single-digit millisecond at any scale; primary key = partition key (+ optional sort key); automatically partitioned across nodes
  • Provisioned mode: set RCU/WCU; auto-scaling available; cheaper for predictable load; read = 4KB per RCU; write = 1KB per WCU
  • On-Demand mode: pay per request; instant scale; no capacity planning; ideal for spiky or unknown workloads
  • GSI (Global Secondary Index): different partition and sort key than base table; has own RCU/WCU; can be added any time; enables flexible queries on non-key attributes
  • LSI (Local Secondary Index): same partition key, different sort key; must be created at table creation; shares capacity with base table; up to 5 per table
  • DynamoDB Streams: captures INSERT/MODIFY/REMOVE events; 24-hour retention; triggers Lambda for event-driven processing; use for replication, aggregation, notifications
  • DAX (DynamoDB Accelerator): in-memory cache; microsecond latency; write-through; no app code changes (drop-in DAX client); ideal for read-heavy or hot-key workloads
  • DynamoDB TTL: automatically deletes expired items; no RCU/WCU consumed for deletion; use for sessions, temporary data, event logs

6. ElastiCache: Redis vs Memcached

FeatureRedisMemcached
Data structuresRich: sorted sets, lists, hashes, bitmaps, geospatialSimple strings only
PersistenceYes (RDB snapshots, AOF)No
Multi-AZ / ReplicationYes; automatic failoverNo replication
Pub/SubYesNo
Multi-threadedNo (single-threaded core)Yes
Use casesLeaderboards, sessions, real-time analytics, pub/subSimple object caching, horizontal scale
  • Exam rule: persistence, replication, complex data structures, pub/sub, Multi-AZ โ†’ Redis. Simple fast caching, multi-threaded โ†’ Memcached
  • Cache-aside (lazy loading): app checks cache โ†’ miss โ†’ query DB โ†’ write to cache โ†’ serve. Most common pattern
  • Write-through: write to cache AND DB simultaneously; cache always current; higher write latency; no stale reads
  • Session caching: store user sessions in Redis/Memcached; stateless app servers scale freely; sessions survive instance termination

7. CloudFront vs Global Accelerator & Networking

FeatureCloudFrontGlobal Accelerator
ProtocolHTTP/HTTPS onlyTCP, UDP (any)
CachingYes โ€” 450+ edge locationsNo caching
IP addressesDynamic (DNS-based)Static Anycast IPs (2)
Network pathInternet + AWS edgeAWS global backbone only
LayerLayer 7Layer 3/4
Use casesCDN, static/dynamic content, Lambda@EdgeGaming, IoT, VoIP, non-HTTP, static IP needs
  • CloudFront behaviors: path patterns map to different origins (e.g., /images/* โ†’ S3, /api/* โ†’ ALB); cache policies control headers/query strings/cookies; Lambda@Edge modifies requests/responses at edge
  • Direct Connect: dedicated private connection from on-prem to AWS; 1 or 10 Gbps; consistent latency; not encrypted by default โ€” add VPN over DX for encryption; weeks to provision
  • VPN vs Direct Connect: VPN = encrypted, over internet, quick setup, variable latency; Direct Connect = unencrypted (unless + VPN), dedicated, consistent latency, compliant for regulated workloads

8. Data Ingestion & Analytics

ServiceTypeKey CapabilityDestination / Use
Kinesis Data StreamsReal-time streamingOrdered; multi-consumer; replay; 24hโ€“365 days retentionLambda, KDA, custom EC2
Kinesis Data FirehoseManaged deliveryBatch & deliver; Lambda transforms; โ‰ฅ60s bufferS3, Redshift, OpenSearch, Splunk
AWS GlueServerless ETLData Catalog; crawlers; Spark/Python jobs; job bookmarksAthena, EMR, Redshift
Amazon AthenaServerless querySQL on S3 (Presto); pay per TB scannedGlue Data Catalog integration
AWS Lake FormationData lake governanceColumn/row-level security; centralized access controlsGlue, Athena, Redshift Spectrum
AWS DataSyncOnline transferOn-prem/other cloud to AWS storage; up to 10 GbpsNFS, SMB โ†’ S3, EFS, FSx
  • Kinesis vs SQS: Kinesis = ordered streaming, multiple consumers, replay, real-time analytics; SQS = message queue, one consumer per message, no replay after deletion, decoupling
  • Athena cost optimization: partition data by date/region; use columnar formats (Parquet, ORC) to reduce TB scanned
  • Glue job bookmarks: track previously processed data for incremental ETL โ€” avoids reprocessing on re-run
  • Amazon Redshift: columnar data warehouse; Redshift Spectrum queries S3 directly; use for OLAP analytics at petabyte scale

Memory Hooks

Six targeted mnemonics to lock in the trickiest Domain 3 distinctions for exam day.

๐Ÿ’พ
EBS Quick Reference
Boot=gp3 ยท DB=io2 ยท Big Sequential=st1 ยท Cheap Archive=sc1
gp3 is the new default โ€” it decouples IOPS from disk size (gp2 did not). io2 Block Express for sub-ms critical DB. st1 for large throughput workloads. sc1 for the lowest cost cold storage. Neither st1 nor sc1 can be used as boot volumes.
๐Ÿ—‚๏ธ
DynamoDB Indexes
GSI = Global (any key, anytime). LSI = Local (same partition, at creation).
GSI = Global โ€” any partition key, any sort key, own RCU/WCU, add at any time. LSI = Local โ€” must share the same partition key, defined only at table creation. Need flexible querying on new attributes? Always add a GSI.
โšก
Redis vs Memcached
Redis = Rich. Memcached = Minimalist.
Redis = Rich features: persistence, replication, pub/sub, sorted sets, Multi-AZ failover. Memcached = Minimalist: pure cache, multi-threaded, no persistence. Exam shortcut: any mention of replication, persistence, or complex data โ†’ Redis.
๐ŸŒ
CloudFront vs Global Accelerator
CloudFront = Content. Global Accelerator = Connections.
CloudFront = HTTP caching at 450+ edge locations (static/dynamic content). Global Accelerator = TCP/UDP routing over AWS backbone, static Anycast IPs, no caching. Not HTTP? Need static IP? โ†’ Global Accelerator.
๐ŸŒŠ
Kinesis vs SQS
Kinesis = Stream (replay). SQS = Queue (one-and-done).
Kinesis = ordered stream, multiple independent consumers, replay available, real-time analytics. SQS = queue, one consumer per message, message deleted after processing, async decoupling. Need replay or multiple consumers? โ†’ Kinesis Data Streams.
๐Ÿ”ต
Aurora Advantage
6 copies ยท 3 AZs ยท 15 replicas ยท <1s cross-region
Aurora's storage layer maintains 6 copies across 3 AZs automatically. Up to 15 read replicas with sub-10ms lag. Aurora Global Database replicates cross-region in <1 second for global reads and sub-minute DR failover. Choose Aurora = MySQL/PostgreSQL + HA.

Practice Quiz โ€” Domain 3

10 scenario-based questions mirroring the SAA-C03 exam style. Select one answer per question then submit to see your score.

1. A company stores images that are accessed frequently for the first 30 days, then rarely accessed after that. They want to minimize costs. Which S3 configuration is BEST?
2. A database workload requires 64,000 IOPS and sub-millisecond latency. Which EBS volume type should be selected?
3. An application needs to cache DynamoDB read results to achieve microsecond latency. The cache must support complex data structures and survive AZ failures. Which solution should be used?
4. A media company needs a file storage system that multiple EC2 instances across different AZs can mount simultaneously and that automatically scales capacity. Which service fits?
5. A company's web application is used globally. Static assets (images, CSS) should be delivered from edge locations, while dynamic API requests should be routed to the origin ALB. Which service handles this?
6. A streaming analytics platform needs to ingest millions of IoT events per second, allow multiple consumers to independently process the same data stream, and replay events from the past 7 days. Which service is correct?
7. A company runs a read-heavy MySQL workload on RDS. They want to offload read traffic while keeping the primary available for writes. The read replicas should be in the same region. What should they implement?
8. A data team needs to run ad-hoc SQL queries against CSV files stored in S3 without loading them into a database. Which service is MOST appropriate?
9. A company needs a content delivery network that also requires a static Anycast IP address for whitelisting by corporate firewalls and serves non-HTTP UDP traffic for their gaming platform. Which service should they use?
10. An application uses Aurora MySQL and experiences slowdowns during peak hours due to too many database connections from thousands of Lambda functions. What is the RECOMMENDED solution?

Flashcards

Click any card to reveal the answer. Eight high-yield concepts for Domain 3.

Tap a card to flip it

S3 Intelligent-Tiering
When should you use it?
Use when access patterns are unpredictable. Automatically moves objects between Frequent and Infrequent tiers. Small monitoring fee per object; no retrieval fees. Eliminates guesswork about which storage class to choose.
gp3 vs gp2 EBS
What is the key difference?
gp3 decouples IOPS from capacity โ€” provision up to 16,000 IOPS independent of disk size. gp2 IOPS scaled with size (3 IOPS/GB). gp3 is cheaper and more flexible โ€” always prefer gp3 for new workloads.
DynamoDB On-Demand vs Provisioned
Which capacity mode fits which workload?
On-Demand = pay per request, instant scale, no planning โ€” best for spiky or unknown workloads. Provisioned = set RCU/WCU, auto-scaling available, cheaper for predictable consistent load.
Kinesis Data Streams vs Kinesis Data Firehose
What is the key distinction?
Streams = real-time, custom consumers, replay available, manage shards manually. Firehose = fully managed delivery to S3/Redshift/OpenSearch, near-real-time (โ‰ฅ60s buffer), no custom consumers, Lambda transforms.
Aurora vs RDS Read Replicas
How do they differ?
Aurora = up to 15 replicas, <10ms replica lag, share same storage cluster as primary, auto-failover promotion. RDS = up to 5โ€“15 replicas (engine-dependent), async replication, manual promotion required.
ElastiCache Write-Through Pattern
How does it work?
On every database write, update the cache simultaneously. Cache always has current data. Slightly higher write latency but no stale reads. Opposite of lazy loading (cache-aside), which only populates on cache miss.
FSx for Lustre
What is the primary use case?
High-performance computing (HPC): ML training, financial simulations, genomics, video rendering. Integrates natively with S3. Delivers hundreds of GB/s throughput and millions of IOPS โ€” the fastest file system on AWS.
CloudFront Signed URLs vs Signed Cookies
Which to use when?
Signed URL = access to a single file (individual download, one object). Signed Cookie = access to multiple files (entire premium section, video streaming library). Signed URLs override cache behaviors; use cookies for subscription content areas.

Study Advisor

Select your current level for a targeted Domain 3 study plan.

Beginners โ€” Build Your Foundation

  • Draw the S3 storage class chart: cost on Y-axis, retrieval speed on X-axis โ€” place each class visually
  • Learn EBS types by use case: boot โ†’ gp3, high IOPS DB โ†’ io2, big sequential reads โ†’ st1, cheap cold storage โ†’ sc1
  • Understand the DynamoDB primary key concept: every query must include the partition key; sort key enables range queries
  • Practice explaining Multi-AZ vs Read Replicas to someone: Multi-AZ = HA with sync standby; Read Replicas = async for read scale
  • Watch an AWS CloudFront demo to visualize edge caching before memorizing the distinctions

Resources

Authoritative AWS documentation and whitepapers for Domain 3 deep dives.

Official AWS Documentation

AWS Whitepapers

Exam Guide

The official AWS Certified Solutions Architect โ€“ Associate (SAA-C03) Exam Guide V1.1 lists all task statements and knowledge areas for each domain. Download it directly from aws.amazon.com/certification to ensure your study plan covers all tested objectives.

Ready to Ace SAA-C03 Domain 3?

Build confidence with full-length practice exams and instant explanations

Start Practicing Free โ†’