AWS AI Practitioner Learning Hub · AIF-C01

Pass AIF-C01
without the overwhelm.

A focused, free prep hub for the AWS Certified AI Practitioner (AIF-C01). Concept walkthroughs, an interactive Bedrock + RAG architecture, 30+ Q&A, a 15-question timed mock exam, hands-on labs, real customer use cases, and full-syllabus cheat-sheet PDFs — all in one page.

Quick Revision PDF

Core Topics

30+

Q & A

Mock MCQs

Hands-on Labs

Use Cases

PDF Guides

AWS · AIF-C01 65 questions (50 scored) 90 minutes 700/1000 to pass $100 exam fee Foundational level

Exam Domain Blueprint

AIF-C01 · 5 domains · weighted by % of scored questions on the exam

D1 · Fundamentals of AI & ML

20%

D2 · Fundamentals of Generative AI

24%

D3 · Applications of Foundation Models

28%

D4 · Guidelines for Responsible AI

14%

D5 · Security, Compliance & Governance

14%

Built & passed by

Deven Kalathiya

DevOps & AI engineer. After clearing CLF-C02, the same notes-first approach got me through AIF-C01 — now packaged here, free, in the format I wish I'd had on day one.

Portfolio GitHub LinkedIn

01 / LEARN

Core Topics

Click any topic to expand. Each panel covers what it is, when AWS expects you to use it, and the exam-relevant details. Read top-to-bottom for a clean walk-through of the entire AIF-C01 syllabus.

TOPIC 01 · DOMAIN 1

Fundamentals of AI & ML

AI > ML > Deep Learning > GenAI hierarchy, learning types, ML lifecycle & SageMaker inference modes.

Foundational20% of exam

TOPIC 02 · DOMAIN 2

Generative AI Fundamentals

Foundation models, tokens, embeddings, vector DBs, transformers vs diffusion vs GANs.

Core24% of exam

TOPIC 03 · BEDROCK

Amazon Bedrock Deep Dive

The star of the exam. Single-API access to FMs, Knowledge Bases, Agents, Guardrails, pricing modes.

CriticalHigh-yield

TOPIC 04 · SAGEMAKER

SageMaker & ML Lifecycle

Data Wrangler, Feature Store, Training Jobs, Endpoints, JumpStart, Clarify, Model Monitor.

HeavyAll 5 domains

TOPIC 05 · DOMAIN 3

Applications of FMs

Customization spectrum: prompt engineering → RAG → fine-tuning → continued pre-training → from scratch.

Highest weight28% of exam

TOPIC 06 · AI SERVICES

Amazon Q & Managed AI

Q Business / Developer / QuickSight / Connect, plus Comprehend, Rekognition, Transcribe, Polly, Translate.

FrequentService mapping

TOPIC 07 · DOMAIN 4

Responsible AI

Fairness, bias, transparency vs explainability, Clarify, Guardrails, Service Cards, Model Cards, A2I.

Conceptual14% of exam

TOPIC 08 · DOMAIN 5

Security & Governance

IAM, KMS, VPC endpoints, CloudTrail, Bedrock Model Invocation Logging, Macie, GenAI Scoping Matrix.

Critical14% of exam

TOPIC 09 · METRICS

Evaluation & Inference Tuning

BLEU, ROUGE, BERTScore, Perplexity, classification metrics. Temperature, Top-p, Top-k.

MemorizePattern-match

Fundamentals of AI & ML

Domain 1 is 20% of AIF-C01. AWS expects you to know the AI hierarchy, the four learning types, the four SageMaker inference modes, and the ML lifecycle stages.

The AI hierarchy (concentric layers)

The umbrella — any machine that simulates human intelligence.

Subset of AI that learns from data rather than following hand-coded rules.

Deep Learning

Subset of ML using neural networks — powers vision, speech, NLP.

Generative AI

Subset of DL that creates new content — text, images, code, audio.

Agentic AI

Systems that plan, reason, and take actions autonomously (e.g., Bedrock Agents).

Types of Machine Learning

Type	What it learns from	AWS examples
Supervised	Labeled data → classification or regression	Comprehend (sentiment), Rekognition, Fraud Detector
Unsupervised	Unlabeled data → clustering, anomaly detection	SageMaker K-Means, Random Cut Forest
Reinforcement	Reward signals from an environment	DeepRacer, SageMaker RL
Self-supervised	Generates its own labels from unlabeled data	How foundation models are pre-trained

SageMaker inference types HIGH-YIELD

Type	Latency	Payload	Use case
Real-time	Low (ms)	Small	Persistent endpoint, always-on traffic (chatbots, fraud)
Serverless	Low–Med	Small	Intermittent / unpredictable traffic, scales to zero
Asynchronous	High	Up to 1 GB	Long processing (up to 1 hr), large payloads, queued
Batch Transform	Highest	Very large	Bulk offline jobs, no endpoint, scheduled

ML lifecycle on AWS

Business problem framing → Data collection (S3, Kinesis) → Data prep (Data Wrangler, Glue) → Feature engineering (Feature Store) → Training (SageMaker Training Jobs) → Evaluation → Deployment (SageMaker Endpoints) → Monitoring (Model Monitor, Clarify).

Key terminology — memorize the contrasts

Bias vs Variance

Bias = systematic error favouring an outcome (detected by SageMaker Clarify). Variance = sensitivity to training-data fluctuations.

Overfitting vs Underfitting

Overfitting: high train accuracy, low test (memorized data). Underfitting: low on both (model too simple).

Hyperparameters vs Parameters

Hyperparameters are set before training (LR, epochs) — tuned via Automatic Model Tuning. Parameters are the weights/biases learned during training.

Exam tip: "Creating new content" → GenAI. "Multi-step workflow / takes actions" → Agentic AI. "Lots of unlabeled domain data" → self-supervised / continued pre-training.

Don't pick ML when… rules are deterministic and well-known, data is insufficient, you need 100% accuracy with no human review, or the cost of errors is unacceptable.

Generative AI Fundamentals

Domain 2 is 24% of the exam. The big building blocks: foundation models, tokens, embeddings, vector DBs, and the flavours of generative architectures.

Foundation Models on Bedrock

A foundation model is a large model pre-trained on massive datasets via self-supervised learning, adaptable to many downstream tasks. Fine-tuning a FM is far cheaper than training from scratch. On Bedrock you'll find:

Anthropic Claude

Strong reasoning & long context (Claude family — Haiku, Sonnet, Opus tiers).

Amazon Titan / Nova

AWS's own FMs — text, embeddings, image; integrated billing & security.

Meta Llama

Open-weight models, customizable.

Mistral · Cohere · AI21

Specialized providers — multilingual, summarization, embeddings.

Stability AI

Image generation (Stable Diffusion family).

LLM building blocks

Concept	Definition
Tokens	Sub-word chunks (~4 chars in English). Bedrock pricing is per input + output token.
Embeddings	Numerical vector representations of text/images. Used for semantic search & RAG. Services: `Titan Embeddings`, `Cohere Embed`.
Vector database	Stores embeddings for similarity search.
Context window	Max tokens a model processes at once.
Temperature	Controls randomness — 0 = deterministic, 1 = creative.

Vector databases on AWS

OpenSearch Serverless

Default for Bedrock Knowledge Bases.

Aurora PostgreSQL

With the pgvector extension.

Neptune Analytics

Graph + vector for relationship-rich queries.

MemoryDB (Redis)

Fastest — in-memory vector search.

DocumentDB

MongoDB-compatible with vector support.

CLASSIC TRAP: DynamoDB is not a vector database. Don't pick it for embedding storage.

Types of generative models

Architecture	What it's used for
Transformers	Attention-based — backbone of most modern LLMs.
GANs	Generative Adversarial Networks — image generation, two-network duel.
VAEs	Variational Autoencoders — data generation from a learned latent space.
Diffusion models	Image generation (Stable Diffusion) — denoise from random noise.

Bedrock pricing modes

On-demand

Pay per token. No commitment. Use for variable / low traffic.

Provisioned throughput

Reserved capacity — cheaper per token at high, predictable volume.

Batch inference

50% cheaper than on-demand for non-urgent jobs.

Smaller models

Cheaper & faster (Claude Haiku < Sonnet < Opus).

GenAI limitations to recognize

Hallucinations (confident but false — mitigate with RAG/grounding/Guardrails), knowledge cutoff (mitigate with RAG), cost (long contexts add up), latency (bigger models = slower), bias (reflects training data), non-determinism (same input → different output when temperature > 0).

Amazon Bedrock — The Star of the Exam

Bedrock is fully managed and serverless, providing single-API access to multiple FMs with no infrastructure to manage. If a question says "no infrastructure", "fully managed", or "API-only", the answer is almost always Bedrock.

Bedrock features you must know

Knowledge Bases

Managed RAG — connects FMs to your S3 data, handles chunking → embeddings → retrieval automatically.

Agents

Multi-step task execution. FM + Action Groups (Lambda / OpenAPI) + Knowledge Bases + Memory.

Guardrails

Filter harmful content, block topics, redact PII, contextual grounding checks.

Model Evaluation

Compare FMs for your specific use case — automated + human evaluation.

Custom Models

Fine-tune or do continued pre-training.

Provisioned Throughput

Reserved capacity for predictable workloads.

Bedrock vs SageMaker JumpStart CLASSIC TRAP

Aspect	Amazon Bedrock	SageMaker JumpStart
Infrastructure	Fully managed, serverless	You deploy on YOUR SageMaker
Access	Single API call	Deploy endpoint first
Control	Limited, abstracted	Full (instances, hyperparams)
Use case	Fast, hands-off GenAI	Deep customization needed

Trigger phrases: "no infrastructure" → Bedrock. "Full control" / "custom infrastructure" → JumpStart.

Bedrock data privacy KEY FACT

Customer prompts and outputs on Bedrock are NOT used to train base FMs. Your data stays yours. Bedrock is HIPAA-eligible, SOC compliant, GDPR-ready, PCI DSS supported.

SageMaker & the ML Lifecycle

SageMaker is the all-in-one ML platform — every lifecycle stage has a SageMaker tool. The exam tests whether you can match a stage to the right service.

SageMaker tools by lifecycle stage

Stage	Service	What it does
Data prep	Data Wrangler	Visual data prep — 300+ built-in transforms, no code needed.
Feature engineering	Feature Store	Centralized store for features — share across training & inference.
Training	Training Jobs	Managed training on the instance type you pick.
Hyperparameter tuning	Automatic Model Tuning	Bayesian / random search across hyperparameter ranges.
Pre-built models	JumpStart	Deploy open-source FMs & hundreds of pretrained models.
Deployment	Endpoints	Real-time / serverless / async / batch — see Domain 1 panel.
Bias & explainability	Clarify	Detects bias in data & trained models. Provides SHAP explainability.
Drift monitoring	Model Monitor	Detects data drift & concept drift in production.
Human-in-loop	Augmented AI (A2I)	Sends low-confidence predictions to human reviewers.
Model documentation	Model Cards	You document YOUR custom models — training data, intended use, etc.

Pattern matching cheat-sheet

"Detect bias / feature importance" → SageMaker Clarify (SHAP values).

"Data drift / model degradation" → SageMaker Model Monitor.

"Low confidence + human review" → Amazon A2I.

"Train from a known good starting point" → SageMaker JumpStart.

Applications of Foundation Models

Domain 3 is the highest-weighted domain at 28%. Master the customization spectrum and prompt engineering — they together cover most of the questions.

FM customization spectrum (cheapest → most expensive)

Method	Cost	Use when…
Prompt engineering	Free	Simple tasks, no special data
RAG	Cheap	Private/recent factual data, reduce hallucinations
Fine-tuning (instruction)	Moderate	Specific style/format/tone, with labeled examples
Domain adaptation FT	Moderate	Limited domain-specific labeled data
Continued pre-training	Expensive	Lots of UNLABELED domain data
Train from scratch	V. expensive	Rarely needed — only for fundamentally new architectures

Decision rule: Style/format/tone → Fine-tuning. Knowledge/facts → RAG. Domain expertise + huge unlabeled corpus → Continued pre-training.

TRAP: AWS prefers RAG when data is "private", "recent", or "frequently updated". Don't pick fine-tuning unless the question emphasizes style/tone or labeled examples.

RAG — the 5-step pipeline

Ingest documents from a source (S3, Confluence, SharePoint, web crawler).
Chunk into smaller pieces.
Embed chunks into vectors (Titan/Cohere Embeddings).
Store in a vector DB (OpenSearch Serverless, etc.).
Retrieve & Generate: query → similarity search → top chunks injected into prompt → FM answers.

Bedrock Knowledge Bases handles all 5 steps automatically. Connects to S3, Confluence, Salesforce, SharePoint, web crawler.

Prompt engineering techniques

Zero-shot

Ask the model with no examples — relies entirely on its pretraining.

Few-shot / single-shot

Provide 1+ examples in the prompt to anchor format and style.

Chain-of-thought (CoT)

Instruct the model to reason step by step. Boosts math/logic dramatically.

Prompt templates

Reusable structures with placeholders — used in Knowledge Bases & Agents.

Prompt risks & mitigation

Prompt injection (malicious user input overrides system instructions), prompt leaking (tricking the model into revealing its system prompt), jailbreaking (bypassing guardrails), poisoning (harmful data during fine-tuning), hijacking (taking control of model behaviour).

Mitigation: Bedrock Guardrails, input validation, content filtering.

Inference parameters

Parameter	Effect
Temperature (0–1)	Randomness. Low = factual, deterministic. High = creative.
Top-p (nucleus)	Considers tokens whose probabilities sum to p.
Top-k	Considers the top-k most likely tokens.
Max tokens	Caps the output length.
Stop sequences	Strings that halt generation when emitted.

Bedrock Agents

Agents enable LLMs to take actions, not just generate text. Components:

Foundation model

The brain (e.g., Claude, Titan).

Instructions

System prompt defining the agent's role & goals.

Action Groups

Lambda functions or OpenAPI schemas the agent can call.

Knowledge Bases

Optional RAG sources to ground responses.

Memory

Optional, multi-turn context retention.

Pattern: "Multi-step task + take actions + use private data" → Bedrock Agents.

Amazon Q & Managed AI Services

The exam tests whether you can match an AI use case to the right pre-built AWS service. Memorize the Q variants and the classic managed-AI line-up.

Amazon Q variants

Q Business

Enterprise GenAI assistant connecting to company data (SharePoint, Salesforce, S3). Uses IAM Identity Center for user identity.

Q Developer

Coding assistant in IDEs (was CodeWhisperer). VS Code, JetBrains, AWS console.

Q in QuickSight

BI / analytics natural-language queries against your data.

Q in Connect

Contact-center agent assistant. Real-time recommendations during calls.

Managed AI services — pick the right one

Use case	Service
Sentiment, entity, key-phrase, language detection in text	Amazon Comprehend
Image & video analysis (faces, labels, text in images, moderation)	Amazon Rekognition
Speech → text	Amazon Transcribe
Text → speech	Amazon Polly
Translation between languages	Amazon Translate
Extract text + tables + forms from documents	Amazon Textract
Detect online fraud (account takeover, payment fraud)	Amazon Fraud Detector
Personalised recommendations	Amazon Personalize
Forecast time-series data	Amazon Forecast (now via SageMaker Canvas)
Conversational chatbots	Amazon Lex
Healthcare entity extraction (HIPAA-eligible)	Amazon Comprehend Medical
Industrial vision quality inspection	Amazon Lookout for Vision

Pattern: "Code assistant in IDE" → Q Developer. "Contact center agent assist" → Q in Connect. "Search the company knowledge base in natural language" → Q Business.

Responsible AI

Domain 4 is 14%. Mostly conceptual — know the 8 responsible-AI principles, the transparency vs explainability distinction, and which AWS service implements each.

The 8 responsible-AI principles

Fairness

AI treats all groups equitably.

Explainability

Can describe HOW a model reached a decision.

Privacy & Security

Protect data used in & by AI.

Safety

AI behaves reliably and avoids harm.

Controllability

Humans can oversee, intervene, override.

Veracity & Robustness

Accurate outputs, resilient to attacks.

Governance

Policies, accountability, documentation.

Transparency

Openness about what the AI does.

Transparency vs Explainability CLASSIC TRAP

Aspect	Transparency	Explainability
Focus	The whole system	A specific prediction
Question answered	What data, what model, what limits, who built it?	Which features contributed to this output and how?
Form	Documentation (Model Cards, Service Cards)	Feature attribution (SHAP, LIME)
Memory aid	Visible from outside (glass box)	Model explains its reasoning

AWS tools for Responsible AI

SageMaker Clarify

Detects bias in data & trained models. Provides explainability via SHAP values. THE answer for bias-detection & feature-importance questions.

Bedrock Guardrails

Content filters (hate, violence, sexual, insults), denied topics, word filters, PII redaction, contextual grounding checks. Implements safety for GenAI.

AI Service Cards

Documentation about AWS-managed AI services (Rekognition, Textract, Bedrock FMs). Implements transparency.

SageMaker Model Cards

YOU document YOUR custom models. Customer equivalent of AI Service Cards.

Amazon A2I

Adds human review to AI predictions for low-confidence cases. Implements controllability.

SageMaker Model Monitor

Detects data & concept drift in production. Implements veracity & robustness.

Model drift

Data drift: input distribution changes (new product categories appear). Concept drift: the relationship between inputs and outputs changes (what counts as fraud evolves). Mitigation: SageMaker Model Monitor → detect drift → trigger retraining.

Sources of bias & mitigation

Sources: data, algorithmic, sampling, confirmation, measurement bias.

Mitigation: diverse balanced datasets, audit across demographic slices, SageMaker Clarify, fairness metrics (demographic parity, equalized odds), continuous post-deployment monitoring.

Legal context: GDPR (right to explanation), CCPA, HIPAA, EU AI Act, copyright concerns for GenAI training data & outputs.

Security, Compliance & Governance

Domain 5 is 14%. AWS reuses its standard security stack (IAM, KMS, VPC, CloudTrail) and adds a few AI-specific pieces (Bedrock Model Invocation Logging, Guardrails, the GenAI Scoping Matrix).

Core security services for AI workloads

Service	What it does for AI
IAM	Least privilege; use roles (not long-term keys) for Bedrock/SageMaker/Q. SCPs for org guardrails. IAM Identity Center → user identity for Q Business.
KMS	Encryption at rest for S3, EBS, SageMaker volumes. Customer-managed keys (CMKs) supported.
Secrets Manager	Stores API keys, DB credentials.
VPC + VPC Endpoints (PrivateLink)	Access Bedrock / SageMaker / Q WITHOUT going over the public internet.
SageMaker VPC mode	Run training/inference inside your own VPC.
CloudTrail	Logs all API calls — who invoked which model, when.
CloudWatch	Metrics & logs (latency, errors, token usage).
AWS Config	Tracks configuration changes & compliance.
Bedrock Model Invocation Logging	Logs PROMPTS and RESPONSES to S3 / CloudWatch — required if the question says "audit prompts AND responses".
Macie	Detects PII / sensitive data in S3 training corpora.
Lake Formation / Glue Catalog / DataZone	Fine-grained access & governance over data lakes.

Bedrock data privacy KEY FACT

Customer prompts and outputs on Bedrock are NOT used to train base FMs. Your data stays yours. Bedrock is HIPAA-eligible, SOC, GDPR-ready, PCI DSS supported.

GenAI Security Scoping Matrix (5 scopes)

Scope	What it is	Control vs responsibility
Scope 1	Consumer apps (e.g., public ChatGPT)	Least control, least responsibility
Scope 2	Enterprise SaaS with GenAI features	↓
Scope 3	Pre-trained FMs (Bedrock on-demand)	↓
Scope 4	Fine-tuned FMs with your data	↓
Scope 5	Self-trained models from scratch	Most control, most responsibility

AI-specific security risks

Prompt injection (mitigate with Bedrock Guardrails), data leakage through prompts (users pasting confidential info), model theft / extraction (adversaries copying via API), training-data poisoning (corrupted data biases the model), membership inference (determining if specific data was in training set), PII exposure in outputs.

Compliance & cost-governance tools

AWS Artifact (SOC, ISO, HIPAA, PCI DSS, GDPR reports), Audit Manager, AWS Config, SCPs (org-wide), AWS Budgets (cost alerts), Cost Explorer, tagging by project, provisioned vs on-demand, Bedrock Batch (50% cheaper).

Shared Responsibility for AI

AWS — security OF the cloud

Infrastructure, FM availability, hypervisor, physical security.

Customer — security IN the cloud

IAM policies, prompts, fine-tuning data, Guardrails configuration, output handling.

Pattern: "Without exposing data to public internet" → VPC Endpoints / PrivateLink. "Detect PII in S3 training data" → Macie. "Audit prompts AND responses" → Bedrock Model Invocation Logging (NOT just CloudTrail).

Evaluation Metrics & Inference Tuning

Easy points if you memorize them. The exam loves single-word triggers ("translation" → BLEU; "summarization" → ROUGE).

Generative model evaluation metrics

Metric	Used for
BLEU	Translation quality
ROUGE	Summarization quality
BERTScore	Semantic similarity using BERT embeddings
Perplexity	How well a language model predicts a sample (lower = better)
Human evaluation	Gold standard, expensive — best for nuanced quality
Bedrock Model Evaluation	Automated + human evaluation service for comparing FMs

Classification metrics (supervised ML)

Metric	What it measures
Accuracy	Overall correctness — misleading on imbalanced data.
Precision	Of predicted positives, how many were correct? Use when false positives are costly.
Recall	Of actual positives, how many did we catch? Use when false negatives are costly (e.g., disease detection).
F1	Harmonic mean of precision & recall.
AUC-ROC	Trade-off across thresholds.

Choosing a foundation model — selection factors

Capability (reasoning / coding / multilingual / vision), cost per token, latency, context window, modalities (text vs multimodal), customization support, regional availability.

Business metrics for AI

Beyond model quality: task completion rate, user satisfaction (CSAT), cost per inference, time to resolution, conversion rate, error/hallucination rate, ROI.

02 / ARCHITECTURE

Interactive AWS AI/ML Architecture

Click any component below to see what it does, where it sits in a typical Bedrock + RAG production stack, and which AIF-C01 questions it answers. Top-down: data sources → embedding pipeline → vector store → FM serving → app, with security and observability cutting across every layer.

USER & CLIENT LAYER

End User / App i

Amazon Q / Custom UI i

↓natural language prompt

FOUNDATION MODEL LAYER

Amazon Bedrock i

Bedrock Agents i

Bedrock Guardrails i

↓retrieval & tool calls

RETRIEVAL & KNOWLEDGE

Knowledge Bases i

OpenSearch Serverless i

Aurora pgvector i

↓training data & raw documents

DATA & ML PIPELINE

S3 (data lake) i

SageMaker i

SageMaker Clarify i

Amazon A2I i

↓cross-cutting: every prompt and response is governed

SECURITY · OBSERVABILITY · GOVERNANCE

IAM & Identity Center i

KMS Encryption i

VPC Endpoints i

Model Invocation Logging i

Macie (PII) i

Why this matters: AIF-C01 questions almost always describe a scenario and ask you to pick the right service. If you internalize this layered model — User → FM → Retrieval → Data → Cross-cutting Security/Observability — you can map most questions to the right answer in seconds.

03 / Q & A

Exam-Style Q & A

30 questions in AIF-C01 style, mixed across all five domains. Click to expand. Filter by difficulty.

What's the difference between AI, ML, Deep Learning, and Generative AI?Easy▼

AI is the umbrella — any machine that simulates human intelligence. ML is a subset of AI that learns from data. Deep Learning is a subset of ML using neural networks. Generative AI is a subset of DL that creates new content. Agentic AI sits on top of GenAI — systems that plan, reason, and take actions autonomously.

What are the four types of machine learning?Easy▼

Supervised (labeled data → classification or regression — Comprehend, Rekognition, Fraud Detector). Unsupervised (no labels → clustering, anomaly detection — K-Means, Random Cut Forest). Reinforcement (rewards — DeepRacer, SageMaker RL). Self-supervised (generates own labels — how foundation models are pre-trained).

Bedrock vs SageMaker JumpStart — when do you pick which?Easy▼

Bedrock = fully managed, serverless, single API call, no infra to manage. Use when you want fast, hands-off GenAI. SageMaker JumpStart = you deploy on YOUR SageMaker infrastructure, full control over instances and hyperparameters. Use when you need deep customization. Trigger phrases: "no infrastructure" → Bedrock; "full control" → JumpStart.

What is RAG and what problem does it solve?Easy▼

Retrieval Augmented Generation grounds an LLM's responses in your private/recent data without retraining. Pipeline: (1) ingest documents from S3, (2) chunk, (3) embed into vectors, (4) store in a vector DB, (5) at query time retrieve top chunks + inject into prompt + FM answers. Solves hallucinations and stale knowledge cutoff. Bedrock Knowledge Bases handles all 5 steps automatically.

Name the four SageMaker inference types and one use case for each.Easy▼

Real-time — persistent endpoint, low ms latency (chatbots, fraud). Serverless — intermittent traffic, scales to zero. Asynchronous — long processing up to 1 hr, payloads up to 1 GB, queued. Batch Transform — bulk offline jobs, no endpoint, scheduled.

Which AWS service detects bias in a trained model?Easy▼

SageMaker Clarify. It detects bias in both data and trained models, and provides explainability via SHAP values (feature attribution). It's the answer for any "detect bias" or "feature importance" exam question.

What is a foundation model?Easy▼

A large model pre-trained on massive datasets via self-supervised learning, adaptable to many downstream tasks. FMs can be fine-tuned far cheaper than training from scratch. On Bedrock you'll find Anthropic Claude, Amazon Titan/Nova, Meta Llama, Mistral, Cohere, AI21 Jurassic, and Stability AI.

What is a prompt and what is a token?Easy▼

A prompt is the input text you send to a model. A token is a sub-word chunk (~4 characters in English) — both prompts and outputs are measured in tokens, and Bedrock pricing is per input + output token.

Which Q variant assists developers in their IDE?Easy▼

Amazon Q Developer (formerly CodeWhisperer). Other variants: Q Business (enterprise data assistant), Q in QuickSight (BI/analytics natural-language), Q in Connect (contact-center agent assist).

What is the AWS exam fee, format, and pass score for AIF-C01?Easy▼

$100 USD. 65 questions (50 scored + 15 unscored), 90 minutes, pass mark 700/1000 on a compensatory model. Validity: 3 years. Online proctored or test centre.

When would you use fine-tuning vs RAG?Medium▼

Fine-tuning when you need to teach a model a specific style, format, or tone with labeled examples. RAG when you need it to answer based on private, recent, or frequently updated facts. AWS prefers RAG when data is private/recent — don't pick fine-tuning unless the question emphasizes style/tone or labeled examples. Decision rule: Style/tone → FT. Knowledge/facts → RAG.

What's the difference between transparency and explainability?Medium▼

Transparency = openness about the whole system (what data, what model, what limits, who built it) — documentation-focused (Service Cards, Model Cards). Explainability = ability to explain a specific prediction (which features contributed, how) — feature attribution like SHAP. Memory aid: transparency = glass box from outside; explainability = the model explains its own reasoning.

Which vector databases does AWS support for storing embeddings?Medium▼

OpenSearch Serverless (default for Bedrock Knowledge Bases), Aurora PostgreSQL with pgvector, Neptune Analytics, MemoryDB (Redis-based, fastest), and DocumentDB. Trap: DynamoDB is NOT a vector database — don't pick it for embedding storage.

Name 5 capabilities of Bedrock Guardrails.Medium▼

(1) Content filters (hate, violence, sexual, insults), (2) denied topics (custom blocked subjects), (3) word filters (block specific words/profanity), (4) PII redaction (mask SSNs, credit cards, etc.), (5) contextual grounding checks (detect hallucinations vs source documents).

What is the GenAI Security Scoping Matrix?Medium▼

AWS defines 5 scopes based on how you use FMs (more control = more responsibility): Scope 1 Consumer apps (e.g., public ChatGPT) — least control. Scope 2 Enterprise SaaS with GenAI features. Scope 3 Pre-trained FMs (Bedrock on-demand). Scope 4 Fine-tuned models. Scope 5 Self-trained from scratch — most control + responsibility.

Which evaluation metric do you use for translation? For summarization?Medium▼

BLEU for translation. ROUGE for summarization. Other generative metrics: BERTScore (semantic similarity), Perplexity (how well an LM predicts a sample — lower = better), and human evaluation (gold standard, expensive).

What does temperature do in an LLM, and how do top-p and top-k differ?Medium▼

Temperature (0–1) controls randomness — 0 = deterministic, 1 = creative. Top-p (nucleus sampling) considers tokens whose probabilities sum to p. Top-k considers only the top-k most likely tokens. You usually pick one of top-p or top-k, not both.

What is data drift vs concept drift, and which AWS service detects it?Medium▼

Data drift: the input distribution changes (new product categories appear). Concept drift: the relationship between inputs and outputs changes (what counts as fraud evolves). SageMaker Model Monitor detects both and can trigger retraining.

How do you keep Bedrock traffic off the public internet?Medium▼

Use VPC Endpoints (PrivateLink). They let you reach Bedrock, SageMaker, and Q from inside a VPC without traversing the internet. Pattern: "without exposing data to public internet" → VPC Endpoints / PrivateLink.

Which AWS service detects PII in S3 training data?Medium▼

Amazon Macie. It uses ML to identify, classify, and protect sensitive data (PII, financial info) stored in S3 buckets — the right answer when a question mentions "detect PII" or "sensitive data discovery" in S3.

Walk through the components of a Bedrock Agent.Hard▼

An Agent enables an LLM to take actions, not just generate text. Components: Foundation model (the brain — e.g., Claude); Instructions (system prompt defining the agent's role); Action Groups (Lambda functions or OpenAPI schemas the agent can call); Knowledge Bases (optional RAG sources to ground responses); Memory (optional, multi-turn context). Pattern: "multi-step task + take actions + use private data" → Bedrock Agents.

If a customer wants to log every prompt and every response for auditing, which AWS service do you recommend?Hard▼

Bedrock Model Invocation Logging — it logs prompts AND responses to S3 or CloudWatch Logs. CloudTrail alone only logs the API call metadata (who/when/which model), not the prompt/response content. The combination of CloudTrail + Model Invocation Logging gives a complete audit trail.

A bank wants a chatbot that answers from internal policy PDFs that change weekly. What AWS approach do you recommend?Hard▼

RAG via Bedrock Knowledge Bases. Store the PDFs in S3, point a Knowledge Base at the bucket, pick OpenSearch Serverless as the vector store, and call Retrieve and Generate from a Bedrock model. RAG (not fine-tuning) because the data is private and changes frequently. Add Bedrock Guardrails for safety and use VPC endpoints to keep traffic private.

When should you choose continued pre-training over fine-tuning?Hard▼

When you have a large unlabeled corpus of domain-specific text (e.g., thousands of medical or legal documents) and want the model to absorb domain language and concepts. Fine-tuning wants labeled input/output pairs to teach style or specific tasks. Continued pre-training is more expensive and shifts the base behaviour; pick it only when prompt engineering and RAG aren't enough.

Customer prompts on Bedrock — are they used to train the base FMs?Hard▼

No. Customer prompts and outputs on Bedrock are NOT used to train base FMs. Your data stays yours. Bedrock is HIPAA-eligible, SOC compliant, GDPR-ready, PCI DSS supported. This is a key fact AWS expects you to know.

How do you detect and respond to model drift in production?Hard▼

SageMaker Model Monitor compares inference traffic to a baseline you established at deployment. It detects data quality drift, model quality drift, bias drift, and feature attribution drift. When it fires, route to CloudWatch alarms / EventBridge → trigger an automated retraining pipeline (typically a SageMaker Pipelines job that re-runs training, evaluation, and conditional deployment).

Compare Bedrock pricing modes and when to pick each.Hard▼

On-demand — pay per token, no commitment, variable workloads. Provisioned throughput — reserved capacity, cheaper per token, for high & predictable volume. Batch inference — 50% cheaper than on-demand for non-urgent jobs. Smaller models (Claude Haiku vs Opus) — cheaper & faster for simpler tasks. Pick based on volume, latency tolerance, and budget.

What is the difference between Service Cards and Model Cards?Hard▼

AWS AI Service Cards — AWS publishes them for managed AI services (Rekognition, Textract, Bedrock FMs). They cover intended use, limitations, performance, fairness. They implement transparency. SageMaker Model Cards — YOU create them for YOUR custom models, documenting training data, intended use, evaluation, ethical considerations. They are the customer equivalent of AI Service Cards.

What's the role of Amazon A2I in a responsible AI workflow?Hard▼

Amazon Augmented AI (A2I) sends low-confidence predictions from Rekognition, Textract, or your custom models to human reviewers. This implements controllability and safety in the responsible-AI principles — humans can intervene before a low-confidence output is acted on. It comes with built-in workflows for Rekognition and Textract, plus custom workflows for your own models.

A solution must use a private FM, store embeddings, retrieve company policies, and log every interaction to satisfy audit. List the AWS services.Hard▼

Bedrock (private FM via API; data not used for training), Bedrock Knowledge Bases (RAG over your S3 corpus), OpenSearch Serverless (vector store), S3 (raw documents) with KMS at-rest encryption, VPC endpoints / PrivateLink (keep traffic off the public internet), Bedrock Model Invocation Logging (capture prompts & responses), CloudTrail (API audit), and Bedrock Guardrails (safety filters).

Mock Exam

15 multiple-choice questions in AIF-C01 style. Timed at 22 minutes (same per-question pace as the real 65-question, 90-minute exam). Pass mark: 70%. You'll see an explanation after each question.

Ready to test yourself?

A mix of easy, medium, and hard questions sampled across all five AIF-C01 domains: AI/ML Fundamentals, Generative AI, Applications of FMs, Responsible AI, and Security. You'll get an explanation after each question and a full breakdown at the end.

15 questions

22 minutes

70% to pass

Hands-On Labs

Six guided labs you can run in the AWS Free Tier (most use Bedrock — request model access first in the console). Console click-paths and AWS CLI snippets side by side. The console click-paths are stable; the CLI snippets show the equivalent API calls.

Lab 01 · Your first Bedrock prompt

Enable model access for Anthropic Claude on Bedrock and send your first prompt from the Playground and the CLI.

Console: Bedrock → Model access → Manage model access → request Anthropic Claude (approval is usually instant).
Playground: Bedrock → Playgrounds → Chat → pick Claude → "Explain RAG in 3 sentences".
CLI: aws bedrock-runtime invoke-model --model-id anthropic.claude-3-haiku-20240307-v1:0 --body '{"messages":[{"role":"user","content":"Explain RAG"}],"max_tokens":256,"anthropic_version":"bedrock-2023-05-31"}' out.json
Inspect out.json — note the input_tokens and output_tokens fields (this is what you're billed on).

Bedrock · Free with low usage

Lab 02 · Build a RAG app with Knowledge Bases

Index a folder of PDFs in S3, point a Bedrock Knowledge Base at the bucket, and query it.

Upload 5–10 PDFs to s3://your-aiprep-bucket/docs/.
Console: Bedrock → Knowledge Bases → Create. Pick S3 as data source, OpenSearch Serverless as vector store, Titan Embeddings as the embedding model.
Sync the data source — embeddings are generated automatically.
Test: Test knowledge base → Retrieve and generate and ask a question grounded in the PDFs.
Click "Show source details" to see which chunks were retrieved — that's your RAG working.

Bedrock + S3 + OpenSearch · ~$0.50/day for tiny corpus

Lab 03 · Build a Bedrock Agent

Create an agent that calls a Lambda function as an action group — multi-step task with real action.

Write a Python Lambda get_weather(city) that returns a stub response.
Define an OpenAPI schema describing the action.
Console: Bedrock → Agents → Create. Pick Claude as the FM, paste the OpenAPI schema, link the Lambda.
Test the agent with: "What's the weather in Mumbai today?" — observe it call the Lambda.
Optional: attach a Knowledge Base so the agent can ground in your S3 docs too.

Bedrock + Lambda · Pay per invocation

Lab 04 · Configure Bedrock Guardrails

Block sensitive topics and redact PII from FM outputs.

Console: Bedrock → Guardrails → Create guardrail.
Set content filters (hate, violence, sexual, insults) to High.
Add a denied topic: "Investment advice" with examples.
Enable PII filters (block emails, mask credit-card numbers).
Test in the Guardrail playground with prompts that should trigger each filter; observe the redacted output.
Reference the guardrail ID in your InvokeModel calls to apply it at runtime.

Bedrock · Free to create, pay per request

Lab 05 · Bias detection with SageMaker Clarify

Run a Clarify processing job on a tabular dataset to surface bias and feature importance.

Upload the UCI Adult Income CSV to s3://your-aiprep-bucket/clarify/.
Train a simple XGBoost model in SageMaker JumpStart (or use a pre-trained one).
Configure a Clarify processing job — set facet_name=sex and label_values_or_threshold=1.
Run pre-training bias analysis (Class Imbalance, DPL) and post-training (DPPL, AD).
Open the Clarify report — note SHAP feature attributions.

SageMaker · ~$5 for an end-to-end run

Lab 06 · Drift detection with Model Monitor

Deploy a model, capture inference data, and create a monitoring schedule.

Deploy a SageMaker real-time endpoint with data_capture_config enabled (writes to S3).
Generate baseline statistics from your training set with DefaultModelMonitor.
Schedule an hourly Model Monitor job that compares captured inferences to the baseline.
Send some out-of-distribution traffic to the endpoint to trigger drift.
Inspect the violations report in S3 and set a CloudWatch alarm on the metric.

SageMaker · Endpoint runs charge hourly — clean up after

Real-World Use Cases

Eight production patterns AWS likes to test as scenarios. Read each as: what the customer wants → which AWS services solve it → why.

FinServ Bank

Internal Q&A over private docs

Scenario: 10,000 internal policy and procedure PDFs, updated weekly. Employees waste hours searching SharePoint. The bank wants a chatbot that answers from the latest documents — without retraining a model every week.

AWS solution: PDFs land in S3. Bedrock Knowledge Bases ingests, chunks, embeds (Titan Embeddings) and stores vectors in OpenSearch Serverless. Bedrock (Claude) answers via RetrieveAndGenerate. Guardrails redact PII; VPC endpoints keep traffic off the public internet. RAG (not fine-tuning) because data is private and changes constantly.

BedrockKnowledge BasesOpenSearch ServerlessS3GuardrailsVPC Endpoints

Telco Inc

Contact-centre agent assist

Scenario: Live agents handle thousands of calls a day. Average handle-time is too high; new hires struggle to find the right policy or upsell offer. Need real-time recommendations during the call.

AWS solution: Calls flow through Amazon Connect. Q in Connect listens, retrieves the right answer from the knowledge base, and surfaces it to the agent's screen as the customer talks. Comprehend tags real-time sentiment so a supervisor sees escalations early. Transcribe writes call transcripts to S3 for QA.

Q in ConnectAmazon ConnectComprehendTranscribeBedrock

DevHouse

Coding assistant for engineers

Scenario: 200 engineers split across VS Code and JetBrains IDEs. The CTO wants AI code suggestions that are aware of internal libraries — and SSO-controlled per team.

AWS solution: Amazon Q Developer (formerly CodeWhisperer) plugs straight into both IDEs. IAM Identity Center federates SSO and gates access by team. Q Developer suggests code, explains existing functions, writes unit tests, and flags security issues — all per-license, no infra to operate.

Q DeveloperIAM Identity CenterCloudTrail

PaySmart

Real-time fraud detection

Scenario: A payments fintech needs to score every transaction in < 100 ms. They have history but limited fraud-labelled data. They don't want to train a custom ML model from scratch.

AWS solution: Transactions stream into Kinesis Data Streams. Lambda consumers call Amazon Fraud Detector (purpose-built managed ML, pre-trained on Amazon's own data plus yours). Scores write to DynamoDB for the rules engine; alerts to EventBridge → SNS. No SageMaker tuning required.

Fraud DetectorKinesisLambdaDynamoDBEventBridge

AP Auto

Invoice IDP pipeline

Scenario: An accounts-payable team receives thousands of invoices weekly — mix of scanned PDFs, emailed photos, and supplier portals. Manual data entry is slow and error-prone.

AWS solution: Invoices land in S3. Amazon Textract extracts forms and tables with confidence scores; rows above threshold flow to DynamoDB. Below-threshold rows go to Amazon A2I for human review — text-book Controllability. Bedrock (Claude) optionally normalizes free-text "memo" fields. EventBridge orchestrates the pipeline.

TextractA2IBedrockDynamoDBLambdaEventBridge

GlobalSaaS

Multilingual feedback analysis

Scenario: A global SaaS company collects product reviews in 12 languages. PMs need a single dashboard showing sentiment, themes, and trends — without hiring 12 translators.

AWS solution: Reviews stream into S3. Amazon Translate normalizes them to English. Comprehend extracts sentiment, entities, and key-phrases. Aggregates land in QuickSight; Q in QuickSight lets PMs ask "what are users saying about checkout in Brazil this month?" in plain English.

TranslateComprehendQuickSightQ in QuickSightS3

SocialPlat

Content moderation at scale

Scenario: A social platform accepts millions of image and video uploads daily. Need to flag policy-violating content fast, but with human review for grey areas — and document the responsible-AI process for regulators.

AWS solution: Uploads write to S3. EventBridge triggers Lambda which calls Rekognition (content moderation labels: Explicit, Suggestive, Violence, etc.). High-confidence violations auto-block; ambiguous items go to A2I for human reviewers. SageMaker Model Cards document model behaviour, AI Service Cards cover Rekognition transparency.

RekognitionA2IEventBridgeLambdaModel Cards

FreshMart

Demand forecasting per SKU

Scenario: A grocery chain needs weekly demand forecasts for each SKU at each store. The merchandising team has business analysts but no in-house ML engineers.

AWS solution: Historical sales sit in S3. SageMaker Canvas lets the analyst build a no-code time-series forecast (or Amazon Forecast for a fully managed alternative). EventBridge re-runs the pipeline weekly. QuickSight visualises by region and category. Model Monitor watches for data drift if customer behaviour shifts.

SageMaker CanvasAmazon ForecastEventBridgeQuickSightModel Monitor

Cheat Sheet

Grab the PDFs to revise offline, then memorize the decision patterns below — they'll solve a big chunk of the exam by pattern-matching alone.

Download the PDF guides

Quick Revision · 4 pages

AWS AI Practitioner — Quick Revision Cheat Sheet

All 5 domains in one short read · ~9 KB

Detailed Guide · 12 pages

AWS AI Practitioner — Detailed Study Guide

Full walkthrough with traps, tables & exam-day strategy · ~26 KB

Top decision patterns

If the question mentions…	Answer is likely…
"No infrastructure" / "fully managed" / "API"	Amazon Bedrock
"Full control" / "your own infrastructure"	SageMaker JumpStart
"Private data" / "company documents" / "no retraining"	RAG / Bedrock Knowledge Bases
"Specific style/tone" + "labeled examples"	Instruction fine-tuning
"Lots of UNLABELED domain data"	Continued pre-training
"Multi-step task" + "take actions" + "API calls"	Bedrock Agents
"Detect bias" / "feature importance" / "SHAP"	SageMaker Clarify
"Low confidence" + "human review"	Amazon A2I
"Data drift" / "model degradation"	SageMaker Model Monitor
"Detect PII in S3"	Amazon Macie
"Block topics, filter PII in outputs"	Bedrock Guardrails
"Audit prompts AND responses"	Bedrock Model Invocation Logging
"Without public internet"	VPC Endpoints / PrivateLink
"Vector storage / semantic search"	OpenSearch Serverless (NOT DynamoDB)
"Translation evaluation"	BLEU
"Summarization evaluation"	ROUGE
"Code assistant in IDE"	Amazon Q Developer
"Contact center agent assist"	Amazon Q in Connect
"Enterprise data assistant"	Amazon Q Business
"Detect online fraud"	Amazon Fraud Detector
"OCR / extract tables & forms from PDFs"	Amazon Textract
"Image labels / face detection / moderation"	Amazon Rekognition
"Speech to text"	Amazon Transcribe
"Text to speech"	Amazon Polly
"Translate text"	Amazon Translate
"Personalization / recommendations"	Amazon Personalize
"Conversational chatbot"	Amazon Lex

SageMaker inference modes

Mode	Latency	Payload	Pick when…
Real-time	Low (ms)	Small	Always-on, persistent endpoint, low-latency required
Serverless	Low–Med	Small	Intermittent / unpredictable traffic, scales to zero
Asynchronous	High	Up to 1 GB	Long processing (up to 1 hr), large payloads, queued
Batch Transform	Highest	Very large	Bulk offline jobs, no endpoint, scheduled

FM customization spectrum

Method	Cost	Use when
Prompt engineering	Free	Simple tasks, no special data
RAG	Cheap	Private/recent data, reduce hallucinations
Fine-tuning (instruction)	Moderate	Specific style/format/tone, labeled examples
Domain adaptation FT	Moderate	Limited domain-specific labeled data
Continued pre-training	Expensive	Lots of UNLABELED domain data
Train from scratch	V. expensive	Rarely needed

Final exam-day tips

1. Read every question twice.

Keywords like "private", "no infrastructure", "unlabeled", "managed" pick the answer.

2. Eliminate clearly wrong answers first.

Usually 2 of the 4 options are obviously wrong.

3. No penalty for guessing.

Answer every question, never leave blank.

4. Flag & review.

Mark uncertain ones; come back at the end.

5. Trust your first instinct.

Don't second-guess unless you spot a clear error.

6. ~80 sec/question pace.

90 min ÷ 65 questions. Don't get stuck.

Pass AIF-C01without the overwhelm.

Exam Domain Blueprint

Core Topics

The AI hierarchy (concentric layers)

Types of Machine Learning

SageMaker inference types HIGH-YIELD

ML lifecycle on AWS

Key terminology — memorize the contrasts

Foundation Models on Bedrock

LLM building blocks

Vector databases on AWS

Types of generative models

Bedrock pricing modes

GenAI limitations to recognize

Bedrock features you must know

Bedrock vs SageMaker JumpStart CLASSIC TRAP

Bedrock data privacy KEY FACT

SageMaker tools by lifecycle stage

Pattern matching cheat-sheet

FM customization spectrum (cheapest → most expensive)

RAG — the 5-step pipeline

Prompt engineering techniques

Prompt risks & mitigation

Inference parameters

Bedrock Agents

Amazon Q variants

Managed AI services — pick the right one

The 8 responsible-AI principles

Transparency vs Explainability CLASSIC TRAP

AWS tools for Responsible AI

Model drift

Sources of bias & mitigation

Core security services for AI workloads

Bedrock data privacy KEY FACT

GenAI Security Scoping Matrix (5 scopes)

AI-specific security risks

Compliance & cost-governance tools

Shared Responsibility for AI

Generative model evaluation metrics

Classification metrics (supervised ML)

Choosing a foundation model — selection factors

Business metrics for AI

Interactive AWS AI/ML Architecture

Exam-Style Q & A

Mock Exam

Ready to test yourself?

—

Hands-On Labs

Lab 01 · Your first Bedrock prompt

Lab 02 · Build a RAG app with Knowledge Bases

Lab 03 · Build a Bedrock Agent

Lab 04 · Configure Bedrock Guardrails

Lab 05 · Bias detection with SageMaker Clarify

Lab 06 · Drift detection with Model Monitor

Real-World Use Cases

Cheat Sheet

Download the PDF guides

Top decision patterns

SageMaker inference modes

FM customization spectrum

Final exam-day tips

—

Pass AIF-C01
without the overwhelm.