Free Practice Questions for AWS Certified Machine Learning - Specialty (MLS-C01) Certification

🔄 Last checked for updates April 19th, 2026

Study with 360 exam-style practice questions designed to help you prepare for the AWS Certified Machine Learning - Specialty (MLS-C01).

Random Questions

Practice with randomly mixed questions from all topics

Question MixAll Topics

FormatRandom Order

Domain Mode

Practice questions from a specific topic area

Select Domain

Quiz History

Exam Details

Key information about AWS Certified Machine Learning - Specialty (MLS-C01)

Official study guide

View

Question formats CertSafari offers

Multiple choice

exam format:

Multiple choice, Multiple response

passing score:

750 out of 1,000

prerequisites:

Experience performing basic hyperparameter optimization and with ML/deep learning frameworks.

target audience:

Individuals in an AI/ML development or data science role with 2+ years of experience developing, architecting, and running ML/deep learning workloads in the AWS Cloud.

number of questions:

50 scored questions, 15 unscored questions

Exam Topics & Skills Assessed

Skills measured (from the official study guide)

Domain 1: Data Engineering

Subdomain 1.1: Create data repositories for ML

- Identify data sources (for example, content and location, primary sources such as user data). - Determine storage mediums (for example, databases, Amazon S3, Amazon Elastic File System [Amazon EFS], Amazon Elastic Block Store [Amazon EBS]).

Subdomain 1.2: Identify and implement a data ingestion solution

- Identify data job styles and job types (for example, batch load, streaming). - Orchestrate data ingestion pipelines (batch-based ML workloads and streaming-based ML workloads). - Amazon Kinesis - Amazon Data Firehose - Amazon EMR - AWS Glue - Amazon Managed Service for Apache Flink - Schedule jobs.

Subdomain 1.3: Identify and implement a data transformation solution

- Transform data in transit (ETL, AWS Glue, Amazon EMR, AWS Batch). - Handle ML-speciﬁc data by using MapReduce (for example, Apache Hadoop, Apache Spark, Apache Hive).

Domain 2: Exploratory Data Analysis

Subdomain 2.1: Sanitize and prepare data for modeling

- Identify and handle missing data, corrupt data, and stop words. - Format, normalize, augment, and scale data. - Determine whether there is suﬃcient labeled data. - Identify mitigation strategies. - Use data labelling tools (for example, Amazon Mechanical Turk).

Subdomain 2.2: Perform feature engineering

- Identify and extract features from datasets, including from data sources such as text, speech, images, and public datasets. - Analyze and evaluate feature engineering concepts (for example, binning, tokenization, outliers, synthetic features, one-hot encoding, reducing dimensionality of data).

Subdomain 2.3: Analyze and visualize data for ML

- Create graphs (for example, scatter plots, time series, histograms, box plots). - Interpret descriptive statistics (for example, correlation, summary statistics, p-value). - Perform cluster analysis (for example, hierarchical, diagnosis, elbow plot, cluster size).

Domain 3: Modeling

Subdomain 3.1: Frame business problems as ML problems

- Determine when to use and when not to use ML. - Know the diﬀerence between supervised and unsupervised learning. - Select from among classiﬁcation, regression, forecasting, clustering, recommendation, and foundation models.

Subdomain 3.2: Select the appropriate model(s) for a given ML problem

- XGBoost, logistic regression, k-means, linear regression, decision trees, random forests, RNN, CNN, ensemble, transfer learning, and large language models (LLMs) - Express the intuition behind models.

Subdomain 3.3: Train ML models

- Split data between training and validation (for example, cross validation). - Understand optimization techniques for ML training (for example, gradient descent, loss functions, convergence). - Choose appropriate compute resources (for example GPU or CPU, distributed or non-distributed). - Choose appropriate compute platforms (Spark or non-Spark). - Update and retrain models. - Batch or real-time/online

Subdomain 3.4: Perform hyperparameter optimization

- Perform regularization. - Dropout - L1/L2 - Perform cross-validation. - Initialize models. - Understand neural network architecture (layers and nodes), learning rate, and activation functions. - Understand tree-based models (number of trees, number of levels). - Understand linear models (learning rate).

Subdomain 3.5: Evaluate ML models

- Avoid overﬁtting or underﬁtting. - Detect and handle bias and variance. - Evaluate metrics (for example, area under curve [AUC]-receiver operating characteristics [ROC], accuracy, precision, recall, Root Mean Square Error [RMSE], F1 score). - Interpret confusion matrices. - Perform oﬄine and online model evaluation (A/B testing). - Compare models by using metrics (for example, time to train a model, quality of model, engineering costs). - Perform cross-validation.

Domain 4: Machine Learning Implementation and Operations

Subdomain 4.1: Build ML solutions for performance, availability, scalability, resiliency, and fault tolerance

- Log and monitor AWS environments. - AWS CloudTrail and Amazon CloudWatch - Build error monitoring solutions. - Deploy to multiple AWS Regions and multiple Availability Zones. - Create AMIs and golden images. - Create Docker containers. - Deploy Auto Scaling groups. - Rightsize resources (for example, instances, Provisioned IOPS, volumes). - Perform load balancing. - Follow AWS best practices.

Subdomain 4.2: Recommend and implement the appropriate ML services and features for a given problem

- ML on AWS (application services), for example: - Amazon Polly - Amazon Lex - Amazon Transcribe - Amazon Q - Understand AWS service quotas. - Determine when to build custom models and when to use Amazon SageMaker built-in algorithms. - Understand AWS infrastructure (for example, instance types) and cost considerations. - Use Spot Instances to train deep learning models by using AWS Batch.

Subdomain 4.3: Apply basic AWS security practices to ML solutions

- AWS Identity and Access Management (IAM) - S3 bucket policies - Security groups - VPCs - Encryption and anonymization

Subdomain 4.4: Deploy and operationalize ML solutions

- Expose endpoints and interact with them. - Understand ML models. - Perform A/B testing. - Retrain pipelines. - Debug and troubleshoot ML models. - Detect and mitigate drops in performance. - Monitor performance of the model.

Techniques & products

Amazon Athena

Amazon Data Firehose

Amazon EMR

AWS Glue

Amazon Kinesis

Amazon Kinesis Data Streams

AWS Lake Formation

Amazon Managed Service for Apache Flink

Amazon OpenSearch Service

Amazon QuickSight

AWS Batch

Amazon EC2

AWS Lambda

Amazon Elastic Container Registry (ECR)

Amazon Elastic Container Service (ECS)

Amazon Elastic Kubernetes Service (EKS)

AWS Fargate

Amazon Redshift

AWS IoT Greengrass

Amazon Bedrock

Amazon Comprehend

AWS Deep Learning AMIs (DLAMI)

Amazon Forecast

Amazon Fraud Detector

Amazon Lex

Amazon Kendra

Amazon Mechanical Turk

Amazon Polly

Amazon Q

Amazon Rekognition

Amazon SageMaker

Amazon Textract

Amazon Transcribe

Amazon Translate

AWS CloudTrail

Amazon CloudWatch

Amazon VPC

AWS Identity and Access Management (IAM)

Amazon Elastic Block Store (EBS)

Amazon Elastic File System (EFS)

Amazon FSx

Amazon S3

Ingestion and collection

Processing and ETL

Data analysis and visualization

Model training

Model deployment and inference

Operationalizing ML

AWS ML application services

Python

Java

Scala

SQL

Notebooks

Integrated Development Environments (IDEs)

XGBoost

Logistic regression

K-means

Linear regression

Decision trees

Random forests

Recurrent Neural Networks (RNN)

Convolutional Neural Networks (CNN)

Ensemble learning

Transfer learning

Large Language Models (LLMs)

MapReduce

Apache Hadoop

Apache Spark

Apache Hive

Hyperparameter optimization

Regularization (Dropout, L1/L2)

Cross-validation

Neural network architecture

Learning rate

Activation functions

Gradient descent

Loss functions

Overfitting

Underfitting

Bias and variance

AUC-ROC

Accuracy

Precision

Recall

RMSE

F1 score

Confusion matrices

A/B testing

Multi-Region deployment

Multi-Availability Zone deployment

AMIs

Docker containers

Auto Scaling groups

Load balancing

Security groups

Encryption

Anonymization

Start Practicing