Free Practice Questions for AWS Certified Machine Learning - Specialty (MLS-C01) Certification

    šŸ”„ Last checked for updates March 16th, 2026

    Study with 360 exam-style practice questions designed to help you prepare for the AWS Certified Machine Learning - Specialty (MLS-C01).

    Start Practicing

    Random Questions

    Practice with randomly mixed questions from all topics

    Question MixAll Topics
    FormatRandom Order

    Domain Mode

    Practice questions from a specific topic area

    Exam Information

    Exam Details

    Key information about AWS Certified Machine Learning - Specialty (MLS-C01)

    Official study guide

    View

    Question formats CertSafari offers
    • Multiple choice
    exam format:

    Multiple choice, Multiple response

    passing score:

    750 out of 1,000

    prerequisites:

    Experience performing basic hyperparameter optimization and with ML/deep learning frameworks.

    target audience:

    Individuals in an AI/ML development or data science role with 2+ years of experience developing, architecting, and running ML/deep learning workloads in the AWS Cloud.

    number of questions:

    50 scored questions, 15 unscored questions

    Exam Topics & Skills Assessed

    Skills measured (from the official study guide)

    Domain 1: Data Engineering

    Subdomain 1.1: Create data repositories for ML

    - Identify data sources (for example, content and location, primary sources such as user data). - Determine storage mediums (for example, databases, Amazon S3, Amazon Elastic File System [Amazon EFS], Amazon Elastic Block Store [Amazon EBS]).

    Subdomain 1.2: Identify and implement a data ingestion solution

    - Identify data job styles and job types (for example, batch load, streaming). - Orchestrate data ingestion pipelines (batch-based ML workloads and streaming-based ML workloads). - Amazon Kinesis - Amazon Data Firehose - Amazon EMR - AWS Glue - Amazon Managed Service for Apache Flink - Schedule jobs.

    Subdomain 1.3: Identify and implement a data transformation solution

    - Transform data in transit (ETL, AWS Glue, Amazon EMR, AWS Batch). - Handle ML-specific data by using MapReduce (for example, Apache Hadoop, Apache Spark, Apache Hive).

    Domain 2: Exploratory Data Analysis

    Subdomain 2.1: Sanitize and prepare data for modeling

    - Identify and handle missing data, corrupt data, and stop words. - Format, normalize, augment, and scale data. - Determine whether there is sufficient labeled data. - Identify mitigation strategies. - Use data labelling tools (for example, Amazon Mechanical Turk).

    Subdomain 2.2: Perform feature engineering

    - Identify and extract features from datasets, including from data sources such as text, speech, images, and public datasets. - Analyze and evaluate feature engineering concepts (for example, binning, tokenization, outliers, synthetic features, one-hot encoding, reducing dimensionality of data).

    Subdomain 2.3: Analyze and visualize data for ML

    - Create graphs (for example, scatter plots, time series, histograms, box plots). - Interpret descriptive statistics (for example, correlation, summary statistics, p-value). - Perform cluster analysis (for example, hierarchical, diagnosis, elbow plot, cluster size).

    Domain 3: Modeling

    Subdomain 3.1: Frame business problems as ML problems

    - Determine when to use and when not to use ML. - Know the difference between supervised and unsupervised learning. - Select from among classification, regression, forecasting, clustering, recommendation, and foundation models.

    Subdomain 3.2: Select the appropriate model(s) for a given ML problem

    - XGBoost, logistic regression, k-means, linear regression, decision trees, random forests, RNN, CNN, ensemble, transfer learning, and large language models (LLMs) - Express the intuition behind models.

    Subdomain 3.3: Train ML models

    - Split data between training and validation (for example, cross validation). - Understand optimization techniques for ML training (for example, gradient descent, loss functions, convergence). - Choose appropriate compute resources (for example GPU or CPU, distributed or non-distributed). - Choose appropriate compute platforms (Spark or non-Spark). - Update and retrain models. - Batch or real-time/online

    Subdomain 3.4: Perform hyperparameter optimization

    - Perform regularization. - Dropout - L1/L2 - Perform cross-validation. - Initialize models. - Understand neural network architecture (layers and nodes), learning rate, and activation functions. - Understand tree-based models (number of trees, number of levels). - Understand linear models (learning rate).

    Subdomain 3.5: Evaluate ML models

    - Avoid overfitting or underfitting. - Detect and handle bias and variance. - Evaluate metrics (for example, area under curve [AUC]-receiver operating characteristics [ROC], accuracy, precision, recall, Root Mean Square Error [RMSE], F1 score). - Interpret confusion matrices. - Perform offline and online model evaluation (A/B testing). - Compare models by using metrics (for example, time to train a model, quality of model, engineering costs). - Perform cross-validation.

    Domain 4: Machine Learning Implementation and Operations

    Subdomain 4.1: Build ML solutions for performance, availability, scalability, resiliency, and fault tolerance

    - Log and monitor AWS environments. - AWS CloudTrail and Amazon CloudWatch - Build error monitoring solutions. - Deploy to multiple AWS Regions and multiple Availability Zones. - Create AMIs and golden images. - Create Docker containers. - Deploy Auto Scaling groups. - Rightsize resources (for example, instances, Provisioned IOPS, volumes). - Perform load balancing. - Follow AWS best practices.

    Subdomain 4.2: Recommend and implement the appropriate ML services and features for a given problem

    - ML on AWS (application services), for example: - Amazon Polly - Amazon Lex - Amazon Transcribe - Amazon Q - Understand AWS service quotas. - Determine when to build custom models and when to use Amazon SageMaker built-in algorithms. - Understand AWS infrastructure (for example, instance types) and cost considerations. - Use Spot Instances to train deep learning models by using AWS Batch.

    Subdomain 4.3: Apply basic AWS security practices to ML solutions

    - AWS Identity and Access Management (IAM) - S3 bucket policies - Security groups - VPCs - Encryption and anonymization

    Subdomain 4.4: Deploy and operationalize ML solutions

    - Expose endpoints and interact with them. - Understand ML models. - Perform A/B testing. - Retrain pipelines. - Debug and troubleshoot ML models. - Detect and mitigate drops in performance. - Monitor performance of the model.

    Techniques & products

    Amazon Athena
    Amazon Data Firehose
    Amazon EMR
    AWS Glue
    Amazon Kinesis
    Amazon Kinesis Data Streams
    AWS Lake Formation
    Amazon Managed Service for Apache Flink
    Amazon OpenSearch Service
    Amazon QuickSight
    AWS Batch
    Amazon EC2
    AWS Lambda
    Amazon Elastic Container Registry (ECR)
    Amazon Elastic Container Service (ECS)
    Amazon Elastic Kubernetes Service (EKS)
    AWS Fargate
    Amazon Redshift
    AWS IoT Greengrass
    Amazon Bedrock
    Amazon Comprehend
    AWS Deep Learning AMIs (DLAMI)
    Amazon Forecast
    Amazon Fraud Detector
    Amazon Lex
    Amazon Kendra
    Amazon Mechanical Turk
    Amazon Polly
    Amazon Q
    Amazon Rekognition
    Amazon SageMaker
    Amazon Textract
    Amazon Transcribe
    Amazon Translate
    AWS CloudTrail
    Amazon CloudWatch
    Amazon VPC
    AWS Identity and Access Management (IAM)
    Amazon Elastic Block Store (EBS)
    Amazon Elastic File System (EFS)
    Amazon FSx
    Amazon S3
    Ingestion and collection
    Processing and ETL
    Data analysis and visualization
    Model training
    Model deployment and inference
    Operationalizing ML
    AWS ML application services
    Python
    Java
    Scala
    R
    SQL
    Notebooks
    Integrated Development Environments (IDEs)
    XGBoost
    Logistic regression
    K-means
    Linear regression
    Decision trees
    Random forests
    Recurrent Neural Networks (RNN)
    Convolutional Neural Networks (CNN)
    Ensemble learning
    Transfer learning
    Large Language Models (LLMs)
    MapReduce
    Apache Hadoop
    Apache Spark
    Apache Hive
    Hyperparameter optimization
    Regularization (Dropout, L1/L2)
    Cross-validation
    Neural network architecture
    Learning rate
    Activation functions
    Gradient descent
    Loss functions
    Overfitting
    Underfitting
    Bias and variance
    AUC-ROC
    Accuracy
    Precision
    Recall
    RMSE
    F1 score
    Confusion matrices
    A/B testing
    Multi-Region deployment
    Multi-Availability Zone deployment
    AMIs
    Docker containers
    Auto Scaling groups
    Load balancing
    Security groups
    Encryption
    Anonymization

    CertSafari is not affiliated with, endorsed by, or officially connected to Amazon Web Services, Inc.. Full disclaimer