Free Practice Questions for Microsoft Certified: Azure Data Scientist Associate (DP-100) Certification

📚 Exam Guide: April 11, 2025

🔄 Last checked for updates May 17th, 2026

Study with 362 exam-style practice questions designed to help you prepare for the Microsoft Certified: Azure Data Scientist Associate (DP-100). All questions are aligned with the latest exam guide and include detailed explanations to help you master the material.

All Domains

Practice with randomly mixed questions from all topics

Question MixAll Topics

FormatRandom Order

Domain Mode

Practice questions from a specific topic area

Select Domain

Quiz History

Exam Details

Key information about Microsoft Certified: Azure Data Scientist Associate (DP-100)

Official study guide

View

Question formats CertSafari offers

Multiple choice
Ordering
Matching
True/False

level:

associate (intermediate)

prerequisites:

Knowledge and experience in data science using Azure Machine Learning, MLflow, Azure AI services (including Azure AI Search), and Azure AI Foundry.

target audience:

Candidates with subject matter expertise in applying data science and machine learning to implement and run ML workloads on Azure, and knowledge of optimizing language models for AI applications using Azure AI.

skills measured as of:

April 11, 2025

Exam Topics & Skills Assessed

Skills measured (from the official study guide)

Domain 1: Design and prepare a machine learning solution

Subdomain 1.1: Design a machine learning solution

- Identify the structure and format for datasets - Determine the compute specifications for machine learning workload - Select the development approach to train a model

Subdomain 1.2: Create and manage resources in an Azure Machine Learning workspace

- Create and manage a workspace - Create and manage datastores - Create and manage compute targets - Set up Git integration for source control

Subdomain 1.3: Create and manage assets in an Azure Machine Learning workspace

- Create and manage data assets - Create and manage environments - Share assets across workspaces by using registries

Domain 2: Explore data, and run experiments

Subdomain 2.1: Use automated machine learning to explore optimal models

- Use automated machine learning for tabular data - Use automated machine learning for computer vision - Use automated machine learning for natural language processing - Select and understand training options, including preprocessing and algorithms - Evaluate an automated machine learning run, including responsible AI guidelines

Subdomain 2.2: Use notebooks for custom model training

- Use the terminal to configure a compute instance - Access and wrangle data in notebooks - Wrangle data interactively with attached Synapse Spark pools and serverless Spark compute - Retrieve features from a feature store to train a model - Track model training by using MLflow - Evaluate a model, including responsible AI guidelines

Subdomain 2.3: Automate hyperparameter tuning

- Select a sampling method - Define the search space - Define the primary metric - Define early termination options

Domain 3: Train and deploy models

Subdomain 3.1: Run model training scripts

- Consume data in a job - Configure compute for a job run - Configure an environment for a job run - Track model training with MLflow in a job run - Define parameters for a job - Run a script as a job - Use logs to troubleshoot job run errors

Subdomain 3.2: Implement training pipelines

- Create custom components - Create a pipeline - Pass data between steps in a pipeline - Run and schedule a pipeline - Monitor and troubleshoot pipeline runs

Subdomain 3.3: Manage models

- Define the signature in the MLmodel file - Package a feature retrieval specification with the model artifact - Register an MLflow model - Assess a model by using responsible AI principles

Subdomain 3.4: Deploy a model

- Configure settings for online deployment - Deploy a model to an online endpoint - Test an online deployed service - Configure compute for a batch deployment - Deploy a model to a batch endpoint - Invoke the batch endpoint to start a batch scoring job

Domain 4: Optimize language models for AI applications

Subdomain 4.1: Prepare for model optimization

- Select and deploy a language model from the model catalog - Compare language models using benchmarks - Test a deployed language model in the playground - Select an optimization approach

Subdomain 4.2: Optimize through prompt engineering and prompt flow

- Test prompts with manual evaluation - Define and track prompt variants - Create prompt templates - Define chaining logic with the prompt flow SDK - Use tracing to evaluate your flow

Subdomain 4.3: Optimize through Retrieval Augmented Generation (RAG)

- Prepare data for RAG, including cleaning, chunking, and embedding - Configure a vector store - Configure an Azure AI Search-based index store - Evaluate your RAG solution

Subdomain 4.4: Optimize through fine-tuning

- Prepare data for fine-tuning - Select an appropriate base model - Run a fine-tuning job - Evaluate your fine-tuned model

Techniques & products

Azure Machine Learning

MLflow

Azure AI services

Azure AI Search

Azure AI Foundry

Git integration

Datastores

Compute targets

Data assets

Environments

Registries

Automated machine learning (AutoML)

Tabular data

Computer vision

Natural language processing (NLP)

Notebooks

Compute instance

Synapse Spark pools

Serverless Spark compute

Feature store

Hyperparameter tuning

Sampling methods

Training scripts

Job runs

Training pipelines

Custom components

MLmodel file

Model artifacts

Responsible AI principles

Online deployment

Online endpoints

Batch deployment

Batch endpoints

Model catalog

Prompt engineering

Prompt flow

Prompt flow SDK

Retrieval Augmented Generation (RAG)

Vector store

Azure AI Search-based index store

Fine-tuning

Start Practicing