Databricks Certified Data Engineer Associate: Certified Data Engineer Associate

Practice Test #3

Simulate the real exam experience with 45 questions and a 90-minute time limit. Practice with AI-verified answers and detailed explanations.

45Questions90Minutes700/1000Passing Score

Browse Practice Questions

AI-Powered

Triple AI-Verified Answers & Explanations

Every answer is cross-verified by 3 leading AI models to ensure maximum accuracy. Get detailed per-option explanations and in-depth question analysis.

GPT Pro

Claude Opus

Gemini Pro

Per-option explanations

In-depth question analysis

3-model consensus accuracy

Practice Questions

Question 1

A data engineer has realized that they made a mistake when making a daily update to a table. They need to use Delta time travel to restore the table to a version that is 3 days old. However, when the data engineer attempts to time travel to the older version, they are unable to restore the data because the data files have been deleted. Which of the following explains why the data files are no longer present?

Want to practice all questions on the go?

Download Cloud Pass — includes practice tests, progress tracking & more.

Other Practice Tests

Start Practicing Now

Download Cloud Pass and start practicing all Databricks Certified Data Engineer Associate: Certified Data Engineer Associate exam questions.

Want to practice all questions on the go?

Get the app

Download Cloud Pass — includes practice tests, progress tracking & more.

Question 2

Which of the following benefits is provided by the array functions from Spark SQL?

Question 3

Which of the following describes the relationship between Gold tables and Silver tables?

Question 4

Which of the following tools is used by Auto Loader process data incrementally?

Question 5

A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then perform a streaming write into a new table. The cade block used by the data engineer is below:

(spark.table("sales")
  .withColumn("avg_price", col("sales") / col("units"))
  .writeStream
  .option("checkpointLocation", checkpointPath)
  .outputMode("complete")
  .
  .table("new_sales")
)

If the data engineer only wants the query to execute a micro-batch to process data every 5 seconds, which of the following lines of code should the data engineer use to fill in the blank?

Question 6

A data analysis team has noticed that their Databricks SQL queries are running too slowly when connected to their always-on SQL endpoint. They claim that this issue is present when many members of the team are running small queries simultaneously. They ask the data engineering team for help. The data engineering team notices that each of the team’s queries uses the same SQL endpoint. Which of the following approaches can the data engineering team use to improve the latency of the team’s queries?

Question 7

Which of the following is a benefit of the Databricks Lakehouse Platform embracing open source technologies?

Question 8

Which of the following is stored in the Databricks customer's cloud account?

Question 9

A data engineer wants to create a relational object by pulling data from two tables. The relational object does not need to be used by other data engineers in other sessions. In order to save on storage costs, the data engineer wants to avoid copying and storing physical data.

Which of the following relational objects should the data engineer create?

Question Analysis

Core concept: This question tests understanding of logical vs. physical relational objects in Databricks/Spark SQL, especially objects that avoid materializing (copying) data and their session scope. In Databricks, tables (including Delta tables) are physical datasets stored on DBFS/cloud storage and registered in a metastore, while views/temporary views are logical definitions (saved queries) that reference underlying data without storing a new copy. Why the answer is correct: The engineer wants (1) a relational object created from two tables (typically a join), (2) not needed by other engineers in other sessions, and (3) to avoid copying/storing physical data to save storage costs. A temporary view is session-scoped and stores only the query definition in the Spark session catalog. When queried, Spark re-runs the underlying query against the source tables. This meets all requirements: it is relational, avoids physical storage, and is not shared across sessions. Key features and best practices: Temporary views are created with CREATE TEMP VIEW (or DataFrame.createOrReplaceTempView). They live only for the lifetime of the Spark session and are not persisted in the Hive/Unity Catalog metastore. They are ideal for ad hoc analysis, intermediate transformations, and modularizing complex SQL (e.g., joining two tables, filtering, projecting) without creating new tables. If you need cross-session reuse but still no data copy, a (non-temporary) view is appropriate; if you need performance, consider caching the view results in memory (CACHE TABLE) rather than writing a new table. Common misconceptions: Many learners pick “View” because it also avoids storing physical data. However, a standard view is persisted in the metastore and is accessible to other users/sessions (subject to permissions). The prompt explicitly says it does not need to be used by other engineers in other sessions, pointing to a temporary view. Another misconception is thinking a “Spark SQL Table” is lightweight; tables generally imply persisted metadata and typically persisted data (managed/external). Delta tables definitely store data files. Exam tips: Look for keywords: “avoid copying physical data” implies a view; “not used by others/other sessions” implies TEMPORARY view. If the question mentions “shared,” “reusable,” or “governed,” lean toward a standard view or table in Unity Catalog; if it mentions “session-only,” choose temporary view.

Question 10

In which of the following scenarios should a data engineer use the MERGE INTO command instead of the INSERT INTO command?

Practice Test #3

Triple AI-Verified Answers & Explanations

Practice Questions

Other Practice Tests

Practice Test #1

Practice Test #2

Practice Test #4

Start Practicing Now

Practice Test #5