Databricks Certified Machine Learning Associate: Certified Machine Learning Associate

Practice Test #1

Simulasikan pengalaman ujian sesungguhnya dengan 45 soal dan batas waktu 90 menit. Berlatih dengan jawaban terverifikasi AI dan penjelasan detail.

45Soal90Menit700/1000Skor Kelulusan

Jelajahi Soal Latihan

Didukung AI

Jawaban & Penjelasan Terverifikasi oleh 3 AI

Setiap jawaban diverifikasi silang oleh 3 model AI terkemuka untuk memastikan akurasi maksimum. Dapatkan penjelasan detail per opsi dan analisis soal mendalam.

GPT Pro

Claude Opus

Gemini Pro

Penjelasan per opsi

Analisis soal mendalam

Akurasi konsensus 3 model

Soal Latihan

Soal 1

A data scientist wants to tune a set of hyperparameters for a machine learning model. They have wrapped a Spark ML model in the objective function objective_function and they have defined the search space search_space. As a result, they have the following code block:

num_evals = 100
trials = SparkTrials()
best_hyperparam = fmin(
    fn=objective_function,
    space=search_space,
    algo=tpe.suggest,
    max_evals=num_evals,
    trials=trials
)

Which of the following changes do they need to make to the above code block in order to accomplish the task?

Soal 2

Which of the following statements describes a Spark ML estimator?

Soal 3

A data scientist has a Spark DataFrame spark_df. They want to create a new Spark DataFrame that contains only the rows from spark_df where the value in column discount is less than or equal 0. Which of the following code blocks will accomplish this task?

Soal 4

A data scientist has defined a Pandas UDF function predict to parallelize the inference process for a single-node model:

@pandas_udf("double")
def predict(iterator: Iterator[pd.DataFrame]) -> Iterator[pd.Series]:
    model_path = f"runs:/{run.info.run_id}/model"
    model = mlflow.sklearn.load_model(model_path)
    for features in iterator:
        pdf = pd.concat(features, axis=1)
        yield pd.Series(model.predict(pdf))

They have written the following incomplete code block to use predict to score each record of Spark DataFrame spark_df:

prediction_df = spark_df.withColumn(
    "prediction",
    ____
)

Which of the following lines of code can be used to complete the code block to successfully complete the task?

Soal 5

A machine learning engineer has created a Feature Table new_table using Feature Store Client fs. When creating the table, they specified a metadata description with key information about the Feature Table. They now want to retrieve that metadata programmatically. Which of the following lines of code will return the metadata description?

Ingin berlatih semua soal di mana saja?

Unduh Cloud Pass — termasuk tes latihan, pelacakan progres & lainnya.

Soal 6

Which of the following is a benefit of using vectorized pandas UDFs instead of standard PySpark UDFs?

Soal 7

A data scientist is wanting to explore the Spark DataFrame spark_df. The data scientist wants visual histograms displaying the distribution of numeric features to be included in the exploration. Which of the following lines of code can the data scientist run to accomplish the task?

Soal 8

A machine learning engineer is trying to perform batch model inference. They want to get predictions using the linear regression model saved at the path model_uri for the DataFrame batch_df. batch_df has the following schema: customer_id STRING The machine learning engineer runs the following code block to perform inference on batch_df using the linear regression model at model_uri:

predictions = fs.score_batch(
    model_uri,
    batch_df
)

In which situation will the machine learning engineer’s code block perform the desired inference?

Soal 9

A data scientist uses 3-fold cross-validation when optimizing model hyperparameters for a regression problem. The following root-mean-squared-error values are calculated on each of the validation folds: • 10.0 • 12.0 • 17.0 Which of the following values represents the overall cross-validation root-mean-squared error?

Soal 10

A data scientist wants to use Spark ML to impute missing values in their PySpark DataFrame features_df. They want to replace missing values in all numeric columns in features_df with each respective numeric column’s median value. They have developed the following code block to accomplish this task:

imputer = Imputer(
    strategy="median",
    inputCols=input_columns,
    outputCols=output_columns
)

imputed_features_df = imputer.transform(features_df)

The code block is not accomplishing the task. Which reasons describes why the code block is not accomplishing the imputation task?

← Lihat Semua Soal Databricks Certified Machine Learning Associate: Certified Machine Learning Associate

Mulai Latihan Sekarang

Unduh Cloud Pass dan mulai berlatih semua soal Databricks Certified Machine Learning Associate: Certified Machine Learning Associate.

Ingin berlatih semua soal di mana saja?

Dapatkan aplikasi

Unduh Cloud Pass — termasuk tes latihan, pelacakan progres & lainnya.

Cloud Pass

Databricks Certified Machine Learning Associate: Certified Machine Learning Associate

Practice Test #1

Simulasikan pengalaman ujian sesungguhnya dengan 45 soal dan batas waktu 90 menit. Berlatih dengan jawaban terverifikasi AI dan penjelasan detail.

45Soal90Menit700/1000Skor Kelulusan

Jelajahi Soal Latihan

Didukung AI

Jawaban & Penjelasan Terverifikasi oleh 3 AI

Setiap jawaban diverifikasi silang oleh 3 model AI terkemuka untuk memastikan akurasi maksimum. Dapatkan penjelasan detail per opsi dan analisis soal mendalam.

GPT Pro

Claude Opus

Gemini Pro

Penjelasan per opsi

Analisis soal mendalam

Akurasi konsensus 3 model

Soal Latihan

Soal 1

num_evals = 100
trials = SparkTrials()
best_hyperparam = fmin(
    fn=objective_function,
    space=search_space,
    algo=tpe.suggest,
    max_evals=num_evals,
    trials=trials
)

Which of the following changes do they need to make to the above code block in order to accomplish the task?

Soal 2

Which of the following statements describes a Spark ML estimator?

Soal 3

Soal 4

A data scientist has defined a Pandas UDF function predict to parallelize the inference process for a single-node model:

@pandas_udf("double")
def predict(iterator: Iterator[pd.DataFrame]) -> Iterator[pd.Series]:
    model_path = f"runs:/{run.info.run_id}/model"
    model = mlflow.sklearn.load_model(model_path)
    for features in iterator:
        pdf = pd.concat(features, axis=1)
        yield pd.Series(model.predict(pdf))

They have written the following incomplete code block to use predict to score each record of Spark DataFrame spark_df:

prediction_df = spark_df.withColumn(
    "prediction",
    ____
)

Which of the following lines of code can be used to complete the code block to successfully complete the task?

Soal 5

Ingin berlatih semua soal di mana saja?

Unduh Cloud Pass — termasuk tes latihan, pelacakan progres & lainnya.

Soal 6

Which of the following is a benefit of using vectorized pandas UDFs instead of standard PySpark UDFs?

Soal 7

Soal 8

predictions = fs.score_batch(
    model_uri,
    batch_df
)

In which situation will the machine learning engineer’s code block perform the desired inference?

Soal 9

Soal 10

imputer = Imputer(
    strategy="median",
    inputCols=input_columns,
    outputCols=output_columns
)

imputed_features_df = imputer.transform(features_df)

The code block is not accomplishing the task. Which reasons describes why the code block is not accomplishing the imputation task?

← Lihat Semua Soal Databricks Certified Machine Learning Associate: Certified Machine Learning Associate

Mulai Latihan Sekarang

Unduh Cloud Pass dan mulai berlatih semua soal Databricks Certified Machine Learning Associate: Certified Machine Learning Associate.

Ingin berlatih semua soal di mana saja?

Dapatkan aplikasi

Unduh Cloud Pass — termasuk tes latihan, pelacakan progres & lainnya.