Free Databricks Certified Machine Learning Associate Exam Databricks-Machine-Learning-Associate Exam Practice Test

UNLOCK FULL
Databricks-Machine-Learning-Associate Exam Features
In Just $59 You can Access
  • All Official Question Types
  • Interactive Web-Based Practice Test Software
  • No Installation or 3rd Party Software Required
  • Customize your practice sessions (Free Demo)
  • 24/7 Customer Support
Page: 1 / 15
Total Questions: 74
  • Which of the following approaches can be used to view the notebook that was run to create an MLflow run?

    Answer: C Next Question
  • A health organization is developing a classification model to determine whether or not a patient currently has a specific type of infection. The organization's leaders want to maximize the number of positive cases identified by the model.Which of the following classification metrics should be used to evaluate the model?

    Answer: E Next Question
  • A data scientist has produced three new models for a single machine learning problem. In the past, the solution used just one model. All four models have nearly the same prediction latency, but a machine learning engineer suggests that the new solution will be less time efficient during inference. In which situation will the machine learning engineer be correct?

    Answer: D Next Question
  • A machine learning engineer is trying to scale a machine learning pipeline by distributing its feature engineering process.Which of the following feature engineering tasks will be the least efficient to distribute?

    Answer: D Next Question
  • Which of the following machine learning algorithms typically uses bagging?

    Answer: C Next Question
  • A machine learning engineering team has a Job with three successive tasks. Each task runs a single notebook. The team has been alerted that the Job has failed in its latest run.Which of the following approaches can the team use to identify which task is the cause of the failure?

    Answer: B Next Question
  • A data scientist has been given an incomplete notebook from the data engineering team. The notebook uses a Spark DataFrame spark_df on which the data scientist needs to perform further feature engineering. Unfortunately, the data scientist has not yet learned the PySpark DataFrame API.Which of the following blocks of code can the data scientist run to be able to use the pandas API on Spark?

    Answer: A Next Question
  • A data scientist has created a linear regression model that uses log(price) as a label variable. Using this model, they have performed inference and the predictions and actual label values are in Spark DataFrame preds_df.They are using the following code block to evaluate the model: regression_evaluator.setMetricName("rmse").evaluate(preds_df)Which of the following changes should the data scientist make to evaluate the RMSE in a way that is comparable with price?

    Answer: D Next Question
  • A data scientist is using Spark SQL to import their data into a machine learning pipeline. Once the data is imported, the data scientist performs machine learning tasks using Spark ML.Which of the following compute tools is best suited for this use case?

    Answer: B Next Question
  • A data scientist is wanting to explore the Spark DataFrame spark_df. The data scientist wants visual histograms displaying the distribution of numeric features to be included in the exploration.Which of the following lines of code can the data scientist run to accomplish the task?

    Answer: E Next Question
Page: 1 / 15
Total Questions: 74