Summer Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exc65

A machine learning engineer has developed a model and registered it using the FeatureStoreClient fs. The model has model URI model_uri. The engineer now needs to perform batch inference on customer-level Spark DataFrame spark_df, but it is missing a few of the static features that were used when training the model. The customer_id column is the primary key of spark_df and the training set used when training and logging the model.

Which of the following code blocks can be used to compute predictions for spark_df when the missing feature values can be found in the Feature Store by searching for features by customer_id?

A.

df = fs.get_missing_features(spark_df, model_uri)

fs.score_model(model_uri, df)

B.

fs.score_model(model_uri, spark_df)

C.

df = fs.get_missing_features(spark_df, model_uri)

fs.score_batch(model_uri, df)

df = fs.get_missing_features(spark_df)

D.

fs.score_batch(model_uri, df)

E.

fs.score_batch(model_uri, spark_df)

Which of the following describes the purpose of the context parameter in the predict method of Python models for MLflow?

A.

The context parameter allows the user to specify which version of the registered MLflowModel should be used based on the given application's current scenario

B.

The context parameter allows the user to document the performance of a model after it has been deployed

C.

The context parameter allows the user to include relevant details of the business case to allow downstream users to understand the purpose of the model

D.

The context parameter allows the user to provide the model with completely custom if-else logic for the given application's current scenario

E.

The context parameter allows the user to provide the model access to objects like preprocessing models or custom configuration files

Which of the following lists all of the model stages are available in the MLflow Model Registry?

A.

Development. Staging. Production

B.

None. Staging. Production

C.

Staging. Production. Archived

D.

None. Staging. Production. Archived

E.

Development. Staging. Production. Archived

A data scientist has developed a scikit-learn random forest model model, but they have not yet logged model with MLflow. They want to obtain the input schema and the output schema of the model so they can document what type of data is expected as input.

Which of the following MLflow operations can be used to perform this task?

A.

mlflow.models.schema.infer_schema

B.

mlflow.models.signature.infer_signature

C.

mlflow.models.Model.get_input_schema

D.

mlflow.models.Model.signature

E.

There is no way to obtain the input schema and the output schema of an unlogged model.

Which of the following tools can assist in real-time deployments by packaging software with its own application, tools, and libraries?

A.

Cloud-based compute

B.

None of these tools

C.

REST APIs

D.

Containers

E.

Autoscaling clusters

Which of the following describes concept drift?

A.

Concept drift is when there is a change in the distribution of an input variable

B.

Concept drift is when there is a change in the distribution of a target variable

C.

Concept drift is when there is a change in the relationship between input variables and target variables

D.

Concept drift is when there is a change in the distribution of the predicted target given by the model

E.

None of these describe Concept drift

A data scientist has developed a model to predict ice cream sales using the expected temperature and expected number of hours of sun in the day. However, the expected temperature is dropping beneath the range of the input variable on which the model was trained.

Which of the following types of drift is present in the above scenario?

A.

Label drift

B.

None of these

C.

Concept drift

D.

Prediction drift

E.

Feature drift

A data scientist has developed and logged a scikit-learn random forest model model, and then they ended their Spark session and terminated their cluster. After starting a new cluster, they want to review the feature_importances_ of the original model object.

Which of the following lines of code can be used to restore the model object so that feature_importances_ is available?

A.

mlflow.load_model(model_uri)

B.

client.list_artifacts(run_id)["feature-importances.csv"]

C.

mlflow.sklearn.load_model(model_uri)

D.

This can only be viewed in the MLflow Experiments UI

E.

client.pyfunc.load_model(model_uri)

A data scientist is using MLflow to track their machine learning experiment. As a part of each MLflow run, they are performing hyperparameter tuning. The data scientist would like to have one parent run for the tuning process with a child run for each unique combination of hyperparameter values.

They are using the following code block:

The code block is not nesting the runs in MLflow as they expected.

Which of the following changes does the data scientist need to make to the above code block so that it successfully nests the child runs under the parent run in MLflow?

A.

Indent the child run blocks within the parent run block

B.

Add the nested=True argument to the parent run

C.

Remove the nested=True argument from the child runs

D.

Provide the same name to the run name parameter for all three run blocks

E.

Add the nested=True argument to the parent run and remove the nested=True arguments from the child runs

Which of the following Databricks-managed MLflow capabilities is a centralized model store?

A.

Models

B.

Model Registry

C.

Model Serving

D.

Feature Store

E.

Experiments