Summer Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exc65

A data scientist is developing a model to predict the outcome of a vote for a national mascot. The choice is between tigers and lions. The full data set represents feedback from individuals representing 17 professions and 12 different locations. The following rank aggregation represents 80% of the data set:

(Screenshot shows survey rankings for just two professions and a few locations, all voting for "Tigers")

Which of the following is the most likely concern about the model's ability to predict the outcome of the vote?

A.

Interpolated data

B.

Extrapolated data

C.

In-sample data

D.

Out-of-sample data

A data scientist is clustering a data set but does not want to specify the number of clusters present. Which of the following algorithms should the data scientist use?

A.

DBSCAN

B.

k-nearest neighbors

C.

k-means

D.

Logistic regression

Under perfect conditions, E. coli bacteria would cover the entire earth in a matter of days. Which of the following types of models is the best for explaining this type of growth?

A.

Linear

B.

Logarithmic

C.

Polynomial

D.

Exponential

A data scientist is building a model to predict customer credit scores based on information collected from reporting agencies. The model needs to automatically adjust its parameters to adapt to recent changes in the information collected. Which of the following is the best model to use?

A.

Decision tree

B.

Random forest

C.

Linear discriminant analysis

D.

XGBoost

A data scientist is preparing to brief a non-technical audience that is focused on analysis and results. During the modeling process, the data scientist produced the following artifacts:

Which of the following artifacts should the data scientist include in the briefing? (Choose two.)

A.

Final charts and dashboards

B.

Model selection, justification, and purpose

C.

Code documentation

D.

Mathematical descriptions of clustering algorithms included in the selected model

E.

Model performance statistics (accuracy, precision, recall, F1 score, etc.)

F.

Data dictionary