Summer Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exc65

Which of the following distribution methods or models can most effectively represent the actual arrival times of a bus that runs on an hourly schedule?

A.

Binomial

B.

Exponential

C.

Normal

D.

Poisson

A data scientist needs to:

Build a predictive model that gives the likelihood that a car will get a flat tire.

Provide a data set of cars that had flat tires and cars that did not.

All the cars in the data set had sensors taking weekly measurements of tire pressure similar to the sensors that will be installed in the cars consumers drive.

Which of the following is the most immediate data concern?

A.

Granularity misalignment

B.

Multivariate outliers

C.

Insufficient domain expertise

D.

Lagged observations

A data scientist is working with a data set that has ten predictors and wants to use only the predictors that most influence the results. Which of the following models would be the best for the data scientist to use?

A.

OLS

B.

Ridge

C.

Weighted least squares

D.

LASSO

In a modeling project, people evaluate phrases and provide reactions as the target variable for the model. Which of the following best describes what this model is doing?

A.

Sentiment analysis

B.

Named-entity recognition

C.

TF-IDF vectorization

D.

Part-of-speech tagging

Given the equation:

Xt = δ + ϕ1Xt−1 + ωt, where ωt ∼ N(0, σω²)

Which of the following time series models best represents this process?

A.

ARIMA(1,1,1)

B.

ARMA(1,1)

C.

SARIMA(1,1,1) × (1,1,1)1

D.

AR(1)

Which of the following measures would a data scientist most likely use to calculate the similarity of two text strings?

A.

Word cloud

B.

Edit distance

C.

String indexing

D.

k-nearest neighbors

Which of the following image data augmentation techniques allows a data scientist to increase the size of a data set?

A.

Clipping

B.

Cropping

C.

Masking

D.

Scaling

A data scientist is deploying a model that needs to be accessed by multiple departments with minimal development effort by the departments. Which of the following APIs would be best for the data scientist to use?

A.

SOAP

B.

RPC

C.

JSON

D.

REST

Which of the following techniques enables automation and iteration of code releases?

A.

Virtualization

B.

Markdown

C.

Code isolation

D.

CI/CD

A data scientist is creating a responsive model that will update a product's daily pricing based on the previous day's sales volume. Which of the following resource constraints is the data scientist's greatest concern?

A.

Deployment time

B.

Training time

C.

Development time

D.

Data collection time