Weekend Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sntaclus

The selection criterion used in the forward selection method in the GLMSELECT procedure is:

A.

RSQ

B.

MSE

C.

R-squared

D.

AIC

A financial services manager wants to assess the probability that certain clients will default on their Home Equity Line of Credit (HELOC). A former employee left the code listed below.

The training data set is named HELOC, while a similar data set of more recent clients is named RECENT_HELOC.

Which SAS data steps will calculate the predicted probability of default on recent clients? (Choose two.)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Refer to the exhibit:

Based upon the comparative ROC plot for two competing models, which is the champion model and why?

A.

Candidate 1, because the area outside the curve is greater

B.

Candidate 2, because the area under the curve is greater

C.

Candidate 1, because it is closer to the diagonal reference curve

D.

Candidate 2, because it shows less over fit than Candidate 1

The following LOGISTIC procedure output analyzes the relationship between a binary response and an ordinal predictor variable, wrist_size Using reference cell coding, the analyst selects Large (L) as the reference level.

What is the estimated logit for a person with large wrist size?

Click the calculator button to display a calculator if needed.

A.

0.0819

B.

0.5663

C.

-3.7727

D.

-1.0415

PROC GLMSELECT was used for building a model predicting the natural log of a baseball player's salary from certain performance and longevity statistics. The model used backward elimination using SBC as its selection criterion. The sequence of steps is summarized in the graphic shown below:

At Step 9 number of at bats (nAtBat) was removed from the model.

Why was it removed?

A.

Removing nAtBat had the largest effect on the parameter estimate of nHits.

B.

The p-Value for nAtBat was largest.

C.

Removing nAtBat yielded the largest improvement to SBC.

D.

The p-Value for nAtBat was smallest.

Refer to the exhibit.

These graphs were created using the GLM procedure with the plots(only)=diagnostics option.

Which plot do you use to identify influential observations?

A.

Cook's D by Observation

B.

Residual by Quantile

C.

Residual by Predicted

D.

Fit - Mean and Residual Plot

In order to perform honest assessment on a predictive model, what is an acceptable division between training, validation, and testing data?

A.

Training: 50% Validation: 0% Testing: 50%

B.

Training: 100% Validation: 0% Testing: 0%

C.

Training: 0% Validation: 100% Testing: 0%

D.

Training: 50% Validation: 50% Testing: 0%

Refer to the ROC curve:

As you move along the curve, what changes?

A.

The priors in the population

B.

The true negative rate in the population

C.

The proportion of events in the training data

D.

The probability cutoff for scoring

Assume a $10 cost for soliciting a non-responder and a $200 profit for soliciting a responder. The logistic regression model gives a probability score named P_R on a SAS data set called VALID. The VALID data set contains the responder variable Pinch, a 1/0 variable coded as 1 for responder. Customers will be solicited when their probability score is more than 0.05.

Which SAS program computes the profit for each customer in the data set VALID?

A.

Option A

B.

Option B

C.

Option C

D.

Option D