Weekend Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sntaclus

Drag the adjustment formulas for oversamping from the left and place them into the correct location in the confusion matrix shown on the right.

An analyst investigates Region (A, B, or C) as an input variable in a logistic regression model.

The analyst discovers that the probability of purchasing a certain item when Region = A is 1.

What problem does this illustrate?

A.

Collinearity

B.

Influential observations

C.

Quasi-complete separation

D.

Problems that arise due to missing values

Screening for non-linearity in binary logistic regression can be achieved by visualizing:

A.

A scatter plot of binary response versus a predictor variable.

B.

A trend plot of empirical logit versus a predictor variable.

C.

A logistic regression plot of predicted probability values versus a predictor variable.

D.

A box plot of the odds ratio values versus a predictor variable.

Which method is NOT an appropriate way to score new observations with a known target in a logistic regression model?

A.

Use the SCORE statement in the LOGISTIC procedure.

B.

Augment the training data set with new observations and set their responses to missing.

C.

Augment the training data set with new observations and rerun the LOGISTIC procedure.

D.

Use the saved parameter estimates from the LOGISTIC procedure and score new observations in the SCORE procedure.

Which SAS program will correctly use backward elimination selection criterion within the REG procedure?

A.

Option A

B.

Option B

C.

Option C

D.

Option D

A researcher has several variables that could be possible predictors for the final model. There is interest in checking all 2-way interactions for possible entry to the model. The researcher has decided to use forward selection within PROC LOGISTIC. Fill in the missing code option that will ensure that all 2-way interactions will be considered for entry.

A.

start = 5

B.

include = 4

C.

include = 5

D.

start = 4

Refer to the following odds ratio table:

What is a correct interpretation of the estimate?

A.

The odds of the event are 1.142 greater for each one dollar increase in salary.

B.

The odds of the event are 1.142 greater for each one thousand dollar increase in salary.

C.

The probability of the event is 1.142 greater for each one dollar increase in salary.

D.

The probability of the event is 1.142 greater for each one thousand dollar increase in salary.

A confusion matrix is created for data that were oversampled due to a rare target.

What values are not affected by this oversampling?

A.

Sensitivity and PV+

B.

Specificity and PV-

C.

PV+ and PV-

D.

Sensitivity and Specificity

Given the following GLM procedure output:

Which statement is correct at an alpha level of 0.05?

A.

School*Gender should be removed because it is non-significant.

B.

Gender should be removed because it is non-significant.

C.

School should be removed because it is significant.

D.

Gender should not be removed due to its involvement in the significant interaction.

An analyst generates a model using the LOGISTIC procedure. They are now interested in getting the sensitivity and specificity statistics on a validation data set for a variety of cutoff values.

Which statement and option combination will generate these statistics?

A.

Score data=valid1 out=roc;

B.

Score data=valid1 outroc=roc;

C.

mode1 resp(event= '1') = gender region/outroc=roc;

D.

mode1 resp(event"1") = gender region/ out=roc;