Summer Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exc65

What do first-order and second-order Markov processes have in common concerning next word prediction?

A.

Both use WordNet to model the probability of the next word

B.

Both are unsupervised methods

C.

Both provide the foundation to build a trigram language model

D.

Neither makes assumptions about the probability of the next word

What is an ideal use case for HDFS?

A.

Storing files that are updated frequently

B.

Storing files that are written once and read many times

C.

Storing results between Map steps and Reduce steps

D.

Storing application files in memory

Which HDFS feature protects against user errors causing accidental loss of data?

A.

Encryption

B.

Replication

C.

Namenode federation

D.

Snapshots

In multinomial logistic regression, what is used to calculate the probability of outcome occurring?

A.

Logistic function applied to a linear combination of the input and outcome variables

B.

Linear regression applied to a combination of input variables

C.

Linear regression applied to a combination of input and outcome variables

D.

Logistic function applied to a linear combination of the input variables

What is an important simu-lation design consideration?

    A.

    Ensure model Inputs align with reality

    B.

    Use different seed values to regenerate results

    C.

    For rare event models, minimize number of trials

    D.

    A complex model is better than a simple model

    In a connected, undirected graph of 5 nodes with 10 edges, how many more edges need to be added to make the clustering coefficient of every node equal 1 ?

    A.

    0

    B.

    5

    C.

    10

    D.

    15

    What is an intended application of the MapReduce framework?

    A.

    Processing can be broken into smaller pieces

    B.

    Processing a large number of small files

    C.

    Processing in real time is required

    D.

    Processing a small subset of data

    What is a random subspace of features, as used by Random Forests?

    A.

    A random subset of features that are chosen at each split in the decision tree

    B.

    Filtration of data that does not meet a pre-defined weighting thrsehold

    C.

    The creation of out-of-bag (OOB) data that is used to select features

    D.

    Removal of highly correlated variables to randomize the features

    Which Hadoop Files System shell command copies data from a local file system into HDFS?

    A.

    rm

    B.

    cp

    C.

    put

    D.

    get