Weekend Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sntaclus

Which type of Python UDFs let you define Python functions that receive batches of input rows as Pandas DataFrames and return batches of results as Pandas arrays or Series?

A.

MPP Python UDFs

B.

Scaler Python UDFs

C.

Vectorized Python UDFs

D.

Hybrid Python UDFs

Which tools helps data scientist to manage ML lifecycle & Model versioning?

A.

MLFlow

B.

Pachyderm

C.

Albert

D.

CRUX

Which of the following metrics are used to evaluate classification models?

A.

Area under the ROC curve

B.

F1 score

C.

Confusion matrix

D.

All of the above

Which ones are the known limitations of using External function?

A.

Currently, external functions cannot be shared with data consumers via Secure Data Sharing.

B.

Currently, external functions must be scalar functions. A scalar external function re-turns a single value for each input row.

C.

External functions have more overhead than internal functions (both built-in functions and internal UDFs) and usually execute more slowly

D.

An external function accessed through an AWS API Gateway private endpoint can be accessed only from a Snowflake VPC (Virtual Private Cloud) on AWS and in the same AWS region.

Data Scientist used streams in ELT (extract, load, transform) processes where new data inserted in-to a staging table is tracked by a stream. A set of SQL statements transform and insert the stream contents into a set of production tables. Raw data is coming in the JSON format, but for analysis he needs to transform it into relational columns in the production tables. which of the following Data transformation SQL function he can used to achieve the same?

A.

He could not apply Transformation on Stream table data.

B.

lateral flatten()

C.

METADATA$ACTION ()

D.

Transpose()

Mark the Incorrect understanding of Data Scientist about Streams?

A.

Streams on views support both local views and views shared using Snowflake Secure Data Sharing, including secure views.

B.

Streams can track changes in materialized views.

C.

Streams itself does not contain any table data.

D.

Streams do not support repeatable read isolation.

Select the correct mappings:

I. W Weights or Coefficients of independent variables in the Linear regression model --> Model Pa-rameter

II. K in the K-Nearest Neighbour algorithm --> Model Hyperparameter

III. Learning rate for training a neural network --> Model Hyperparameter

IV. Batch Size --> Model Parameter

A.

I,II

B.

I,II,III

C.

III,IV

D.

II,III,IV

Consider a data frame df with 10 rows and index [ 'r1', 'r2', 'r3', 'row4', 'row5', 'row6', 'r7', 'r8', 'r9', 'row10']. What does the expression g = df.groupby(df.index.str.len()) do?

A.

Groups df based on index values

B.

Groups df based on length of each index value

C.

Groups df based on index strings

D.

Data frames cannot be grouped by index values. Hence it results in Error.

Mark the incorrect statement regarding Python UDF?

A.

Python UDFs can contain both new code and calls to existing packages

B.

For each row passed to a UDF, the UDF returns either a scalar (i.e. single) value or, if defined as a table function, a set of rows.

C.

A UDF also gives you a way to encapsulate functionality so that you can call it repeatedly from multiple places in code

D.

A scalar function (UDF) returns a tabular value for each input row

Consider a data frame df with columns ['A', 'B', 'C', 'D'] and rows ['r1', 'r2', 'r3']. What does the ex-pression df[lambda x : x.index.str.endswith('3')] do?

A.

Returns the row name r3

B.

Results in Error

C.

Returns the third column

D.

Filters the row labelled r3