Which type of Python UDFs let you define Python functions that receive batches of input rows as Pandas DataFrames and return batches of results as Pandas arrays or Series?
Which tools helps data scientist to manage ML lifecycle & Model versioning?
Which of the following metrics are used to evaluate classification models?
Which ones are the known limitations of using External function?
Data Scientist used streams in ELT (extract, load, transform) processes where new data inserted in-to a staging table is tracked by a stream. A set of SQL statements transform and insert the stream contents into a set of production tables. Raw data is coming in the JSON format, but for analysis he needs to transform it into relational columns in the production tables. which of the following Data transformation SQL function he can used to achieve the same?
Mark the Incorrect understanding of Data Scientist about Streams?
Select the correct mappings:
I. W Weights or Coefficients of independent variables in the Linear regression model --> Model Pa-rameter
II. K in the K-Nearest Neighbour algorithm --> Model Hyperparameter
III. Learning rate for training a neural network --> Model Hyperparameter
IV. Batch Size --> Model Parameter
Consider a data frame df with 10 rows and index [ 'r1', 'r2', 'r3', 'row4', 'row5', 'row6', 'r7', 'r8', 'r9', 'row10']. What does the expression g = df.groupby(df.index.str.len()) do?
Mark the incorrect statement regarding Python UDF?
Consider a data frame df with columns ['A', 'B', 'C', 'D'] and rows ['r1', 'r2', 'r3']. What does the ex-pression df[lambda x : x.index.str.endswith('3')] do?