Summer Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exc65

What does the Data Science Service template in Oracle Resource Manager (ORM) NOTautomatically create?

A.

Required user groups

B.

Dynamic groups

C.

Individual Data Science users

D.

Policies for a basic use case

In machine learning, what is the primary difference between supervised and unsupervised learning?

A.

Supervised learning involves data that has been labeled and classified, while unsupervised learning data is unlabeled and unclassified.

B.

Supervised learning is monitored closely by data scientists, while they don’t play a role inunsupervised learning.

C.

Supervised learning is only used for image recognition, while unsupervised learning can be used for various analytics applications.

D.

Supervised learning is created and managed by the Data Engineer.

Which feature of Oracle Cloud Infrastructure Data Science provides an interactive coding environment for building and training machine learning models?

A.

Model Catalog

B.

Jobs

C.

Notebook Sessions

D.

Projects

You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail? Pick THREE.

A.

Update the deployment to add more instances

B.

Delete the deployment

C.

Update the deployment to use fewer instances

D.

Update the deployment to use a larger virtual machine (more CPUs/memory)

E.

Reduce the load balancer bandwidth limit so that fewer requests come in

You are a data scientist with a set of text and image files that need annotation, and you want to use Oracle Cloud Infrastructure (OCI) Data Labeling. Which of the following THREE annotation classes are supported by the tool?

A.

Object detection

B.

Named entity extraction

C.

Classification (single/multi-label)

D.

Key-point and landmark

E.

Polygonal segmentation

F.

Semantic segmentation

You want to make your model more frugal to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method is appropriate to display the comparability between Continuous and Categorical features?

A.

pearson_plot()

B.

cramersv_plot()

C.

correlation_ratio_plot()

D.

corr()

You have received machine learning model training code, without clear information about the optimal shape to run the training on. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?

A.

Start with a smaller shape and monitor the job run metrics and time required to complete the model training. If the compute shape is not fully utilized, tune the model parameters, and rerun the job. Repeat the process until the shape resources are fully utilized.

B.

Start with the strongest compute shape Jobs support and monitor the job run metrics and time required to complete the model training. Tune the model so that it utilizes as much compute resources as possible, even at an increased cost.

C.

Start with a small shape and monitor the utilization metrics and time required to complete the model training. If the compute shape is fully utilized, change to compute that has more resources and rerun the job. Repeat the process until the processing time does not improve.

D.

Start with a random compute shape and monitor the utilization metrics and time required to finish the model training. Perform model training optimization and performance tests in advance to identify the right compute shape before running the model training as a job.

While working with Git on Oracle Cloud Infrastructure (OCI) Data Science, you notice that two of the operations are taking more time than the others due to your slow internet speed. Which TWO operations would experience the delay?

A.

Moving the changes into staging area for the next commit

B.

Updating the local repo to match the content from a remote repository

C.

Pushing changes to a remote repository

D.

Making a commit that is taking a snapshot of the local repository for the next push

E.

Converting an existing local project folder to a Git repository

You have an embarrassingly parallel or distributed batch job with a large amount of data running using Data Science Jobs. What would be the best approach to run the workload?

A.

Create a job in Data Science Jobs and then start the number of simultaneous job runs required for your workload

B.

Create a new job for every job run that you have to run in parallel, because the Data Science Job service can have only one job per job

C.

Create the job in Data Science Jobs and start a job run. When it is done, start a new job run until you achieve the number of runs required

D.

Reconfigure the job run because Data Science Jobs does not support embarrassingly parallel

You have just started as a data scientist at a healthcare company. You have been asked to analyze and improve a deep neural network model, which was built based on the electrocardiogram records of patients. There are no details about the model framework that was built. What would be the best way to find more details about the machine learning models inside the model catalog?

A.

Refer to the code inside the model

B.

Check for model taxonomy details

C.

Check for metadata tags

D.

Check for provenance details