Cloudera CCA-500 Free Certification Exam Questions Answer Jul 2025 update

Question # 11

You’re upgrading a Hadoop cluster from HDFS and MapReduce version 1 (MRv1) to one running HDFS and MapReduce version 2 (MRv2) on YARN. You want to set and enforce version 1 (MRv1) to one running HDFS and MapReduce version 2 (MRv2) on YARN. You want to set and enforce a block size of 128MB for all new files written to the cluster after upgrade. What should you do?

You cannot enforce this, since client code can always override this value

Set dfs.block.size to 128M on all the worker nodes, on all client machines, and on the NameNode, and set the parameter to final

Set dfs.block.size to 128 M on all the worker nodes and client machines, and set the parameter to final. You do not need to set this value on the NameNode

Set dfs.block.size to 134217728 on all the worker nodes, on all client machines, and on the NameNode, and set the parameter to final

Set dfs.block.size to 134217728 on all the worker nodes and client machines, and set the parameter to final. You do not need to set this value on the NameNode

Question # 12

Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar SampleJar MyClass on a client machine?

SampleJar.Jar is sent to the ApplicationMaster which allocates a container for SampleJar.Jar

Sample.jar is placed in a temporary directory in HDFS

SampleJar.jar is sent directly to the ResourceManager

SampleJar.jar is serialized into an XML file which is submitted to the ApplicatoionMaster

Question # 13

You have recently converted your Hadoop cluster from a MapReduce 1 (MRv1) architecture to MapReduce 2 (MRv2) on YARN architecture. Your developers are accustomed to specifying map and reduce tasks (resource allocation) tasks when they run jobs: A developer wants to know how specify to reduce tasks when a specific job runs. Which method should you tell that developers to implement?

MapReduce version 2 (MRv2) on YARN abstracts resource allocation away from the idea of “tasks” into memory and virtual cores, thus eliminating the need for a developer to specify the number of reduce tasks, and indeed preventing the developer from specifying the number of reduce tasks.

In YARN, resource allocations is a function of megabytes of memory in multiples of 1024mb. Thus, they should specify the amount of memory resource they need by executing –D mapreduce-reduces.memory-mb-2048

In YARN, the ApplicationMaster is responsible for requesting the resource required for a specific launch. Thus, executing –D yarn.applicationmaster.reduce.tasks=2 will specify that the ApplicationMaster launch two task contains on the worker nodes.

Developers specify reduce tasks in the exact same way for both MapReduce version 1 (MRv1) and MapReduce version 2 (MRv2) on YARN. Thus, executing –D mapreduce.job.reduces-2 will specify reduce tasks.

In YARN, resource allocation is function of virtual cores specified by the ApplicationManager making requests to the NodeManager where a reduce task is handeled by a single container (and thus a single virtual core). Thus, the developer needs to specify the number of virtual cores to the NodeManager by executing –p yarn.nodemanager.cpu-vcores=2

Question # 14

Table schemas in Hive are:

Stored as metadata on the NameNode

Stored along with the data in HDFS

Stored in the Metadata

Stored in ZooKeeper

Question # 15

For each YARN job, the Hadoop framework generates task log file. Where are Hadoop task log files stored?

Cached by the NodeManager managing the job containers, then written to a log directory on the NameNode

Cached in the YARN container running the task, then copied into HDFS on job completion

In HDFS, in the directory of the user who generates the job

On the local disk of the slave mode running the task

Question # 16

A user comes to you, complaining that when she attempts to submit a Hadoop job, it fails. There is a Directory in HDFS named /data/input. The Jar is named j.jar, and the driver class is named DriverClass.

She runs the command:

Hadoop jar j.jar DriverClass /data/input/data/output

The error message returned includes the line:

PriviligedActionException as:training (auth:SIMPLE) cause:org.apache.hadoop.mapreduce.lib.input.invalidInputException:

Input path does not exist: file:/data/input

What is the cause of the error?

The user is not authorized to run the job on the cluster

The output directory already exists

The name of the driver has been spelled incorrectly on the command line

The directory name is misspelled in HDFS

The Hadoop configuration files on the client do not point to the cluster

Question # 17

Your Hadoop cluster is configuring with HDFS and MapReduce version 2 (MRv2) on YARN. Can you configure a worker node to run a NodeManager daemon but not a DataNode daemon and still have a functional cluster?

Yes. The daemon will receive data from the NameNode to run Map tasks

Yes. The daemon will get data from another (non-local) DataNode to run Map tasks

Yes. The daemon will receive Map tasks only

Yes. The daemon will receive Reducer tasks only

Question # 18

Given:

You want to clean up this list by removing jobs where the State is KILLED. What command you enter?

Yarn application –refreshJobHistory

Yarn application –kill application_1374638600275_0109

Yarn rmadmin –refreshQueue

Yarn rmadmin –kill application_1374638600275_0109

Weekend Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: sntaclus

Free Practice Questions for Cloudera CCA-500 Exam

The Answer Is:

The Answer Is:

The Answer Is:

The Answer Is:

Explanation:

The Answer Is:

The Answer Is:

The Answer Is:

The Answer Is:

Explanation: