Exam 1: Cloudera Certified Administrator for Apache Hadoop (CCAH)

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar SampleJar MyClass on a client machine?

Free
(Multiple Choice)
4.8/5
(33)
Correct Answer:
Verified

A

In CDH4 and later, which file contains a serialized form of all the directory and files inodes in the filesystem, giving the NameNode a persistent checkpoint of the filesystem metadata?

Free
(Multiple Choice)
4.8/5
(32)
Correct Answer:
Verified

C

Your cluster is running MapReduce version 2 (MRv2) on YARN. Your ResourceManager is configured to use the FairScheduler. Now you want to configure your scheduler such that a new user on the cluster can submit jobs into their own queue application submission. Which configuration should you set?

Free
(Multiple Choice)
4.9/5
(34)
Correct Answer:
Verified

A

You have a cluster running with a FIFO scheduler enabled. You submit a large job A to the cluster, which you expect to run for one hour. Then, you submit job B to the cluster, which you expect to run a couple of minutes only. You submit both jobs with the same priority. Which two best describes how FIFO Scheduler arbitrates the cluster resources for job and its tasks? (Choose two)

(Multiple Choice)
4.7/5
(23)

Which two features does Kerberos security add to a Hadoop cluster? (Choose two)

(Multiple Choice)
4.9/5
(33)

Cluster Summary: 45 files and directories, 12 blocks = 57 total. Heap size is 15.31 MB/193.38MB(7%) 11ec3bb2_d99a_f026_916b_b9ef210356fc__00 Refer to the above screenshot. You configure a Hadoop cluster with seven DataNodes and on of your monitoring UIs displays the details shown in the exhibit. What does the this tell you?

(Multiple Choice)
5.0/5
(32)

Which is the default scheduler in YARN?

(Multiple Choice)
4.8/5
(35)

Identify two features/issues that YARN is designated to address: (Choose two)

(Multiple Choice)
4.9/5
(27)

You suspect that your NameNode is incorrectly configured, and is swapping memory to disk. Which Linux commands help you to identify whether swapping is occurring? (Select all that apply)

(Multiple Choice)
4.7/5
(37)

On a cluster running CDH 5.0 or above, you use the hadoop fs -put command to write a 300MB file into a previously empty directory using an HDFS block size of 64 MB. Just after this command has finished writing 200 MB of this file, what would another use see when they look in directory?

(Multiple Choice)
4.8/5
(34)

You are running a Hadoop cluster with a NameNode on host mynamenode, a secondary NameNode on host mysecondarynamenode and several DataNodes. Which best describes how you determine when the last checkpoint happened?

(Multiple Choice)
4.7/5
(29)

Assuming a cluster running HDFS, MapReduce version 2 (MRv2) on YARN with all settings at their default, what do you need to do when adding a new slave node to cluster?

(Multiple Choice)
4.8/5
(41)

A slave node in your cluster has 4 TB hard drives installed (4 x 2TB). The DataNode is configured to store HDFS blocks on all disks. You set the value of the dfs.datanode.du.reserved parameter to 100 GB. How does this alter HDFS block storage?

(Multiple Choice)
4.9/5
(43)

What does CDH packaging do on install to facilitate Kerberos security setup?

(Multiple Choice)
4.9/5
(29)

Which process instantiates user code, and executes map and reduce tasks on a cluster running MapReduce v2 (MRv2) on YARN?

(Multiple Choice)
4.9/5
(37)

You have just run a MapReduce job to filter user messages to only those of a selected geographical region. The output for this job is in a directory named westUsers, located just below your home directory in HDFS. Which command gathers these into a single file on your local file system?

(Multiple Choice)
4.8/5
(31)

Which scheduler would you deploy to ensure that your cluster allows short jobs to finish within a reasonable time without starting long-running jobs?

(Multiple Choice)
4.9/5
(37)

You are working on a project where you need to chain together MapReduce, Pig jobs. You also need the ability to use forks, decision points, and path joins. Which ecosystem project should you use to perform these actions?

(Multiple Choice)
4.8/5
(33)

What two processes must you do if you are running a Hadoop cluster with a single NameNode and six DataNodes, and you want to change a configuration parameter so that it affects all six DataNodes. (Choose two)

(Multiple Choice)
4.9/5
(33)

Your cluster implements HDFS High Availability (HA). Your two NameNodes are named nn01 and nn02. What occurs when you execute the command: hdfs haadmin -failover nn01 nn02?

(Multiple Choice)
4.9/5
(28)
Showing 1 - 20 of 30
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)