Exam 2: Cloudera Certified Developer for Apache Hadoop (CCDH)

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

You need to move a file titled "weblogs" into HDFS. When you try to copy the file, you can't. You know you have ample space on your DataNodes. Which action should you take to relieve this situation and store more files in HDFS?

(Multiple Choice)
4.9/5
(35)

Indentify which best defines a SequenceFile?

(Multiple Choice)
4.9/5
(36)

Determine which best describes when the reduce method is first called in a MapReduce job?

(Multiple Choice)
4.9/5
(25)

You have the following key-value pairs as output from your Map task: (the, 1) (fox, 1) (faster, 1) (than, 1) (dog, 1) How many keys will be passed to the Reducer's reduce method?

(Multiple Choice)
4.9/5
(33)

You need to perform statistical analysis in your MapReduce job and would like to call methods in the Apache Commons Math library, which is distributed as a 1.3 megabyte Java archive (JAR) file. Which is the best way to make this library available to your MapReducer job at runtime?

(Multiple Choice)
4.9/5
(37)

For each intermediate key, each reducer task can emit:

(Multiple Choice)
4.9/5
(32)

A combiner reduces:

(Multiple Choice)
4.8/5
(40)

A client application creates an HDFS file named foo.txt with a replication factor of 3. Identify which best describes the file access rules in HDFS if the file has a single block that is stored on data nodes A, B and C?

(Multiple Choice)
5.0/5
(30)

In a MapReduce job, you want each of your input files processed by a single map task. How do you configure a MapReduce job so that a single map task processes each input file regardless of how many blocks the input file occupies?

(Multiple Choice)
4.9/5
(30)

MapReduce v2 (MRv2/YARN) splits which major functions of the JobTracker into separate daemons? Select two.

(Multiple Choice)
4.9/5
(31)

When can a reduce class also serve as a combiner without affecting the output of a MapReduce program?

(Multiple Choice)
4.7/5
(39)

In a MapReduce job, the reducer receives all values associated with same key. Which statement best describes the ordering of these values?

(Multiple Choice)
4.8/5
(43)

You write MapReduce job to process 100 files in HDFS. Your MapReduce algorithm uses TextInputFormat: the mapper applies a regular expression over input values and emits key-values pairs with the key consisting of the matching text, and the value containing the filename and byte offset. Determine the difference between setting the number of reduces to one and settings the number of reducers to zero.

(Multiple Choice)
4.8/5
(37)

All keys used for intermediate output from mappers must:

(Multiple Choice)
4.7/5
(37)

You have user profile records in your OLPT database, that you want to join with web logs you have already ingested into the Hadoop file system. How will you obtain these user records?

(Multiple Choice)
4.7/5
(26)

On a cluster running MapReduce v1 (MRv1), a TaskTracker heartbeats into the JobTracker on your cluster, and alerts the JobTracker it has an open map task slot. What determines how the JobTracker assigns each map task to a TaskTracker?

(Multiple Choice)
4.9/5
(33)
Showing 21 - 36 of 36
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)