Multiple Choice
Which best describes how TextInputFormat processes input files and line breaks?
A) Input file splits may cross line breaks. A line that crosses file splits is read by the RecordReader of the split that contains the beginning of the broken line.
B) Input file splits may cross line breaks. A line that crosses file splits is read by the RecordReaders of both splits containing the broken line.
C) The input file is split exactly at the line breaks, so each RecordReader will read a series of complete lines.
D) Input file splits may cross line breaks. A line that crosses file splits is ignored.
E) Input file splits may cross line breaks. A line that crosses file splits is read by the RecordReader of the split that contains the end of the broken line.
Correct Answer:

Verified
Correct Answer:
Verified
Q3: Workflows expressed in Oozie can contain:<br>A) Sequences
Q4: Table metadata in Hive is:<br>A) Stored as
Q5: You need to create a job that
Q6: What is the disadvantage of using multiple
Q7: In the reducer, the MapReduce API provides
Q9: In a MapReduce job with 500 map
Q10: Analyze each scenario below and indentify which
Q11: In a large MapReduce job with m
Q12: The Hadoop framework provides a mechanism for
Q13: For each input key-value pair, mappers can