Multiple Choice
You are planning to migrate your current on-premises Apache Hadoop deployment to the cloud. You need to ensure that the deployment is as fault-tolerant and cost-effective as possible for long-running batch jobs. You want to use a managed service. What should you do?
A) Deploy a Cloud Dataproc cluster. Use a standard persistent disk and 50% preemptible workers. Store data in Cloud Storage, and change references in scripts from hdfs:// to gs://
B) Deploy a Cloud Dataproc cluster. Use an SSD persistent disk and 50% preemptible workers. Store data in Cloud Storage, and change references in scripts from hdfs:// to gs://
C) Install Hadoop and Spark on a 10-node Compute Engine instance group with standard instances. Install the Cloud Storage connector, and store the data in Cloud Storage. Change references in scripts from hdfs:// to gs://
D) Install Hadoop and Spark on a 10-node Compute Engine instance group with preemptible instances. Store data in HDFS. Change references in scripts from hdfs:// to gs://
Correct Answer:

Verified
Correct Answer:
Verified
Q246: You work for an economic consulting firm
Q247: You are designing storage for 20 TB
Q248: You are integrating one of your internal
Q249: MJTelco Case Study Company Overview MJTelco is
Q250: Your company has recently grown rapidly and
Q251: Which of the following is not possible
Q252: Your company produces 20,000 files every hour.
Q253: Your infrastructure includes a set of YouTube
Q254: If a dataset contains rows with individual
Q255: The YARN ResourceManager and the HDFS NameNode