Multiple Choice
You receive data files in CSV format monthly from a third party. You need to cleanse this data, but every third month the schema of the files changes. Your requirements for implementing these transformations include: Executing the transformations on a schedule Enabling non-developer analysts to modify transformations Providing a graphical tool for designing transformations What should you do?
A) Use Cloud Dataprep to build and maintain the transformation recipes, and execute them on a scheduled basis
B) Load each month's CSV data into BigQuery, and write a SQL query to transform the data to a standard schema. Merge the transformed tables together with a SQL query
C) Help the analysts write a Cloud Dataflow pipeline in Python to perform the transformation. The Python code should be stored in a revision control system and modified as the incoming data's schema changes
D) Use Apache Spark on Cloud Dataproc to infer the schema of the CSV file before creating a Dataframe. Then implement the transformations in Spark SQL before writing the data out to Cloud Storage and loading into BigQuery
Correct Answer:

Verified
Correct Answer:
Verified
Q145: Your weather app queries a database every
Q146: Which of the following statements is NOT
Q147: You have a data pipeline that writes
Q148: Your company is loading comma-separated values (CSV)
Q149: You are building a data pipeline on
Q151: You are designing a cloud-native historical data
Q152: What are all of the BigQuery operations
Q153: You have an Apache Kafka cluster on-prem
Q154: You work for a shipping company that
Q155: Which of the following statements about the