Multiple Choice
You are creating a new pipeline in Google Cloud to stream IoT data from Cloud Pub/Sub through Cloud Dataflow to BigQuery. While previewing the data, you notice that roughly 2% of the data appears to be corrupt. You need to modify the Cloud Dataflow pipeline to filter out this corrupt data. What should you do?
A) Add a SideInput that returns a Boolean if the element is corrupt.
B) Add a ParDo transform in Cloud Dataflow to discard corrupt elements.
C) Add a Partition transform in Cloud Dataflow to separate valid data from corrupt data.
D) Add a GroupByKey transform in Cloud Dataflow to group all of the valid data together and discard the rest.
Correct Answer:

Verified
Correct Answer:
Verified
Q233: When creating a new Cloud Dataproc cluster
Q234: You are developing an application that uses
Q235: Suppose you have a table that includes
Q236: You are building an application to share
Q237: You are building a new data pipeline
Q239: You are using Google BigQuery as your
Q240: You need to create a new transaction
Q241: Your financial services company is moving to
Q242: You have some data, which is shown
Q243: You decided to use Cloud Datastore to