Multiple Choice
You are designing a pipeline that publishes application events to a Pub/Sub topic. Although message ordering is not important, you need to be able to aggregate events across disjoint hourly intervals before loading the results to BigQuery for analysis. What technology should you use to process and load this data to BigQuery while ensuring that it will scale with large volumes of events?
A) Create a Cloud Function to perform the necessary data processing that executes using the Pub/Sub trigger every time a new message is published to the topic.
B) Schedule a Cloud Function to run hourly, pulling all available messages from the Pub/Sub topic and performing the necessary aggregations.
C) Schedule a batch Dataflow job to run hourly, pulling all available messages from the Pub/Sub topic and performing the necessary aggregations.
D) Create a streaming Dataflow job that reads continually from the Pub/Sub topic and performs aggregations using tumbling windows.
Correct Answer:

Verified
Correct Answer:
Verified
Q131: You decided to use Cloud Datastore to
Q132: MJTelco Case Study Company Overview MJTelco is
Q133: You've migrated a Hadoop job from an
Q134: The _ for Cloud Bigtable makes it
Q135: You have a data pipeline with a
Q137: You need to store and analyze social
Q138: You work for a shipping company that
Q139: You have enabled the free integration between
Q140: The marketing team at your organization provides
Q141: You are building new real-time data warehouse