Multiple Choice
A media company wants to perform machine learning and analytics on the data residing in its Amazon S3 data lake. There are two data transformation requirements that will enable the consumers within the company to create reports: Daily transformations of 300 GB of data with different file formats landing in Amazon S3 at a scheduled time. One-time transformations of terabytes of archived data residing in the S3 data lake. Which combination of solutions cost-effectively meets the company's requirements for transforming the data? (Choose three.)
A) For daily incoming data, use AWS Glue crawlers to scan and identify the schema.
B) For daily incoming data, use Amazon Athena to scan and identify the schema.
C) For daily incoming data, use Amazon Redshift to perform transformations.
D) For daily incoming data, use AWS Glue workflows with AWS Glue jobs to perform transformations.
E) For archived data, use Amazon EMR to perform data transformations.
F) For archived data, use Amazon SageMaker to perform data transformations.
Correct Answer:

Verified
Correct Answer:
Verified
Q43: An Amazon Redshift database contains sensitive user
Q44: A company is planning to do a
Q45: A company is planning to create a
Q46: A financial services company needs to aggregate
Q47: A transportation company uses IoT sensors attached
Q49: A company wants to provide its data
Q50: A company owns facilities with IoT devices
Q51: A manufacturing company has been collecting IoT
Q52: A financial company uses Apache Hive on
Q53: A company developed a new elections reporting