Multiple Choice
A data analyst is designing a solution to interactively query datasets with SQL using a JDBC connection. Users will join data stored in Amazon S3 in Apache ORC format with data stored in Amazon Elasticsearch Service (Amazon ES) and Amazon Aurora MySQL. Which solution will provide the MOST up-to-date results?
A) Use AWS Glue jobs to ETL data from Amazon ES and Aurora MySQL to Amazon S3. Query the data with Amazon Athena.
B) Use Amazon DMS to stream data from Amazon ES and Aurora MySQL to Amazon Redshift. Query the data with Amazon Redshift.
C) Query all the datasets in place with Apache Spark SQL running on an AWS Glue developer endpoint.
D) Query all the datasets in place with Apache Presto running on Amazon EMR.
Correct Answer:

Verified
Correct Answer:
Verified
Q106: An operations team notices that a few
Q107: A healthcare company uses AWS data and
Q108: Three teams of data analysts use Apache
Q109: A central government organization is collecting events
Q110: A large company has a central data
Q112: A company wants to provide its data
Q113: A software company hosts an application on
Q114: A company uses Amazon Redshift as its
Q115: A company is sending historical datasets to
Q116: A company has developed an Apache Hive