Question 1

In most cases, Hadoop is used to replace data warehouses.

Accepted Answer

A) True 
 B)False  False

Question 2

In what ways can communications companies use geospatial analysis to harness their data effectively?

Accepted Answer

Communication companies often generate massive amounts of data every day. The ability to analyze the data quickly with a high level of location-specific granularity can better identify the customer churn and help in formulating strategies specific to locations for increasing operational efficiency, quality of service, and revenue.

Question 3

Why are some portions of tape backup workloads being redirected to Hadoop clusters today?

Accepted Answer

• First, while it may appear inexpensive to store data on tape, the true cost comes with the difficulty of retrieval. Not only is the data stored offline, requiring hours if not days to restore, but tape cartridges themselves are also prone to degradation over time, making data loss a reality and forcing companies to factor in those costs. To make matters worse, tape formats change every couple of years, requiring organizations to either perform massive data migrations to the newest tape format or risk the inability to restore data from obsolete tapes.
• Second, it has been shown that there is value in keeping historical data online and accessible. As in the clickstream example, keeping raw data on a spinning disk for a longer duration makes it easy for companies to revisit data when the context changes and new constraints need to be applied. Searching thousands of disks with Hadoop is dramatically faster and easier than spinning through hundreds of magnetic tapes. Additionally, as disk densities continue to double every 18 months, it becomes economically feasible for organizations to hold many years' worth of raw or refined data in HDFS.

Question 4

The direction of the industry over the next few years will likely be moving toward more tightly coupled Hadoop and relational DBMS-based data warehouse technologies.

Accepted Answer

A) True 
 B)False

Question 5

Cloud computing originates from a reference to the Internet as a "cloud" and is a combination of several information technology components as services.

Accepted Answer

A) True 
 B)False

Question 6

The analytics layer of the Big Data stack is experiencing what type of development currently?

Accepted Answer

A)  significant development 
B)  limited development 
C)  no development/stagnant 
D)  no development/reject as being non-important 
A)  significant development 
B)  limited development 
C)  no development/stagnant 
D)  no development/reject as being non-important

Question 7

MapReduce can be easily understood by skilled programmers due to its procedural nature.

Accepted Answer

A) True 
 B)False

Question 8

What are the differences between stream analytics and perpetual analytics? When would you use one or the other?

Accepted Answer

• In many cases they are used synonymous

Question 9

Hadoop was designed to handle petabytes and extabytes of data distributed over multiple nodes in parallel.

Accepted Answer

A) True 
 B)False

Question 10

Define MapReduce.

Accepted Answer

As described by Dean and Ghemawat (2004)

Question 11

Openshift is Google's cloud application platform based on a PaaS model.

Accepted Answer

A) True 
 B)False

Question 12

Managers should be more concerned that data is stored in a structured data warehouse or a Hadoop cluster, and less about the actually the insights that can be derived from the data.

Accepted Answer

A) True 
 B)False

In most cases, Hadoop is used to replace data warehouses.

In what ways can communications companies use geospatial analysis to harness their data effectively?

Why are some portions of tape backup workloads being redirected to Hadoop clusters today?

The direction of the industry over the next few years will likely be moving toward more tightly coupled Hadoop and relational DBMS-based data warehouse technologies.

Cloud computing originates from a reference to the Internet as a "cloud" and is a combination of several information technology components as services.

The analytics layer of the Big Data stack is experiencing what type of development currently?

MapReduce can be easily understood by skilled programmers due to its procedural nature.

What are the differences between stream analytics and perpetual analytics? When would you use one or the other?

Hadoop was designed to handle petabytes and extabytes of data distributed over multiple nodes in parallel.

Define MapReduce.

Openshift is Google's cloud application platform based on a PaaS model.

Managers should be more concerned that data is stored in a structured data warehouse or a Hadoop cluster, and less about the actually the insights that can be derived from the data.

Overview of Business Intelligence, Analytics, Data Science, and Artificial Intelligence: Systems for Decision Support

Artificial Intelligence Concepts, Drivers, Major Technologies, and Business Applications

Nature of Data, Statistical Modeling, and Visualization

Data Mining Process, Methods, and Algorithms

Machine-Learning Techniques for Predictive Analytics

Deep Learning and Cognitive Computing

Text Mining, Sentiment Analysis, and Social Analytics

Prescriptive Analytics: Optimization and Simulation

Robotics: Industrial and Consumer Applications

Group Decision Making, Collaborative Systems, and AI Support

Knowledge Systems: Expert Systems, Recommenders, Chatbots, Virtual Personal Assistants, and Robo Advisors

The Internet of Things As a Platform for Intelligent Applications

Implementation Issues: From Ethics and Privacy to Organizational and Societal Impacts

Filters

Exam 9: Big Data, Cloud Computing, and Location Analytics: Concepts and Tool

In most cases, Hadoop is used to replace data warehouses.

In what ways can communications companies use geospatial analysis to harness their data effectively?

Why are some portions of tape backup workloads being redirected to Hadoop clusters today?

The direction of the industry over the next few years will likely be moving toward more tightly coupled Hadoop and relational DBMS-based data warehouse technologies.

Cloud computing originates from a reference to the Internet as a "cloud" and is a combination of several information technology components as services.

The analytics layer of the Big Data stack is experiencing what type of development currently?

MapReduce can be easily understood by skilled programmers due to its procedural nature.

What are the differences between stream analytics and perpetual analytics? When would you use one or the other?

Hadoop was designed to handle petabytes and extabytes of data distributed over multiple nodes in parallel.

Define MapReduce.

Openshift is Google's cloud application platform based on a PaaS model.

Managers should be more concerned that data is stored in a structured data warehouse or a Hadoop cluster, and less about the actually the insights that can be derived from the data.

Overview of Business Intelligence, Analytics, Data Science, and Artificial Intelligence: Systems for Decision Support

Artificial Intelligence Concepts, Drivers, Major Technologies, and Business Applications

Nature of Data, Statistical Modeling, and Visualization

Data Mining Process, Methods, and Algorithms

Machine-Learning Techniques for Predictive Analytics

Deep Learning and Cognitive Computing

Text Mining, Sentiment Analysis, and Social Analytics

Prescriptive Analytics: Optimization and Simulation

Robotics: Industrial and Consumer Applications

Group Decision Making, Collaborative Systems, and AI Support

Knowledge Systems: Expert Systems, Recommenders, Chatbots, Virtual Personal Assistants, and Robo Advisors

The Internet of Things As a Platform for Intelligent Applications

Implementation Issues: From Ethics and Privacy to Organizational and Societal Impacts

Filters