Exam 13: Big Data and Analytics
Using data to understand customers/clients and business operations to sustain and foster growth and profitability is
C
What is a data scientist and what does the job involve?
A data scientist is a role or a job frequently associated with Big Data or data science. In a very short time it has become one of the most sought-out roles in the marketplace. Currently, data scientists' most basic, current skill is the ability to write code (in the latest Big Data languages and platforms). A more enduring skill will be the need for data scientists to communicate in a language that all their stakeholders understand-and to demonstrate the special skills involved in storytelling with data, whether verbally, visually, or-ideally-both. Data scientists use a combination of their business and technical skills to investigate Big Data looking for ways to improve current business analytics practices (from descriptive to predictive and prescriptive) and hence to improve decisions for new business opportunities.
In the Dublin City Council case study, GPS data from the city's buses and CCTV were the only data sources for the Big Data GIS-based application.
False
The Big Data and Analysis in Politics case study makes it clear that the unpredictability of elections makes politics an unsuitable arena for Big Data.
A newly popular unit of data in the Big Data era is the petabyte (PB), which is
In the Big Data and Analytics in Politics case study, what was the analytic system output or goal?
What are the differences between stream analytics and perpetual analytics? When would you use one or the other?
There is a current undersupply of data scientists for the Big Data market.
In the eBay use case study, load ________ helped the company meet its Big Data needs with the extremely fast data handling and application availability requirements.
The data scientist is a profession for a field that is still largely being defined.
Hadoop is primarily a(n) ________ file system and lacks capabilities we'd associate with a DBMS, such as indexing, random access to data, and support for SQL.
In the health sciences, the largest potential source of Big Data comes from
A job ________ is a node in a Hadoop cluster that initiates and coordinates MapReduce jobs, or the processing of the data.
HBase, Cassandra, MongoDB, and Accumulo are examples of ________ databases.
List and describe four of the most critical success factors for Big Data analytics.
Under which of the following requirements would it be more appropriate to use Hadoop over a data warehouse?
________ bring together hardware and software in a physical unit that is not only fast but also scalable on an as-needed basis.
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)