Deck 9: Business Intelligence Systems

Full screen (f)
exit full mode
Question
The software component of a business intelligence system is called a business intelligence database.
Use Space or
up arrow
down arrow
to flip the card.
Question
Publish results is the process of delivering business intelligence to the knowledge workers who need it.
Question
Business intelligence enables police departments to better utilise their personnel through predictive-policing.
Question
Using BI for identifying changes in the purchasing patterns of customers is a labour-intensive process.
Question
What are the three primary activities in the business intelligence process?
Question
What is predictive-policing?
Question
Which of the following is a fundamental category of business intelligence (BI)analysis?

A)pull publishing
B)data mining
C)push publishing
D)data acquisition
Question
Data mining is the process of obtaining,cleaning,organising,relating,and cataloging source data.
Question
The data that an organisation purchases from data vendors can act as the source data for a business intelligence system.
Question
Which of the following statements is true of business intelligence (BI)systems?

A)Business intelligence systems are primarily used for developing software systems and data mining applications.
B)The software component of a business intelligence system is called an intelligence database.
C)Business intelligence systems analyse an organisation's past performance to make predictions.
D)The four standard components of business intelligence systems are software,procedures,applications,and programs.
Question
________ requires users to request business intelligence results.

A)Pull publishing
B)Data acquisition
C)Push publishing
D)Data mining
Question
Business intelligence systems are information systems that process operational and other data to analyse past performance and to make predictions.
Question
Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it?

A)publish results
B)BI analysis
C)data acquisition
D)data mining
Question
________ is the process of delivering business intelligence to users without any request from the users.

A)Data mining
B)Pull publishing
C)Push publishing
D)Data acquisition
Question
How does business intelligence help marketers identify changes in the purchasing patterns of customers?
Question
Pull publishing delivers business intelligence to users without any request from the users.
Question
What are business intelligence systems?
Question
Project management is one of the few domains in which business intelligence is rarely used.
Question
________ process operational and other data in organisations to analyse past performance and make predictions.

A)Live migration techniques
B)Windowing systems
C)Business intelligence systems
D)Virtualisation software
Question
________ is the process of obtaining,cleaning,organising,relating,and cataloging source data.

A)Push publishing
B)Pull publishing
C)Data interpretation
D)Data acquisition
Question
Facts about data,such as its source,format,assumptions,and constraints,are called metadata.
Question
Data granularity refers to the level of detail represented by data.
Question
Differentiate between push publishing and pull publishing.
Question
The source,format,assumptions,constraints,and other facts concerning certain data are called ________.

A)metadata
B)microdata
C)data structures
D)network packets
Question
A ________ is a data collection that addresses the needs of a particular department or functional area of a business.

A)datasheet
B)dataspace
C)data room
D)data mart
Question
Users in a data mart obtain data that pertain to a particular business function from a ________.

A)data room
B)datasheet
C)data warehouse
D)data centre
Question
________ refers to the level of detail represented by data.

A)Dimensionality
B)Granularity
C)Aggregation
D)Abstraction
Question
A data warehouse is a facility for managing an organisation's business intelligence data.
Question
The use of an organisation's operational data as the source data for a BI system is not usually recommended because it ________.

A)considers only external data and not internal data regarding an organisation's functions
B)is not possible to create reports based on operational data
C)requires considerable processing and can drastically reduce system performance
D)is not possible to perform business intelligence analyses on operational data
Question
The purpose of a ________ is to extract data from operational systems and other sources,clean the data,and store and catalog that data for processing by business intelligence tools.

A)data mart
B)data centre
C)data room
D)data warehouse
Question
Which of the following statements is true of a data warehouse?

A)A data warehouse functions like a retail store in a supply chain.
B)A data warehouse is larger than a data mart.
C)Users in a data warehouse obtain data pertaining to a business function from a data mart.
D)Data analysts who work with a data warehouse are experts in a particular business function.
Question
Which of the following statements is true of data with granularity?

A)Granularity refers to the level of detail represented by the data.
B)If granularity is too coarse,data can be separated into constituent parts using regression.
C)The granularity of clickstream data is too coarse.
D)If granularity is too coarse,data can be made finer by summing and combining.
Question
External data purchased from outside resources are not included in data warehouses.
Question
If the granularity of certain data is too coarse,the data can be separated into constituent parts using statistical techniques.
Question
Problematic data are termed dirty data.
Question
A ________ is a facility for managing an organisation's business intelligence data.

A)dataspace
B)data warehouse
C)data room
D)datasheet
Question
Which of the following problems is particularly common for data that have been gathered over time?

A)missing values
B)wrong granularity
C)lack of consistency
D)lack of integration
Question
Problematic data are referred to as ________.

A)rough data
B)clickstream data
C)granular data
D)dirty data
Question
The granularity in clickstream data is too coarse.
Question
The more attributes there are in a sample data,the easier it is to build a model that fits the sample data,but that is worthless as a predictor.Which of the following best explains this phenomenon?

A)the tragedy of the commons
B)the curse of dimensionality
C)the zero-sum game
D)the free rider problem
Question
________ are reports produced when something out of predefined bounds occurs.

A)Subscriptions
B)Dynamic reports
C)Exception reports
D)Static reports
Question
What is clickstream data?
Question
The goal of ________,a type of business intelligence analysis,is to create information about past performance.

A)reporting
B)push publishing
C)BigData
D)data mining
Question
Users in a data mart obtain data that pertain to a particular business function from a data warehouse.
Question
What is data granularity?
Question
Explain the curse of dimensionality.
Question
Data marts are usually larger than data warehouses.
Question
An advantage of data warehouses is the low cost required to create,staff,and operate them.
Question
________ techniques emerged from the combined discipline of statistics,mathematics,artificial intelligence,and machine-learning.

A)Data mining
B)Push publishing
C)Exception reporting
D)Pull publishing
Question
Explain the functions of a data warehouse.
Question
________ is the application of statistical techniques to find patterns and relationships among data for classification and prediction.

A)Push publishing
B)Data encryption
C)Data mining
D)Pull publishing
Question
________ is the process of sorting,grouping,summing,filtering,and formatting structured data.

A)Push publishing
B)Reporting analysis
C)Cloud computing
D)Pull publishing
Question
What are the functions of a data warehouse?
Question
The curse of dimensionality states that the more attributes there are,the more difficult it is to build a model that fits the sample data.
Question
How is a data warehouse different from a data mart?
Question
________ is an unsupervised data mining technique in which statistical techniques identify groups of entities that have similar characteristics.

A)Content indexing
B)Regression analysis
C)Cloud computing
D)Cluster analysis
Question
Which of the following statements is true of unsupervised data mining?

A)Analysts apply data mining techniques to estimate the parameters of a developed model.
B)Analysts create hypotheses only after performing an analysis.
C)Data miners develop models prior to performing an analysis.
D)Regression analysis is the most commonly used unsupervised data mining technique.
Question
Data marts are data collections that address the needs of a particular department or functional area of a business.
Question
Which of the following refers to data in the form of rows and columns?

A)problematic data
B)granulated data
C)structured data
D)nonintegrated data
Question
Data analysts who work with data warehouses are not usually experts in a given business function.
Question
The results generated in the Map phase are combined in the ________ phase.

A)control
B)Reduce
C)construct
D)Pig
Question
Reporting analysis is used primarily for classifying and predicting BI data.
Question
Which of the following statements is true of Hadoop?

A)Hadoop is an open-source program that implements MapReduce.
B)Technical skills are not required to run and use Hadoop.
C)Hadoop includes a query language entitled Big.
D)Hadoop is written in C++ and runs on Linux.
Question
In the case of ________,data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models.

A)unsupervised data mining
B)supervised data mining
C)pull publishing techniques
D)push publishing techniques
Question
________ is an open-source program supported by the Apache Foundation that manages thousands of computers and implements MapReduce.

A)Apache Wave
B)Linux
C)BigData
D)Hadoop
Question
Regression analysis is used in ________.

A)exception reporting
B)supervised data mining
C)unsupervised data mining
D)static reporting
Question
BigData has low velocity and is generated slowly.
Question
What is data mining?
Question
What is unsupervised data mining?
Question
With unsupervised data mining,analysts do not create a model or hypothesis before running the analysis.
Question
BigData refers to data sets that are at least a petabyte in size.
Question
Explain supervised data mining.
Question
BigData has volume,velocity,and variation characteristics that far exceed those of traditional reporting and data mining.
Question
________ is used to measure the impact of a set of variables on another variable during data mining.

A)Regression analysis
B)Cluster analysis
C)Cloud computing
D)Context indexing
Question
Cluster analysis measures the impact of a set of variables on another variable.
Question
Regression analysis is used to identify groups of entities that have similar characteristics.
Question
In the ________ phase,a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.

A)Map
B)crash
C)control
D)Pig
Question
Which of the following statements is true of BigData?

A)BigData cannot store graphics,audio,and video files.
B)BigData has low velocity and is generated slowly.
C)BigData refers to data sets that are at least a petabyte in size.
D)BigData contains only structured data.
Question
Structured data is data in the form of rows and columns.
Question
MapReduce is a technique for harnessing the power of thousands of computers working in parallel.
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/92
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 9: Business Intelligence Systems
1
The software component of a business intelligence system is called a business intelligence database.
False
2
Publish results is the process of delivering business intelligence to the knowledge workers who need it.
True
3
Business intelligence enables police departments to better utilise their personnel through predictive-policing.
True
4
Using BI for identifying changes in the purchasing patterns of customers is a labour-intensive process.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
5
What are the three primary activities in the business intelligence process?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
6
What is predictive-policing?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
7
Which of the following is a fundamental category of business intelligence (BI)analysis?

A)pull publishing
B)data mining
C)push publishing
D)data acquisition
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
8
Data mining is the process of obtaining,cleaning,organising,relating,and cataloging source data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
9
The data that an organisation purchases from data vendors can act as the source data for a business intelligence system.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
10
Which of the following statements is true of business intelligence (BI)systems?

A)Business intelligence systems are primarily used for developing software systems and data mining applications.
B)The software component of a business intelligence system is called an intelligence database.
C)Business intelligence systems analyse an organisation's past performance to make predictions.
D)The four standard components of business intelligence systems are software,procedures,applications,and programs.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
11
________ requires users to request business intelligence results.

A)Pull publishing
B)Data acquisition
C)Push publishing
D)Data mining
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
12
Business intelligence systems are information systems that process operational and other data to analyse past performance and to make predictions.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
13
Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it?

A)publish results
B)BI analysis
C)data acquisition
D)data mining
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
14
________ is the process of delivering business intelligence to users without any request from the users.

A)Data mining
B)Pull publishing
C)Push publishing
D)Data acquisition
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
15
How does business intelligence help marketers identify changes in the purchasing patterns of customers?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
16
Pull publishing delivers business intelligence to users without any request from the users.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
17
What are business intelligence systems?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
18
Project management is one of the few domains in which business intelligence is rarely used.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
19
________ process operational and other data in organisations to analyse past performance and make predictions.

A)Live migration techniques
B)Windowing systems
C)Business intelligence systems
D)Virtualisation software
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
20
________ is the process of obtaining,cleaning,organising,relating,and cataloging source data.

A)Push publishing
B)Pull publishing
C)Data interpretation
D)Data acquisition
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
21
Facts about data,such as its source,format,assumptions,and constraints,are called metadata.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
22
Data granularity refers to the level of detail represented by data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
23
Differentiate between push publishing and pull publishing.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
24
The source,format,assumptions,constraints,and other facts concerning certain data are called ________.

A)metadata
B)microdata
C)data structures
D)network packets
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
25
A ________ is a data collection that addresses the needs of a particular department or functional area of a business.

A)datasheet
B)dataspace
C)data room
D)data mart
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
26
Users in a data mart obtain data that pertain to a particular business function from a ________.

A)data room
B)datasheet
C)data warehouse
D)data centre
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
27
________ refers to the level of detail represented by data.

A)Dimensionality
B)Granularity
C)Aggregation
D)Abstraction
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
28
A data warehouse is a facility for managing an organisation's business intelligence data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
29
The use of an organisation's operational data as the source data for a BI system is not usually recommended because it ________.

A)considers only external data and not internal data regarding an organisation's functions
B)is not possible to create reports based on operational data
C)requires considerable processing and can drastically reduce system performance
D)is not possible to perform business intelligence analyses on operational data
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
30
The purpose of a ________ is to extract data from operational systems and other sources,clean the data,and store and catalog that data for processing by business intelligence tools.

A)data mart
B)data centre
C)data room
D)data warehouse
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
31
Which of the following statements is true of a data warehouse?

A)A data warehouse functions like a retail store in a supply chain.
B)A data warehouse is larger than a data mart.
C)Users in a data warehouse obtain data pertaining to a business function from a data mart.
D)Data analysts who work with a data warehouse are experts in a particular business function.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
32
Which of the following statements is true of data with granularity?

A)Granularity refers to the level of detail represented by the data.
B)If granularity is too coarse,data can be separated into constituent parts using regression.
C)The granularity of clickstream data is too coarse.
D)If granularity is too coarse,data can be made finer by summing and combining.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
33
External data purchased from outside resources are not included in data warehouses.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
34
If the granularity of certain data is too coarse,the data can be separated into constituent parts using statistical techniques.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
35
Problematic data are termed dirty data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
36
A ________ is a facility for managing an organisation's business intelligence data.

A)dataspace
B)data warehouse
C)data room
D)datasheet
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
37
Which of the following problems is particularly common for data that have been gathered over time?

A)missing values
B)wrong granularity
C)lack of consistency
D)lack of integration
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
38
Problematic data are referred to as ________.

A)rough data
B)clickstream data
C)granular data
D)dirty data
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
39
The granularity in clickstream data is too coarse.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
40
The more attributes there are in a sample data,the easier it is to build a model that fits the sample data,but that is worthless as a predictor.Which of the following best explains this phenomenon?

A)the tragedy of the commons
B)the curse of dimensionality
C)the zero-sum game
D)the free rider problem
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
41
________ are reports produced when something out of predefined bounds occurs.

A)Subscriptions
B)Dynamic reports
C)Exception reports
D)Static reports
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
42
What is clickstream data?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
43
The goal of ________,a type of business intelligence analysis,is to create information about past performance.

A)reporting
B)push publishing
C)BigData
D)data mining
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
44
Users in a data mart obtain data that pertain to a particular business function from a data warehouse.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
45
What is data granularity?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
46
Explain the curse of dimensionality.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
47
Data marts are usually larger than data warehouses.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
48
An advantage of data warehouses is the low cost required to create,staff,and operate them.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
49
________ techniques emerged from the combined discipline of statistics,mathematics,artificial intelligence,and machine-learning.

A)Data mining
B)Push publishing
C)Exception reporting
D)Pull publishing
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
50
Explain the functions of a data warehouse.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
51
________ is the application of statistical techniques to find patterns and relationships among data for classification and prediction.

A)Push publishing
B)Data encryption
C)Data mining
D)Pull publishing
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
52
________ is the process of sorting,grouping,summing,filtering,and formatting structured data.

A)Push publishing
B)Reporting analysis
C)Cloud computing
D)Pull publishing
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
53
What are the functions of a data warehouse?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
54
The curse of dimensionality states that the more attributes there are,the more difficult it is to build a model that fits the sample data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
55
How is a data warehouse different from a data mart?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
56
________ is an unsupervised data mining technique in which statistical techniques identify groups of entities that have similar characteristics.

A)Content indexing
B)Regression analysis
C)Cloud computing
D)Cluster analysis
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
57
Which of the following statements is true of unsupervised data mining?

A)Analysts apply data mining techniques to estimate the parameters of a developed model.
B)Analysts create hypotheses only after performing an analysis.
C)Data miners develop models prior to performing an analysis.
D)Regression analysis is the most commonly used unsupervised data mining technique.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
58
Data marts are data collections that address the needs of a particular department or functional area of a business.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
59
Which of the following refers to data in the form of rows and columns?

A)problematic data
B)granulated data
C)structured data
D)nonintegrated data
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
60
Data analysts who work with data warehouses are not usually experts in a given business function.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
61
The results generated in the Map phase are combined in the ________ phase.

A)control
B)Reduce
C)construct
D)Pig
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
62
Reporting analysis is used primarily for classifying and predicting BI data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
63
Which of the following statements is true of Hadoop?

A)Hadoop is an open-source program that implements MapReduce.
B)Technical skills are not required to run and use Hadoop.
C)Hadoop includes a query language entitled Big.
D)Hadoop is written in C++ and runs on Linux.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
64
In the case of ________,data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models.

A)unsupervised data mining
B)supervised data mining
C)pull publishing techniques
D)push publishing techniques
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
65
________ is an open-source program supported by the Apache Foundation that manages thousands of computers and implements MapReduce.

A)Apache Wave
B)Linux
C)BigData
D)Hadoop
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
66
Regression analysis is used in ________.

A)exception reporting
B)supervised data mining
C)unsupervised data mining
D)static reporting
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
67
BigData has low velocity and is generated slowly.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
68
What is data mining?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
69
What is unsupervised data mining?
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
70
With unsupervised data mining,analysts do not create a model or hypothesis before running the analysis.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
71
BigData refers to data sets that are at least a petabyte in size.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
72
Explain supervised data mining.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
73
BigData has volume,velocity,and variation characteristics that far exceed those of traditional reporting and data mining.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
74
________ is used to measure the impact of a set of variables on another variable during data mining.

A)Regression analysis
B)Cluster analysis
C)Cloud computing
D)Context indexing
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
75
Cluster analysis measures the impact of a set of variables on another variable.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
76
Regression analysis is used to identify groups of entities that have similar characteristics.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
77
In the ________ phase,a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.

A)Map
B)crash
C)control
D)Pig
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
78
Which of the following statements is true of BigData?

A)BigData cannot store graphics,audio,and video files.
B)BigData has low velocity and is generated slowly.
C)BigData refers to data sets that are at least a petabyte in size.
D)BigData contains only structured data.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
79
Structured data is data in the form of rows and columns.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
80
MapReduce is a technique for harnessing the power of thousands of computers working in parallel.
Unlock Deck
Unlock for access to all 92 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 92 flashcards in this deck.