Deck 9: Business Intelligence Systems

Full screen (f)
exit full mode
Question
The use of business intelligence (BI) for identifying changes in the purchasing patterns of customers is a labor-intensive process.
Use Space or
up arrow
down arrow
to flip the card.
Question
What are the three primary activities in the business intelligence process?
Question
The patterns, relationships, and trends identified by BI systems are called business intelligence.
Question
________ requires users to request business intelligence results.

A) Push publishing
B) Pull publishing
C) Data acquisition
D) Data mining
Question
What are business intelligence systems?
Question
________ is the process of obtaining, cleaning, organizing, relating, and cataloging source data.

A) Data manipulation
B) BI analysis
C) Publish results
D) Data acquisition
Question
How does business intelligence help marketers identify changes in the purchasing patterns of customers?
Question
BI analysis is the process of obtaining, cleaning, organizing, relating, and cataloging source data.
Question
Project management is one of the few domains in which business intelligence is rarely used.
Question
The data that an organization purchases from data vendors can act as the source data for a business intelligence system.
Question
The three fundamental categories of BI analysis are reporting, data mining, and BigData.
Question
As information systems, BI systems have three standard components.
Question
________ is the process of delivering business intelligence to users without any request from the users.

A) Push publishing
B) Pull publishing
C) Data acquisition
D) Data mining
Question
Which of the following statements is TRUE of business intelligence (BI) systems?

A) Business intelligence systems are primarily used for developing software systems and data mining applications.
B) The four standard components of business intelligence systems are software, procedures, applications, and programs.
C) The software component of a business intelligence system is called an intelligence database.
D) Business intelligence systems analyze an organization's past performance to make predictions.
Question
Which of the following is a fundamental category of business intelligence (BI) analysis?

A) data acquisition
B) reporting
C) push publishing
D) pull publishing
Question
Problem solving requires project management.
Question
Push publishing requires a user to request BI results.
Question
Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it?

A) data acquisition
B) BI analysis
C) publish results
D) data mining
Question
________ process operational and other data in organizations to analyze past performance and make predictions.

A) Virtualization techniques
B) Live migration techniques
C) Business intelligence systems
D) Windowing systems
Question
The purchasing pattern of an individual never change.
Question
Data inconsistencies can occur from the nature of a business activity.
Question
A ________ is designed to extract data from operational systems and other sources, clean the data, and store and catalog that data for processing by business intelligence tools.

A) data mart
B) data center
C) data room
D) data warehouse
Question
External data purchased from outside resources are not included in data warehouses.
Question
Data granularity refers to the amount of data represented by data.
Question
Placing business intelligence (BI) applications on operational servers can dramatically reduce system performance.
Question
The granularity in clickstream data is too coarse.
Question
Which of the following statements is TRUE of data with granularity?

A) It can be too fine or too coarse and also have wrong granularity.
B) If granularity is too coarse, data can be made finer by summing and combining.
C) It is not possible to have a wrong granularity for a data.
D) If granularity is too coarse, data can be separated into constituent parts using regression.
Question
Differentiate between push publishing and pull publishing.
Question
A data warehouse is a facility for managing an organization's business intelligence data.
Question
Problematic data are termed ________.

A) random data
B) macro data
C) vague data
D) dirty data
Question
If the granularity of certain data is too coarse, the data can be separated into constituent parts using statistical techniques.
Question
The source, format, assumptions and constraints, and other facts concerning certain data are called ________.

A) metadata
B) data structures
C) microdata
D) network packets
Question
A ________ is a data collection, smaller than the data warehouse that addresses the needs of a particular department or functional area of a business.

A) data mart
B) data room
C) datasheet
D) dataspace
Question
Which of the following statements is TRUE of a data warehouse?

A) A data warehouse is larger than a data mart.
B) A data warehouse functions like a retail store in a supply chain.
C) Users in a data warehouse obtain data pertaining to a business function from a data mart.
D) Data analysts who work with a data warehouse are experts in a particular business function.
Question
A ________ is a facility for managing an organization's business intelligence data.

A) datasheet
B) dataspace
C) data warehouse
D) data table
Question
The use of an organization's operational data as the source data for a business intelligence system is not usually recommended because it ________.

A) is not possible to create reports based on operational data
B) is not possible to perform business intelligence analyses on operational data
C) requires considerable processing and can drastically reduce system performance
D) considers only the external data and not the internal data regarding the organization's functions
Question
________ refers to the level of detail represented by data.

A) Abstraction
B) Granularity
C) Dimensionality
D) Aggregation
Question
Which of the following problems is particularly common for data that have been gathered over time?

A) wrong granularity
B) lack of integration
C) lack of consistency
D) missing values
Question
Users in a data mart obtain data that pertain to a particular business function from a ________.

A) data room
B) data center
C) datasheet
D) data warehouse
Question
The more attributes there are in a sample data, the easier it is to build a model that fits the sample data, but that is worthless as a predictor. Which of the following best explains this phenomenon?

A) the free rider problem
B) the curse of dimensionality
C) the tragedy of the commons
D) the zero-sum game
Question
The curse of dimensionality states that the more attributes there are, the more difficult it is to build a model that fits the sample data.
Question
What is clickstream data?
Question
________ is the process of sorting, grouping, summing, filtering, and formatting structured data.

A) Push publishing
B) Publish results
C) Cloud computing
D) Reporting analysis
Question
Data marts are data collections that address the needs of a particular department or functional area of a business.
Question
Which of the following refers to data in the form of rows and columns?

A) granulated data
B) structured data
C) micro data
D) coarse data
Question
Which of the following statements is TRUE of unsupervised data mining?

A) Analysts apply unsupervised data mining techniques to estimate the parameters of a developed model.
B) Analysts create hypotheses only after performing an analysis.
C) Regression analysis is the most commonly used unsupervised data mining technique.
D) Data miners develop models prior to performing an analysis.
Question
An advantage of data warehouses is the low cost required to create, staff, and operate them.
Question
Explain the curse of dimensionality.
Question
Data marts are usually larger than data warehouses.
Question
Explain the functions of a data warehouse.
Question
Users in a data mart obtain data that pertain to a particular business function from a data warehouse.
Question
________ is the application of statistical techniques to find patterns and relationships among data for classification and prediction.

A) Data encryption
B) Data warehousing
C) Data mining
D) Data decryption
Question
________ are reports produced when something out of predefined bounds occurs.

A) Exception reports
B) Static reports
C) Dynamic reports
D) Subscription reports
Question
________ techniques emerged from the combined discipline of statistics, mathematics, artificial intelligence, and machine-learning.

A) Push publishing
B) Pull publishing
C) Data mining
D) Exception reporting
Question
What is data granularity?
Question
How is a data warehouse different from a data mart?
Question
The goal of ________, a type of business intelligence analysis, is to create information about past performance.

A) push publishing
B) data mining
C) reporting analyses
D) BigData
Question
What are the functions of a data warehouse?
Question
Data analysts who work with data warehouses are experts at data management, data cleaning, data transformation, and data relationships.
Question
________ is an unsupervised data mining technique in which statistical techniques identify groups of entities that have similar characteristics.

A) Cluster analysis
B) Content indexing
C) Regression analysis
D) Cloud computing
Question
Explain supervised data mining.
Question
MapReduce is a technique for harnessing the power of thousands of computers working in parallel.
Question
In the case of ________, data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models.

A) pull publishing techniques
B) supervised data mining
C) push publishing techniques
D) unsupervised data mining
Question
In the ________ phase, a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.

A) crash
B) break
C) reduce
D) map
Question
________ is an open source program supported by the Apache Foundation that manages thousands of computers and that implements MapReduce.

A) Hadoop
B) BigData
C) Linux
D) Apache Wave
Question
BigData refers to data that have great variety and may have structured data as well as different formats.
Question
BigData has volume, velocity, and variation characteristics that far exceed those of traditional reporting and data mining.
Question
BigData has low velocity and is generated slowly.
Question
Reporting analysis is used primarily for classifying and predicting BI data.
Question
Cluster analysis measures the impact of a set of variables on another variable.
Question
Regression analysis is used to identify groups of entities that have similar characteristics.
Question
The results generated in the map phase are combined in the ________ phase.

A) pig
B) control
C) reduce
D) construct
Question
Which of the following statements is TRUE of Hadoop?

A) Hadoop is written in C++ and runs on Linux.
B) Hadoop includes a query language called Big.
C) Hadoop is an open source program that implements MapReduce.
D) Technical skills are not required to run and use Hadoop.
Question
Structured data is data in the form of rows and columns.
Question
________ is used to measure the impact of a set of variables on another variable during data mining.

A) Cluster analysis
B) Context indexing
C) Cloud computing
D) Regression analysis
Question
What is data mining?
Question
Regression analysis is used in ________.

A) progress reporting
B) bug reporting
C) supervised data mining
D) unsupervised data mining
Question
Which of the following statements is TRUE of BigData?

A) BigData contains only structured data.
B) BigData has low velocity and is generated slowly.
C) BigData cannot store graphics, audio, and video files.
D) BigData refers to data sets that are at least a petabyte in size.
Question
With unsupervised data mining, analysts do not create a model or hypothesis before running the analysis.
Question
What is unsupervised data mining?
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/104
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 9: Business Intelligence Systems
1
The use of business intelligence (BI) for identifying changes in the purchasing patterns of customers is a labor-intensive process.
False
2
What are the three primary activities in the business intelligence process?
The three primary activities in the business intelligence process include: acquire data, perform analysis, and publish results.
Data acquisition is the process of obtaining, cleaning, organizing, relating, and cataloging source data. Business intelligence analysis is the process of creating business intelligence and includes three fundamental categories: reporting, data mining, and BigData. Publish results is the process of delivering business intelligence to the knowledge workers who need it.
3
The patterns, relationships, and trends identified by BI systems are called business intelligence.
True
4
________ requires users to request business intelligence results.

A) Push publishing
B) Pull publishing
C) Data acquisition
D) Data mining
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
5
What are business intelligence systems?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
6
________ is the process of obtaining, cleaning, organizing, relating, and cataloging source data.

A) Data manipulation
B) BI analysis
C) Publish results
D) Data acquisition
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
7
How does business intelligence help marketers identify changes in the purchasing patterns of customers?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
8
BI analysis is the process of obtaining, cleaning, organizing, relating, and cataloging source data.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
9
Project management is one of the few domains in which business intelligence is rarely used.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
10
The data that an organization purchases from data vendors can act as the source data for a business intelligence system.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
11
The three fundamental categories of BI analysis are reporting, data mining, and BigData.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
12
As information systems, BI systems have three standard components.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
13
________ is the process of delivering business intelligence to users without any request from the users.

A) Push publishing
B) Pull publishing
C) Data acquisition
D) Data mining
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
14
Which of the following statements is TRUE of business intelligence (BI) systems?

A) Business intelligence systems are primarily used for developing software systems and data mining applications.
B) The four standard components of business intelligence systems are software, procedures, applications, and programs.
C) The software component of a business intelligence system is called an intelligence database.
D) Business intelligence systems analyze an organization's past performance to make predictions.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
15
Which of the following is a fundamental category of business intelligence (BI) analysis?

A) data acquisition
B) reporting
C) push publishing
D) pull publishing
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
16
Problem solving requires project management.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
17
Push publishing requires a user to request BI results.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
18
Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it?

A) data acquisition
B) BI analysis
C) publish results
D) data mining
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
19
________ process operational and other data in organizations to analyze past performance and make predictions.

A) Virtualization techniques
B) Live migration techniques
C) Business intelligence systems
D) Windowing systems
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
20
The purchasing pattern of an individual never change.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
21
Data inconsistencies can occur from the nature of a business activity.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
22
A ________ is designed to extract data from operational systems and other sources, clean the data, and store and catalog that data for processing by business intelligence tools.

A) data mart
B) data center
C) data room
D) data warehouse
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
23
External data purchased from outside resources are not included in data warehouses.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
24
Data granularity refers to the amount of data represented by data.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
25
Placing business intelligence (BI) applications on operational servers can dramatically reduce system performance.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
26
The granularity in clickstream data is too coarse.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
27
Which of the following statements is TRUE of data with granularity?

A) It can be too fine or too coarse and also have wrong granularity.
B) If granularity is too coarse, data can be made finer by summing and combining.
C) It is not possible to have a wrong granularity for a data.
D) If granularity is too coarse, data can be separated into constituent parts using regression.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
28
Differentiate between push publishing and pull publishing.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
29
A data warehouse is a facility for managing an organization's business intelligence data.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
30
Problematic data are termed ________.

A) random data
B) macro data
C) vague data
D) dirty data
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
31
If the granularity of certain data is too coarse, the data can be separated into constituent parts using statistical techniques.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
32
The source, format, assumptions and constraints, and other facts concerning certain data are called ________.

A) metadata
B) data structures
C) microdata
D) network packets
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
33
A ________ is a data collection, smaller than the data warehouse that addresses the needs of a particular department or functional area of a business.

A) data mart
B) data room
C) datasheet
D) dataspace
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
34
Which of the following statements is TRUE of a data warehouse?

A) A data warehouse is larger than a data mart.
B) A data warehouse functions like a retail store in a supply chain.
C) Users in a data warehouse obtain data pertaining to a business function from a data mart.
D) Data analysts who work with a data warehouse are experts in a particular business function.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
35
A ________ is a facility for managing an organization's business intelligence data.

A) datasheet
B) dataspace
C) data warehouse
D) data table
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
36
The use of an organization's operational data as the source data for a business intelligence system is not usually recommended because it ________.

A) is not possible to create reports based on operational data
B) is not possible to perform business intelligence analyses on operational data
C) requires considerable processing and can drastically reduce system performance
D) considers only the external data and not the internal data regarding the organization's functions
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
37
________ refers to the level of detail represented by data.

A) Abstraction
B) Granularity
C) Dimensionality
D) Aggregation
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
38
Which of the following problems is particularly common for data that have been gathered over time?

A) wrong granularity
B) lack of integration
C) lack of consistency
D) missing values
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
39
Users in a data mart obtain data that pertain to a particular business function from a ________.

A) data room
B) data center
C) datasheet
D) data warehouse
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
40
The more attributes there are in a sample data, the easier it is to build a model that fits the sample data, but that is worthless as a predictor. Which of the following best explains this phenomenon?

A) the free rider problem
B) the curse of dimensionality
C) the tragedy of the commons
D) the zero-sum game
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
41
The curse of dimensionality states that the more attributes there are, the more difficult it is to build a model that fits the sample data.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
42
What is clickstream data?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
43
________ is the process of sorting, grouping, summing, filtering, and formatting structured data.

A) Push publishing
B) Publish results
C) Cloud computing
D) Reporting analysis
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
44
Data marts are data collections that address the needs of a particular department or functional area of a business.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
45
Which of the following refers to data in the form of rows and columns?

A) granulated data
B) structured data
C) micro data
D) coarse data
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
46
Which of the following statements is TRUE of unsupervised data mining?

A) Analysts apply unsupervised data mining techniques to estimate the parameters of a developed model.
B) Analysts create hypotheses only after performing an analysis.
C) Regression analysis is the most commonly used unsupervised data mining technique.
D) Data miners develop models prior to performing an analysis.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
47
An advantage of data warehouses is the low cost required to create, staff, and operate them.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
48
Explain the curse of dimensionality.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
49
Data marts are usually larger than data warehouses.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
50
Explain the functions of a data warehouse.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
51
Users in a data mart obtain data that pertain to a particular business function from a data warehouse.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
52
________ is the application of statistical techniques to find patterns and relationships among data for classification and prediction.

A) Data encryption
B) Data warehousing
C) Data mining
D) Data decryption
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
53
________ are reports produced when something out of predefined bounds occurs.

A) Exception reports
B) Static reports
C) Dynamic reports
D) Subscription reports
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
54
________ techniques emerged from the combined discipline of statistics, mathematics, artificial intelligence, and machine-learning.

A) Push publishing
B) Pull publishing
C) Data mining
D) Exception reporting
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
55
What is data granularity?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
56
How is a data warehouse different from a data mart?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
57
The goal of ________, a type of business intelligence analysis, is to create information about past performance.

A) push publishing
B) data mining
C) reporting analyses
D) BigData
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
58
What are the functions of a data warehouse?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
59
Data analysts who work with data warehouses are experts at data management, data cleaning, data transformation, and data relationships.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
60
________ is an unsupervised data mining technique in which statistical techniques identify groups of entities that have similar characteristics.

A) Cluster analysis
B) Content indexing
C) Regression analysis
D) Cloud computing
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
61
Explain supervised data mining.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
62
MapReduce is a technique for harnessing the power of thousands of computers working in parallel.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
63
In the case of ________, data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models.

A) pull publishing techniques
B) supervised data mining
C) push publishing techniques
D) unsupervised data mining
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
64
In the ________ phase, a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.

A) crash
B) break
C) reduce
D) map
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
65
________ is an open source program supported by the Apache Foundation that manages thousands of computers and that implements MapReduce.

A) Hadoop
B) BigData
C) Linux
D) Apache Wave
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
66
BigData refers to data that have great variety and may have structured data as well as different formats.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
67
BigData has volume, velocity, and variation characteristics that far exceed those of traditional reporting and data mining.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
68
BigData has low velocity and is generated slowly.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
69
Reporting analysis is used primarily for classifying and predicting BI data.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
70
Cluster analysis measures the impact of a set of variables on another variable.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
71
Regression analysis is used to identify groups of entities that have similar characteristics.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
72
The results generated in the map phase are combined in the ________ phase.

A) pig
B) control
C) reduce
D) construct
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
73
Which of the following statements is TRUE of Hadoop?

A) Hadoop is written in C++ and runs on Linux.
B) Hadoop includes a query language called Big.
C) Hadoop is an open source program that implements MapReduce.
D) Technical skills are not required to run and use Hadoop.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
74
Structured data is data in the form of rows and columns.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
75
________ is used to measure the impact of a set of variables on another variable during data mining.

A) Cluster analysis
B) Context indexing
C) Cloud computing
D) Regression analysis
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
76
What is data mining?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
77
Regression analysis is used in ________.

A) progress reporting
B) bug reporting
C) supervised data mining
D) unsupervised data mining
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
78
Which of the following statements is TRUE of BigData?

A) BigData contains only structured data.
B) BigData has low velocity and is generated slowly.
C) BigData cannot store graphics, audio, and video files.
D) BigData refers to data sets that are at least a petabyte in size.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
79
With unsupervised data mining, analysts do not create a model or hypothesis before running the analysis.
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
80
What is unsupervised data mining?
Unlock Deck
Unlock for access to all 104 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 104 flashcards in this deck.