Deck 12: Big Data datawarehouse and Business Intelligence Systems
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/102
Play
Full screen (f)
Deck 12: Big Data datawarehouse and Business Intelligence Systems
1
Business Intelligence (BI)systems obtain data in three different ways.
True
2
Data warehouses also store the data warehouse metadata.
True
3
A data mart is a collection of data that addresses a particular component of a functional area of a business.
True
4
Business Intelligence (BI)reporting systems are used to filter data,sort data,group data and make simple calculations based on the data.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
5
Dimensional databases use the star schema.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
6
Data warehouses often include data purchased from outside vendors.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
7
Data warehouse data are frequently denormalized.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
8
Business Intelligence (BI)reporting systems summarize the current status of business activities and compare that status with past events,but not with predicted future activities.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
9
Data mining uses sophisticated statistical and mathematical techniques to perform what-if analyses,to make predictions,and to facilitate decision making.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
10
Big Data is the name given to the enormous datasets generated by Web 2.0 applications.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
11
A data warehouse is a database system that has data,programs and personnel specialized in Business Intelligence (BI)processing.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
12
Business Intelligence (BI)systems support operational activities.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
13
Problematic data are called "dirty data."
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
14
Operational databases store historical data.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
15
Business Intelligence (BI)systems are information systems that help users analyze and use data.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
16
Report delivery is more important for data mining than it is for reporting systems.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
17
Dimensional databases are used for analytical data processing.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
18
Metadata about the data's source,format,assumptions and constraints are kept in a data warehouse metadata database.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
19
Data warehouses are populated with data prepared by data extraction,transformation and load (ETL)programs.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
20
Business Intelligence (BI)reporting systems can analyze data using standard SQL.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
21
In a snowflake table,each dimension table is normalized.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
22
In a common form of RFM analysis,customers with an R score of 5 are in the 20% of customers who have the most recent orders.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
23
Microsoft Excel 2013 cannot import SQL Server 2013 data directly into a PivotTable report,but must first place the data into a worksheet.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
24
Operational databases contain a fact table.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
25
An OLAP cube is limited to three axes.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
26
In RFM analysis,M stands for "how much money."
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
27
In RFM analysis,R stands for "how recently."
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
28
Although Microsoft Excel 2013 will create a PivotTable report using SQL Server 2012 data,it does not have formatting tools that can be used with the report.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
29
Business Intelligence (BI)reporting systems are intended to create meaningful information from disparate data sources and to deliver that information to the proper users on a timely basis.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
30
When creating an OLAP report based on SQL Server 2012 data,it is often a good idea to create a view to organize the data needed for the OLAP report.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
31
To create an OLAP report for an SQL Server 2012 database,use the PivotTable tool in SQL Server 2012.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
32
RFM analysis is a way of analyzing and ranking customers based on online survey data.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
33
OLAP provides the ability to sum,count,average and perform other simple arithmetic operations on groups of data.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
34
In a common form of RFM analysis,a score of 1 is "high" or "good" while a score of 5 is "low" of "bad."
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
35
A star schema resembles a star,with a dimension table at the center and fact tables radiating out from the center.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
36
In a common form of RFM analysis,an RFM score of {5 1 1} means that the customer orders frequently and orders items of high monetary value but has not ordered anything for some time.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
37
In a common form of RFM analysis,customers are sorted into five groups and given an associated score depending on their group.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
38
Microsoft Excel 2013 allows us to connect directly to an SQL Server 2012 database when building a PivotTable.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
39
In RFM analysis,F stands for "how frequently."
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
40
The term drill down refers to the capability of seeing the data in smaller and smaller units.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
41
Data mining is the application of mathematical and statistical techniques to find patterns and relationships that can be used to classify and predict future outcomes.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
42
Which of the following is (are)true about data mining applications?
A)They use sophisticated mathematical techniques.
B)They use sophisticated statistical techniques.
C)Their report delivery is more important than report delivery for reporting systems.
D)A and B
E)A,B,and C
A)They use sophisticated mathematical techniques.
B)They use sophisticated statistical techniques.
C)Their report delivery is more important than report delivery for reporting systems.
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
43
Data mining applications are used to accomplish which of the following tasks?
A)Perform what-if analysis
B)Make predications
C)Facilitate decision making
D)A and B
E)A,B,and C
A)Perform what-if analysis
B)Make predications
C)Facilitate decision making
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
44
In the MapReduce process,the Reduce step is followed by the Map step.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
45
Most of NoSQL nonrelational database methodologies are known as structured storage.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
46
We have obtained access to the company's operational data.We examine 50 records for customers with phone numbers that should use the current area code of 345.Of these 50 records,we find 10 that still use an older area code of 567.This is an example of ________.
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
E)a "too much data" problem
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
E)a "too much data" problem
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
47
NoSQL really stands of "Not only SQL."
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
48
Amazon.com's Dynamo was an early example of structured storage.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
49
Business Intelligence (BI)systems fall into which of the following categories?
A)Processing
B)Reporting
C)Data mining
D)A and B
E)B and C
A)Processing
B)Reporting
C)Data mining
D)A and B
E)B and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
50
We have obtained access to the company's operational data.In one record,we find that a customer's age has been recorded as "337." This is an example of ________.
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
E)a "too much data" problem
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
E)a "too much data" problem
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
51
The movement that uses different database methods than the relational model and/or SQL is called the NoSQL movement.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
52
Facebook uses the Apache Software Foundation's Cassandra NoSQL database.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
53
Most data mining techniques are simple and easy to use.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
54
Business Intelligence (BI)reporting systems can do which of the following operations?
A)Filter data
B)Group data
C)Modify data
D)A and B
E)A,B,and C
A)Filter data
B)Group data
C)Modify data
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
55
One Business Intelligence (BI)reporting system that uses extensions to SQL is:
A)cluster analysis.
B)OLAP.
C)regression analysis.
D)RFM analysis.
E)A,B,and C
A)cluster analysis.
B)OLAP.
C)regression analysis.
D)RFM analysis.
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
56
Business Intelligence (BI)systems do which of the following?
A)Analyze current and past activities
B)Predict future events
C)Record and process transactions
D)A and B
E)A,B,and C
A)Analyze current and past activities
B)Predict future events
C)Record and process transactions
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
57
Business Intelligence (BI)systems obtain their data by which of the following means?
A)Read and process data from an operational database
B)Process extracts from operational databases
C)Process data purchased from data vendors
D)A and B
E)A,B,and C
A)Read and process data from an operational database
B)Process extracts from operational databases
C)Process data purchased from data vendors
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
58
Google's Bigtable was an early example of structured storage.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
59
Which of the following is a reason that operational data are difficult to read?
A)Dirty data
B)Missing values
C)Nonintegrated data
D)A and B
E)A,B,and C
A)Dirty data
B)Missing values
C)Nonintegrated data
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
60
Structured storage column families are indistinguishable for relational database tables.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
61
The "R" in RFM analysis stands for ________.
A)rank
B)recent
C)relationship
D)readiness
E)random
A)rank
B)recent
C)relationship
D)readiness
E)random
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
62
Dimensional databases are used to track historical data,and therefore must have a ________.
A)time dimension
B)customer dimension
C)sales dimension
D)A or B
E)A and B
A)time dimension
B)customer dimension
C)sales dimension
D)A or B
E)A and B
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
63
In OLAP,the characteristic of a measure is called a ________.
A)level
B)dimension
C)measure
D)member
E)slice
A)level
B)dimension
C)measure
D)member
E)slice
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
64
A data mart differs from a data warehouse in that:
A)it has a smaller database.
B)it deals with a particular component or functional area of the business.
C)data mart users do not have the data management expertise of data warehouse employees.
D)A and B
E)A,B,and C
A)it has a smaller database.
B)it deals with a particular component or functional area of the business.
C)data mart users do not have the data management expertise of data warehouse employees.
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
65
Data warehouses use a(n)________.
A)operational database
B)dimensional database
C)structured storage
D)A or B
E)All of the above
A)operational database
B)dimensional database
C)structured storage
D)A or B
E)All of the above
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
66
RFM analysis analyzes and ranks customers based on:
A)their purchasing patterns.
B)their income status.
C)their residential location.
D)A and B
E)A,B,and C
A)their purchasing patterns.
B)their income status.
C)their residential location.
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
67
A data warehouse database differs from an operational database because:
A)data warehouse data are not stored in tables.
B)data warehouse databases do not have metadata.
C)data warehouse data are often denormalized.
D)B and C
E)A,B,and C
A)data warehouse data are not stored in tables.
B)data warehouse databases do not have metadata.
C)data warehouse data are often denormalized.
D)B and C
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
68
Snowflake schemas have normalized ________.
A)fact tables
B)dimension tables
C)map tables
D)reduce tables
E)None of the above
A)fact tables
B)dimension tables
C)map tables
D)reduce tables
E)None of the above
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
69
RFM scores commonly range from ________,with ________ being the "high" or "most desirable" (from the vendor's point of view)score.
A)0 to 5;0
B)0 to 5;5
C)1 to 5;1
D)1 to 5,5
E)1 to 10,10
A)0 to 5;0
B)0 to 5;5
C)1 to 5;1
D)1 to 5,5
E)1 to 10,10
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
70
OLAP stands for ________.
A)OnLine Analytical Processing
B)OffLine Analytical Processing
C)OnLine Analysis Process
D)OffLine Analysis Process
E)Old,Lazy And Particular
A)OnLine Analytical Processing
B)OffLine Analytical Processing
C)OnLine Analysis Process
D)OffLine Analysis Process
E)Old,Lazy And Particular
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
71
An OLAP cube is called that because some products show OLAP displays on ________ axes.
A)one
B)two
C)three
D)four
E)a number that varies depending upon the analysis being done
A)one
B)two
C)three
D)four
E)a number that varies depending upon the analysis being done
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
72
Which of the following are components of a data warehouse?
A)Data extract,transform,and load (ETL)preparation programs
B)Data warehouse data
C)Data warehouse metadata
D)B and C
E)A,B,and C
A)Data extract,transform,and load (ETL)preparation programs
B)Data warehouse data
C)Data warehouse metadata
D)B and C
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
73
The "M" in RFM analysis stands for ________.
A)money
B)mostly
C)modest
D)modern
E)modem
A)money
B)mostly
C)modest
D)modern
E)modem
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
74
Star schemas have a ________ at the center of the star.
A)fact table
B)dimension table
C)map table
D)reduce table
E)None of the above
A)fact table
B)dimension table
C)map table
D)reduce table
E)None of the above
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
75
A Business Intelligence (BI)reporting system:
A)creates meaningful information from disparate data sources.
B)delivers information to users on a timely basis.
C)uses statistical procedures to predict future events.
D)A and B
E)A,B,and C
A)creates meaningful information from disparate data sources.
B)delivers information to users on a timely basis.
C)uses statistical procedures to predict future events.
D)A and B
E)A,B,and C
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
76
In OLAP,the data item of interest is called a ________.
A)level
B)dimension
C)measure
D)member
E)slice
A)level
B)dimension
C)measure
D)member
E)slice
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
77
The term drill down means the user wants to ________.
A)summarize data
B)get older data
C)sort data
D)get more details
E)aggregate data
A)summarize data
B)get older data
C)sort data
D)get more details
E)aggregate data
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
78
We have done an RFM analysis on our customer data.John Smith has a score of {5 1 1}.This means that John:
A)has ordered recently,and orders a lot when he orders.
B)hasn't ordered recently,but orders a lot when he orders.
C)has ordered recently,but doesn't order a lot when he orders.
D)hasn't ordered recently,and doesn't order a lot when he orders.
E)None of the above is correct.
A)has ordered recently,and orders a lot when he orders.
B)hasn't ordered recently,but orders a lot when he orders.
C)has ordered recently,but doesn't order a lot when he orders.
D)hasn't ordered recently,and doesn't order a lot when he orders.
E)None of the above is correct.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
79
We have obtained access to the company's operational data.We have been asked to produce a report with an item by item analysis of sales,but the only sales figure available is the total sale value for each order.This is an example of ________.
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
E)a "too much data" problem
A)dirty data
B)inconsistent data
C)nonintegrated data
D)a "wrong format" problem
E)a "too much data" problem
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
80
Slowly changing dimensions are handled by a(n)________.
A)operational database
B)dimensional database
C)structured storage
D)A or B
E)All of the above
A)operational database
B)dimensional database
C)structured storage
D)A or B
E)All of the above
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck