Deck 11: Big Data and Analytics

Full screen (f)
exit full mode
Question
Big data requires effectively processing:

A) a single data type (numeric).
B) two data types (text and numeric).
C) many data types.
D) a single data type (text).
Use Space or
up arrow
down arrow
to flip the card.
Question
The three 'v's commonly associated with big data include:

A) viewable, volume, and variety.
B) volume, variety, and velocity.
C) verified, variety, and velocity.
D) vigilant, viewable, and verified.
Question
________ is the most popular key-value store NoSQL database management system.

A) Access
B) Apache Cassandra
C) Neo4j
D) Redis
Question
The NoSQL model that incorporates 'column families' is called a:

A) key-value score.
B) document store.
C) wide-column store.
D) column-SQL database.
Question
Big Data includes:

A) large volumes of data with many different data types that are processed at very high speeds.
B) large volumes of data entry with a single data type processed at very high speeds.
C) large volumes of entity relationship diagrams (ERD) with many different data types that are processed at very high speeds.
D) large volumes of entity relationship diagrams (ERD) with a single data type processed at very high speeds.
Question
An organization that decides to adopt the most popular NoSQL database management system would select:

A) Access.
B) MongoDB.
C) Neo4j.
D) Redis.
Question
At a basic level, analytics refers to:

A) collecting data.
B) conducting a needs analysis.
C) analysis and interpretation of data.
D) normalizing data.
Question
The NoSQL model that is specifically designed to maintain information regarding the relationships (often real-world instances of entities) between data items is called a:

A) key-value score.
B) document store.
C) wide-column store.
D) graph-oriented database.
Question
NoSQL systems allow ________ by incorporating commodity servers that can be easily added to the architectural solution.

A) scaling down
B) scaling out
C) scaling up
D) scaling over
Question
Big data:

A) requires a normalized dataset to 3ʳᵈ Normal Form.
B) does not require a strictly defined data model.
C) requires a strictly defined schema.
D) requires a normalized dataset to BCNF.
Question
________ generally processes the largest quantities of data.

A) Operational databases
B) Transaction processing
C) Big data
D) Data marts
Question
The NoSQL model that includes a simple pair of a key and an associated collection of values is called a:

A) key-value score.
B) document store.
C) wide-column store.
D) graph database.
Question
NoSQL focuses on:

A) avoidance of replication of data.
B) minimizing storage space.
C) normalized data.
D) flexibility.
Question
NoSQL includes data storage and retrieval:

A) based on the relational model.
B) based on normalized tables.
C) not based on the relational model.
D) not based on data.
Question
An organization that requires a sole focus on performance with the ability for keys to include strings, hashes, lists, and sorted sets would select ________ database management system.

A) Access
B) Excel Spreadsheet
C) Neo4j
D) Redis
Question
Apache Cassandra is a leading producer of ________ NoSQL database management systems.

A) key-value store
B) wide-column
C) relational
D) graph
Question
According to your text, NoSQL stands for:

A) Numbered Structured Query Language.
B) No Structured Query Language.
C) Not Only Structured Query Language.
D) Numeric Only Structured Query Language.
Question
An organization that requires a graph database that is highly scalable would select the ________ database management system.

A) Access
B) Excel Spreadsheet
C) Neo4j
D) Redis
Question
________ includes the value of speed in a NoSQL database.

A) Velocity
B) Vigilant
C) Verified
D) Variety
Question
________ includes NoSQL accommodation of various data types.

A) Velocity
B) Vigilant
C) Verified
D) Variety
Question
When reporting and analysis organization of the data is determined when the data is used is called a:

A) entity relationship diagram.
B) schema binding.
C) schema on read.
D) cognitive schema.
Question
An organization using HDFS realizes that hardware failure is a(n):

A) norm.
B) irregularity.
C) anomaly.
D) inconsistency.
Question
Hive is a(n) ________ data warehouse software.

A) Oracle
B) Microsoft
C) Macintosh
D) Apache
Question
With HDFS it is less expensive to move the execution of computation to data than to move the:

A) data to hardware.
B) data to systems analysis.
C) data to computation.
D) data to processes.
Question
The Hadoop Distributed File System (HDFS) is the foundation of a ________ infrastructure of Hadoop.

A) relational database management system
B) DBBMS
C) Java
D) data management
Question
Allowing users to dive deeper into the view of data with online analytical processing (OLAP) is an important part of:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Question
________ is an important scripting language to help reduce the complexity of MapReduce.

A) Pig
B) Horse
C) Dog
D) Cat
Question
Application of statistical and computational methods to predict data events is:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Question
When an organization must decide on optimization and simulation tools to make things happen it is using:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Question
It is true that in an HDFS cluster the DataNodes are the:

A) large number of slaves.
B) single master servers.
C) language libraries.
D) business intelligences.
Question
It is true that in an HDFS cluster the NameNode is the:

A) large number of slaves.
B) single master server.
C) language library.
D) business intelligence.
Question
Regarding big data value, the primary focus is on:

A) usefulness.
B) speed.
C) quantity.
D) variety.
Question
Although volume, variety, and velocity are considered the initial three v dimensions, two additional Vs of big data were added and include:

A) veracity and verified.
B) volume and verified.
C) verified and valuable.
D) veracity and value.
Question
NoSQL systems enable automated ________ to allow distribution of the data among multiple nodes to allow servers to operate independently on the data located on it.

A) sharing
B) sharding
C) SQL
D) mongo
Question
Descriptive, predictive, and ________ are the three main types of analytics.

A) adaptive
B) comparative
C) prescriptive
D) decisive
Question
________ are examples of Business Intelligences and Analytics 3.0 because they have millions of observations per second.

A) Administrative systems
B) Web-based interaction logs
C) Web-based customer platforms
D) Smartphones
Question
________ includes concern about data quality issues.

A) Velocity
B) Vigilant
C) Veracity
D) Variety
Question
When a data repository (including internal and external data) does NOT follow a predefined schema, this is called a:

A) data dump.
B) data ocean.
C) data lake.
D) data stream.
Question
The oldest form of analytics is:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Question
The Hadoop framework consists of the ________ algorithm to solve large scale problems.

A) MapSystem
B) MapReduce
C) MapCluster
D) MapComponent
Question
________ tools commonly load data into intermediate hypercube structures.

A) OLAP
B) MOLAP
C) ROLAP
D) TLAP
Question
First degree or complete price discrimination relates to:

A) the minimum price customers are willing to pay.
B) the maximum price customers are willing to pay.
C) the preferred product based on personal preference.
D) the number of products customers are willing to purchase.
Question
NoSQL focuses on avoidance of replication and minimizing storage space.
Question
A business owner that needs carefully normalized tables would likely need a relational database instead of a NoSQL database.
Question
The 'schema on read' approach often incorporates JSON or XML.
Question
NoSQL stands for 'Not only SQL.'
Question
Economies of storage indicate data storage costs increase every year.
Question
Transaction processing and management reporting tend to fit big data databases better than relational databases.
Question
________ are not used for querying and analyzing data stored in data warehouses.

A) Word processing programs
B) OLAP tools
C) MOLAP tools
D) Dashboard tools
Question
A researcher trying to explain why sales of garden supplies in Hawaii have decreased would be an example of ________ data mining.

A) explanatory
B) confirmatory
C) exploratory
D) laboratory
Question
Structured Query Language (SQL) is a set of methodologies, processes, architectures, and technologies that transform raw data into meaningful information.
Question
Decision Support Systems (DSS) was a precursor to analytics and business intelligence.
Question
When online analytical processing (OLAP) studies last year's sales, this represents:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Question
Value (related to the five 'v's of big data) addresses the pursuit of a meaningful goal.
Question
NoSQL databases DO NOT support ACID (atomicity, consistency, isolation, and durability).
Question
All of the following are applications for big data and analytics EXCEPT:

A) business.
B) science and technology.
C) security and public health.
D) personal finances.
Question
Big data allows for two different data types (text and numeric).
Question
The original three 'v's attributed to big data include volume, variety, and velocity.
Question
________ is arguably the most common concern by individuals regarding big data analytics.

A) Saving money
B) Taking up large amounts of computer storage
C) Personal privacy
D) Processing time
Question
The goal of data mining related to analyzing data for unexpected relationships is:

A) explanatory.
B) confirmatory.
C) exploratory.
D) laboratory.
Question
Hive creates MapReduce jobs and executes them on a Hadoop Cluster.
Question
The dive in anywhere characteristic of a data lake is overrides constraints related to confidentiality.
Question
JSON is commonly used in conjunction with the 'document store' NoSQL database model.
Question
Server logs are considered a big data variety data type.
Question
The philosophical underpinnings of big data are based on schema on write.
Question
HDFS is an acronym for Hadoop distributed file system.
Question
Big data databases tend to sacrifice consistency for availability.
Question
MapReduce is an algorithm for massive parallel processing utilized by Hadoop.
Question
HP HAVEn integrates HP technologies with open source big data technologies.
Question
Collect everything is a characteristic of a data lake.
Question
Neo4j is a wide-column NoSQL database management system developed by Oracle.
Question
Data in HDFS files cannot be updated.
Question
Graph-oriented databases are designed to maintain information regarding the relationships between data items.
Question
Hadoop is considered a relational database management system.
Question
HBASE is a wide-column store database that runs on top of HDFS (modeled after Google).
Question
MongoDB is a proprietary NoSQL database management system created by Oracle.
Question
Word processing documents are commonly stored in a 'document store' NoSQL database model.
Question
The schema on write and schema on read are considered synonymous approaches.
Question
The target market for Hadoop is small to medium companies using local area networks.
Question
Apache Cassandra is a wide-column NoSQL database management system.
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/102
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 11: Big Data and Analytics
1
Big data requires effectively processing:

A) a single data type (numeric).
B) two data types (text and numeric).
C) many data types.
D) a single data type (text).
C
2
The three 'v's commonly associated with big data include:

A) viewable, volume, and variety.
B) volume, variety, and velocity.
C) verified, variety, and velocity.
D) vigilant, viewable, and verified.
B
3
________ is the most popular key-value store NoSQL database management system.

A) Access
B) Apache Cassandra
C) Neo4j
D) Redis
D
4
The NoSQL model that incorporates 'column families' is called a:

A) key-value score.
B) document store.
C) wide-column store.
D) column-SQL database.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
5
Big Data includes:

A) large volumes of data with many different data types that are processed at very high speeds.
B) large volumes of data entry with a single data type processed at very high speeds.
C) large volumes of entity relationship diagrams (ERD) with many different data types that are processed at very high speeds.
D) large volumes of entity relationship diagrams (ERD) with a single data type processed at very high speeds.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
6
An organization that decides to adopt the most popular NoSQL database management system would select:

A) Access.
B) MongoDB.
C) Neo4j.
D) Redis.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
7
At a basic level, analytics refers to:

A) collecting data.
B) conducting a needs analysis.
C) analysis and interpretation of data.
D) normalizing data.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
8
The NoSQL model that is specifically designed to maintain information regarding the relationships (often real-world instances of entities) between data items is called a:

A) key-value score.
B) document store.
C) wide-column store.
D) graph-oriented database.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
9
NoSQL systems allow ________ by incorporating commodity servers that can be easily added to the architectural solution.

A) scaling down
B) scaling out
C) scaling up
D) scaling over
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
10
Big data:

A) requires a normalized dataset to 3ʳᵈ Normal Form.
B) does not require a strictly defined data model.
C) requires a strictly defined schema.
D) requires a normalized dataset to BCNF.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
11
________ generally processes the largest quantities of data.

A) Operational databases
B) Transaction processing
C) Big data
D) Data marts
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
12
The NoSQL model that includes a simple pair of a key and an associated collection of values is called a:

A) key-value score.
B) document store.
C) wide-column store.
D) graph database.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
13
NoSQL focuses on:

A) avoidance of replication of data.
B) minimizing storage space.
C) normalized data.
D) flexibility.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
14
NoSQL includes data storage and retrieval:

A) based on the relational model.
B) based on normalized tables.
C) not based on the relational model.
D) not based on data.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
15
An organization that requires a sole focus on performance with the ability for keys to include strings, hashes, lists, and sorted sets would select ________ database management system.

A) Access
B) Excel Spreadsheet
C) Neo4j
D) Redis
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
16
Apache Cassandra is a leading producer of ________ NoSQL database management systems.

A) key-value store
B) wide-column
C) relational
D) graph
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
17
According to your text, NoSQL stands for:

A) Numbered Structured Query Language.
B) No Structured Query Language.
C) Not Only Structured Query Language.
D) Numeric Only Structured Query Language.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
18
An organization that requires a graph database that is highly scalable would select the ________ database management system.

A) Access
B) Excel Spreadsheet
C) Neo4j
D) Redis
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
19
________ includes the value of speed in a NoSQL database.

A) Velocity
B) Vigilant
C) Verified
D) Variety
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
20
________ includes NoSQL accommodation of various data types.

A) Velocity
B) Vigilant
C) Verified
D) Variety
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
21
When reporting and analysis organization of the data is determined when the data is used is called a:

A) entity relationship diagram.
B) schema binding.
C) schema on read.
D) cognitive schema.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
22
An organization using HDFS realizes that hardware failure is a(n):

A) norm.
B) irregularity.
C) anomaly.
D) inconsistency.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
23
Hive is a(n) ________ data warehouse software.

A) Oracle
B) Microsoft
C) Macintosh
D) Apache
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
24
With HDFS it is less expensive to move the execution of computation to data than to move the:

A) data to hardware.
B) data to systems analysis.
C) data to computation.
D) data to processes.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
25
The Hadoop Distributed File System (HDFS) is the foundation of a ________ infrastructure of Hadoop.

A) relational database management system
B) DBBMS
C) Java
D) data management
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
26
Allowing users to dive deeper into the view of data with online analytical processing (OLAP) is an important part of:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
27
________ is an important scripting language to help reduce the complexity of MapReduce.

A) Pig
B) Horse
C) Dog
D) Cat
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
28
Application of statistical and computational methods to predict data events is:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
29
When an organization must decide on optimization and simulation tools to make things happen it is using:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
30
It is true that in an HDFS cluster the DataNodes are the:

A) large number of slaves.
B) single master servers.
C) language libraries.
D) business intelligences.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
31
It is true that in an HDFS cluster the NameNode is the:

A) large number of slaves.
B) single master server.
C) language library.
D) business intelligence.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
32
Regarding big data value, the primary focus is on:

A) usefulness.
B) speed.
C) quantity.
D) variety.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
33
Although volume, variety, and velocity are considered the initial three v dimensions, two additional Vs of big data were added and include:

A) veracity and verified.
B) volume and verified.
C) verified and valuable.
D) veracity and value.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
34
NoSQL systems enable automated ________ to allow distribution of the data among multiple nodes to allow servers to operate independently on the data located on it.

A) sharing
B) sharding
C) SQL
D) mongo
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
35
Descriptive, predictive, and ________ are the three main types of analytics.

A) adaptive
B) comparative
C) prescriptive
D) decisive
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
36
________ are examples of Business Intelligences and Analytics 3.0 because they have millions of observations per second.

A) Administrative systems
B) Web-based interaction logs
C) Web-based customer platforms
D) Smartphones
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
37
________ includes concern about data quality issues.

A) Velocity
B) Vigilant
C) Veracity
D) Variety
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
38
When a data repository (including internal and external data) does NOT follow a predefined schema, this is called a:

A) data dump.
B) data ocean.
C) data lake.
D) data stream.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
39
The oldest form of analytics is:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
40
The Hadoop framework consists of the ________ algorithm to solve large scale problems.

A) MapSystem
B) MapReduce
C) MapCluster
D) MapComponent
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
41
________ tools commonly load data into intermediate hypercube structures.

A) OLAP
B) MOLAP
C) ROLAP
D) TLAP
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
42
First degree or complete price discrimination relates to:

A) the minimum price customers are willing to pay.
B) the maximum price customers are willing to pay.
C) the preferred product based on personal preference.
D) the number of products customers are willing to purchase.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
43
NoSQL focuses on avoidance of replication and minimizing storage space.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
44
A business owner that needs carefully normalized tables would likely need a relational database instead of a NoSQL database.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
45
The 'schema on read' approach often incorporates JSON or XML.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
46
NoSQL stands for 'Not only SQL.'
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
47
Economies of storage indicate data storage costs increase every year.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
48
Transaction processing and management reporting tend to fit big data databases better than relational databases.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
49
________ are not used for querying and analyzing data stored in data warehouses.

A) Word processing programs
B) OLAP tools
C) MOLAP tools
D) Dashboard tools
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
50
A researcher trying to explain why sales of garden supplies in Hawaii have decreased would be an example of ________ data mining.

A) explanatory
B) confirmatory
C) exploratory
D) laboratory
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
51
Structured Query Language (SQL) is a set of methodologies, processes, architectures, and technologies that transform raw data into meaningful information.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
52
Decision Support Systems (DSS) was a precursor to analytics and business intelligence.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
53
When online analytical processing (OLAP) studies last year's sales, this represents:

A) predictive analytics.
B) descriptive analytics.
C) prescriptive analytics.
D) comparative analytics.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
54
Value (related to the five 'v's of big data) addresses the pursuit of a meaningful goal.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
55
NoSQL databases DO NOT support ACID (atomicity, consistency, isolation, and durability).
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
56
All of the following are applications for big data and analytics EXCEPT:

A) business.
B) science and technology.
C) security and public health.
D) personal finances.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
57
Big data allows for two different data types (text and numeric).
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
58
The original three 'v's attributed to big data include volume, variety, and velocity.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
59
________ is arguably the most common concern by individuals regarding big data analytics.

A) Saving money
B) Taking up large amounts of computer storage
C) Personal privacy
D) Processing time
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
60
The goal of data mining related to analyzing data for unexpected relationships is:

A) explanatory.
B) confirmatory.
C) exploratory.
D) laboratory.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
61
Hive creates MapReduce jobs and executes them on a Hadoop Cluster.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
62
The dive in anywhere characteristic of a data lake is overrides constraints related to confidentiality.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
63
JSON is commonly used in conjunction with the 'document store' NoSQL database model.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
64
Server logs are considered a big data variety data type.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
65
The philosophical underpinnings of big data are based on schema on write.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
66
HDFS is an acronym for Hadoop distributed file system.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
67
Big data databases tend to sacrifice consistency for availability.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
68
MapReduce is an algorithm for massive parallel processing utilized by Hadoop.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
69
HP HAVEn integrates HP technologies with open source big data technologies.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
70
Collect everything is a characteristic of a data lake.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
71
Neo4j is a wide-column NoSQL database management system developed by Oracle.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
72
Data in HDFS files cannot be updated.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
73
Graph-oriented databases are designed to maintain information regarding the relationships between data items.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
74
Hadoop is considered a relational database management system.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
75
HBASE is a wide-column store database that runs on top of HDFS (modeled after Google).
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
76
MongoDB is a proprietary NoSQL database management system created by Oracle.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
77
Word processing documents are commonly stored in a 'document store' NoSQL database model.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
78
The schema on write and schema on read are considered synonymous approaches.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
79
The target market for Hadoop is small to medium companies using local area networks.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
80
Apache Cassandra is a wide-column NoSQL database management system.
Unlock Deck
Unlock for access to all 102 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 102 flashcards in this deck.