Exam 4: Data Mining for Business Intelligence

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

The simple split methodology splits the data into two mutually exclusive subsets called a ________ set and a ________ set.

Free
(Multiple Choice)
4.8/5
(35)
Correct Answer:
Verified

A

Mass, length, time, plane angle, energy, and electric charge are examples of physical measures whose data are represented in interval scales.

Free
(True/False)
4.8/5
(37)
Correct Answer:
Verified

False

Sketch a simple taxonomy of data in data mining.

Free
(Short Answer)
5.0/5
(36)
Correct Answer:
Verified

See Figure 4.3 in the textbook.

At the highest level of abstraction, data can be classified as ________ and ________.

(Multiple Choice)
4.8/5
(36)

The ________ index has been used in economics to measure the diversity of a population. The same concept can be used to determine the purity of a specific class as a result of a decision to branch along a particular attribute or variable.

(Short Answer)
4.7/5
(25)

________ means that discovered patterns in a dataset hold true on new data with a sufficient degree of certainty.

(Short Answer)
4.7/5
(25)

On the commercial side, the most common use of data mining has been in ________ sectors.

(Multiple Choice)
4.8/5
(31)

________ data represent the labels of multiple classes used to divide a variable into specific groups.

(Short Answer)
4.9/5
(42)

Compared to the other steps in CRISP-DM, data preprocessing consumes the most time and effort; most believe that this step accounts for roughly 80 percent of the total time spent on a data mining project

(True/False)
4.8/5
(35)

________ data can be readily represented by some sort of probability distribution.

(Short Answer)
4.8/5
(31)

Data mining is tightly positioned at the intersection of many disciplines. Those disciplines include all of the following except:

(Multiple Choice)
4.8/5
(40)

________ data, also known as categorical data, contains both nominal and ordinal data.

(Short Answer)
4.8/5
(38)

Data mining is a prime candidate for better management of companies that are data-rich, but knowledge-poor.

(True/False)
4.8/5
(26)

A good question to ask with respect to the patterns/relationships that association rule mining can discover is "Are all association rules interesting and useful?" In order to answer such a question, association rule mining uses two common metrics ________ and ________.

(Multiple Choice)
4.7/5
(31)

________ are essentially a hierarchy of if-then statements. They are most appropriate for categorical and interval data.

(Multiple Choice)
4.9/5
(30)

Identify and describe the two types of numerical data. Give an example of each.

(Essay)
4.9/5
(38)

The data mining environment is usually a client-server architecture or a Web-based information systems architecture.

(True/False)
4.9/5
(33)

Because of their ability to model highly complex real-world problems, researchers and practitioners have found many uses for ________.

(Multiple Choice)
4.9/5
(36)

Technically speaking, data mining is a process that uses statistical, mathematical, and artificial intelligence techniques to extract and identify useful information and subsequent knowledge from large sets of data.

(True/False)
4.8/5
(37)

Identify and describe the two types of categorical data. Give an example of each.

(Essay)
4.9/5
(30)
Showing 1 - 20 of 70
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)