Exam 5: Data Mining for Business Intelligence

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

List and briefly explain four of the factors to be considered when assessing a model to be used for classification.

Free
(Essay)
4.8/5
(31)
Correct Answer:
Verified

Predictive accuracy.The model's ability to correctly predict the class label of new or previously unseen data.

Speed.Computational costs involved in generating and using the model,where faster is better.

Robustness.The model's ability to make reasonably accurate predictions,given noisy data or data with missing and erroneous values.

Scalability.The ability to construct a prediction model efficiently given a large amount of data.

Interpretability.The level of understanding and insight provided by the model.

________ are essentially a hierarchy of if-then statements.They are most appropriate for categorical and interval data.

Free
(Multiple Choice)
4.8/5
(31)
Correct Answer:
Verified

C

The most commonly used measure to calculate the closeness between pairs of items in cluster analysis is the ________.

Free
(Short Answer)
4.7/5
(43)
Correct Answer:
Verified

distance measure

Why has data mining gained the attention of the business world?

(Multiple Choice)
4.8/5
(35)

In order to be applied successfully,a data mining study must be viewed as a set of automated software tools and techniques.

(True/False)
4.8/5
(26)

Data mining requires a separate,dedicated database.

(True/False)
4.8/5
(29)

Using existing and relevant data,data mining builds models to identify ________ among the attributes presented in the dataset.

(Short Answer)
4.8/5
(30)

List and briefly describe three of the major types of patterns that data mining attempts to identify.

(Essay)
4.7/5
(38)

Two types of categorical data are nominal data and ordinal data.

(True/False)
4.8/5
(31)

The model's ability to make reasonably accurate predictions,given noisy data or data with missing and erroneous values,is called ________.

(Short Answer)
4.9/5
(34)

Data mining is tightly positioned at the intersection of many disciplines.Those disciplines include all of the following except:

(Multiple Choice)
4.9/5
(35)

Numeric data represent the numeric values of specific variables,which are ________ variables that can take on an infinite number of fractional values.

(Multiple Choice)
4.8/5
(40)

________ data can be readily represented by some sort of probability distribution.

(Short Answer)
4.9/5
(29)

________ data represent the labels of multiple classes used to divide a variable into specific groups.

(Short Answer)
4.9/5
(36)

A common example of interval scale measurement is temperature on the Celsius scale.

(True/False)
4.9/5
(30)

The simple split methodology splits the data into two mutually exclusive subsets called a ________ set and a ________ set.

(Multiple Choice)
4.8/5
(36)

With ________,a fixed number of instances from the original data is sampled (with replacement)for training and the rest of the dataset is used for testing.This process is repeated as many times as desired.

(Short Answer)
4.9/5
(28)

The first step in the data mining process is to understand the relevant data from the available databases.

(True/False)
4.9/5
(38)

Associations are a type of pattern that discovers time-ordered events,such as predicting that an existing banking customer who already has a checking account will open a savings account followed by an investment account within a year.

(True/False)
4.9/5
(38)

List the four data preprocessing steps.

(Essay)
4.9/5
(36)
Showing 1 - 20 of 69
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)