Exam 9: Predictive Data Mining

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

A test set is the data set used to

(Multiple Choice)
4.8/5
(32)

A(n) __________ is often displayed as a row of values in a spreadsheet or database in which the columns correspond to the variables.

(Multiple Choice)
4.8/5
(36)

__________ involves descriptive statistics, data visualization, and clustering.

(Multiple Choice)
4.9/5
(30)

The y-axis of a decile chart shows

(Multiple Choice)
4.8/5
(30)

__________ is a generalization of linear regression for predicting a categorical outcome variable.

(Multiple Choice)
4.7/5
(35)

A __________ classifies a categorical outcome variable by splitting observations into groups via a sequence of hierarchical rules.

(Multiple Choice)
4.9/5
(37)

__________ is the manipulation of the data with the goal of putting it in a form suitable for formal modeling.

(Multiple Choice)
4.9/5
(39)

A characteristic or quantity of interest that can take on different values is a(n)

(Multiple Choice)
4.9/5
(33)

Estimation methods are also referred to as

(Multiple Choice)
4.9/5
(37)

__________ is a measure of the heterogeneity of observations in a classification tree.

(Multiple Choice)
4.9/5
(27)

The x-axis of a lift chart shows

(Multiple Choice)
4.8/5
(35)

__________ is the step in data mining that includes addressing missing and erroneous data, reducing the number of variables, defining new variables, and data exploration.

(Multiple Choice)
4.9/5
(36)

Data used to build a data mining model is called

(Multiple Choice)
4.8/5
(40)

_________ attempts to classify a categorical outcome as a linear function of explanatory variables.

(Multiple Choice)
4.8/5
(46)

In the k-nearest neighbors method, when the value of k is set to 1

(Multiple Choice)
4.7/5
(36)

Misclassifying an actual __________ observation as a(n) __________ observation is known as a false positive.

(Multiple Choice)
4.7/5
(37)

Which of the following is a commonly used supervised learning method?

(Multiple Choice)
4.7/5
(34)

__________ refers to the scenario in which the analyst builds a model that does a great job of explaining the sample of data on which it is based but fails to accurately predict outside the sample data.

(Multiple Choice)
4.9/5
(33)
Showing 21 - 38 of 38
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)