Exam 10: Introduction to Data Mining

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

is a data-mining technique used for classifying a set of observations into predefined classes.

Free
(Multiple Choice)
4.9/5
(45)
Correct Answer:
Verified

C

Expected confidence assumes independence between the consequent and the antecedent.

Free
(True/False)
4.8/5
(32)
Correct Answer:
Verified

True

Which of the following is true of hierarchical clustering?

Free
(Multiple Choice)
4.8/5
(34)
Correct Answer:
Verified

C

The market share of a business would be considered a lagging measure in the cause-and- effect modeling of data mining.

(True/False)
4.9/5
(32)

Logistic regression is different from discriminant analysis in that logistic regression .

(Multiple Choice)
4.7/5
(34)

Logistic regression cannot be employed when the dependent variable is binary.

(True/False)
4.8/5
(25)

Which of the following is true of logistic regression as a classifying method?

(Multiple Choice)
4.8/5
(40)

Which of the following features of classification, used in Excel, for a particular database will necessarily be coded to a certain value?

(Multiple Choice)
4.7/5
(47)

Which of the following formulas calculates the Euclidean distance between X and Y?

(Multiple Choice)
4.8/5
(31)

When using logistic regression, where p being the probability that the dependent variable Y = 1, X1, X2 ..., Xk are the independent variables, and β0, β1, β2 ..., βk are unknown regression coefficients, is called the odds of belonging to category 1(Y = 1).

(Multiple Choice)
4.9/5
(38)

Which of the following data sets provides the most realistic estimate of the performance of a model on completely unseen data?

(Multiple Choice)
4.9/5
(34)

Validation data sets differ from training data sets in that validation data sets .

(Multiple Choice)
4.9/5
(36)

In cluster analysis, the objects within clusters should exhibit a high amount of dissimilarity.

(True/False)
4.8/5
(32)

How is the strength of an association measured?

(Essay)
4.8/5
(34)

Which of the following uses the sum of squares between the objects in the cluster when measuring their distances?

(Multiple Choice)
4.9/5
(33)

The accuracy of the model on the test data gives a realistic estimate of the performance of the model on completely unseen data.

(True/False)
4.8/5
(40)

A musical instruments retailer has 10,000 point-of-sale transactions out of which 1500 sales included both items of electric guitars and guitar cases, and out of which 750 had sales of new strings. If the electric guitars are considered A, the guitar cases are considered B, and the strings are considered C, then the associate rule for these sales become "If A and B are purchased, then C is also purchased." Calculate the confidence level, expected confidence level, and lift for this rule, given that total transactions for C is 3000.

(Essay)
4.8/5
(37)

How are objects clustered in agglomerative hierarchical clustering?

(Essay)
4.8/5
(31)

Sendstars is a package delivering company that recently made a study on its customer retention and service renewal metrics. They found that most customers defected from using Sendstars' services due to customer dissatisfaction stemming from delivery personnel being rude or ill-mannered. To curb this issue, Sendstars gave special training to its employees in customer service. Which of the following data mining approaches did Sendstars employ when they decided to train their employees in customer care based on the study?

(Multiple Choice)
4.9/5
(33)

Which of the following typically describes the support for the association rule?

(Multiple Choice)
4.8/5
(32)
Showing 1 - 20 of 53
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)