Exam 10: Introduction to Data Mining

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

A musical instruments retailer has 10,000 point-of-sale transactions out of which 1500 sales included both items of electric guitars and guitar cases, and out of which 750 had sales of new strings. If the electric guitars are considered A, the guitar cases are considered B, and the strings are considered C, then the associate rule for these sales become "If A and B are purchased, then C is also purchased." Calculate the confidence level, expected confidence level, and lift for this rule, given that total transactions for C is 3000.

Free
(Essay)
4.8/5
(40)
Correct Answer:
Verified

Total sales = 10,000 Sales of A and B = 1,500
Sales of A, B, and C = 750
The association rule "If A and B are purchased, then C is also purchased" has a support of 750 out of 1500 sales.
confidence = P consequent | antecedent) = Pantecedent + consequent) / Pantecedent) Therefore the confidence level = Total support / Total sales = 750 / 1,500 = 50%.
The total number of transactions for C is given as 3000.
Expected confidence is the number of transactions that include the consequent divided by the total number of transactions.
Therefore the expected confidence level = 3,000 / 10,000 = 30%.
The lift is calculated as confidence / expected confidence = 50% / 30% = 1.67.

Which of the following types of data-mining methods provides probabilistic if-then statements?

Free
(Multiple Choice)
4.8/5
(38)
Correct Answer:
Verified

A

Lagging and leading measures in cause-and-effect modeling are uncorrelated.

Free
(True/False)
5.0/5
(26)
Correct Answer:
Verified

False

In cluster analysis, the objects within clusters should exhibit a high amount of dissimilarity.

(True/False)
4.8/5
(32)

Which of the following is true of hierarchical clustering?

(Multiple Choice)
4.9/5
(47)

Spam filtering for e-mails can be seen as an example of which of the following types of approaches of data mining?

(Multiple Choice)
4.8/5
(41)

Which of the following is true of the lift value in association rule?

(Multiple Choice)
4.9/5
(37)

Expected confidence assumes independence between the consequent and the antecedent.

(True/False)
4.9/5
(40)

Which of the following is included in the data mining approach of Cluster Analysis?

(Multiple Choice)
4.8/5
(44)

In classification, which of the following would be considered as a categorical variable of interest for a credit approval decision for a requester?

(Multiple Choice)
4.9/5
(36)

Which of the following is true of the value of k in the k-Nearest Neighbors algorithm?

(Multiple Choice)
4.7/5
(40)

Logistic regression cannot be employed when the dependent variable is binary.

(True/False)
4.9/5
(40)

How is the strength of an association measured?

(Essay)
4.8/5
(34)

is the ratio of the number of transactions that include all items in the consequent as well as the antecedent to the number of transactions that include all items in the antecedent.

(Multiple Choice)
4.8/5
(38)

Briefly explain classification as a data-mining tool with an example.

(Essay)
4.8/5
(40)

How are objects clustered in agglomerative hierarchical clustering?

(Essay)
4.9/5
(34)

The algorithm is a classification scheme that attempts to find records in a database that are similar to one we wish to classify.

(Multiple Choice)
4.8/5
(44)

is a data-mining technique used for classifying a set of observations into predefined classes.

(Multiple Choice)
4.9/5
(42)

The market share of a business would be considered a lagging measure in the cause-and- effect modeling of data mining.

(True/False)
4.8/5
(33)

Which of the following uses the sum of squares between the objects in the cluster when measuring their distances?

(Multiple Choice)
4.8/5
(42)
Showing 1 - 20 of 43
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)