Exam 4: Descriptive Data Mining

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

Observation refers to the

Free
(Multiple Choice)
4.8/5
(35)
Correct Answer:
Verified

B

Complete linkage can be used to measure the distance between clusters that are the __________ in cluster analysis.

Free
(Multiple Choice)
4.9/5
(29)
Correct Answer:
Verified

B

The process of converting a word to its stem, or root word, is referred to as __________.

Free
(Multiple Choice)
4.8/5
(39)
Correct Answer:
Verified

B

Suppose we had a data set of from a call center where customers were asked to choose between the following three options: hear account information, billing questions, and customer service. Using the given order of the three options, and using 0-1 dummy variables to encode the categorical variables, which of the following combinations would yield an entry "customer service"?

(Multiple Choice)
4.8/5
(32)

__________ is a measure that computes the dissimilarity between a cluster AB and a cluster C by averaging the distance between A and C and the distance between B and C.

(Multiple Choice)
4.9/5
(30)

The endpoint of a k-means clustering algorithm occurs when

(Multiple Choice)
4.8/5
(32)

The process of dividing text into separate terms is referred to as __________.

(Multiple Choice)
4.8/5
(38)

Average linkage is a measure of calculating dissimilarity between two clusters by

(Multiple Choice)
4.7/5
(27)

__________ approaches are designed to describe patterns and relationships in large data sets with many observations of many variables.

(Multiple Choice)
4.7/5
(30)

The data preparation technique used in market segmentation to divide consumers into different homogeneous groups is called

(Multiple Choice)
4.9/5
(34)

In which of the following scenarios would it be appropriate to use hierarchical clustering?

(Multiple Choice)
4.8/5
(30)

Jaccard's coefficient is different from the matching coefficient in that the former

(Multiple Choice)
5.0/5
(29)

__________ is a method of calculating dissimilarity between clusters by calculating the distance between the centroids of the two clusters.

(Multiple Choice)
4.7/5
(31)

An analysis of items frequently co-occurring in transactions is known as

(Multiple Choice)
4.9/5
(40)

Suppose the dissimilarity between clusters A and B has the value 24 and the dissimilarity between cluster B and C has the value 12. Use McQuitty's method to determine the dissimilarity of clusters A and B.

(Multiple Choice)
4.9/5
(37)

Hierarchical clustering using __________ results in a sequence of aggregated clusters that minimizes the loss of information between the individual observation level and the cluster level.

(Multiple Choice)
4.8/5
(34)

Complete linkage can be used to measure the distance between _________ in cluster analysis.

(Multiple Choice)
4.8/5
(37)

The lift ratio of an association rule with a confidence value of 0.45 and in which the consequent occurs in 6 out of 10 cases is

(Multiple Choice)
4.8/5
(40)

The strength of the association rule is known as __________ and is calculated as the ratio of the confidence of an association rule to the benchmark confidence.

(Multiple Choice)
4.7/5
(30)

Which statement is true of an association rule?

(Multiple Choice)
4.9/5
(32)
Showing 1 - 20 of 44
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)