Exam 8: Introduction to Data Mining

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

Which chart is a bar chart displayed in 10 equal-sized intervals, or every 10% of the observations?

(Multiple Choice)
4.9/5
(36)

The following table displays the weights for computing the principal components and the data for two Observations. The mean and standard deviation for x 1 are 3.60 and 1.70, respectively. What is the z-score of x 1 for Observation 1? The following table displays the weights for computing the principal components and the data for two Observations. The mean and standard deviation for x <sub>1</sub> are 3.60 and 1.70, respectively. What is the z-score of x <sub>1</sub> for Observation 1?

(Multiple Choice)
4.8/5
(33)

The following table displays the weights for computing the principal components and the data for two Observations. The mean and standard deviation for x 1 are 4.2 and 1.9, respectively. The mean and standard deviation for x 2 are 6.2 and 4.8, respectively. Compute the first principal component score for Observation 1. The following table displays the weights for computing the principal components and the data for two Observations. The mean and standard deviation for x <sub>1</sub> are 4.2 and 1.9, respectively. The mean and standard deviation for x <sub>2 </sub>are 6.2 and 4.8, respectively. Compute the first principal component score for Observation 1.

(Multiple Choice)
4.8/5
(39)

Consider the partial data set in the table represents online hours spent shopping by age and income. Using the min-max transformation to normalize Income, what is the average standard deviation of Income for the chart provided? Use the min-max transformation to normalize the observations for Income spent online. Consider the partial data set in the table represents online hours spent shopping by age and income. Using the min-max transformation to normalize Income, what is the average standard deviation of Income for the chart provided? Use the min-max transformation to normalize the observations for Income spent online.

(Multiple Choice)
4.9/5
(39)

Consider the partial data set in the table represents online hours spent shopping by age and income. The average and standard deviation for the full data set is $47,667 and $14,292, respectively. Using z-scores to standardize the observations, what is the average standard deviation of Income for the three provided? Consider the partial data set in the table represents online hours spent shopping by age and income. The average and standard deviation for the full data set is $47,667 and $14,292, respectively. Using z-scores to standardize the observations, what is the average standard deviation of Income for the three provided?

(Multiple Choice)
4.7/5
(42)

Cross-Industry Standard Process for Data Mining (CRISP-DM) consists of six phases. Of the six, which one represents the phase where data wrangling occurs?

(Multiple Choice)
4.8/5
(30)

When a predictive model is made overly complex to fit in the quirks of given sample data, it is called ______.

(Multiple Choice)
4.8/5
(40)

Using the Euclidean distance between pairwise observations, which pairwise observation is most dissimilar? Using the Euclidean distance between pairwise observations, which pairwise observation is most dissimilar?

(Multiple Choice)
4.9/5
(36)

The process of applying a set of analytical techniques for the development of machine learning and artificial intelligence is called data mining.

(True/False)
4.8/5
(38)

Using the Euclidean distance between pairwise observations, which pairwise observation is most dissimilar? Using the Euclidean distance between pairwise observations, which pairwise observation is most dissimilar?

(Multiple Choice)
4.8/5
(37)

A diagram that represents the information in equal-sized intervals, deciles, is called a cumulative lift chart.

(True/False)
4.9/5
(32)

Common applications of unsupervised learning include dimension reduction and prediction model.

(True/False)
5.0/5
(51)

Molly e-mailed her clients offering a free 30-minute massage for referrals. In the following validation set of 100, Class 1 reflects the clients predicted to provide referrals and Class 0 reflects the clients predicted to not provide referrals. -Based on the confusion matrix, what was the True Positive (TP) of current clients who provided referrals for a free massage? Molly e-mailed her clients offering a free 30-minute massage for referrals. In the following validation set of 100, Class 1 reflects the clients predicted to provide referrals and Class 0 reflects the clients predicted to not provide referrals.  -Based on the confusion matrix, what was the True Positive (TP) of current clients who provided referrals for a free massage?

(Multiple Choice)
4.8/5
(38)

When using PCA, all the following are disadvantages except

(Multiple Choice)
4.8/5
(44)

Calculate the misclassification rate for the following confusion matrix. Calculate the misclassification rate for the following confusion matrix.

(Multiple Choice)
4.9/5
(43)

Cameron is performing a study on the IQ of groups in various areas. He has calculated that the average IQ of Group A is 108 with a standard deviation of 8. What is the z-score for someone with an IQ of 112?

(Multiple Choice)
4.9/5
(33)

Based on the following confusion matrix with a validation set of 100, Class 1 reflects the members targeted who purchased services and Class 0 reflects the non-targeted respondents who did not purchase services. Calculate the specificity rate. Based on the following confusion matrix with a validation set of 100, Class 1 reflects the members targeted who purchased services and Class 0 reflects the non-targeted respondents who did not purchase services. Calculate the specificity rate.

(Multiple Choice)
4.9/5
(33)

The following table displays the weights for computing the principal components and the data for two Observations. The mean and standard deviation for x1 are 4.2 and 1.4, respectively. What is the z-score of x 1 for Observation 1? The following table displays the weights for computing the principal components and the data for two Observations. The mean and standard deviation for x<sub>1</sub> are 4.2 and 1.4, respectively. What is the z-score of x <sub>1</sub> for Observation 1?

(Multiple Choice)
4.8/5
(33)

Molly e-mailed her clients offering a free 30-minute massage for referrals. In the following validation set of 100, Class 1 reflects the clients predicted to provide referrals and Class 0 reflects the clients predicted to not provide referrals. -Based on the confusion matrix, what was the False Negative (FN) of current clients who provided referrals for a free message? Molly e-mailed her clients offering a free 30-minute massage for referrals. In the following validation set of 100, Class 1 reflects the clients predicted to provide referrals and Class 0 reflects the clients predicted to not provide referrals.  -Based on the confusion matrix, what was the False Negative (FN) of current clients who provided referrals for a free message?

(Multiple Choice)
5.0/5
(31)

Oversampling involves intentionally selecting more samples from one class than from other classes to adjust the class distribution of a data set.

(True/False)
4.9/5
(41)
Showing 21 - 40 of 54
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)