Exam 9: Supervised Data Mining: K-Nearest Neighbors and Naãve Bayes
Exam 1: Introduction to Business Analytics44 Questions
Exam 2: Data Management and Wrangling46 Questions
Exam 3: Data Visualization and Summary Measures52 Questions
Exam 4: Probability and Probability Distributions50 Questions
Exam 5: Statistical Inference53 Questions
Exam 6: Regression Analysis53 Questions
Exam 7: Advanced Regression Analysis52 Questions
Exam 8: Introduction to Data Mining54 Questions
Exam 9: Supervised Data Mining: K-Nearest Neighbors and Naãve Bayes54 Questions
Exam 10: Supervised Data Mining: Decision Trees51 Questions
Exam 11: Unsupervised Data Mining53 Questions
Exam 12: Forecasting With Time Series Data53 Questions
Exam 13: Introduction to Prescriptive Analytics49 Questions
Select questions type
The chart below is a summary of the main results of a test data set representing the population observed purchasing a virtual digital assistant. What does the accuracy rate indicate? 

Free
(Multiple Choice)
4.8/5
(43)
Correct Answer:
C
For a new observation of (0, 0, 0), what is the k-nearest neighbor when k = 1. 

Free
(Multiple Choice)
4.8/5
(47)
Correct Answer:
D
An issue with the naïve Bayes classifier is determining rare outcomes because the estimate is 0. To overcome this problem, the algorithm allows a replacement of zero probability with a nonzero value. This technique is called
Free
(Multiple Choice)
4.8/5
(42)
Correct Answer:
B
What is the Euclidean distance between Observation 2 and the origin point of (0, 0, 0)? 

(Multiple Choice)
4.9/5
(42)
A new applicant, age 45, is applying for a loan. Using the table below, what is the estimated probability the loan will be approved? k = 4. 

(Multiple Choice)
4.9/5
(40)
Of the following options, which does not represent the naïve Bayes method?
(Multiple Choice)
4.9/5
(38)
A new applicant, age 32, is applying for a loan. Using the table below, what is the estimated probability the loan will default using k = 3. 

(Multiple Choice)
4.9/5
(35)
In a decile-wise lift chart, what does the lift value of the leftmost bar imply? 

(Multiple Choice)
4.8/5
(39)
What is the Euclidean distance between Observation 1 and the origin point of (0, 0, 0)? 

(Multiple Choice)
4.8/5
(43)
The chart below is a summary of the main results of a test data set representing the population observed purchasing a virtual digital assistant. What is the percent of the results that are incorrectly classified? 

(Multiple Choice)
4.9/5
(36)
An R's ROC curve with AUC = 0.9453 is presented below from an analysis on potential increased membership level from current basic members at Costco Wholesale. What does the AUC indicate on the prediction on increased membership enrollment among current base members? 

(Multiple Choice)
4.9/5
(35)
Marta is partitioning her data set into 60% for training and 40% for validation. She is first specifying 'Member' as her target variable. What will she need to program to ensure consistency to fix a random seed?
(Multiple Choice)
4.7/5
(34)
If the performance measures from the training data are considerably higher than the values from the validation and test data, what could be the issue?
(Multiple Choice)
4.8/5
(32)
To validate the model on the validation set, Mary calibrates the output of the model to examining all possible outcomes of the prediction (true positive, true negative, false positive, false negative). One way is to use a cutoff value and use functions such as the ifelse () function. These statements are called
(Multiple Choice)
4.8/5
(37)
Naïve Bayes classifiers are relatively simple, efficient, and assume dependency among predictors.
(True/False)
4.8/5
(47)
This chart determines how well the model performs in terms of sensitivity and specificity.
(Multiple Choice)
4.9/5
(37)
The chart below is a summary of the main results of a test data set representing the population observed purchasing a virtual digital assistant. What does the accuracy rate indicate? 

(Multiple Choice)
4.7/5
(38)
Mark is reviewing a partial summary of results from a test data set on a small health clinic. With an accuracy 75% (100 count), Sensitivity 50%, and Specificity 100%, can Mark correctly predict the true positive rate to identify those with the flu?
(Multiple Choice)
4.9/5
(38)
Using the following table, what is the estimate of P(Color) = Black and what is the smoothed estimate of P(Color). k = 1. 

(Multiple Choice)
4.8/5
(41)
Using the following table, which k should be used in the subsequent calculations? 

(Multiple Choice)
4.9/5
(38)
Showing 1 - 20 of 54
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)