Exam 10: Supervised Data Mining: Decision Trees
Exam 1: Introduction to Business Analytics44 Questions
Exam 2: Data Management and Wrangling46 Questions
Exam 3: Data Visualization and Summary Measures52 Questions
Exam 4: Probability and Probability Distributions50 Questions
Exam 5: Statistical Inference53 Questions
Exam 6: Regression Analysis53 Questions
Exam 7: Advanced Regression Analysis52 Questions
Exam 8: Introduction to Data Mining54 Questions
Exam 9: Supervised Data Mining: K-Nearest Neighbors and Naãve Bayes54 Questions
Exam 10: Supervised Data Mining: Decision Trees51 Questions
Exam 11: Unsupervised Data Mining53 Questions
Exam 12: Forecasting With Time Series Data53 Questions
Exam 13: Introduction to Prescriptive Analytics49 Questions
Select questions type
The following table reflects a partial Analytic Solver's Performance measure for a hotel cost during an NFL game night. What is the MAD implying? 

Free
(Multiple Choice)
4.9/5
(36)
Correct Answer:
D
Using the following pruning table, which tree is the best-pruned tree? 

Free
(Multiple Choice)
4.7/5
(43)
Correct Answer:
A
Decision trees produced by the CART algorithm are binary, meaning that there are two branches for each decision node.
Free
(True/False)
4.9/5
(31)
Correct Answer:
True
Using the following pruning table, which tree is the minimum error tree? 

(Multiple Choice)
4.8/5
(35)
The overall MSE split for Age = 25 is $22,987,111.29 and for Age = 23 is $21,983,723.40. Of the two presented, Age = 25 is slightly higher and has a lower level of impurity for constructing a regression tree.
(True/False)
4.9/5
(30)
In a R complexity parameter table, the xerror column represents:
(Multiple Choice)
4.8/5
(35)
A pure subset contains leaf nodes where cases have contradicting values to the target variable, to enhance the variable case outcomes and allow for further splits.
(True/False)
4.8/5
(36)
Robin wanted to know if the age partition chosen for her data was the best fit for her 30 case, 90% Class 1, 10% Class 0 partition. She completed the Gini impurity index with the results of (Age < 32) = 0.2034 and (Age 32) = 0.2786. What is the weighted combination and what did partition at Age 32 produce?
(Multiple Choice)
4.8/5
(37)
Which tree is the least complex and contains the smallest validation error?
(Multiple Choice)
5.0/5
(36)
Based on the following values for income, what are the possible split points?
{12,665, 15,432, 28,763, 34,876, 43,987, 53,677}
(Multiple Choice)
4.9/5
(36)
Based on the following sorted 20 values for age, what are the possible split points?
{20, 22, 24, 26, 28, 31, 32, 34, 35, 40, 42, 43, 45, 47, 49, 51, 52, 53, 55, 57}
(Multiple Choice)
4.8/5
(30)
Before constructing a decision tree, one of the first steps is identifying possible splits of the predictor variable.
(True/False)
4.8/5
(37)
A split at the $32,000 Income point creates a top and bottom partition. Compute the overall (weighted) Gini index given an Income Split of $32,000. 

(Multiple Choice)
4.8/5
(33)
Using the following sample of a regression prune log, the minimum error tree is decision node # 19 with a standard error of 4.689492 (not shown). Using the information provided, which decision node number represents the best-pruned tree? 

(Multiple Choice)
4.7/5
(38)
The overall MSE split for Age = 24 is $21,987,111.29 and for Age = 23 is $20,983,723.40. Of the two presented, Age = 24 is slightly higher and has a lower level of impurity for constructing a regression tree.
(True/False)
4.8/5
(29)
When a target variable is categorical, the CART algorithm produces a __________ tree to predict the class memberships of new cases.
(Multiple Choice)
5.0/5
(41)
If 80% of the cases belong to Class 0 and 20% belong to Class 1, what is the Gini index?
(Multiple Choice)
4.9/5
(34)
Viewing the results in the following scatterplot, for the 11 cases to the left subset (Age < 40), two belong to Class 1 and nine belong to Class 0. In the right subset (Age ? 40) three belong to Class 1 and one belong to Class 0. What is the Index score for the two subsets? 

(Multiple Choice)
4.9/5
(33)
The following table reflects a partial Analytic Solver's Performance measure for a hotel cost during an NFL game night. What is the MAD implying? 

(Multiple Choice)
4.9/5
(38)
Based on the Gini index, 0.10 implies a higher degree of purity because it is closer to 0 than 0.5.
(True/False)
4.7/5
(36)
Showing 1 - 20 of 51
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)