Exam 5: Predictive Analytics I: Trees, K-Nearest Neighbors, Naive Bayes,

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

For a sufficiently large value of k, the k-nearest neighbors classification approach will always result in a lower misclassification rate than the simple branch splitting approach of the classification tree.

(True/False)
4.9/5
(41)

An internet service provider (ISP) has randomly selected a sample of 223 observations concerning values of the response variable Churn and several predictor variables based on customer activity during the most recently billed month. Here Churn equals Yes if a customer churned-left the internet service provider for another ISP-and equals No otherwise. The predictor variable MinutesOn is the average daily minutes the customer spent online. EmailSent is the average daily number of emails the customer sent from the email address provided by the ISP. ServCalls is the number of times the customer called for service. Below is part of the classification tree they derived from the data collected in the study. An internet service provider (ISP) has randomly selected a sample of 223 observations concerning values of the response variable Churn and several predictor variables based on customer activity during the most recently billed month. Here Churn equals Yes if a customer churned-left the internet service provider for another ISP-and equals No otherwise. The predictor variable MinutesOn is the average daily minutes the customer spent online. EmailSent is the average daily number of emails the customer sent from the email address provided by the ISP. ServCalls is the number of times the customer called for service. Below is part of the classification tree they derived from the data collected in the study.   Of the 223 sampled customers, what is the sample proportion of those who did not churn? Of the 223 sampled customers, what is the sample proportion of those who did not churn?

(Multiple Choice)
4.8/5
(37)

An internet service provider (ISP) has randomly selected a sample of 223 observations concerning values of the response variable Churn and several predictor variables based on customer activity during the most recently billed month. Here Churn equals Yes if a customer churned-left the internet service provider for another ISP-and equals No otherwise. The predictor variable MinutesOn is the average daily minutes the customer spent online. EmailSent is the average daily number of emails the customer sent from the email address provided by the ISP. ServCalls is the number of times the customer called for service. Below is part of the classification tree they derived from the data collected in the study. An internet service provider (ISP) has randomly selected a sample of 223 observations concerning values of the response variable Churn and several predictor variables based on customer activity during the most recently billed month. Here Churn equals Yes if a customer churned-left the internet service provider for another ISP-and equals No otherwise. The predictor variable MinutesOn is the average daily minutes the customer spent online. EmailSent is the average daily number of emails the customer sent from the email address provided by the ISP. ServCalls is the number of times the customer called for service. Below is part of the classification tree they derived from the data collected in the study.   Of the sampled customers who spent an average of at least 511 minutes online per day, how many did not churn? Of the sampled customers who spent an average of at least 511 minutes online per day, how many did not churn?

(Multiple Choice)
4.8/5
(37)

Suppose that a bank wishes to predict whether or not an existing holder of its Silver credit card will upgrade, for an annual fee, to its Platinum credit card. To do this, the bank carries out a pilot study that randomly selects 40 of its existing Silver card holders and offers each Silver card holder an upgrade to its Platinum card. Here, the response variable Upgrade equals 1 if the Silver card holder decided to upgrade and 0 otherwise. Moreover, the predictor variable Purchases is last year's purchases (in thousands of dollars) by the Silver card holder, and the predictor variable PlatProfile equals 1 if the Silver card holder conforms to the bank's Platinum profile and 0 otherwise. Below is the classification tree they derived from the data collected in the study. Suppose that a bank wishes to predict whether or not an existing holder of its Silver credit card will upgrade, for an annual fee, to its Platinum credit card. To do this, the bank carries out a pilot study that randomly selects 40 of its existing Silver card holders and offers each Silver card holder an upgrade to its Platinum card. Here, the response variable Upgrade equals 1 if the Silver card holder decided to upgrade and 0 otherwise. Moreover, the predictor variable Purchases is last year's purchases (in thousands of dollars) by the Silver card holder, and the predictor variable PlatProfile equals 1 if the Silver card holder conforms to the bank's Platinum profile and 0 otherwise. Below is the classification tree they derived from the data collected in the study.   Assume they classify those with an upgrade probability estimate of at least .5 as upgraders. Based on this classification tree, a member of the study sample who made $50,450 in purchases last year, did not conform to the bank's Platinum profile, and upgraded to the Platinum card would be… Assume they classify those with an upgrade probability estimate of at least .5 as upgraders. Based on this classification tree, a member of the study sample who made $50,450 in purchases last year, did not conform to the bank's Platinum profile, and upgraded to the Platinum card would be…

(Multiple Choice)
4.8/5
(27)

The nearest neighbors to an observation are determined by measuring the distance between the set of predictor variables for that observation and the set of predictor variables for every other observation.

(True/False)
4.9/5
(38)

An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters. An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters.   A potential borrower who has been at their current job for 1 year would like to apply for a loan with payments equaling 21% of their income. To be approved for the loan they would need to be classified as a non-Defaulter. Of the following credit scores, which is the lowest this potential borrower could have to be approved for the loan? A potential borrower who has been at their current job for 1 year would like to apply for a loan with payments equaling 21% of their income. To be approved for the loan they would need to be classified as a non-Defaulter. Of the following credit scores, which is the lowest this potential borrower could have to be approved for the loan?

(Multiple Choice)
4.7/5
(26)

An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters. An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters.   A potential borrower who has been at their current job for 16 years would like to apply for a loan. To be approved for the loan they would need to be classified as a non-Defaulter. Of the following credit scores, which is the lowest this potential borrower could have to be approved for the loan? A potential borrower who has been at their current job for 16 years would like to apply for a loan. To be approved for the loan they would need to be classified as a non-Defaulter. Of the following credit scores, which is the lowest this potential borrower could have to be approved for the loan?

(Multiple Choice)
4.7/5
(45)

An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters. An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters.   Based on this classification tree, a member of the study sample who had a credit score of 667, been at their current job for 3 years, took out a loan with payments equaling 13% of their income, and did not default would be Based on this classification tree, a member of the study sample who had a credit score of 667, been at their current job for 3 years, took out a loan with payments equaling 13% of their income, and did not default would be

(Multiple Choice)
4.8/5
(37)

A classification tree is useful for predicting a quantitative response variable.

(True/False)
4.9/5
(34)

The naive Bayes' classification procedure can be used to predict

(Multiple Choice)
4.7/5
(29)

An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters. An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters.   A potential borrower with a credit score of 724 who has been at their current job for 3 years and has a monthly salary of $6,000 would like to apply for a loan. To be approved for the loan they would need to be classified as a non-Defaulter. Of the following monthly payments, which is the highest this loan could have to be approved for this potential borrower? A potential borrower with a credit score of 724 who has been at their current job for 3 years and has a monthly salary of $6,000 would like to apply for a loan. To be approved for the loan they would need to be classified as a non-Defaulter. Of the following monthly payments, which is the highest this loan could have to be approved for this potential borrower?

(Multiple Choice)
4.9/5
(42)

An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters. An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters.   Based on this classification tree, a member of the study sample who had a credit score of 819, been at their current job for 3 years, took out a loan with payments equaling 15% of their income, and did not default would be Based on this classification tree, a member of the study sample who had a credit score of 819, been at their current job for 3 years, took out a loan with payments equaling 15% of their income, and did not default would be

(Multiple Choice)
4.8/5
(30)

The confusion matrix is not a good indicator of a classification tree's accuracy.

(True/False)
4.9/5
(35)

Because different classification techniques will perform better for different data sets, ensemble models consider multiple classification techniques before selecting the best classification technique to use for a particular data set.

(True/False)
4.8/5
(39)

An MBA admissions officer wishes to predict an MBA applicant's grade point average (GPA) for the MBA program on the basis of the applicant's score on the Graduate Management Admission Test (GMAT) and their undergraduate GPA (UGPA). The admissions officer used a random sample of previously admitted applicants to build a regression tree that can be used to predict the MBA GPAs of future MBA students. Below is the final regression tree. An MBA admissions officer wishes to predict an MBA applicant's grade point average (GPA) for the MBA program on the basis of the applicant's score on the Graduate Management Admission Test (GMAT) and their undergraduate GPA (UGPA). The admissions officer used a random sample of previously admitted applicants to build a regression tree that can be used to predict the MBA GPAs of future MBA students. Below is the final regression tree.   The school awards an MBA Scholarship to admitted applicants who it predicts will earn a GPA of 3.7 or higher in the MBA program. An MBA applicant has an undergraduate GPA of 3.21. Based on this regression tree, which of the following GMAT scores is the lowest this applicant can earn to qualify for the Dean's Scholarship? The school awards an MBA Scholarship to admitted applicants who it predicts will earn a GPA of 3.7 or higher in the MBA program. An MBA applicant has an undergraduate GPA of 3.21. Based on this regression tree, which of the following GMAT scores is the lowest this applicant can earn to qualify for the Dean's Scholarship?

(Multiple Choice)
4.8/5
(35)

A cable television company has randomly selected a sample of 222 Basic package customers for a marketing test to see which customers are more likely to upgrade to the Premium package. They monitored two predictor variables based on customer activity during the most recently billed month. Then they included a special upgrade offer for the Premium package along with their bill. The response variable Upgrade equals True if a customer accepted the offer to upgrade to the Premium package and equals False otherwise. The predictor variable ViewTime is the average daily minutes the customer had at least one TV on in their house. Network is 1 if the TV is tuned to a traditional ("over the public airwaves") broadcast network at least 50 percent of the time that it is turned on and 0 otherwise. Let the events U, Uc, HV, and NV denote the events that the randomly selected Basic customer, respectively, upgraded, did not upgrade, had a high ViewTime [a view time greater than the median amount of 507 minutes], and was primarily a traditional broadcast network viewer [i.e., spent at least 50% of time tuned to a traditional network]. The data they collected show: 1) 43 out of 222 Basic customers upgraded, or P(U) = 43/222. 2) 179 out of 222 Basic customers did not upgrade, or P(Uc) = 179/222. 3) 37 out of 43 upgraders had a high ViewTime, or P(HV|U) = 37/43. 4) 63 out of 179 non-upgraders had a high ViewTime, or P(HV|Uc) = 63/179. "5) 7 out of 43 upgraders was primarily a traditional broadcast network viewer, Or P(NV|U) = 7/43." "6) 52 out of 179 non-upgraders was primarily a traditional broadcast network viewer, Or P(NV|Uc) = 52/179. Using naive Bayes' Theorem, what is the approximate probability that a Basic customer will upgrade if they do not have a high ViewTime and primarily view traditional broadcast networks?"

(Multiple Choice)
4.9/5
(29)

An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters. An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters.   Based on this classification tree, a member of the study sample who had a credit score of 537, been at their current job for 12 years, took out a loan with payments equaling 16% of their income, and defaulted would be Based on this classification tree, a member of the study sample who had a credit score of 537, been at their current job for 12 years, took out a loan with payments equaling 16% of their income, and defaulted would be

(Multiple Choice)
4.7/5
(27)

Suppose that a bank wishes to predict whether or not an existing holder of its Silver credit card will upgrade, for an annual fee, to its Platinum credit card. To do this, the bank carries out a pilot study that randomly selects 40 of its existing Silver card holders and offers each Silver card holder an upgrade to its Platinum card. Here, the response variable Upgrade equals 1 if the Silver card holder decided to upgrade and 0 otherwise. Moreover, the predictor variable Purchases is last year's purchases (in thousands of dollars) by the Silver card holder, and the predictor variable PlatProfile equals 1 if the Silver card holder conforms to the bank's Platinum profile and 0 otherwise. Below is the classification tree they derived from the data collected in the study. Suppose that a bank wishes to predict whether or not an existing holder of its Silver credit card will upgrade, for an annual fee, to its Platinum credit card. To do this, the bank carries out a pilot study that randomly selects 40 of its existing Silver card holders and offers each Silver card holder an upgrade to its Platinum card. Here, the response variable Upgrade equals 1 if the Silver card holder decided to upgrade and 0 otherwise. Moreover, the predictor variable Purchases is last year's purchases (in thousands of dollars) by the Silver card holder, and the predictor variable PlatProfile equals 1 if the Silver card holder conforms to the bank's Platinum profile and 0 otherwise. Below is the classification tree they derived from the data collected in the study.   Assume they classify those with an upgrade probability estimate of at least .5 as upgraders. Based on this classification tree, a member of the study sample who made $28,520 in purchases last year and conformed to the bank's Platinum profile but did not upgrade to the Platinum card would be… Assume they classify those with an upgrade probability estimate of at least .5 as upgraders. Based on this classification tree, a member of the study sample who made $28,520 in purchases last year and conformed to the bank's Platinum profile but did not upgrade to the Platinum card would be…

(Multiple Choice)
4.8/5
(34)

An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters. An automobile finance company analyzed a sample of recent automobile loans to try to determine key factors in identifying borrowers who would be likely to default on their auto loan. The response variable Default equals 1 if the borrower defaulted during the term of the loan and 0 otherwise. The predictor variable AutoDebt% was the ratio (expressed as a percent) of the required loan payments to the borrower's take-home income at the time of purchase. JobTime was the number of years the borrower had worked at their current job at the time of purchase. CredScore was the borrower's credit score at the time of purchase. Below is part of the classification tree the finance company derived from the data collected in the study. Assume they classify those with a default probability estimate of at least .5 as Defaulters.   A potential borrower with a credit score of 668 who has just started their current job is applying for a loan with payments equaling 7% of their income. Based on this classification tree, the best estimate of the probability that this loan applicant would default would be ________. A potential borrower with a credit score of 668 who has just started their current job is applying for a loan with payments equaling 7% of their income. Based on this classification tree, the best estimate of the probability that this loan applicant would default would be ________.

(Multiple Choice)
4.8/5
(30)

Which of the following possible response variables is most appropriate to predict using a classification tree?

(Multiple Choice)
4.9/5
(46)
Showing 41 - 60 of 101
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)