Exam 17: Regression Models With Dummy Variables
Exam 1: Statistics and Data102 Questions
Exam 2: Tabular and Graphical Methods123 Questions
Exam 3: Numerical Descriptive Measures152 Questions
Exam 4: Introduction to Probability148 Questions
Exam 5: Discrete Probability Distributions158 Questions
Exam 6: Continuous Probability Distributions143 Questions
Exam 7: Sampling and Sampling Distributions136 Questions
Exam 8: Interval Estimation131 Questions
Exam 9: Hypothesis Testing116 Questions
Exam 10: Statistical Inference Concerning Two Populations131 Questions
Exam 11: Statistical Inference Concerning Variance120 Questions
Exam 12: Chi-Square Tests120 Questions
Exam 13: Analysis of Variance120 Questions
Exam 14: Regression Analysis140 Questions
Exam 15: Inference With Regression Models125 Questions
Exam 16: Regression Models for Nonlinear Relationships118 Questions
Exam 17: Regression Models With Dummy Variables130 Questions
Exam 18: Time Series and Forecasting125 Questions
Exam 19: Returns, Index Numbers, and Inflation120 Questions
Exam 20: Nonparametric Tests120 Questions
Select questions type
A bank manager is interested in assigning a rating to the holders of credit cards issued by her bank. The rating is based on the probability of defaulting on credit cards and is as follows.
To estimate this probability, she decided to use the logit model,
P =
, where
y = a binary response variable which is 1 if the credit card is in default and 0 otherwise
x1 = the ratio of the credit card balance to the credit card limit (in %)
x2 = the ratio of the total debt to the annual income (in %)
The following output is obtained.
Note: The p-values of the corresponding tests are shown in parentheses below the estimated coefficients.
What will be the rating for a person with a balance ratio of 15% and a debt ratio of 30%?



(Short Answer)
4.9/5
(41)
The major advantage of a logit model over a linear probability model is that the predicted values of y are always between ________.
(Multiple Choice)
4.7/5
(29)
Consider the following regression model: Humidity = β0 + β1Temperature + β2Spring + β3Summer + β4Fall + β5Rain + ε, where the dummy variables Spring, Summer, and Fall represent the qualitative variable Season (spring, summer, fall, winter), and the dummy variable Rain is defined as Rain = 1 if rainy day, Rain = 0 otherwise.
What is the regression equation for the summer rainy days?
(Multiple Choice)
4.9/5
(35)
A researcher has developed the following regression equation to predict the prices of luxurious Oceanside condominium units,
= 40 + 0.15Size + 50View, where Price is the price of a unit (in $1,000s), Size is the square footage (in sq. feet), and View is a dummy variable taking on 1 for an ocean view unit and 0 for a bay view unit. Which of the following is the difference in predicted prices of the ocean view and bay view units with the same square footage?

(Multiple Choice)
4.8/5
(39)
For the logit model, the predicted values of the response variables can always be interpreted as probabilities.
(True/False)
4.8/5
(35)
A medical researcher is interested in assessing the probability of some symptoms of coronary heart disease being present at a given age. For this reason, he uses the following data collected on 100 individuals.
The response variable is CHD which is 1 if coronary heart disease is present and 0 otherwise. The following linear probability model and logit model have been estimated: Linear probability:
= - 0.53796 + 0.02181Age
Logit:
=
Using the logit model, what is the estimated probability of coronary heart disease for a 60-year-old individual?




(Multiple Choice)
4.9/5
(38)
According to the Center for Disease Control and Prevention, life expectancy at age 65 in America is about 18.7 years. Medical researchers have argued that while excessive drinking is detrimental to health, drinking a little alcohol every day, especially wine, may be associated with the increase in life expectancy. Others have also linked longevity with income and gender. The data relating to the length of life after 65 (Life) were collected. Data were collected related to the length of life after 65 (Life) including the average income (in $1,000s) (Income), the average number of alcoholic drinks consumed per day (Drinks), and a dummy variable Woman which is 1 when the individual is a woman and 0 otherwise. The following model was created: Life = β0 + β1Income + β2Drinks + β3Woman + ε. Output corresponding to this model is shown below.
Using a p-value approach, which of the following conclusions is correct at 5% significance level?

(Multiple Choice)
4.7/5
(37)
A realtor wants to predict and compare the prices of homes in three neighboring locations. She considers the following linear models:
Model A: Price = β0 + β1 Size + β2 Age + ε
Model B: Price = β0 + β1 Size + β3 Loc1 + β4 Loc2 + ε
Model C: Price = β0 + β1 Size + β2 Age + β3 Loc1 + β4 Loc2 + ε
where,
Price = the price of a home (in $1,000s)
Size = the square footage (in sq. feet)
Loc1 = a dummy variable taking on 1 for Location 1, and 0 otherwise
Loc2 = a dummy variable taking on 1 for Location 2, and 0 otherwise
After collecting data on 52 sales and applying regression, her findings were summarized in the following table.
Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients.
Suppose that at the 10% significance level, you do not reject the null hypothesis, H0: β2 = 0, when testing the significance of Age in Model C. Would you delete Age from Model C?

(Essay)
4.9/5
(35)
An over-the-counter drug manufacturer wants to examine the effectiveness of a new drug in curing an illness most commonly found in older patients. Thirteen patients are given the new drug and 13 patients are given the old drug. To avoid bias in the experiment, they are not told which drug is given to them. To check how the effectiveness depends on the age of patients, the following data have been collected.
The variables are Effectiveness = the response variable measured on a scale from 0 to 100,
Age = the age of a patient (in years),
Drug = a dummy variable that is 1 for the new drug and 0 for the old drug.
The regression model, Effectiveness = β0 + β1Age + β2Drug + β3Age × Drug, is estimated and the following results are obtained.
Which of the following is the estimated regression model for the new drug?





(Multiple Choice)
4.8/5
(33)
For the model y = β0 + β1x + β2xd + ε, which of the following are the hypotheses for testing the individual significance of the interaction variable xd?
(Multiple Choice)
4.9/5
(42)
Which of the following predictions cannot be described by a binary choice model?
(Multiple Choice)
4.9/5
(29)
In the model y = β0 + β1x + β2d + β3xd + ε, for a given x and d = 1, the predicted value of y is given by ________.
(Multiple Choice)
4.8/5
(36)
A researcher wants to examine how the remaining balance on $100,000 loans taken 10 to 20 years ago depends on whether the loan was a prime or subprime loan. He collected a sample of 25 prime loans and 25 subprime loans and recorded the data in the following variables: Balance = the remaining amount of loan to be paid off (in $),
Time = the time elapsed from taking the loan,
Prime = a dummy variable assuming 1 for prime loans, and 0 for subprime loans.
The regression results obtained for the models:
Model A: Balance = β0 + β1Prime + ε
Model B: Balance = β0 + β1Time + β2Prime + β3Time × Prime + ε
Model C: Balance = β0 + β1Prime + β2Time × Prime + ε,
Are summarized in the following table.
Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients.
Which of the following is the p-value for testing the individual significance of Time in Model B?

(Multiple Choice)
4.7/5
(24)
A bank manager is interested in assigning a rating to the holders of credit cards issued by her bank. The rating is based on the probability of defaulting on credit cards and is as follows.
To estimate this probability, she decided to use the logit model,
P =
, where
y = a binary response variable which is 1 if the credit card is in default and 0 otherwise
x1 = the ratio of the credit card balance to the credit card limit (in %)
x2 = the ratio of the total debt to the annual income (in %)
The following output is obtained.
Note: The p-values of the corresponding tests are shown in parentheses below the estimated coefficients.
(Using Excel or R) Suppose that only applicants with excellent and good ratings are qualified for a loan. Assume that the balance ratio, x1, of those who apply is normally distributed with μ1 = 18% and σ2 = 6%, while their debt ratio, x2, is normally distributed with μ2 = 30% and σ2 = 8%. Assume also that x1 and x2 are independent. Simulate 1,000 applications to estimate the percent of those that are qualified for a loan.



(Short Answer)
4.9/5
(35)
To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected and the following variables considered:
Salary = the monthly salary (excluding fringe benefits and bonuses)
Educ = the number of years of education
Exper = the number of months of experience
Gender = the gender of an individual; 1 for males, and 0 for females
The regression results obtained for the models are as follows:
Model A: Salary = β0 + β1 Educ + β2 Exper + β3 Gender + β4 Exper × Gender + ε,
Model B: Salary = β0 + β1 Educ + β2 Exper + ε, are summarized in the following table.
Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients.
Define the alternative hypothesis for testing the joint significance of Exper and Exper × Gender in Model A?

(Essay)
4.9/5
(27)
To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected and the following variables considered:
Salary = the monthly salary (excluding fringe benefits and bonuses)
Educ = the number of years of education
Exper = the number of months of experience
Gender = the gender of an individual; 1 for males, and 0 for females
The regression results obtained for the models are as follows:
Model A: Salary = β0 + β1 Educ + β2 Exper + β3 Gender + β4 Exper × Gender + ε,
Model B: Salary = β0 + β1 Educ + β2 Exper + ε, are summarized in the following table.
Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients.
What is the value of the test statistic for testing the joint significance of Exper and Exper × Gender in Model A?

(Short Answer)
4.7/5
(29)
Like any other university, Seton Hall University uses SAT scores and high school GPAs as primary criteria for admission. Data for 30 students who had recently applied to Seton Hall were collected including admission (1 for admission and 0 otherwise), SAT score, and GPA. Output for the model is shows below.
Which of the following statements is correct about the individual significance of the variables at the 5% significance level?

(Multiple Choice)
4.9/5
(37)
Which of the following regression models does not include an interaction variable?
(Multiple Choice)
4.9/5
(31)
To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses),
Educ = the number of years of education,
Exper = the number of months of experience,
Train = the number of weeks of training,
Gender = the gender of an individual; 1 for males, and 0 for females.
Excel partial outputs corresponding to these models are available and shown below.
Model A: Salary = β0 + β1Educ + β2Exper + β3Train + β4Gender + ε
Model B: Salary = β0 + β1Educ + β2Exper + β3Gender + ε
Using Model A, which of the following is the estimated average difference between the salaries of male and female employees with the same years of education, months of experience, and weeks of training?


(Multiple Choice)
4.8/5
(37)
A realtor wants to predict and compare the prices of homes in three neighboring locations. She considers the following linear models:
Model A: Price = β0 + β1 Size + β2 Age + ε
Model B: Price = β0 + β1 Size + β3 Loc1 + β4 Loc2 + ε
Model C: Price = β0 + β1 Size + β2 Age + β3 Loc1 + β4 Loc2 + ε
where,
Price = the price of a home (in $1,000s)
Size = the square footage (in sq. feet)
Loc1 = a dummy variable taking on 1 for Location 1, and 0 otherwise
Loc2 = a dummy variable taking on 1 for Location 2, and 0 otherwise
After collecting data on 52 sales and applying regression, her findings were summarized in the following table.
Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients.
What is the regression equation for model B?

(Essay)
4.8/5
(33)
Showing 21 - 40 of 130
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)