Exam 17: Regression Models With Dummy Variables
Exam 1: Statistics and Data102 Questions
Exam 2: Tabular and Graphical Methods123 Questions
Exam 3: Numerical Descriptive Measures152 Questions
Exam 4: Introduction to Probability148 Questions
Exam 5: Discrete Probability Distributions158 Questions
Exam 6: Continuous Probability Distributions143 Questions
Exam 7: Sampling and Sampling Distributions136 Questions
Exam 8: Interval Estimation131 Questions
Exam 9: Hypothesis Testing116 Questions
Exam 10: Statistical Inference Concerning Two Populations131 Questions
Exam 11: Statistical Inference Concerning Variance120 Questions
Exam 12: Chi-Square Tests120 Questions
Exam 13: Analysis of Variance120 Questions
Exam 14: Regression Analysis140 Questions
Exam 15: Inference With Regression Models125 Questions
Exam 16: Regression Models for Nonlinear Relationships118 Questions
Exam 17: Regression Models With Dummy Variables130 Questions
Exam 18: Time Series and Forecasting125 Questions
Exam 19: Returns, Index Numbers, and Inflation120 Questions
Exam 20: Nonparametric Tests120 Questions
Select questions type
In a model y = β0 + β1x + β2d + β3xd + ε, the ________ F test for the joint significance of d and xd can be performed.
(Short Answer)
5.0/5
(38)
An over-the-counter drug manufacturer wants to examine the effectiveness of a new drug in curing an illness most commonly found in older patients. Thirteen patients are given the new drug and 13 patients are given the old drug. To avoid bias in the experiment, they are not told which drug is given to them. To check how the effectiveness depends on the age of patients, the following data have been collected.
The variables are Effectiveness = the response variable measured on a scale from 0 to 100,
Age = the age of a patient (in years),
Drug = a binary variable with 1 for the new drug and 0 for the old drug.
The regression model, Effectiveness = β0 + β1Age + β2Drug + β3Age × Drug, is estimated and the following results are obtained.
Which of the following is the estimated regression model?





(Multiple Choice)
4.9/5
(25)
A researcher wants to examine how the remaining balance on $100,000 loans taken 10 to 20 years ago depends on whether the loan was a prime or subprime loan. He collected a sample of 25 prime loans and 25 subprime loans and recorded the data in the following variables: Balance = the remaining amount of loan to be paid off (in $),
Time = the time elapsed from taking the loan,
Prime = a dummy variable assuming 1 for prime loans, and 0 for subprime loans.
The regression results obtained for the models:
Model A: Balance = β0 + β1Prime + ε
Model B: Balance = β0 + β1Time + β2Prime + β3Time × Prime + ε
Model C: Balance = β0 + β1Prime + β2Time × Prime + ε,
Are summarized in the following table.
Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients.
Suppose that at the 5% significance level you do not reject the null hypothesis, H0: β1 = β3 = 0, when testing the joint significance of Time and Time × Prime in Model B. Should you delete both Time and Time × Prime from Model B?

(Multiple Choice)
4.9/5
(39)
The number of dummy variables representing a qualitative variable should be one less than the number of categories of the variable.
(True/False)
4.9/5
(38)
A medical researcher is interested in assessing the probability of some symptoms of coronary heart disease (CHD) being present at a given age. For this reason, he uses the following data collected on 100 individuals.
The response variable is CHD which is 1 if coronary heart disease is present and 0 otherwise. The following linear probability model and logit model have been estimated: Linear probability:
= - 0.53796 + 0.02181Age
Logit:
=
Using the linear probability model, what is the estimated probability of coronary heart disease for a 60-year-old individual?




(Multiple Choice)
4.7/5
(26)
To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses),
Educ = the number of years of education,
Exper = the number of months of experience,
Train = the number of weeks of training,
Gender = the gender of an individual; 1 for males, and 0 for females.
Excel partial outputs corresponding to these models are available and shown below.
Model A: Salary = β0 + β1Educ + β2Exper + β3Train + β4Gender + ε
Model B: Salary = β0 + β1Educ + β2Exper + β3Gender + ε
Using Model B, what is the regression equation found by Excel for females?


(Multiple Choice)
4.8/5
(31)
A logit model can be estimated with the method of ________.
(Multiple Choice)
4.9/5
(33)
Consider the model y = β0 + β1x + β2d + ε, where x is a quantitative variable and d is a dummy variable. When d = 1, the predicted value of y ________.
(Multiple Choice)
4.9/5
(21)
For the model y = β0 + β1x + β2d + β3xd+ε, in which d is a dummy variable, we cannot perform the F test for the joint significance of the dummy variable d and the interaction variable xd.
(True/False)
4.8/5
(29)
A logit model ensures that the predicted probability of the binary response variable falls between ________.
(Short Answer)
4.9/5
(33)
Suppose that we have a qualitative variable Month with categories: January, February, etc. How many dummy variables are needed to describe Month?
(Multiple Choice)
4.7/5
(30)
Consider the following regression model: Humidity = β0 + β1Temperature + β2Spring + β3Summer + β4Fall + β5Rain + ε, where the dummy variables Spring, Summer, and Fall represent the qualitative variable Season (spring, summer, fall, winter), and the dummy variable Rain is defined as Rain = 1 if rainy day, Rain = 0 otherwise.
What is the regression equation for the summer days?
(Multiple Choice)
4.8/5
(42)
A bank manager is interested in assigning a rating to the holders of credit cards issued by her bank. The rating is based on the probability of defaulting on credit cards and is as follows.
To estimate this probability, she decided to use the logit model,
P =
, where
y = a binary response variable which is 1 if the credit card is in default and 0 otherwise
x1 = the ratio of the credit card balance to the credit card limit (in %)
x2 = the ratio of the total debt to the annual income (in %)
The following output is obtained.
Note: The p-values of the corresponding tests are shown in parentheses below the estimated coefficients.
Assuming the debt ratio is 30%, what is the range of values for the balance ratio that yields the fair rating?



(Short Answer)
4.9/5
(33)
If the number of dummy variables representing a qualitative variable equals the number of categories of this variable, one deals with the problem of perfect multicollinearity.
(True/False)
4.9/5
(40)
To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses),
Educ = the number of years of education,
Exper = the number of months of experience,
Train = the number of weeks of training,
Gender = the gender of an individual; 1 for males, and 0 for females.
Excel partial outputs corresponding to these models are available and shown below.
Model A: Salary = β0 + β1Educ + β2Exper + β3Train + β4Gender + ε
Model B: Salary = β0 + β1Educ + β2Exper + β3Gender + ε
A group of female managers considers a discrimination lawsuit if on average their salaries can be statistically proven to be lower by more than $500 than the salaries of their male peers with the same level of education and experience. Using Model B, what is the alternative hypothesis for testing the lawsuit condition?


(Multiple Choice)
5.0/5
(36)
For the model y = β0 + β1x + β2d + ε, which test is used for testing the significance of a dummy variable d?
(Multiple Choice)
4.9/5
(28)
To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses),
Educ = the number of years of education,
Exper = the number of months of experience,
Train = the number of weeks of training,
Gender = the gender of an individual; 1 for males, and 0 for females.
The Excel partial outputs corresponding to these models are available and shown below.
Model A: Salary = β0 + β1Educ + β2Exper + β3Train + β4Gender + ε
Model B: Salary = β0 + β1Educ + β2Exper + β3Gender + ε
A group of female managers considers a discrimination lawsuit if on average their salaries could be statistically proven to be lower by more than $500 than the salaries of their male peers with the same level of education and experience. Using Model B, what is the conclusion of the appropriate test at 10% significance level?


(Multiple Choice)
4.8/5
(34)
A model y = β0 + β1x + ε, in which y is a binary variable, is called a linear probability model.
(True/False)
4.9/5
(38)
Regression models that use a dummy variable as the response variable are called binary or discrete ________ models.
(Short Answer)
4.8/5
(33)
A medical researcher is interested in assessing the probability of some symptoms of coronary heart disease being present at a given age. For this reason, he uses the following data collected on 100 individuals.
The response variable is CHD which is 1 if coronary heart disease is present and 0 otherwise. The following linear probability model and logit model have been estimated: Linear probability:
= - 0.53796 + 0.02181Age
Logit:
=
Using the linear probability model, find the age of a person for which the estimated chances of having coronary heart disease are 50%?




(Multiple Choice)
4.8/5
(40)
Showing 61 - 80 of 130
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)