Exam 17: Regression Models With Dummy Variables

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β0 + β1Educ + β2Exper + β3Train + β4Gender + ε To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Train + β<sub>4</sub>Gender + ε     Model B: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Gender + ε     The variable Train is deleted from Model A which results in Model B. Which of the following justifies this choice? To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Train + β<sub>4</sub>Gender + ε     Model B: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Gender + ε     The variable Train is deleted from Model A which results in Model B. Which of the following justifies this choice? Model B: Salary = β0 + β1Educ + β2Exper + β3Gender + ε To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Train + β<sub>4</sub>Gender + ε     Model B: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Gender + ε     The variable Train is deleted from Model A which results in Model B. Which of the following justifies this choice? To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Train + β<sub>4</sub>Gender + ε     Model B: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Gender + ε     The variable Train is deleted from Model A which results in Model B. Which of the following justifies this choice? The variable Train is deleted from Model A which results in Model B. Which of the following justifies this choice?

Free
(Multiple Choice)
4.8/5
(43)
Correct Answer:
Verified

D

Using a ________ we can determine if a particular dummy variable is statistically significant.

Free
(Short Answer)
4.9/5
(43)
Correct Answer:
Verified

t test

According to the Center for Disease Control and Prevention, life expectancy at age 65 in America is about 18.7 years. Medical researchers have argued that while excessive drinking is detrimental to health, drinking a little alcohol every day, especially wine, may be associated with an increase in life expectancy. Others have also linked longevity with income and gender. Data were collected related to the length of life after 65 (Life) were collected including the average income (in $1,000s) (Income), the average number of alcoholic drinks consumed per day (Drinks), and a dummy variable Woman which is 1 when the individual is a woman and 0 otherwise. The following model was created: Life = β0 + β1Income + β2Drinks + β3Woman + ε. Output corresponding to this model is shown below. According to the Center for Disease Control and Prevention, life expectancy at age 65 in America is about 18.7 years. Medical researchers have argued that while excessive drinking is detrimental to health, drinking a little alcohol every day, especially wine, may be associated with an increase in life expectancy. Others have also linked longevity with income and gender. Data were collected related to the length of life after 65 (Life) were collected including the average income (in $1,000s) (Income), the average number of alcoholic drinks consumed per day (Drinks), and a dummy variable Woman which is 1 when the individual is a woman and 0 otherwise. The following model was created: Life = β<sub>0</sub> + β<sub>1</sub>Income + β<sub>2</sub>Drinks + β<sub>3</sub>Woman + ε. Output corresponding to this model is shown below.   Which of the following is the predicted life expectancy for a 65-year-old man with an income of $50,000 and a consumption of three drinks per day? Which of the following is the predicted life expectancy for a 65-year-old man with an income of $50,000 and a consumption of three drinks per day?

Free
(Multiple Choice)
4.9/5
(31)
Correct Answer:
Verified

B

To encourage performance, loyalty, and continuing education, the human resources department at a large company wants to develop a regression-based compensation model for mid-level managers based on three variables: business unit-profitability in $1,000s (Profit), years with company (Years), and a dummy variable (Grad) which takes on 1 when the individual holds a graduate degree in a relevant field and 0 otherwise. Data have been collected for 36 managers. The sample regression equation is To encourage performance, loyalty, and continuing education, the human resources department at a large company wants to develop a regression-based compensation model for mid-level managers based on three variables: business unit-profitability in $1,000s (Profit), years with company (Years), and a dummy variable (Grad) which takes on 1 when the individual holds a graduate degree in a relevant field and 0 otherwise. Data have been collected for 36 managers. The sample regression equation is   = - 13,264.3 + 14.13Profit + 1,868.53Years + 45,121.94Grad - 0.1539Profit × Grad - 198.34 Years × Grad. Which of the following is the predicted value of Compensation for a manager having 15 years with the company, a graduate degree, and a business-unit profit of $4,800,000? = - 13,264.3 + 14.13Profit + 1,868.53Years + 45,121.94Grad - 0.1539Profit × Grad - 198.34 Years × Grad. Which of the following is the predicted value of Compensation for a manager having 15 years with the company, a graduate degree, and a business-unit profit of $4,800,000?

(Multiple Choice)
4.8/5
(41)

A researcher wants to examine how the remaining balance on $100,000 loans taken 10 to 20 years ago depends on whether the loan was a prime or subprime loan. He collected a sample of 25 prime loans and 25 subprime loans and recorded the data in the following variables: Balance = the remaining amount of loan to be paid off (in $), Time = the time elapsed from taking the loan, Prime = a dummy variable assuming 1 for prime loans, and 0 for subprime loans. The regression results obtained for the models: Model A: Balance = β0 + β1Prime + ε Model B: Balance = β0 + β1Time + β2Prime + β3Time × Prime + ε Model C: Balance = β0 + β1Prime + β2Time × Prime + ε, Are summarized in the following table. A researcher wants to examine how the remaining balance on $100,000 loans taken 10 to 20 years ago depends on whether the loan was a prime or subprime loan. He collected a sample of 25 prime loans and 25 subprime loans and recorded the data in the following variables: Balance = the remaining amount of loan to be paid off (in $), Time = the time elapsed from taking the loan, Prime = a dummy variable assuming 1 for prime loans, and 0 for subprime loans. The regression results obtained for the models: Model A: Balance = β<sub>0</sub> + β<sub>1</sub>Prime + ε Model B: Balance = β<sub>0</sub> + β<sub>1</sub>Time + β<sub>2</sub>Prime + β<sub>3</sub>Time × Prime + ε Model C: Balance = β<sub>0</sub> + β<sub>1</sub>Prime + β<sub>2</sub>Time × Prime + ε, Are summarized in the following table.   Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. Which of the following three models would you choose to make the predictions of the remaining loan balance? Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. Which of the following three models would you choose to make the predictions of the remaining loan balance?

(Multiple Choice)
4.9/5
(27)

The method of maximum likelihood estimation (MLE) is used to ________ the parameters of a logit model.

(Short Answer)
4.8/5
(34)

A bank manager is interested in assigning a rating to the holders of credit cards issued by her bank. The rating is based on the probability of defaulting on credit cards and is as follows. A bank manager is interested in assigning a rating to the holders of credit cards issued by her bank. The rating is based on the probability of defaulting on credit cards and is as follows.   To estimate this probability, she decided to use the logit model, P =   , where y = a binary response variable which is 1 if the credit card is in default and 0 otherwise x<sub>1</sub> = the ratio of the credit card balance to the credit card limit (in %) x<sub>2</sub> = the ratio of the total debt to the annual income (in %) The following output is obtained.   Note: The p-values of the corresponding tests are shown in parentheses below the estimated coefficients. If only applicants with excellent and good ratings are qualified for a loan, find a linear relation between their balance ratio and their debt ratio that must be satisfied to be qualified. To estimate this probability, she decided to use the logit model, P = A bank manager is interested in assigning a rating to the holders of credit cards issued by her bank. The rating is based on the probability of defaulting on credit cards and is as follows.   To estimate this probability, she decided to use the logit model, P =   , where y = a binary response variable which is 1 if the credit card is in default and 0 otherwise x<sub>1</sub> = the ratio of the credit card balance to the credit card limit (in %) x<sub>2</sub> = the ratio of the total debt to the annual income (in %) The following output is obtained.   Note: The p-values of the corresponding tests are shown in parentheses below the estimated coefficients. If only applicants with excellent and good ratings are qualified for a loan, find a linear relation between their balance ratio and their debt ratio that must be satisfied to be qualified. , where y = a binary response variable which is 1 if the credit card is in default and 0 otherwise x1 = the ratio of the credit card balance to the credit card limit (in %) x2 = the ratio of the total debt to the annual income (in %) The following output is obtained. A bank manager is interested in assigning a rating to the holders of credit cards issued by her bank. The rating is based on the probability of defaulting on credit cards and is as follows.   To estimate this probability, she decided to use the logit model, P =   , where y = a binary response variable which is 1 if the credit card is in default and 0 otherwise x<sub>1</sub> = the ratio of the credit card balance to the credit card limit (in %) x<sub>2</sub> = the ratio of the total debt to the annual income (in %) The following output is obtained.   Note: The p-values of the corresponding tests are shown in parentheses below the estimated coefficients. If only applicants with excellent and good ratings are qualified for a loan, find a linear relation between their balance ratio and their debt ratio that must be satisfied to be qualified. Note: The p-values of the corresponding tests are shown in parentheses below the estimated coefficients. If only applicants with excellent and good ratings are qualified for a loan, find a linear relation between their balance ratio and their debt ratio that must be satisfied to be qualified.

(Essay)
4.9/5
(34)

To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β0 + β1Educ + β2Exper + β3Train + β4Gender + ε To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Train + β<sub>4</sub>Gender + ε   Model B: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Gender + ε   Assuming the same years of education and months of experience, what is the p-value for testing whether the mean salary of males is greater than the mean salary of females using Model B? Model B: Salary = β0 + β1Educ + β2Exper + β3Gender + ε To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected, and two models were created with the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses), Educ = the number of years of education, Exper = the number of months of experience, Train = the number of weeks of training, Gender = the gender of an individual; 1 for males, and 0 for females. Excel partial outputs corresponding to these models are available and shown below. Model A: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Train + β<sub>4</sub>Gender + ε   Model B: Salary = β<sub>0</sub> + β<sub>1</sub>Educ + β<sub>2</sub>Exper + β<sub>3</sub>Gender + ε   Assuming the same years of education and months of experience, what is the p-value for testing whether the mean salary of males is greater than the mean salary of females using Model B? Assuming the same years of education and months of experience, what is the p-value for testing whether the mean salary of males is greater than the mean salary of females using Model B?

(Multiple Choice)
4.9/5
(31)

Consider the regression equation Consider the regression equation   = b<sub>0</sub> + b<sub>1</sub>xd with b<sub>1</sub> > 0 and a dummy variable d. If d changes from 0 to 1, which of the following is true? = b0 + b1xd with b1 > 0 and a dummy variable d. If d changes from 0 to 1, which of the following is true?

(Multiple Choice)
4.9/5
(36)

A binary choice model is also referred to as a discrete choice model.

(True/False)
4.8/5
(29)

A dummy variable is also called an interaction variable.

(True/False)
4.9/5
(42)

For the model y = β0 + β1x + β2d1 + β3d2 + ε, which of the following tests is used for testing the joint significance of the dummy variables d1 and d2?

(Multiple Choice)
4.8/5
(35)

A realtor wants to predict and compare the prices of homes in three neighboring locations. She considers the following linear models: Model A: Price = β0 + β1 Size + β2 Age + ε Model B: Price = β0 + β1 Size + β3 Loc1 + β4 Loc2 + ε Model C: Price = β0 + β1 Size + β2 Age + β3 Loc1 + β4 Loc2 + ε where, Price = the price of a home (in $1,000s) Size = the square footage (in sq. feet) Loc1 = a dummy variable taking on 1 for Location 1, and 0 otherwise Loc2 = a dummy variable taking on 1 for Location 2, and 0 otherwise After collecting data on 52 sales and applying regression, her findings were summarized in the following table. A realtor wants to predict and compare the prices of homes in three neighboring locations. She considers the following linear models: Model A: Price = β<sub>0</sub> + β<sub>1</sub> Size + β<sub>2</sub> Age + ε Model B: Price = β<sub>0</sub> + β<sub>1</sub> Size + β<sub>3</sub> Loc1 + β<sub>4</sub> Loc2 + ε Model C: Price = β<sub>0</sub> + β<sub>1</sub> Size + β<sub>2</sub> Age + β<sub>3</sub> Loc1 + β<sub>4</sub> Loc2 + ε where, Price = the price of a home (in $1,000s) Size = the square footage (in sq. feet) Loc1 = a dummy variable taking on 1 for Location 1, and 0 otherwise Loc2 = a dummy variable taking on 1 for Location 2, and 0 otherwise After collecting data on 52 sales and applying regression, her findings were summarized in the following table.   Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. Using Model C, what is the conclusion for testing the joint significance of the two dummy variables at the 1% significance level? Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. Using Model C, what is the conclusion for testing the joint significance of the two dummy variables at the 1% significance level?

(Essay)
4.8/5
(36)

A realtor wants to predict and compare the prices of homes in three neighboring locations. She considers the following linear models: Model A: Price = β0 + β1 Size + β2 Age + ε Model B: Price = β0 + β1 Size + β3 Loc1 + β4 Loc2 + ε Model C: Price = β0 + β1 Size + β2 Age + β3 Loc1 + β4 Loc2 + ε where, Price = the price of a home (in $1,000s) Size = the square footage (in sq. feet) Loc1 = a dummy variable taking on 1 for Location 1, and 0 otherwise Loc2 = a dummy variable taking on 1 for Location 2, and 0 otherwise After collecting data on 52 sales and applying regression, her findings were summarized in the following table. A realtor wants to predict and compare the prices of homes in three neighboring locations. She considers the following linear models: Model A: Price = β<sub>0</sub> + β<sub>1</sub> Size + β<sub>2</sub> Age + ε Model B: Price = β<sub>0</sub> + β<sub>1</sub> Size + β<sub>3</sub> Loc1 + β<sub>4</sub> Loc2 + ε Model C: Price = β<sub>0</sub> + β<sub>1</sub> Size + β<sub>2</sub> Age + β<sub>3</sub> Loc1 + β<sub>4</sub> Loc2 + ε where, Price = the price of a home (in $1,000s) Size = the square footage (in sq. feet) Loc1 = a dummy variable taking on 1 for Location 1, and 0 otherwise Loc2 = a dummy variable taking on 1 for Location 2, and 0 otherwise After collecting data on 52 sales and applying regression, her findings were summarized in the following table.   Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. Using Model B, compute the predicted difference between the price of homes with the same square footage located in Location 2 and Location 3. Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. Using Model B, compute the predicted difference between the price of homes with the same square footage located in Location 2 and Location 3.

(Short Answer)
4.8/5
(26)

The logit model cannot be estimated with standard ________ least squares procedures.

(Short Answer)
4.9/5
(44)

The model y = β0 + β1x + β2d + β3xd + ε> is an example of a ________.

(Multiple Choice)
4.8/5
(33)

Like any other university, Seton Hall University uses SAT scores and high school GPAs as primary criteria for admission. Data for 30 students who had recently applied to Seton Hall were collected including admission (1 for admission and 0 otherwise), SAT score, and GPA. Output for the model is shows below. Like any other university, Seton Hall University uses SAT scores and high school GPAs as primary criteria for admission. Data for 30 students who had recently applied to Seton Hall were collected including admission (1 for admission and 0 otherwise), SAT score, and GPA. Output for the model is shows below.   Which of the following is the predicted probability of admission for an individual with a GPA of 3.5 and an SAT score of 1,700? Which of the following is the predicted probability of admission for an individual with a GPA of 3.5 and an SAT score of 1,700?

(Multiple Choice)
4.8/5
(39)

To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected and the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses) Educ = the number of years of education Exper = the number of months of experience Gender = the gender of an individual; 1 for males, and 0 for females The regression results obtained for the models are as follows: Model A: Salary = β0 + β1 Educ + β2 Exper + β3 Gender + β4 Exper × Gender + ε, Model B: Salary = β0 + β1 Educ + β2 Exper + ε, are summarized in the following table. To examine the differences between salaries of male and female middle managers of a large bank, 90 individuals were randomly selected and the following variables considered: Salary = the monthly salary (excluding fringe benefits and bonuses) Educ = the number of years of education Exper = the number of months of experience Gender = the gender of an individual; 1 for males, and 0 for females The regression results obtained for the models are as follows: Model A: Salary = β<sub>0</sub> + β<sub>1</sub> Educ + β<sub>2</sub> Exper + β<sub>3</sub> Gender + β<sub>4</sub> Exper × Gender + ε, Model B: Salary = β<sub>0 </sub>+ β<sub>1</sub> Educ + β<sub>2</sub> Exper + ε, are summarized in the following table.   Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. At 1% significance level, what is the conclusion of testing the joint significance of Exper and Exper × Gender in Model A? Note: The values of relevant test statistics are shown in parentheses below the estimated coefficients. At 1% significance level, what is the conclusion of testing the joint significance of Exper and Exper × Gender in Model A?

(Essay)
4.7/5
(34)

For a linear regression model with a dummy variable d and an interaction variable xd, we ________.

(Multiple Choice)
4.8/5
(35)

The logit model can be estimated with standard ordinary least squares.

(True/False)
4.8/5
(25)
Showing 1 - 20 of 130
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)