Exam 13: Multiple Regression

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

SCENARIO 13-5 A microeconomist wants to determine how corporate sales are influenced by capital and wage spending by companies.She proceeds to randomly select 26 large corporations and record information in millions of dollars.The Microsoft Excel output below shows results of this multiple regression. SUMMARY OUTPUT Regression Statistics Multiple R 0.830 R Square 0.689 Adjusted R Square 0.662 Standard Error 17501.643 Observations 26 ANOVA df SS MS F Signif F Regression 2 15579777040 7789888520 25.432 0.0001 Residual 23 7045072780 306307512 Total 25 22624849820 Coeff StdError t Stat P -value Intercept 15800.0000 6038.2999 2.617 0.0154 Capital 0.1245 0.2045 0.609 0.5485 Wages 7.0762 1.4729 4.804 0.0001 -Referring to SCENARIO 13-5, suppose the microeconomist wants to test whether the coefficient on Capital is significantly different from 0.What is the value of the relevant t-statistic?

(Multiple Choice)
4.7/5
(43)

SCENARIO 13-10 You worked as an intern at We Always Win Car Insurance Company last summer.You notice that individual car insurance premiums depend very much on the age of the individual and the number of traffic tickets received by the individual.You performed a regression analysis in EXCEL and obtained the following partial information: Regression Statistics Multiple R 0.8546 R Square 0.7303 Adjusted R Square 0.6853 Standard Error 226.7502 Observations 15 ANOVA df SS MS F Siqnificonce F Regression 2 835284.6500 16.2457 0.0004 Residual 12 616987.8200 Total 2287557.1200 Coefficients Standard Error t Stat P-value Lower 99\% Upper 99\% Intercept 821.2617 161.9391 5.0714 0.0003 326.6124 1315.9111 Age -1.4061 2.5988 -0.5411 0.5984 -9.3444 6.5321 Tickets 243.4401 43.2470 5.6291 0.0001 111.3406 375.5396 -Referring to SCENARIO 13-10, to test the significance of the multiple regression model, what are the degrees of freedom?

(Essay)
4.8/5
(42)

SCENARIO 13-19 The marketing manager for a nationally franchised lawn service company would like to study the characteristics that differentiate home owners who do and do not have a lawn service.A random sample of 30 home owners located in a suburban area near a large city was selected; 11 did not have a lawn service (code 0) and 19 had a lawn service (code 1).Additional information available concerning these 30 home owners includes family income (Income, in thousands of dollars) and lawn size (Lawn Size, in thousands of square feet). The PHStat output is given below: Binary Logistic Regression Predictor Coefficients SE Coef Z p -Value Intercept -7.8562 3.8224 -2.0553 0.0398 Income 0.0304 0.0133 2.2897 0.0220 Lawn Size 1.2804 0.6971 1.8368 0.0662 Deviance 25.3089 -Referring to SCENARIO 13-19, what should be the decision ('reject' or 'do not reject') on the null hypothesis when testing whether Income makes a significant contribution to the model in the presence of LawnSize at a 0.05 level of significance?

(Essay)
4.7/5
(24)

SCENARIO 13-8 A financial analyst wanted to examine the relationship between salary (in $1,000) and 2 variables: age (X1 = Age) and experience in the field (X2 = Exper).He took a sample of 20 employees and obtained the following Microsoft Excel output: Regression Statistics Multiple R 0.8535 R Square 0.7284 Adjusted R Square 0.6964 Standard Error 10.5630 Observations 20 ANOYA df SS MS F Siqnificonce F Regression 2 5086.5764 2543.2882 22.7941 0.0000 Residual 17 1896.8050 111.5768 Total 19 6983.3814 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept 1.5740 9.2723 0.1698 0.8672 -17.9888 21.1368 Age 1.3045 0.1956 6.6678 0.0000 0.8917 1.7173 Exper -0.1478 0.1944 -0.7604 0.4574 -0.5580 0.2624 -Referring to SCENARIO 13-8, the F test for the significance of the entire regression performed at a level of significance of 0.01 leads to a rejection of the null hypothesis.

(True/False)
5.0/5
(41)

SCENARIO 13-4 A real estate builder wishes to determine how house size (House) is influenced by family income (Income) and family size (Size).House size is measured in hundreds of square feet and income is measured in thousands of dollars.The builder randomly selected 50 families and ran the multiple regression.Partial Microsoft Excel output is provided below: Regression Statistics Multiple R 0.8479 R Square 0.7189 Adjusted R Square 0.7069 Standard Error 17.5571 Observations 50 ANOVA df SS MS F Signif F Regression 37043.3236 18521.6618 0.0000 Residual 14487.7627 308.2503 Total 49 51531.0863 Coefficients Standard Error t Stat -value Intercept -5.5146 7.2273 -0.7630 0.4493 Income 0.4262 0.0392 10.8668 0.0000 Size 5.5437 1.6949 3.2708 0.0020  Also SSR(X1X2)=36400.6326 and SSR(X2X1)=3297.7917\text { Also } \operatorname{SSR}\left(X_{1} \mid X_{2}\right)=36400.6326 \text { and } \operatorname{SSR}\left(X_{2} \mid X_{1}\right)=3297.7917 -Referring to SCENARIO 13-4, which of the following values for the level of significance is the smallest for which each explanatory variable is significant individually?

(Multiple Choice)
4.9/5
(38)

SCENARIO 13-11 A weight-loss clinic wants to use regression analysis to build a model for weight loss of a client (measured in pounds).Two variables thought to affect weight loss are client's length of time on the weight-loss program and time of session.These variables are described below: Y = Weight loss (in pounds) X1 = Length of time in weight-loss program (in months) X2 = 1 if morning session, 0 if not Data for 25 clients on a weight-loss program at the clinic were collected and used to fit the interaction model: Y=β0+β1X1+β2X2+β3X1X2+εY = \beta _ { 0 } + \beta _ { 1 } X _ { 1 } + \beta _ { 2 } X _ { 2 } + \beta _ { 3 } X _ { 1 } X _ { 2 } + \varepsilon Output from Microsoft Excel follows: Multiple R 0.7308 R Square 0.5341 Adjusted R Square 0.4675 Standard Error 43.3275 Observations 25 ANOVA df SS MS Significance F Regression 3 45194.0661 15064.6887 8.0248 0.0009 Residual 21 39422.6542 1877.2692 Total 24 84616.7203 Coefficients Standard Error t Stat P-value Lower 99\% Upper 99\% Intercept -20.7298 22.3710 -0.9266 0.3646 -84.0702 42.6106 Length 7.2472 1.4992 4.8340 0.0001 3.0024 11.4919 Morn 90.1981 40.2336 2.2419 0.0359 -23.7176 204.1138 Length x Morn -5.1024 3.3511 -1.5226 0.1428 -14.5905 4.3857 -Referring to SCENARIO 13-11, which of the following statements is supported by the analysis shown?

(Multiple Choice)
4.8/5
(31)

SCENARIO 13-19 The marketing manager for a nationally franchised lawn service company would like to study the characteristics that differentiate home owners who do and do not have a lawn service.A random sample of 30 home owners located in a suburban area near a large city was selected; 11 did not have a lawn service (code 0) and 19 had a lawn service (code 1).Additional information available concerning these 30 home owners includes family income (Income, in thousands of dollars) and lawn size (Lawn Size, in thousands of square feet). The PHStat output is given below: Binary Logistic Regression Predictor Coefficients SE Coef Z p -Value Intercept -7.8562 3.8224 -2.0553 0.0398 Income 0.0304 0.0133 2.2897 0.0220 Lawn Size 1.2804 0.6971 1.8368 0.0662 Deviance 25.3089 -Referring to SCENARIO 13-19, there is not enough evidence to conclude that the model is not a good-fitting model at a 0.05 level of significance.

(True/False)
4.8/5
(32)

SCENARIO 13-4 A real estate builder wishes to determine how house size (House) is influenced by family income (Income) and family size (Size).House size is measured in hundreds of square feet and income is measured in thousands of dollars.The builder randomly selected 50 families and ran the multiple regression.Partial Microsoft Excel output is provided below: Regression Statistics Multiple R 0.8479 R Square 0.7189 Adjusted R Square 0.7069 Standard Error 17.5571 Observations 50 ANOVA df SS MS F Signif F Regression 37043.3236 18521.6618 0.0000 Residual 14487.7627 308.2503 Total 49 51531.0863 Coefficients Standard Error t Stat -value Intercept -5.5146 7.2273 -0.7630 0.4493 Income 0.4262 0.0392 10.8668 0.0000 Size 5.5437 1.6949 3.2708 0.0020  Also SSR(X1X2)=36400.6326 and SSR(X2X1)=3297.7917\text { Also } \operatorname{SSR}\left(X_{1} \mid X_{2}\right)=36400.6326 \text { and } \operatorname{SSR}\left(X_{2} \mid X_{1}\right)=3297.7917 -Referring to SCENARIO 13-4, what are the residual degrees of freedom that are missing from the output?

(Multiple Choice)
4.7/5
(28)

SCENARIO 13-4 A real estate builder wishes to determine how house size (House) is influenced by family income (Income) and family size (Size).House size is measured in hundreds of square feet and income is measured in thousands of dollars.The builder randomly selected 50 families and ran the multiple regression.Partial Microsoft Excel output is provided below: Regression Statistics Multiple R 0.8479 R Square 0.7189 Adjusted R Square 0.7069 Standard Error 17.5571 Observations 50 ANOVA df SS MS F Signif F Regression 37043.3236 18521.6618 0.0000 Residual 14487.7627 308.2503 Total 49 51531.0863 Coefficients Standard Error t Stat -value Intercept -5.5146 7.2273 -0.7630 0.4493 Income 0.4262 0.0392 10.8668 0.0000 Size 5.5437 1.6949 3.2708 0.0020  Also SSR(X1X2)=36400.6326 and SSR(X2X1)=3297.7917\text { Also } \operatorname{SSR}\left(X_{1} \mid X_{2}\right)=36400.6326 \text { and } \operatorname{SSR}\left(X_{2} \mid X_{1}\right)=3297.7917 -Referring to SCENARIO 13-4, when the builder used a simple linear regression model with house size (House) as the dependent variable and family size (Size) as the independent variable, he obtained an r2 value of 1.25%.What additional percentage of the total variation in house size has been explained by including income in the multiple regression?

(Multiple Choice)
5.0/5
(30)

SCENARIO 13-7 The department head of the accounting department wanted to see if she could predict the GPA of students using the number of course units and total SAT scores of each.She takes a sample of 6 students and generates the following Microsoft Excel output: SUMMARY OUTPUT Regression Statistics Multiple R 0.916 R Square 0.839 Adjusted R Square 0.732 Standard Error 0.24685 Observations 6 ANOVA df SS MS F Signif F Regression 2 0.95219 0.47610 7.813 0.0646 Residual 3 0.18281 0.06094 Total 5 1.13500 Coeff StdError t Stat P -value Intercept 4.593897 1.13374542 4.052 0.0271 Units -0.247270 0.06268485 -3.945 0.0290 Total SAT 0.001443 0.00101241 1.425 0.2494 -Referring to SCENARIO 13-7, the department head wants to use a t test to test for the significance of the coefficient of X1.At a level of significance of 0.05, the departmenthead would decide that β\beta 1 = 0 .

(True/False)
4.8/5
(39)

SCENARIO 13-4 A real estate builder wishes to determine how house size (House) is influenced by family income (Income) and family size (Size).House size is measured in hundreds of square feet and income is measured in thousands of dollars.The builder randomly selected 50 families and ran the multiple regression.Partial Microsoft Excel output is provided below: Regression Statistics Multiple R 0.8479 R Square 0.7189 Adjusted R Square 0.7069 Standard Error 17.5571 Observations 50 ANOVA df SS MS F Signif F Regression 37043.3236 18521.6618 0.0000 Residual 14487.7627 308.2503 Total 49 51531.0863 Coefficients Standard Error t Stat -value Intercept -5.5146 7.2273 -0.7630 0.4493 Income 0.4262 0.0392 10.8668 0.0000 Size 5.5437 1.6949 3.2708 0.0020  Also SSR(X1X2)=36400.6326 and SSR(X2X1)=3297.7917\text { Also } \operatorname{SSR}\left(X_{1} \mid X_{2}\right)=36400.6326 \text { and } \operatorname{SSR}\left(X_{2} \mid X_{1}\right)=3297.7917 -Referring to SCENARIO 13-4, the observed value of the F-statistic is missing from the printout.What are the degrees of freedom for this F-statistic?

(Multiple Choice)
4.9/5
(37)

A multiple regression is called "multiple" because it has several data points.

(True/False)
4.8/5
(31)

SCENARIO 13-8 A financial analyst wanted to examine the relationship between salary (in $1,000) and 2 variables: age (X1 = Age) and experience in the field (X2 = Exper).He took a sample of 20 employees and obtained the following Microsoft Excel output: Regression Statistics Multiple R 0.8535 R Square 0.7284 Adjusted R Square 0.6964 Standard Error 10.5630 Observations 20 ANOYA df SS MS F Siqnificonce F Regression 2 5086.5764 2543.2882 22.7941 0.0000 Residual 17 1896.8050 111.5768 Total 19 6983.3814 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept 1.5740 9.2723 0.1698 0.8672 -17.9888 21.1368 Age 1.3045 0.1956 6.6678 0.0000 0.8917 1.7173 Exper -0.1478 0.1944 -0.7604 0.4574 -0.5580 0.2624 -Referring to SCENARIO 13-8, the value of the partial F test statistic is forH0: Variable X1 does not significantly improve the model after variable X2 has been includedH1: Variable X1 significantly improves the model after variable X2 has been included

(Essay)
4.8/5
(33)

SCENARIO 13-4 A real estate builder wishes to determine how house size (House) is influenced by family income (Income) and family size (Size).House size is measured in hundreds of square feet and income is measured in thousands of dollars.The builder randomly selected 50 families and ran the multiple regression.Partial Microsoft Excel output is provided below: Regression Statistics Multiple R 0.8479 R Square 0.7189 Adjusted R Square 0.7069 Standard Error 17.5571 Observations 50 ANOVA df SS MS F Signif F Regression 37043.3236 18521.6618 0.0000 Residual 14487.7627 308.2503 Total 49 51531.0863 Coefficients Standard Error t Stat -value Intercept -5.5146 7.2273 -0.7630 0.4493 Income 0.4262 0.0392 10.8668 0.0000 Size 5.5437 1.6949 3.2708 0.0020  Also SSR(X1X2)=36400.6326 and SSR(X2X1)=3297.7917\text { Also } \operatorname{SSR}\left(X_{1} \mid X_{2}\right)=36400.6326 \text { and } \operatorname{SSR}\left(X_{2} \mid X_{1}\right)=3297.7917 -Referring to SCENARIO 13-4, which of the independent variables in the model are significant at the 5% level?

(Multiple Choice)
4.8/5
(32)

SCENARIO 13-15 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test.She obtained the data on percentage of students passing the proficiency test (% Passing), mean teacher salary in thousands of dollars (Salaries), and instructional spending per pupil in thousands of dollars (Spending) of 47 schools in the state. Following is the multiple regression output with Y = % Passing as the dependent variable, X1 = Salaries and X 2 = Spending: Regression Statistics Multiple R 0.4276 R Square 0.1828 Adjusted R Square 0.1457 Standard Error 5.7351 Observations 47 ANOVA df SS MS F Significance F Regression 2 323.8284 161.9142 4.9227 0.0118 Residual 44 1447.2094 32.8911 Total 46 1771.0378 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept -72.9916 45.9106 -1.5899 0.1190 -165.5184 19.5352 Salary 2.7939 0.8974 3.1133 0.0032 0.9853 4.6025 Spending 0.3742 0.9782 0.3825 0.7039 -1.5972 2.3455 -Referring to SCENARIO 13-15, the null hypothesis should be rejected at a 5% level of significance when testing whether instructional spending per pupil has any effect on percentage of students passing the proficiency test, considering the effect of mean teacher salary.

(True/False)
4.8/5
(36)

SCENARIO 13-11 A weight-loss clinic wants to use regression analysis to build a model for weight loss of a client (measured in pounds).Two variables thought to affect weight loss are client's length of time on the weight-loss program and time of session.These variables are described below: Y = Weight loss (in pounds) X1 = Length of time in weight-loss program (in months) X2 = 1 if morning session, 0 if not Data for 25 clients on a weight-loss program at the clinic were collected and used to fit the interaction model: Y=β0+β1X1+β2X2+β3X1X2+εY = \beta _ { 0 } + \beta _ { 1 } X _ { 1 } + \beta _ { 2 } X _ { 2 } + \beta _ { 3 } X _ { 1 } X _ { 2 } + \varepsilon Output from Microsoft Excel follows: Multiple R 0.7308 R Square 0.5341 Adjusted R Square 0.4675 Standard Error 43.3275 Observations 25 ANOVA df SS MS Significance F Regression 3 45194.0661 15064.6887 8.0248 0.0009 Residual 21 39422.6542 1877.2692 Total 24 84616.7203 Coefficients Standard Error t Stat P-value Lower 99\% Upper 99\% Intercept -20.7298 22.3710 -0.9266 0.3646 -84.0702 42.6106 Length 7.2472 1.4992 4.8340 0.0001 3.0024 11.4919 Morn 90.1981 40.2336 2.2419 0.0359 -23.7176 204.1138 Length x Morn -5.1024 3.3511 -1.5226 0.1428 -14.5905 4.3857 -Referring to SCENARIO 13-11, what null hypothesis would you test to determine whether the slope of the linear relationship between weight loss (Y) and time on the program (X1) varies according to time of session? a) H0:β1=0H _ { 0 } : \beta _ { 1 } = 0 b) H0:β2=0H _ { 0 } : \beta _ { 2 } = 0 c) H0:β3=0H _ { 0 } : \beta _ { 3 } = 0 d) H0:β1=β2=0H _ { 0 } : \beta _ { 1 } = \beta _ { 2 } = 0

(Short Answer)
4.9/5
(42)

From the coefficient of multiple determination, you cannot detect the strength of the relationship between Y and any individual independent variable.

(True/False)
4.9/5
(47)

SCENARIO 13-10 You worked as an intern at We Always Win Car Insurance Company last summer.You notice that individual car insurance premiums depend very much on the age of the individual and the number of traffic tickets received by the individual.You performed a regression analysis in EXCEL and obtained the following partial information: Regression Statistics Multiple R 0.8546 R Square 0.7303 Adjusted R Square 0.6853 Standard Error 226.7502 Observations 15 ANOVA df SS MS F Siqnificonce F Regression 2 835284.6500 16.2457 0.0004 Residual 12 616987.8200 Total 2287557.1200 Coefficients Standard Error t Stat P-value Lower 99\% Upper 99\% Intercept 821.2617 161.9391 5.0714 0.0003 326.6124 1315.9111 Age -1.4061 2.5988 -0.5411 0.5984 -9.3444 6.5321 Tickets 243.4401 43.2470 5.6291 0.0001 111.3406 375.5396 -Referring to SCENARIO 13-10, the multiple regression model is significant at a10% level of significance.

(True/False)
4.9/5
(42)

SCENARIO 13-17 Given below are results from the regression analysis where the dependent variable is the number of weeks a worker is unemployed due to a layoff (Unemploy) and the independent variables are the age of the worker (Age) and a dummy variable for management position (Manager: 1 = yes, 0 = no). The results of the regression analysis are given below: Regression Statistics Multiple R 0.6391 R Square 0.4085 Adjusted R Square 0.3765 Standard Error 18.8929 Observations 40 ANOVA df SS MS F Significance F Regression 2 9119.0897 4559.5448 12.7740 0.0000 Residual 37 13206.8103 356.9408 Total 39 22325.9 Coefficients Standard Error t Stat P -value Intercept -0.2143 11.5796 -0.0185 0.9853 Age 1.4448 0.3160 4.5717 0.0000 Manager -22.5761 11.3488 -1.9893 0.0541 -Referring to SCENARIO 13-17, the alternative hypothesis H 1: At least one of β\beta j \neq 0 for j =1, 2 implies that the number of weeks a worker is unemployed due to a layoff is affected by all of the explanatory variables.

(True/False)
4.8/5
(24)

SCENARIO 13-3 An economist is interested to see how consumption for an economy (in $ billions) is influenced by gross domestic product ($ billions) and aggregate price (consumer price index).The Microsoft Excel output of this regression is partially reproduced below.  SUMMARY OUTPUT \text { SUMMARY OUTPUT } Regression Statistics Multiple R 0.991 R Square 0.982 Adjusted R Square 0.976 Standard Error 0.299 Observations 10 ANOVA df SS MS F Signif F Regression 2 33.4163 16.7082 186.325 0.0001 Residual 7 0.6277 0.0897 Total 9 34.0440 Coeff StdError t Stat P -value Intercept -0.0861 0.5674 -0.152 0.8837 GDP 0.7654 0.0574 13.340 0.0001 Price -0.0006 0.0028 -0.219 0.8330 -Referring to SCENARIO 13-3, to test whether aggregate price index has a positive impact on consumption, the p-value is

(Multiple Choice)
4.8/5
(33)
Showing 21 - 40 of 307
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)