Exam 13: Multiple Regression

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

SCENARIO 13-15 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test.She obtained the data on percentage of students passing the proficiency test (% Passing), mean teacher salary in thousands of dollars (Salaries), and instructional spending per pupil in thousands of dollars (Spending) of 47 schools in the state. Following is the multiple regression output with Y = % Passing as the dependent variable, X1 = Salaries and X 2 = Spending: Regression Statistics Multiple R 0.4276 R Square 0.1828 Adjusted R Square 0.1457 Standard Error 5.7351 Observations 47 ANOVA df SS MS F Significance F Regression 2 323.8284 161.9142 4.9227 0.0118 Residual 44 1447.2094 32.8911 Total 46 1771.0378 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept -72.9916 45.9106 -1.5899 0.1190 -165.5184 19.5352 Salary 2.7939 0.8974 3.1133 0.0032 0.9853 4.6025 Spending 0.3742 0.9782 0.3825 0.7039 -1.5972 2.3455 -Referring to SCENARIO 13-15, the null hypothesis H0 : β\beta 1 = β\beta 2 = 0 implies that percentageof students passing the proficiency test is not related to either of the explanatory variables.

(True/False)
4.8/5
(41)

SCENARIO 13-18 A logistic regression model was estimated in order to predict the probability that a randomly chosen university or college would be a private university using information on mean total Scholastic Aptitude Test score (SAT) at the university or college and whether the TOEFL criterion is at least 90 (Toefl90 = 1 if yes, 0 otherwise.) The dependent variable, Y, is school type (Type = 1 if private and 0 otherwise).There are 80 universities in the sample. The PHStat output is given below: Binary Logistic Regression Predictor Coefficients SE Coef Z p -Value Intercept -3.9594 1.6741 -2.3650 0.0180 SAT 0.0028 0.0011 2.5459 0.0109 Toefl90:1 0.1928 0.5827 0.3309 0.7407 Deviance 101.9826 -Referring to SCENARIO 13-18, what should be the decision ('reject' or 'do not reject') on the null hypothesis when testing whether Toefl90 makes a significant contribution to the model in the presence of SAT at a 0.05 level of significance?

(Essay)
4.8/5
(30)

SCENARIO 13-17 Given below are results from the regression analysis where the dependent variable is the number of weeks a worker is unemployed due to a layoff (Unemploy) and the independent variables are the age of the worker (Age) and a dummy variable for management position (Manager: 1 = yes, 0 = no). The results of the regression analysis are given below: Regression Statistics Multiple R 0.6391 R Square 0.4085 Adjusted R Square 0.3765 Standard Error 18.8929 Observations 40 ANOVA df SS MS F Significance F Regression 2 9119.0897 4559.5448 12.7740 0.0000 Residual 37 13206.8103 356.9408 Total 39 22325.9 Coefficients Standard Error t Stat P -value Intercept -0.2143 11.5796 -0.0185 0.9853 Age 1.4448 0.3160 4.5717 0.0000 Manager -22.5761 11.3488 -1.9893 0.0541 -Referring to SCENARIO 13-17, there is sufficient evidence that the number of weeks a worker is unemployed due to a layoff depends on at least one of the explanatory variables at a 10% level of significance.

(True/False)
4.8/5
(34)

SCENARIO 13-4 A real estate builder wishes to determine how house size (House) is influenced by family income (Income) and family size (Size).House size is measured in hundreds of square feet and income is measured in thousands of dollars.The builder randomly selected 50 families and ran the multiple regression.Partial Microsoft Excel output is provided below: Regression Statistics Multiple R 0.8479 R Square 0.7189 Adjusted R Square 0.7069 Standard Error 17.5571 Observations 50 ANOVA df SS MS F Signif F Regression 37043.3236 18521.6618 0.0000 Residual 14487.7627 308.2503 Total 49 51531.0863 Coefficients Standard Error t Stat -value Intercept -5.5146 7.2273 -0.7630 0.4493 Income 0.4262 0.0392 10.8668 0.0000 Size 5.5437 1.6949 3.2708 0.0020  Also SSR(X1X2)=36400.6326 and SSR(X2X1)=3297.7917\text { Also } \operatorname{SSR}\left(X_{1} \mid X_{2}\right)=36400.6326 \text { and } \operatorname{SSR}\left(X_{2} \mid X_{1}\right)=3297.7917 -Referring to SCENARIO 13-4, which of the following values for the level of significance is the smallest for which at most one explanatory variable is significant individually?

(Multiple Choice)
4.8/5
(39)

SCENARIO 13-3 An economist is interested to see how consumption for an economy (in $ billions) is influenced by gross domestic product ($ billions) and aggregate price (consumer price index).The Microsoft Excel output of this regression is partially reproduced below.  SUMMARY OUTPUT \text { SUMMARY OUTPUT } Regression Statistics Multiple R 0.991 R Square 0.982 Adjusted R Square 0.976 Standard Error 0.299 Observations 10 ANOVA df SS MS F Signif F Regression 2 33.4163 16.7082 186.325 0.0001 Residual 7 0.6277 0.0897 Total 9 34.0440 Coeff StdError t Stat P -value Intercept -0.0861 0.5674 -0.152 0.8837 GDP 0.7654 0.0574 13.340 0.0001 Price -0.0006 0.0028 -0.219 0.8330 -Referring to SCENARIO 13-3, to test for the significance of the coefficient on gross domestic product, the p-value is

(Multiple Choice)
4.8/5
(33)

SCENARIO 13-8 A financial analyst wanted to examine the relationship between salary (in $1,000) and 2 variables: age (X1 = Age) and experience in the field (X2 = Exper).He took a sample of 20 employees and obtained the following Microsoft Excel output: Regression Statistics Multiple R 0.8535 R Square 0.7284 Adjusted R Square 0.6964 Standard Error 10.5630 Observations 20 ANOYA df SS MS F Siqnificonce F Regression 2 5086.5764 2543.2882 22.7941 0.0000 Residual 17 1896.8050 111.5768 Total 19 6983.3814 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept 1.5740 9.2723 0.1698 0.8672 -17.9888 21.1368 Age 1.3045 0.1956 6.6678 0.0000 0.8917 1.7173 Exper -0.1478 0.1944 -0.7604 0.4574 -0.5580 0.2624 -Referring to SCENARIO 13-8, the partial F test forH0: Variable X2 does not significantly improve the model after variable X1 has been includedH1: Variable X2 significantly improves the model after variable X1 has been included has and degrees of freedom.

(Essay)
4.8/5
(36)

SCENARIO 13-11 A weight-loss clinic wants to use regression analysis to build a model for weight loss of a client (measured in pounds).Two variables thought to affect weight loss are client's length of time on the weight-loss program and time of session.These variables are described below: Y = Weight loss (in pounds) X1 = Length of time in weight-loss program (in months) X2 = 1 if morning session, 0 if not Data for 25 clients on a weight-loss program at the clinic were collected and used to fit the interaction model: Y=β0+β1X1+β2X2+β3X1X2+εY = \beta _ { 0 } + \beta _ { 1 } X _ { 1 } + \beta _ { 2 } X _ { 2 } + \beta _ { 3 } X _ { 1 } X _ { 2 } + \varepsilon Output from Microsoft Excel follows: Multiple R 0.7308 R Square 0.5341 Adjusted R Square 0.4675 Standard Error 43.3275 Observations 25 ANOVA df SS MS Significance F Regression 3 45194.0661 15064.6887 8.0248 0.0009 Residual 21 39422.6542 1877.2692 Total 24 84616.7203 Coefficients Standard Error t Stat P-value Lower 99\% Upper 99\% Intercept -20.7298 22.3710 -0.9266 0.3646 -84.0702 42.6106 Length 7.2472 1.4992 4.8340 0.0001 3.0024 11.4919 Morn 90.1981 40.2336 2.2419 0.0359 -23.7176 204.1138 Length x Morn -5.1024 3.3511 -1.5226 0.1428 -14.5905 4.3857 -Referring to SCENARIO 13-11, in terms of the β\beta s in the model, give the mean change in weight loss (Y) for every 1 month increase in time on the program (X1) when attending the morning session.

(Multiple Choice)
4.7/5
(32)

SCENARIO 13-13 An econometrician is interested in evaluating the relationship of demand for building materials to mortgage rates in Los Angeles and San Francisco.He believes that the appropriate model is Y=10+5X1+8X2Y = 10 + 5 X _ { 1 } + 8 X _ { 2 } where = mortgage rate in \% =1 if SF, 0 if LA Y= demand in \ 100 per capita -Referring to SCENARIO 13-13, the fitted model for predicting demand in Los Angeles is_.

(Multiple Choice)
4.8/5
(38)

SCENARIO 13-15 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test.She obtained the data on percentage of students passing the proficiency test (% Passing), mean teacher salary in thousands of dollars (Salaries), and instructional spending per pupil in thousands of dollars (Spending) of 47 schools in the state. Following is the multiple regression output with Y = % Passing as the dependent variable, X1 = Salaries and X 2 = Spending: Regression Statistics Multiple R 0.4276 R Square 0.1828 Adjusted R Square 0.1457 Standard Error 5.7351 Observations 47 ANOVA df SS MS F Significance F Regression 2 323.8284 161.9142 4.9227 0.0118 Residual 44 1447.2094 32.8911 Total 46 1771.0378 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept -72.9916 45.9106 -1.5899 0.1190 -165.5184 19.5352 Salary 2.7939 0.8974 3.1133 0.0032 0.9853 4.6025 Spending 0.3742 0.9782 0.3825 0.7039 -1.5972 2.3455 -Referring to SCENARIO 13-15, the alternative hypothesis H 1: At least one of β\beta j \neq 0 for j =1, 2 implies that percentage of students passing the proficiency test is affected by both of the explanatory variables.

(True/False)
4.9/5
(37)

SCENARIO 13-4 A real estate builder wishes to determine how house size (House) is influenced by family income (Income) and family size (Size).House size is measured in hundreds of square feet and income is measured in thousands of dollars.The builder randomly selected 50 families and ran the multiple regression.Partial Microsoft Excel output is provided below: Regression Statistics Multiple R 0.8479 R Square 0.7189 Adjusted R Square 0.7069 Standard Error 17.5571 Observations 50 ANOVA df SS MS F Signif F Regression 37043.3236 18521.6618 0.0000 Residual 14487.7627 308.2503 Total 49 51531.0863 Coefficients Standard Error t Stat -value Intercept -5.5146 7.2273 -0.7630 0.4493 Income 0.4262 0.0392 10.8668 0.0000 Size 5.5437 1.6949 3.2708 0.0020  Also SSR(X1X2)=36400.6326 and SSR(X2X1)=3297.7917\text { Also } \operatorname{SSR}\left(X_{1} \mid X_{2}\right)=36400.6326 \text { and } \operatorname{SSR}\left(X_{2} \mid X_{1}\right)=3297.7917 -Referring to SCENARIO 13-4, one individual in the sample had an annual income of $40,000 and a family size of 1.This individual owned a home with an area of 1,000 square feet (House =10.00).What is the residual (in hundreds of square feet) for this data point?

(Essay)
4.8/5
(44)

SCENARIO 13-5 A microeconomist wants to determine how corporate sales are influenced by capital and wage spending by companies.She proceeds to randomly select 26 large corporations and record information in millions of dollars.The Microsoft Excel output below shows results of this multiple regression. SUMMARY OUTPUT Regression Statistics Multiple R 0.830 R Square 0.689 Adjusted R Square 0.662 Standard Error 17501.643 Observations 26 ANOVA df SS MS F Signif F Regression 2 15579777040 7789888520 25.432 0.0001 Residual 23 7045072780 306307512 Total 25 22624849820 Coeff StdError t Stat P -value Intercept 15800.0000 6038.2999 2.617 0.0154 Capital 0.1245 0.2045 0.609 0.5485 Wages 7.0762 1.4729 4.804 0.0001 -Referring to SCENARIO 13-5, what is the p-value for Wages?

(Multiple Choice)
4.9/5
(36)

SCENARIO 13-18 A logistic regression model was estimated in order to predict the probability that a randomly chosen university or college would be a private university using information on mean total Scholastic Aptitude Test score (SAT) at the university or college and whether the TOEFL criterion is at least 90 (Toefl90 = 1 if yes, 0 otherwise.) The dependent variable, Y, is school type (Type = 1 if private and 0 otherwise).There are 80 universities in the sample. The PHStat output is given below: Binary Logistic Regression Predictor Coefficients SE Coef Z p -Value Intercept -3.9594 1.6741 -2.3650 0.0180 SAT 0.0028 0.0011 2.5459 0.0109 Toefl90:1 0.1928 0.5827 0.3309 0.7407 Deviance 101.9826 -Referring to SCENARIO 13-18, what is the p-value of the test statistic when testing whetherToefl90 makes a significant contribution to the model in the presence of SAT?

(Essay)
4.9/5
(37)

SCENARIO 13-6 One of the most common questions of prospective house buyers pertains to the cost of heating in dollars (Y).To provide its customers with information on that matter, a large real estate firm used the following 2 variables to predict heating costs: the daily minimum outside temperature in degrees of Fahrenheit ( X1 ) and the amount of insulation in inches ( X 2 ).Given below is EXCEL output of the regression model. Regression Statistics Multiple R 0.5270 R Square 0.2778 Adjusted R Square 0.1928 Standard Error 40.9107 Observations 20 ANOVA df SS MS F Signif F Regression 2 10943.0190 5471.5095 3.2691 0.0629 Residual 17 28452.6027 1673.6825 Total 19 39395.6218 13-22 Multiple Regression Coefficients Standard Error t Stat P-volue Lower 95\% Upper 95\% Intercept 448.2925 90.7853 4.9379 0.0001 256.7522 639.8328 Temperature -2.7621 1.2371 -2.2327 0.0393 -5.3721 -0.1520 Insulation -15.9408 10.0638 -1.5840 0.1316 -37.1736 5.2919 Also SSR \mid =8343.3572 and SSR \mid =4199.2672 -Referring to SCENARIO 13-6, the value of the partial F test statistic is forH0: Variable X1 does not significantly improve the model after variable X2 has been includedH1: Variable X1 significantly improves the model after variable X2 has been included

(Essay)
5.0/5
(29)

SCENARIO 13-17 Given below are results from the regression analysis where the dependent variable is the number of weeks a worker is unemployed due to a layoff (Unemploy) and the independent variables are the age of the worker (Age) and a dummy variable for management position (Manager: 1 = yes, 0 = no). The results of the regression analysis are given below: Regression Statistics Multiple R 0.6391 R Square 0.4085 Adjusted R Square 0.3765 Standard Error 18.8929 Observations 40 ANOVA df SS MS F Significance F Regression 2 9119.0897 4559.5448 12.7740 0.0000 Residual 37 13206.8103 356.9408 Total 39 22325.9 Coefficients Standard Error t Stat P -value Intercept -0.2143 11.5796 -0.0185 0.9853 Age 1.4448 0.3160 4.5717 0.0000 Manager -22.5761 11.3488 -1.9893 0.0541 -Referring to SCENARIO 13-17, which of the following is the correct null hypothesis to determine whether there is a significant relationship between the number of weeks a worker is unemployed due to a layoff and the entire set of explanatory variables? a) H0:β0=β1=β2=0H _ { 0 } : \beta _ { 0 } = \beta _ { 1 } = \beta _ { 2 } = 0 b) H0:β1=β2=0H _ { 0 } : \beta _ { 1 } = \beta _ { 2 } = 0 c) H0:β0=β1=β2H _ { 0 } : \beta _ { 0 } = \beta _ { 1 } = \beta _ { 2 } d) H0:β1=β2H _ { 0 } : \beta _ { 1 } = \beta _ { 2 }

(Short Answer)
4.8/5
(33)

SCENARIO 13-18 A logistic regression model was estimated in order to predict the probability that a randomly chosen university or college would be a private university using information on mean total Scholastic Aptitude Test score (SAT) at the university or college and whether the TOEFL criterion is at least 90 (Toefl90 = 1 if yes, 0 otherwise.) The dependent variable, Y, is school type (Type = 1 if private and 0 otherwise).There are 80 universities in the sample. The PHStat output is given below: Binary Logistic Regression Predictor Coefficients SE Coef Z p -Value Intercept -3.9594 1.6741 -2.3650 0.0180 SAT 0.0028 0.0011 2.5459 0.0109 Toefl90:1 0.1928 0.5827 0.3309 0.7407 Deviance 101.9826 -Referring to SCENARIO 13-18, which of the following is the correct expression for the estimated model?

(Multiple Choice)
4.8/5
(35)

SCENARIO 13-10 You worked as an intern at We Always Win Car Insurance Company last summer.You notice that individual car insurance premiums depend very much on the age of the individual and the number of traffic tickets received by the individual.You performed a regression analysis in EXCEL and obtained the following partial information: Regression Statistics Multiple R 0.8546 R Square 0.7303 Adjusted R Square 0.6853 Standard Error 226.7502 Observations 15 ANOVA df SS MS F Siqnificonce F Regression 2 835284.6500 16.2457 0.0004 Residual 12 616987.8200 Total 2287557.1200 Coefficients Standard Error t Stat P-value Lower 99\% Upper 99\% Intercept 821.2617 161.9391 5.0714 0.0003 326.6124 1315.9111 Age -1.4061 2.5988 -0.5411 0.5984 -9.3444 6.5321 Tickets 243.4401 43.2470 5.6291 0.0001 111.3406 375.5396 -Referring to SCENARIO 13-10, the total degrees of freedom that are missing in the ANOVAtable should be .

(Essay)
4.7/5
(36)

SCENARIO 13-8 A financial analyst wanted to examine the relationship between salary (in $1,000) and 2 variables: age (X1 = Age) and experience in the field (X2 = Exper).He took a sample of 20 employees and obtained the following Microsoft Excel output: Regression Statistics Multiple R 0.8535 R Square 0.7284 Adjusted R Square 0.6964 Standard Error 10.5630 Observations 20 ANOYA df SS MS F Siqnificonce F Regression 2 5086.5764 2543.2882 22.7941 0.0000 Residual 17 1896.8050 111.5768 Total 19 6983.3814 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept 1.5740 9.2723 0.1698 0.8672 -17.9888 21.1368 Age 1.3045 0.1956 6.6678 0.0000 0.8917 1.7173 Exper -0.1478 0.1944 -0.7604 0.4574 -0.5580 0.2624 -Referring to SCENARIO 13-8, _% of the variation in salary can be explained by the variation in experience while holding age constant.

(Essay)
4.8/5
(29)

SCENARIO 13-6 One of the most common questions of prospective house buyers pertains to the cost of heating in dollars (Y).To provide its customers with information on that matter, a large real estate firm used the following 2 variables to predict heating costs: the daily minimum outside temperature in degrees of Fahrenheit ( X1 ) and the amount of insulation in inches ( X 2 ).Given below is EXCEL output of the regression model. Regression Statistics Multiple R 0.5270 R Square 0.2778 Adjusted R Square 0.1928 Standard Error 40.9107 Observations 20 ANOVA df SS MS F Signif F Regression 2 10943.0190 5471.5095 3.2691 0.0629 Residual 17 28452.6027 1673.6825 Total 19 39395.6218 13-22 Multiple Regression Coefficients Standard Error t Stat P-volue Lower 95\% Upper 95\% Intercept 448.2925 90.7853 4.9379 0.0001 256.7522 639.8328 Temperature -2.7621 1.2371 -2.2327 0.0393 -5.3721 -0.1520 Insulation -15.9408 10.0638 -1.5840 0.1316 -37.1736 5.2919 Also SSR \mid =8343.3572 and SSR \mid =4199.2672 -Referring to SCENARIO 13-6, _% of the variation in heating cost can be explained by the variation in minimum outside temperature while holding the amount of insulation constant.

(Essay)
4.8/5
(39)

A regression had the following results: SST = 102.55, SSE = 82.04.It can be said that 20.0% of the variation in the dependent variable is explained by the independent variables in the regression.

(True/False)
4.8/5
(38)

SCENARIO 13-10 You worked as an intern at We Always Win Car Insurance Company last summer.You notice that individual car insurance premiums depend very much on the age of the individual and the number of traffic tickets received by the individual.You performed a regression analysis in EXCEL and obtained the following partial information: Regression Statistics Multiple R 0.8546 R Square 0.7303 Adjusted R Square 0.6853 Standard Error 226.7502 Observations 15 ANOVA df SS MS F Siqnificonce F Regression 2 835284.6500 16.2457 0.0004 Residual 12 616987.8200 Total 2287557.1200 Coefficients Standard Error t Stat P-value Lower 99\% Upper 99\% Intercept 821.2617 161.9391 5.0714 0.0003 326.6124 1315.9111 Age -1.4061 2.5988 -0.5411 0.5984 -9.3444 6.5321 Tickets 243.4401 43.2470 5.6291 0.0001 111.3406 375.5396 -Referring to SCENARIO 13-10, the regression sum of squares that is missing in the ANOVAtable should be .

(Essay)
4.9/5
(40)
Showing 261 - 280 of 307
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)