Exam 19: Multiple Regression

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

A statistician wanted to determine whether the demographic variables of age, education and income influence the number of hours of television watched per week. A random sample of 25 adults was selected to estimate the multiple regression model y=β0+β1x1+β2x2+β3x3+εy = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 } + \varepsilon . Where: y = number of hours of television watched last week. x1x _ { 1 } = age. x2x _ { 2 } = number of years of education. X3X _ { 3 } = income (in $1000s). The computer output is shown below. THE REGRESSION EQUATION IS y=y = 22.3+0.41x10.29x20.12x322.3 + 0.41 x _ { 1 } - 0.29 x _ { 2 } - 0.12 x _ { 3 } Predictor Coef StDev Constant 22.3 10.7 2.084 0.41 0.19 2.158 -0.29 0.13 -2.231 -0.12 0.03 -4.00 S = 4.51 R-Sq = 34.8%. ANALYSIS OF VARIANCE Source of Variation df SS MS F Regression 3 227 75.667 3.730 Error 21 426 20.286 Total 24 653 What is the coefficient of determination? What does this statistic tell you?

(Essay)
4.7/5
(30)

In a multiple regression problem involving 24 observations and three independent variables, the estimated regression equation is  In a multiple regression problem involving 24 observations and three independent variables, the estimated regression equation is    = 72 + 3.2 x _ { 1 } + 1.5 x _ { 2 } - x _ { 3 }   . For this model, SST = 800 and SSE = 245. The value of the F-statistic for testing the significance of this model is 15.102. =72+3.2x1+1.5x2x3 = 72 + 3.2 x _ { 1 } + 1.5 x _ { 2 } - x _ { 3 } . For this model, SST = 800 and SSE = 245. The value of the F-statistic for testing the significance of this model is 15.102.

(True/False)
4.8/5
(31)

Test the hypotheses: H0:H _ { 0 } : There is no first-order autocorrelation H1:H _ { 1 } : There is positive first-order autocorrelation, given that: the Durbin-Watson statistic d = 0.686, n = 16, k = 1 and α=\alpha = 0.05.

(Essay)
4.7/5
(38)

Which of the following is not true when we add an independent variable to a multiple regression model?

(Multiple Choice)
5.0/5
(35)

In a multiple regression analysis, there are 20 data points and 4 independent variables, and the sum of the squared differences between observed and predicted values of y is 180. The multiple standard error of estimate will be:

(Multiple Choice)
4.9/5
(27)

Excel and Minitab both provide the p-value for testing each coefficient in the multiple regression model. In the case of b2b _ { 2 } , this represents the probability that:

(Multiple Choice)
4.8/5
(36)

Which of the following best describes the ratio MSR/MSE in a multiple linear regression model?

(Multiple Choice)
4.9/5
(35)

A multiple regression the coefficient of determination is 0.81. The percentage of the variation in yy that is explained by the regression equation is 81%.

(True/False)
4.9/5
(37)

For a multiple regression model, the following statistics are given: Total variation in y = SSY = 250, SSE = 50, k = 4, n = 20. The coefficient of determination adjusted for degrees of freedom is:

(Multiple Choice)
4.9/5
(38)

In a multiple regression analysis, if the model provides a poor fit, this indicates that:

(Multiple Choice)
4.8/5
(39)

In multiple regression, the descriptor 'multiple' refers to more than one independent variable.

(True/False)
4.8/5
(42)

Which of the following best explains a small F-statistic when testing the validity of a multiple regression model?

(Multiple Choice)
4.7/5
(42)

The most commonly used method to remedy non-normality or heteroscedasticity in regression analysis is to transform the dependent variable. The most commonly used transformations are y=logy( provided y0)y ^ { \prime } = \log y ( \text { provided } y \geq 0 ) , y=y2y ^ { \prime } = y ^ { 2 } , y=y( provided y0)y ^ { \prime } = \sqrt { y } ( \text { provided } y \geq 0 ) , and y=1/yy ^ { \prime } = 1 / y .

(True/False)
4.8/5
(33)

A statistician wanted to determine whether the demographic variables of age, education and income influence the number of hours of television watched per week. A random sample of 25 adults was selected to estimate the multiple regression model y=β0+β1x1+β2x2+β3x3+εy = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 } + \varepsilon . Where: y = number of hours of television watched last week. x1x _ { 1 } = age. x2x _ { 2 } = number of years of education. X3X _ { 3 } = income (in $1000s). The computer output is shown below. THE REGRESSION EQUATION IS y=y = 22.3+0.41x10.29x20.12x322.3 + 0.41 x _ { 1 } - 0.29 x _ { 2 } - 0.12 x _ { 3 } Predictor Coef StDev Constant 22.3 10.7 2.084 0.41 0.19 2.158 -0.29 0.13 -2.231 -0.12 0.03 -4.00 S = 4.51 R-Sq = 34.8%. ANALYSIS OF VARIANCE Source of Variation df SS MS F Regression 3 227 75.667 3.730 Error 21 426 20.286 Total 24 653 Test the overall validity of the model at the 5% significance level.

(Essay)
4.8/5
(33)

A statistics professor investigated some of the factors that affect an individual student's final grade in his or her course. He proposed the multiple regression model: y=β0+β1x1+β2x2+β3x3+εy = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 } + \varepsilon . Where: y = final mark (out of 100). x1x _ { 1 } = number of lectures skipped. x2x _ { 2 } = number of late assignments. X3X _ { 3 } = mid-term test mark (out of 100). The professor recorded the data for 50 randomly selected students. The computer output is shown below. THE REGRESSION EQUATION IS  A statistics professor investigated some of the factors that affect an individual student's final grade in his or her course. He proposed the multiple regression model:  y = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 } + \varepsilon  . Where: y = final mark (out of 100).  x _ { 1 }  = number of lectures skipped.  x _ { 2 }  = number of late assignments.  X _ { 3 }  = mid-term test mark (out of 100). The professor recorded the data for 50 randomly selected students. The computer output is shown below. THE REGRESSION EQUATION IS    =  41.6 - 3.18 x _ { 1 } - 1.17 x _ { 2 } + .63 x _ { 3 }   \begin{array} { | c | c c c | }  \hline \text { Predictor } & \text { Coef } & \text { StDev } & \text { T } \\ \hline \text { Constant } & 41.6 & 17.8 & 2.337 \\ x _ { 1 } & - 3.18 & 1.66 & - 1.916 \\ x _ { 2 } & - 1.17 & 1.13 & - 1.035 \\ x _ { 3 } & 0.63 & 0.13 & 4.846 \\ \hline \end{array}  S = 13.74 R-Sq = 30.0%.  \begin{array}{l} \text { ANALYSIS OF VARIANCE }\\ \begin{array} { | l | c c c c | }  \hline \text { Source of Variation } & \text { df } & \text { SS } & \text { MS } & \text { F } \\ \hline \text { Regression } & 3 & 3716 & 1238.667 & 6.558 \\ \text { Error } & 46 & 8688 & 188.870 & \\ \hline \text { Total } & 49 & 12404 & & \\ \hline \end{array} \end{array}  Do these data provide enough evidence at the 5% significance level to conclude that the final mark and the number of late assignments are negatively linearly related? = 41.63.18x11.17x2+.63x341.6 - 3.18 x _ { 1 } - 1.17 x _ { 2 } + .63 x _ { 3 } Predictor Coef StDev T Constant 41.6 17.8 2.337 -3.18 1.66 -1.916 -1.17 1.13 -1.035 0.63 0.13 4.846 S = 13.74 R-Sq = 30.0%. ANALYSIS OF VARIANCE Source of Variation df SS MS F Regression 3 3716 1238.667 6.558 Error 46 8688 188.870 Total 49 12404 Do these data provide enough evidence at the 5% significance level to conclude that the final mark and the number of late assignments are negatively linearly related?

(Essay)
4.8/5
(41)

In a multiple regression analysis involving 25 data points and 5 independent variables, the sum of squares terms are calculated as: total variation in y = SSY = 500, SSR = 300, and SSE = 200. In testing the validity of the regression model, the F-value of the test statistic will be:

(Multiple Choice)
4.9/5
(36)

Which of the following best describes first-order autocorrelation?

(Multiple Choice)
4.9/5
(22)

In multiple regression models, the values of the error variable ε\varepsilon are assumed to be:

(Multiple Choice)
4.8/5
(28)

Given the following statistics of a multiple regression model, can we conclude at the 5% significance level that x1x _ { 1 } and y are linearly related? n = 42 k = 6 b1=b _ { 1 } = -5.30 sb1=s _ { b _ { 1 } } = 1.5

(Essay)
4.8/5
(30)

In a multiple regression analysis involving 40 observations and 5 independent variables, total variation in y = SSY = 350 and SSE = 50. The multiple coefficient of determination is:

(Multiple Choice)
4.7/5
(33)
Showing 21 - 40 of 121
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)