Exam 15: Multiple Regression Model Building

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below. Regression Statistics Multiple R 0.5487 R Square 0.3011 Adjusted R Square 0.2538 Standard Error 6442.4456 Observations 80 ANOVA SS MS F Significance Regression 5 1322911703.0671 264582340.6134 6.3747 0.0001 Residual 74 3071377751.1204 41505104.7449 Total 79 4394289454.1875 Coefficients Standard Error t Stat p-value Intercept -3862.4808 6180.9452 -0.6249 0.5340 Temp 51.7031 62.9439 0.8214 0.4140 Win\% 21.1085 16.2338 1.3003 0.1975 OpWin\% 11.3453 6.4617 1.7558 0.0833 Weekend 367.5377 2786.2639 0.1319 0.8954 Promotion 6927.8820 2784.3442 2.4882 0.0151  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.    \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                  The coefficient of multiple determination ( R <sup>2</sup><sub> j </sub>) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, what is the p-value of the test statistic to determine whether TEMP makes a significant contribution to the regression model in the presence of the other independent variables at a 5% level of significance?  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.    \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                  The coefficient of multiple determination ( R <sup>2</sup><sub> j </sub>) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, what is the p-value of the test statistic to determine whether TEMP makes a significant contribution to the regression model in the presence of the other independent variables at a 5% level of significance?  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.    \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                  The coefficient of multiple determination ( R <sup>2</sup><sub> j </sub>) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, what is the p-value of the test statistic to determine whether TEMP makes a significant contribution to the regression model in the presence of the other independent variables at a 5% level of significance?  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.    \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                  The coefficient of multiple determination ( R <sup>2</sup><sub> j </sub>) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, what is the p-value of the test statistic to determine whether TEMP makes a significant contribution to the regression model in the presence of the other independent variables at a 5% level of significance? The coefficient of multiple determination ( R 2 j ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308 -Referring to Table 15-9, what is the p-value of the test statistic to determine whether TEMP makes a significant contribution to the regression model in the presence of the other independent variables at a 5% level of significance?

Free
(Short Answer)
4.9/5
(32)
Correct Answer:
Verified

0.0151

TABLE 15- 8 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state. Let Y = % Passing as the dependent variable, X1 = % Attendance, X2 = Salaries and X3 = Spending. The coefficient of multiple determination (R 2 j) of each of the 3 predictors with all the other remaining predictors are, respectively, 0.0338, 0.4669, and 0.4743. The output from the best- subset regressions is given below: Adjusted Following is the residual plot for % Attendance: Adjusted Model Variables R Square R Square Std. Error 1 X1 3.05 2 0.6024 0.5936 10.5787 2 X1X2 3.66 3 0.6145 0.5970 10.5350 3 X1X2X3 4.00 4 0.6288 0.6029 10.4570 4 X1X3 2.00 3 0.6288 0.6119 10.3375 5 X2 67.35 2 0.0474 0.0262 16.3755 6 X2X3 64.30 3 0.0910 0.0497 16.1768 7 X3 62.33 2 0.0907 0.0705 15.9984 Following is the residual plot for % Attendance:  TABLE 15- 8 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state. Let Y = % Passing as the dependent variable, X<sub>1 </sub>= % Attendance, X<sub>2 </sub>= Salaries and X<sub>3 </sub>= Spending. The coefficient of multiple determination (R <sup>2 </sup><sub>j</sub>) of each of the 3 predictors with all the other remaining predictors are,  respectively, 0.0338, 0.4669, and 0.4743. The output from the best- subset regressions is given below: Adjusted Following is the residual plot for % Attendance:   \begin{array}{llcclcc}  & & & &&  \text {Adjusted} \\ \text {Model }&\text { Variables} &  \mathrm{Cp}  & \mathrm{k} &\text {R Square} & \text {R Square} & \text {Std. Error }\\ \hline 1 & X1 & 3.05 & 2 & 0.6024 & 0.5936 & 10.5787 \\ 2 & X1X2 & 3.66 & 3 & 0.6145 & 0.5970 & 10.5350 \\ 3 & X1X2X3 & 4.00 & 4 & 0.6288 & 0.6029 & 10.4570 \\ 4 & X1X3 & 2.00 & 3 & 0.6288 & 0.6119 & 10.3375 \\ 5 & X2 & 67.35 & 2 & 0.0474 & 0.0262 & 16.3755 \\ 6 & X2X3 & 64.30 & 3 & 0.0910 & 0.0497 & 16.1768 \\ 7 & X3 & 62.33 & 2 & 0.0907 & 0.0705 & 15.9984 \\ \hline \end{array}   Following is the residual plot for % Attendance:     Following is the output of several multiple regression models:   \text {Model (I):}   \begin{array}{lcrclcr} \hline &  \text {Coefficients }&  \text {Std Error} &  \text {Stat } &  \text {p-value} & \text { Lower 95\% }& \text { Upper 95\%} \\ \hline \text { Intercept} & -753.4225 & 101.1149 & -7.4511 & 2.88 \mathrm{E}-09  & -957.3401 & -549.5050 \\ \%  \text {Attend }& 8.5014 & 1.0771 & 7.8929 &6.73 \mathrm{E}-10 & 6.3292 & 10.6735 \\  \text {Salary }& 6.85 \mathrm{E}-07  & 0.0006 & 0.0011 & 0.9991 & -0.0013 & 0.0013 \\  \text {Spending} & 0.0060 & 0.0046 & 1.2879 & 0.2047 & -0.0034 & 0.0153 \\ \hline \end{array}     \text {Model (II):}   \begin{array}{lcccc} \hline &  \text {Coefficients} & \text {Standard Error }& \text { t  Stat} &  \text { p -value } \\ \hline  \text {Intercept }& -753.4086 & 99.1451 & -7.5991 &  1.5291 \mathrm{E}-09 \\ \%  \text {Attendance} & 8.5014 & 1.0645 & 7.9862 & 4.223 \mathrm{E}-10  \\  \text {Spending} & 0.0060 & 0.0034 & 1.7676 & 0.0840 \\ \hline \end{array}     \text {Model (III):}   \begin{array}{lrrrrl} \hline & \text {  d f } & \text { SS } &  \text {  MS } & \text { F } &  \text { Significance F } \\ \hline  \text { Regression} & 2 & 8162.9429 & 4081.4714 & 39.8708 &1.3201 \mathrm{E}-10 \\  \text { Residual} & 44 & 4504.1635 & 102.3674 & & \\  \text { Total} & 46 & 12667.1064 & & & \\ \hline \end{array}     \begin{array}{lrcrr} \hline &  \text {Coefficients }&  \text {Standard Error} & \text { t Stat }&  \text {p -value} \\ \hline  \text {Intercept }& 6672.8367 & 3267.7349 & 2.0420 & 0.0472 \\ \% \text { Attendance} & -150.5694 & 69.9519 & -2.1525 & 0.0369 \\ \%  \text {Attendance Squared}& 0.8532 & 0.3743 & 2.2792 & 0.0276 \\ \hline \end{array}      -Referring to Table 15-8, the null hypothesis should be rejected when testing whether the quadratic effect of daily average of the percentage of students attending class on percentage of students passing the proficiency test is significant at a 5% level of significance. Following is the output of several multiple regression models: Model (I):\text {Model (I):} Coefficients Std Error Stat p-value Lower 95\% Upper 95\% Intercept -753.4225 101.1149 -7.4511 2.88-09 -957.3401 -549.5050 \% Attend 8.5014 1.0771 7.8929 6.73-10 6.3292 10.6735 Salary 6.85-07 0.0006 0.0011 0.9991 -0.0013 0.0013 Spending 0.0060 0.0046 1.2879 0.2047 -0.0034 0.0153 Model (II):\text {Model (II):} Coefficients Standard Error t Stat p -value Intercept -753.4086 99.1451 -7.5991 1.5291-09 \% Attendance 8.5014 1.0645 7.9862 4.223-10 Spending 0.0060 0.0034 1.7676 0.0840 Model (III):\text {Model (III):} d f SS MS F Significance F Regression 2 8162.9429 4081.4714 39.8708 1.3201-10 Residual 44 4504.1635 102.3674 Total 46 12667.1064 Coefficients Standard Error t Stat p -value Intercept 6672.8367 3267.7349 2.0420 0.0472 \% Attendance -150.5694 69.9519 -2.1525 0.0369 \% Attendance Squared 0.8532 0.3743 2.2792 0.0276 -Referring to Table 15-8, the null hypothesis should be rejected when testing whether the quadratic effect of daily average of the percentage of students attending class on percentage of students passing the proficiency test is significant at a 5% level of significance.

Free
(True/False)
4.9/5
(28)
Correct Answer:
Verified

True

TABLE 15-7 A chemist employed by a pharmaceutical firm has developed a muscle relaxant. She took a sample of 14 people suffering from extreme muscle constriction. She gave each a vial containing a dose (X) of the drug and recorded the time to relief (Y) measured in seconds for each. She fit a "centered" curvilinear model to this data. The results obtained by Microsoft Excel follow, where the dose (X) given has been "centered." SUMMARY OUTPUT Regression Statistics Multiple R 0.747 RSquare 0.558 Adjusted R Square 0.478 Standard Error 863.1 Observations 14 ANOVA df SS MS F Significance F Regression 2 10344797 5172399 6.94 0.0110 Residual 11 8193929 744903 Total 13 18538726 Coeff Std Error t Stut p -value Intercept 1283.0 352.0 3.65 0.0040 CenDose 25.228 8.631 2.92 0.0140 CenDoseSq 0.8604 0.3722 2.31 0.0410 -Referring to Table 15-7, suppose the chemist decides to use a t test to determine if there is a significant difference between a curvilinear model without a linear term and a curvilinear model that includes a linear term. Using a level of significance of 0.05, she would decide that the curvilinear model should include a linear term.

Free
(True/False)
4.7/5
(29)
Correct Answer:
Verified

True

TABLE 15-3 A certain type of rare gem serves as a status symbol for many of its owners. In theory, for low prices, the demand increases and it decreases as the price of the gem increases. However, experts hypothesize that when the gem is valued at very high prices, the demand increases with price due to the status owners believe they gain in obtaining the gem. Thus, the model proposed to best explain the demand for the gem by its price is the quadratic model: Y=β0+β1X+β2X2+ε Y=\beta_{0}+\beta_{1} X+\beta_{2} X^{2}+\varepsilon where Y = demand (in thousands) and X = retail price per carat. This model was fit to data collected for a sample of 12 rare gems of this type. A portion of the computer analysis obtained from Microsoft Excel is shown below: SUMMARY OUTPUT Regression Statistics Multiple R 0.994 R Square 0.988 Standard Error 12.42 Observations 12 ANOVA\text {ANOVA} df SS MS F Significance Regression 2 115145 57573 373 0.0001 Residual 9 1388 154 Total 11 116533 Coeff Std Error t Stat p -value Intercept 286.42 9.66 29.64 0.0001 Price -0.31 0.06 -5.14 0.0006 Price Sq 0.000067 0.00007 0.95 0.3647 -Referring to Table 15-3, does there appear to be significant upward curvature in the response curve relating the demand (Y) and the price (X) at 10% level of significance?

(Multiple Choice)
4.9/5
(37)

Collinearity is present when there is a high degree of correlation between independent variables.

(True/False)
4.9/5
(35)

TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below. Regression Statistics Multiple R 0.5487 R Square 0.3011 Adjusted R Square 0.2538 Standard Error 6442.4456 Observations 80 ANOVA SS MS F Significance Regression 5 1322911703.0671 264582340.6134 6.3747 0.0001 Residual 74 3071377751.1204 41505104.7449 Total 79 4394289454.1875 Coefficients Standard Error t Stat p-value Intercept -3862.4808 6180.9452 -0.6249 0.5340 Temp 51.7031 62.9439 0.8214 0.4140 Win\% 21.1085 16.2338 1.3003 0.1975 OpWin\% 11.3453 6.4617 1.7558 0.0833 Weekend 367.5377 2786.2639 0.1319 0.8954 Promotion 6927.8820 2784.3442 2.4882 0.0151  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_____ of the variation in ATTENDANCE can be explained by the five independent variables after taking into consideration the number of independent variables and the number of observations.  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_____ of the variation in ATTENDANCE can be explained by the five independent variables after taking into consideration the number of independent variables and the number of observations.  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_____ of the variation in ATTENDANCE can be explained by the five independent variables after taking into consideration the number of independent variables and the number of observations.  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_____ of the variation in ATTENDANCE can be explained by the five independent variables after taking into consideration the number of independent variables and the number of observations. The coefficient of multiple determination ( R 2 j ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308. -Referring to Table 15-9,_____ of the variation in ATTENDANCE can be explained by the five independent variables after taking into consideration the number of independent variables and the number of observations.

(Short Answer)
4.9/5
(36)

TABLE 15-7 A chemist employed by a pharmaceutical firm has developed a muscle relaxant. She took a sample of 14 people suffering from extreme muscle constriction. She gave each a vial containing a dose (X) of the drug and recorded the time to relief (Y) measured in seconds for each. She fit a "centered" curvilinear model to this data. The results obtained by Microsoft Excel follow, where the dose (X) given has been "centered." SUMMARY OUTPUT Regression Statistics Multiple R 0.747 RSquare 0.558 Adjusted R Square 0.478 Standard Error 863.1 Observations 14 ANOVA df SS MS F Significance F Regression 2 10344797 5172399 6.94 0.0110 Residual 11 8193929 744903 Total 13 18538726 Coeff Std Error t Stut p -value Intercept 1283.0 352.0 3.65 0.0040 CenDose 25.228 8.631 2.92 0.0140 CenDoseSq 0.8604 0.3722 2.31 0.0410 -Referring to Table 15-7, suppose the chemist decides to use an F test to determine if there is a significant curvilinear relationship between time and dose. The value of the test statistic is______ .

(Short Answer)
4.7/5
(33)

TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below. Regression Statistics Multiple R 0.5487 R Square 0.3011 Adjusted R Square 0.2538 Standard Error 6442.4456 Observations 80 ANOVA SS MS F Significance Regression 5 1322911703.0671 264582340.6134 6.3747 0.0001 Residual 74 3071377751.1204 41505104.7449 Total 79 4394289454.1875 Coefficients Standard Error t Stat p-value Intercept -3862.4808 6180.9452 -0.6249 0.5340 Temp 51.7031 62.9439 0.8214 0.4140 Win\% 21.1085 16.2338 1.3003 0.1975 OpWin\% 11.3453 6.4617 1.7558 0.0833 Weekend 367.5377 2786.2639 0.1319 0.8954 Promotion 6927.8820 2784.3442 2.4882 0.0151  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held  The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.  \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                   The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, which of the following assumptions is most likely violated based on the normal probability plot?  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held  The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.  \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                   The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, which of the following assumptions is most likely violated based on the normal probability plot?  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held  The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.  \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                   The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, which of the following assumptions is most likely violated based on the normal probability plot?  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held  The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.  \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                   The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308  -Referring to Table 15-9, which of the following assumptions is most likely violated based on the normal probability plot? The coefficient of multiple determination ( R 2 j ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308 -Referring to Table 15-9, which of the following assumptions is most likely violated based on the normal probability plot?

(Multiple Choice)
4.8/5
(37)

A regression diagnostic tool used to study the possible effects of collinearity is ______.

(Short Answer)
4.8/5
(36)

Collinearity will result in excessively low standard errors of the parameter estimates reported in the regression output.

(True/False)
5.0/5
(37)

TABLE 15-4 In Hawaii, condemnation proceedings are under way to enable private citizens to own the property that their homes are built on. Until recently, only estates were permitted to own land, and homeowners leased the land from the estate. In order to comply with the new law, a large Hawaiian estate wants to use regression analysis to estimate the fair market value of the land. The following model was fit to data collected for n = 20 properties, 10 of which are located near a cove. where Y = Sale price of property in thousands of dollars X1 = Size of property in thousands of square feet X2 = 1 if property located near cove, 0 if not Model 1: Y=β0+β1X1+β2X2+β3X1X2+β4X12+β5X12X2+ε Y=\beta_{0}+\beta_{1} X_{1}+\beta_{2} X_{2}+\beta_{3} X_{1} X_{2}+\beta_{4} X_{1}^{2}+\beta_{5} X_{1}^{2} X_{2}+\varepsilon Using the data collected for the 20 properties, the following partial output obtained from Microsoft Excel is shown: SUMMARY OUTPUT Regression Statistics Multiple R 0.985 R Square 0.970 Standard Error 9.5 Observations 20 ANOVA df SS MS F Significance F Regression 5 28324 5664 622 0.0001 Residual 14 1279 91 Total 19 29063 Coeff STd Error t Stut p -value Intercept -32.1 35.7 -0.90 0.3834 Size 122 5.9 2.05 0.0594 Cove -104.3 53.5 -1.95 0.0715 Size Cove 17.0 8.5 1.99 0.0661 SizeSq -0.3 0.2 -1.28 0.2204 SizeSq Cove -0.3 0.3 -1.13 0.2749 -Referring to Table 15-4, given a quadratic relationship between sale price (Y) and property size (X1), what null hypothesis would you test to determine whether the curves differ from cove and non-cove properties?

(Multiple Choice)
4.8/5
(31)

Using the Studentized residuals ti to determine influential points in a multiple regression model with k independent variable and n observations and letting tn-k-2 denote the upper critical value of a two-tail t test with a 0.10 level of significance, Xi is an influential point if

(Multiple Choice)
4.9/5
(31)

TABLE 15- 8 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state. Let Y = % Passing as the dependent variable, X1 = % Attendance, X2 = Salaries and X3 = Spending. The coefficient of multiple determination (R 2 j) of each of the 3 predictors with all the other remaining predictors are, respectively, 0.0338, 0.4669, and 0.4743. The output from the best- subset regressions is given below: Adjusted Model Variables R Square R Square Std. Error 1 X1 3.05 2 0.6024 0.5936 10.5787 2 X1X2 3.66 3 0.6145 0.5970 10.5350 3 X1X2X3 4.00 4 0.6288 0.6029 10.4570 4 X1X3 2.00 3 0.6288 0.6119 10.3375 5 X2 67.35 2 0.0474 0.0262 16.3755 6 X2X3 64.30 3 0.0910 0.0497 16.1768 7 X3 62.33 2 0.0907 0.0705 15.9984 Following is the residual plot for % Attendance:  TABLE 15- 8 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state. Let Y = % Passing as the dependent variable, X<sub>1 </sub>= % Attendance, X<sub>2 </sub>= Salaries and X<sub>3 </sub>= Spending. The coefficient of multiple determination (R <sup>2 </sup><sub>j</sub>) of each of the 3 predictors with all the other remaining predictors are,  respectively, 0.0338, 0.4669, and 0.4743. The output from the best- subset regressions is given below:  \begin{array}{llcclcc}  & & & &&  \text {Adjusted} \\ \text {Model }&\text { Variables} &  \mathrm{Cp}  & \mathrm{k} &\text {R Square} & \text {R Square} & \text {Std. Error }\\ \hline 1 & X1 & 3.05 & 2 & 0.6024 & 0.5936 & 10.5787 \\ 2 & X1X2 & 3.66 & 3 & 0.6145 & 0.5970 & 10.5350 \\ 3 & X1X2X3 & 4.00 & 4 & 0.6288 & 0.6029 & 10.4570 \\ 4 & X1X3 & 2.00 & 3 & 0.6288 & 0.6119 & 10.3375 \\ 5 & X2 & 67.35 & 2 & 0.0474 & 0.0262 & 16.3755 \\ 6 & X2X3 & 64.30 & 3 & 0.0910 & 0.0497 & 16.1768 \\ 7 & X3 & 62.33 & 2 & 0.0907 & 0.0705 & 15.9984 \\ \hline \end{array}   Following is the residual plot for % Attendance:     Following is the output of several multiple regression models:   \text {Model (I):}   \begin{array}{lcrclcr} \hline &  \text {Coefficients }&  \text {Std Error} &  \text {Stat } &  \text {p-value} & \text { Lower 95\% }& \text { Upper 95\%} \\ \hline \text { Intercept} & -753.4225 & 101.1149 & -7.4511 & 2.88 \mathrm{E}-09  & -957.3401 & -549.5050 \\ \%  \text {Attend }& 8.5014 & 1.0771 & 7.8929 &6.73 \mathrm{E}-10 & 6.3292 & 10.6735 \\  \text {Salary }& 6.85 \mathrm{E}-07  & 0.0006 & 0.0011 & 0.9991 & -0.0013 & 0.0013 \\  \text {Spending} & 0.0060 & 0.0046 & 1.2879 & 0.2047 & -0.0034 & 0.0153 \\ \hline \end{array}     \text {Model (II):}   \begin{array}{lcccc} \hline &  \text {Coefficients} & \text {Standard Error }& \text { t  Stat} &  \text { p -value } \\ \hline  \text {Intercept }& -753.4086 & 99.1451 & -7.5991 &  1.5291 \mathrm{E}-09 \\ \%  \text {Attendance} & 8.5014 & 1.0645 & 7.9862 & 4.223 \mathrm{E}-10  \\  \text {Spending} & 0.0060 & 0.0034 & 1.7676 & 0.0840 \\ \hline \end{array}     \text {Model (III):}   \begin{array}{lrrrrl} \hline & \text {  d f } & \text { SS } &  \text {  MS } & \text { F } &  \text { Significance F } \\ \hline  \text { Regression} & 2 & 8162.9429 & 4081.4714 & 39.8708 &1.3201 \mathrm{E}-10 \\  \text { Residual} & 44 & 4504.1635 & 102.3674 & & \\  \text { Total} & 46 & 12667.1064 & & & \\ \hline \end{array}     \begin{array}{lrcrr} \hline &  \text {Coefficients }&  \text {Standard Error} & \text { t Stat }&  \text {p -value} \\ \hline  \text {Intercept }& 6672.8367 & 3267.7349 & 2.0420 & 0.0472 \\ \% \text { Attendance} & -150.5694 & 69.9519 & -2.1525 & 0.0369 \\ \%  \text {Attendance Squared}& 0.8532 & 0.3743 & 2.2792 & 0.0276 \\ \hline \end{array}      -Referring to Table 15-8, the quadratic effect of daily average of the percentage of students attending class on percentage of students passing the proficiency test is not significant at a 5% level of significance. Following is the output of several multiple regression models: Model (I):\text {Model (I):} Coefficients Std Error Stat p-value Lower 95\% Upper 95\% Intercept -753.4225 101.1149 -7.4511 2.88-09 -957.3401 -549.5050 \% Attend 8.5014 1.0771 7.8929 6.73-10 6.3292 10.6735 Salary 6.85-07 0.0006 0.0011 0.9991 -0.0013 0.0013 Spending 0.0060 0.0046 1.2879 0.2047 -0.0034 0.0153 Model (II):\text {Model (II):} Coefficients Standard Error t Stat p -value Intercept -753.4086 99.1451 -7.5991 1.5291-09 \% Attendance 8.5014 1.0645 7.9862 4.223-10 Spending 0.0060 0.0034 1.7676 0.0840 Model (III):\text {Model (III):} d f SS MS F Significance F Regression 2 8162.9429 4081.4714 39.8708 1.3201-10 Residual 44 4504.1635 102.3674 Total 46 12667.1064 Coefficients Standard Error t Stat p -value Intercept 6672.8367 3267.7349 2.0420 0.0472 \% Attendance -150.5694 69.9519 -2.1525 0.0369 \% Attendance Squared 0.8532 0.3743 2.2792 0.0276 -Referring to Table 15-8, the quadratic effect of daily average of the percentage of students attending class on percentage of students passing the proficiency test is not significant at a 5% level of significance.

(True/False)
4.8/5
(28)

TABLE 15-3 A certain type of rare gem serves as a status symbol for many of its owners. In theory, for low prices, the demand increases and it decreases as the price of the gem increases. However, experts hypothesize that when the gem is valued at very high prices, the demand increases with price due to the status owners believe they gain in obtaining the gem. Thus, the model proposed to best explain the demand for the gem by its price is the quadratic model: Y=β0+β1X+β2X2+ε Y=\beta_{0}+\beta_{1} X+\beta_{2} X^{2}+\varepsilon where Y = demand (in thousands) and X = retail price per carat. This model was fit to data collected for a sample of 12 rare gems of this type. A portion of the computer analysis obtained from Microsoft Excel is shown below: SUMMARY OUTPUT Regression Statistics Multiple R 0.994 R Square 0.988 Standard Error 12.42 Observations 12 ANOVAANOVA df SS MS F Signifcance Regression 2 115145 57573 373 0.0001 Residual 9 1388 154 Total 11 116533 Coeff Std Error t Stat p-value Intercept 286.42 9.66 29.64 0.0001 Price -0.31 0.06 -5.14 0.0006 Price Sq 0.000067 0.00007 0.95 0.3647 -Referring to Table 15-3, a more parsimonious simple linear model is likely to be statistically superior to the fitted curvilinear for predicting sale price (Y).

(True/False)
4.8/5
(33)

So that we can fit curves as well as lines by regression, we often use mathematical manipulations for converting one variable into a different form. These manipulations are called dummy variables.

(True/False)
4.8/5
(43)

TABLE 15- 8 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state. Let Y = % Passing as the dependent variable, X1 = % Attendance, X2 = Salaries and X3 = Spending. The coefficient of multiple determination (R 2 j) of each of the 3 predictors with all the other remaining predictors are, respectively, 0.0338, 0.4669, and 0.4743. The output from the best- subset regressions is given below: Adjusted Adjusted Model Variables R Square R Square Std. Error 1 X1 3.05 2 0.6024 0.5936 10.5787 2 X1X2 3.66 3 0.6145 0.5970 10.5350 3 X1X2X3 4.00 4 0.6288 0.6029 10.4570 4 X1X3 2.00 3 0.6288 0.6119 10.3375 5 X2 67.35 2 0.0474 0.0262 16.3755 6 X2X3 64.30 3 0.0910 0.0497 16.1768 7 X3 62.33 2 0.0907 0.0705 15.9984 Following is the residual plot for % Attendance:  TABLE 15- 8 The superintendent of a school district wanted to predict the percentage of students passing a sixth- grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state. Let Y = % Passing as the dependent variable, X<sub>1 </sub>= % Attendance, X<sub>2 </sub>= Salaries and X<sub>3 </sub>= Spending. The coefficient of multiple determination (R <sup>2 </sup><sub>j</sub>) of each of the 3 predictors with all the other remaining predictors are,  respectively, 0.0338, 0.4669, and 0.4743. The output from the best- subset regressions is given below: Adjusted  \begin{array}{llcclcc}  & & & &&  \text {Adjusted} \\ \text {Model }&\text { Variables} &  \mathrm{Cp}  & \mathrm{k} &\text {R Square} & \text {R Square} & \text {Std. Error }\\ \hline 1 & X1 & 3.05 & 2 & 0.6024 & 0.5936 & 10.5787 \\ 2 & X1X2 & 3.66 & 3 & 0.6145 & 0.5970 & 10.5350 \\ 3 & X1X2X3 & 4.00 & 4 & 0.6288 & 0.6029 & 10.4570 \\ 4 & X1X3 & 2.00 & 3 & 0.6288 & 0.6119 & 10.3375 \\ 5 & X2 & 67.35 & 2 & 0.0474 & 0.0262 & 16.3755 \\ 6 & X2X3 & 64.30 & 3 & 0.0910 & 0.0497 & 16.1768 \\ 7 & X3 & 62.33 & 2 & 0.0907 & 0.0705 & 15.9984 \\ \hline \end{array}   Following is the residual plot for % Attendance:     Following is the output of several multiple regression models:   \text {Model (I):}   \begin{array}{lcrclcr} \hline &  \text {Coefficients }&  \text {Std Error} &  \text {Stat } &  \text {p-value} & \text { Lower 95\% }& \text { Upper 95\%} \\ \hline \text { Intercept} & -753.4225 & 101.1149 & -7.4511 & 2.88 \mathrm{E}-09  & -957.3401 & -549.5050 \\ \%  \text {Attend }& 8.5014 & 1.0771 & 7.8929 &6.73 \mathrm{E}-10 & 6.3292 & 10.6735 \\  \text {Salary }& 6.85 \mathrm{E}-07  & 0.0006 & 0.0011 & 0.9991 & -0.0013 & 0.0013 \\  \text {Spending} & 0.0060 & 0.0046 & 1.2879 & 0.2047 & -0.0034 & 0.0153 \\ \hline \end{array}     \text {Model (II):}   \begin{array}{lcccc} \hline &  \text {Coefficients} & \text {Standard Error }& \text { t  Stat} &  \text { p -value } \\ \hline  \text {Intercept }& -753.4086 & 99.1451 & -7.5991 &  1.5291 \mathrm{E}-09 \\ \%  \text {Attendance} & 8.5014 & 1.0645 & 7.9862 & 4.223 \mathrm{E}-10  \\  \text {Spending} & 0.0060 & 0.0034 & 1.7676 & 0.0840 \\ \hline \end{array}     \text {Model (III):}   \begin{array}{lrrrrl} \hline & \text {  d f } & \text { SS } &  \text {  MS } & \text { F } &  \text { Significance F } \\ \hline  \text { Regression} & 2 & 8162.9429 & 4081.4714 & 39.8708 &1.3201 \mathrm{E}-10 \\  \text { Residual} & 44 & 4504.1635 & 102.3674 & & \\  \text { Total} & 46 & 12667.1064 & & & \\ \hline \end{array}     \begin{array}{lrcrr} \hline &  \text {Coefficients }&  \text {Standard Error} & \text { t Stat }&  \text {p -value} \\ \hline  \text {Intercept }& 6672.8367 & 3267.7349 & 2.0420 & 0.0472 \\ \% \text { Attendance} & -150.5694 & 69.9519 & -2.1525 & 0.0369 \\ \%  \text {Attendance Squared}& 0.8532 & 0.3743 & 2.2792 & 0.0276 \\ \hline \end{array}    -Referring to Table 15-8, the better model using a 5% level of significance derived from the best model above is Following is the output of several multiple regression models: Model (I):\text {Model (I):} Coefficients Std Error Stat p-value Lower 95\% Upper 95\% Intercept -753.4225 101.1149 -7.4511 2.88-09 -957.3401 -549.5050 \% Attend 8.5014 1.0771 7.8929 6.73-10 6.3292 10.6735 Salary 6.85-07 0.0006 0.0011 0.9991 -0.0013 0.0013 Spending 0.0060 0.0046 1.2879 0.2047 -0.0034 0.0153 Model (II):\text {Model (II):} Coefficients Standard Error t Stat p -value Intercept -753.4086 99.1451 -7.5991 1.5291-09 \% Attendance 8.5014 1.0645 7.9862 4.223-10 Spending 0.0060 0.0034 1.7676 0.0840 Model (III):\text {Model (III):} d f SS MS F Significance F Regression 2 8162.9429 4081.4714 39.8708 1.3201-10 Residual 44 4504.1635 102.3674 Total 46 12667.1064 Coefficients Standard Error t Stat p -value Intercept 6672.8367 3267.7349 2.0420 0.0472 \% Attendance -150.5694 69.9519 -2.1525 0.0369 \% Attendance Squared 0.8532 0.3743 2.2792 0.0276 -Referring to Table 15-8, the better model using a 5% level of significance derived from the "best" model above is

(Multiple Choice)
4.9/5
(33)

TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below. Regression Statistics Multiple R 0.5487 R Square 0.3011 Adjusted R Square 0.2538 Standard Error 6442.4456 Observations 80 ANOVA SS MS F Significance Regression 5 1322911703.0671 264582340.6134 6.3747 0.0001 Residual 74 3071377751.1204 41505104.7449 Total 79 4394289454.1875 Coefficients Standard Error t Stat p-value Intercept -3862.4808 6180.9452 -0.6249 0.5340 Temp 51.7031 62.9439 0.8214 0.4140 Win\% 21.1085 16.2338 1.3003 0.1975 OpWin\% 11.3453 6.4617 1.7558 0.0833 Weekend 367.5377 2786.2639 0.1319 0.8954 Promotion 6927.8820 2784.3442 2.4882 0.0151  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_______ of the variation in ATTENDANCE can be explained by the five independent variables.  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_______ of the variation in ATTENDANCE can be explained by the five independent variables.  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_______ of the variation in ATTENDANCE can be explained by the five independent variables.  TABLE 15-9 Many factors determine the attendance at Major League Baseball games. These factors can include when the game is played, the weather, the opponent, whether or not the team is having a good season, and whether or not a marketing promotion is held. Data from 80 games of the Kansas City Royals for the following variables are collected. ATTENDANCE = Paid attendance for the game  TEMP = High temperature for the day WIN% = Team's winning percentage at the time of the game OPWIN% = Opponent team's winning percentage at the time of the game WEEKEND - 1 if game played on Friday, Saturday or Sunday; 0 otherwise PROMOTION - 1 = if a promotion was held; 0 = if no promotion was held The regression results using attendance as the dependent variable and the remaining five variables as the independent variables are presented below.   \begin{array}{l} \text { Regression Statistics }\\ \begin{array} { l r }  \hline \text { Multiple R } & 0.5487 \\ \text { R Square } & 0.3011 \\ \text { Adjusted R Square } & 0.2538 \\ \text { Standard Error } & 6442.4456 \\ \text { Observations } & 80 \\ \hline \end{array} \end{array}      \begin{array}{l} \text { ANOVA }\\ \begin{array} { l c c c c c }  \hline & \mathrm { df } & \text { SS } & \text { MS } & \text { F } & \text { Significance } \mathrm { F } \\ \hline \text { Regression } & 5 & 1322911703.0671 & 264582340.6134 & 6.3747 & 0.0001 \\ \text { Residual } & 74 & 3071377751.1204 & 41505104.7449 & & \\ \text { Total } & 79 & 4394289454.1875 & & & \\ \hline \end{array} \end{array}     \begin{array}{lrrrr} \hline&\text{Coefficients}&\text{ Standard Error}&\text{ t Stat}&\text{p-value}\\ \hline\text{Intercept}&-3862.4808&6180.9452&-0.6249&0.5340\\ \text { Temp } & 51.7031 & 62.9439 & 0.8214 & 0.4140 \\ \text { Win\% } & 21.1085 & 16.2338 & 1.3003 & 0.1975 \\ \text { OpWin\% } & 11.3453 & 6.4617 & 1.7558 & 0.0833 \\ \text { Weekend } & 367.5377 & 2786.2639 & 0.1319 & 0.8954 \\ \text { Promotion } & 6927.8820 & 2784.3442 & 2.4882 & 0.0151 \\ \hline \end{array}                 The coefficient of multiple determination ( R <sup>2</sup> <sub>j</sub> ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308.   -Referring to Table 15-9,_______ of the variation in ATTENDANCE can be explained by the five independent variables. The coefficient of multiple determination ( R 2 j ) of each of the 5 predictors with all the other remaining predictors are, respectively, 0.2675, 0.3101, 0.1038, 0.7325, and 0.7308. -Referring to Table 15-9,_______ of the variation in ATTENDANCE can be explained by the five independent variables.

(Short Answer)
4.9/5
(37)

TABLE 15-7 A chemist employed by a pharmaceutical firm has developed a muscle relaxant. She took a sample of 14 people suffering from extreme muscle constriction. She gave each a vial containing a dose (X) of the drug and recorded the time to relief (Y) measured in seconds for each. She fit a "centered" curvilinear model to this data. The results obtained by Microsoft Excel follow, where the dose (X) given has been "centered." SUMMARY OUTPUT Regression Statistics Multiple R 0.747 RSquare 0.558 Adjusted R Square 0.478 Standard Error 863.1 Observations 14 ANOVA df SS MS F Significance F Regression 2 10344797 5172399 6.94 0.0110 Residual 11 8193929 744903 Total 13 18538726 Coeff Std Error t Stut p -value Intercept 1283.0 352.0 3.65 0.0040 CenDose 25.228 8.631 2.92 0.0140 CenDoseSq 0.8604 0.3722 2.31 0.0410 -Referring to Table 15-7, suppose the chemist decides to use an F test to determine if there is a significant curvilinear relationship between time and dose. The p-value of the test is_______

(Short Answer)
4.8/5
(29)

In stepwise regression, an independent variable is not allowed to be removed from the model once it has entered into the model.

(True/False)
4.8/5
(33)

The goals of model building are to find a good model with the fewest independent variables that is easier to interpret and has lower probability of collinearity.

(True/False)
4.9/5
(32)
Showing 1 - 20 of 88
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)