Exam 12: Multiple Regression and Model Building

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

A collector of grandfather clocks believes that the price received for the clocks at an auction increases with the number of bidders, but at an increasing (rather than a constant)rate. Thus, the Model proposed to best explain auction price (y, in dollars)by number of bidders (x)is the Quadratic model E(y)=β0+β1x+β2x2E ( y ) = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x ^ { 2 } This model was fit to data collected for a sample of 32 clocks sold at auction; the resulting estimate of β1\beta _ { 1 } was .31- .31 . Interpret this estimate of β1\beta _ { 1 } .

(Multiple Choice)
4.8/5
(39)

Consider the partial printout for an interaction regression analysis of the relationship between a dependent variable yy and two independent variables x1x _ { 1 } and x2x _ { 2 } . ANOVA df SS MS F Significance F Regression 3 3393.677324 1131.225775 9391.974782 2.11084-11 Residual 6 0.722675987 0.120445998 Total 9 3394.4 Coefficients Standard Error t Stat P-value Lower 95\% Upper 95\% Intercept 16.72197014 8.283997219 2.018587126 0.09007654 -3.548255659 36.99219593 1 -3.037317759 2.678748705 -1.133856921 0.300116382 -9.591984506 3.517348987 2 -1.046522754 1.547132645 -0.676427297 0.523973988 -4.832222727 2.73917722 12 4.071685147 0.444059933 9.169224345 9.47663-05 2.98510884 5.158261454 a. Write the prediction equation for the interaction model. b. Test the overall utility of the interaction model using the global FF -test at α=.05\alpha = .05 . c. Test the hypothesis (at α=.05)\alpha = .05 ) that x1x _ { 1 } and x2x _ { 2 } interact positively. d. Estimate the change in yy for each additional 1 -unit increase in x1x _ { 1 } when x2=6x _ { 2 } = 6 .

(Essay)
4.7/5
(32)

When testing the utility of the quadratic model E(y)=β0+β1x+β2x2E ( y ) = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x ^ { 2 } , the most important tests involve the null hypotheses H0:β0=0H _ { 0 } : \beta _ { 0 } = 0 and H0:β1=0H _ { 0 } : \beta _ { 1 } = 0 .

(True/False)
4.9/5
(31)

A study of the top MBA programs attempted to predict the average starting salary (in $1000ʹs)of graduates of the program based on the amount of tuition (in $1000ʹs)charged by the program and The average GMAT score of the programʹs students. The results of a regression analysis based on a Sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary Predictor Variables Coefficient Std Error T P VIF Constant -203.402 51.6573 -3.94 0.0002 0.0 Gmat 0.39412 0.09039 4.36 0.0000 2.0 Tuition 0.92012 0.17875 5.15 0.0000 2.0 R-Squared 0.6857 Resid. Mean Square (MSE) 427.511 Adjusted R-Squared 0.6769 Standard Deviation 20.6763 Identify the test statistic that should be used to test to determine if the amount of tuition charged by a program is a useful predictor of the average starting salary of the graduates of the program.

(Multiple Choice)
4.9/5
(39)

As part of a study at a large university, data were collected on n = 224 freshmen computer science (CS)majors in a particular year. The researchers were interested in modeling y, a studentʹs grade Point average (GPA)after three semesters, as a function of the following independent variables (recorded at the time the students enrolled in the university): x1=x _ { 1 } = average high school grade in mathematics (HSM) x2=x _ { 2 } = average high school grade in science (HSS) x3=x _ { 3 } = average high school grade in English (HSE) x4=x _ { 4 } = SAT mathematics score (SATM) x5=x _ { 5 } = SAT verbal score (SATV) A first-order model was fit to data. A 95%95 \% confidence interval for β1\beta 1 is (.06,.22)( .06 , .22 ) . Interpret this result.

(Multiple Choice)
5.0/5
(31)

As part of a study at a large university, data were collected on n = 224 freshmen computer science (CS)majors in a particular year. The researchers were interested in modeling y, a studentʹs grade Point average (GPA)after three semesters, as a function of the following independent variables (recorded at the time the students enrolled in the university): = average high school grade in mathematics (HSM) = average high school grade in science (HSS) = average high school grade in English (HSE) = SAT mathematics score (SATM) = SAT verbal score (SATV) A first-order model was fit to data with R2=0.211R ^ { 2 } = 0.211 . What is the correct interpretation of R2R ^ { 2 } , the coefficient of determination for the model?

(Multiple Choice)
4.7/5
(34)

It is dangerous to predict outside the range of the data collected in a regression analysis. For instance, we shouldnʹt predict the price of a 5000 square foot home if all our sample homes were Smaller than 4500 square feet. Which of the following multiple regression pitfalls does this example Describe?

(Multiple Choice)
4.8/5
(34)

The staff of a test kitchen is attempting to determine the baking time, y y , of a roast, i.e., the time it takes the internal temperature of the roast to reach 165F 165^{\circ} \mathrm{F} , using two variables, the temperature setting of the oven, x1 x_{1} , and the weight of the roast, x2 x_{2} , in pounds. The data for 24 roasts are shown below.  Baking Times of Roasts \text { Baking Times of Roasts }  The staff of a test kitchen is attempting to determine the baking time,   y  , of a roast, i.e., the time it takes the internal temperature of the roast to reach   165^{\circ} \mathrm{F}  , using two variables, the temperature setting of the oven,   x_{1}  , and the weight of the roast,   x_{2}  , in pounds. The data for 24 roasts are shown below.   \text { Baking Times of Roasts }     a.  \text { Fit a complete second-order model to the data. }  b. Do the data provide sufficient evidence to indicate that the second-order terms contribute information for the prediction of   y   ? State the null and alternative hypothes     a.  Fit a complete second-order model to the data. \text { Fit a complete second-order model to the data. } b. Do the data provide sufficient evidence to indicate that the second-order terms contribute information for the prediction of y y ? State the null and alternative hypothes

(Essay)
4.7/5
(33)

The concessions manager at a beachside park recorded the high temperature, the number of people at the park, and the number of bottles of water sold for each of 12 consecutive Saturdays. The data are shown below.  The concessions manager at a beachside park recorded the high temperature, the number of people at the park, and the number of bottles of water sold for each of 12 consecutive Saturdays. The data are shown below.     a. Fit the model  E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 }  to the data, letting  y  represent the number of bottles of water sold,  x _ { 1 }  the temperature, and  x _ { 2 }  the number of people at the park. b. Find the  95 \%  confidence interval for the mean number of bottles of water sold when the temperature is  84 ^ { \circ } \mathrm { F }  and there are 2700 people at the park. c. Find the  95 \%  prediction interval for the number of bottles of water sold when the temperature is  84 ^ { \circ } \mathrm { F }  and there are 2700 people at the park. a. Fit the model E(y)=β0+β1x1+β2x2E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } to the data, letting yy represent the number of bottles of water sold, x1x _ { 1 } the temperature, and x2x _ { 2 } the number of people at the park. b. Find the 95%95 \% confidence interval for the mean number of bottles of water sold when the temperature is 84F84 ^ { \circ } \mathrm { F } and there are 2700 people at the park. c. Find the 95%95 \% prediction interval for the number of bottles of water sold when the temperature is 84F84 ^ { \circ } \mathrm { F } and there are 2700 people at the park.

(Essay)
4.8/5
(31)

Retail price data for n = 60 hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive: y=y = Retail PRICE (measured in dollars) x1=x _ { 1 } = Microprocessor SPEED (measured in megahertz) (Values in sample range from 10 to 40 ) x2=x _ { 2 } = CHIP size (measured in computer processing units) (Values in sample range from 286 to 486 ) A first-order regression model. was fit to the data. Part of the printout follows:  Retail price data for n = 60 hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive:  y =  Retail PRICE (measured in dollars)  x _ { 1 } =  Microprocessor SPEED (measured in megahertz) (Values in sample range from 10 to 40 )  x _ { 2 } =  CHIP size (measured in computer processing units) (Values in sample range from 286 to 486 ) A first-order regression model. was fit to the data. Part of the printout follows:    Identify and interpret the estimate of  \beta _ { 2 } . Identify and interpret the estimate of β2\beta _ { 2 } .

(Essay)
4.8/5
(38)

A first-order model may include terms for both quantitative and qualitative independent variables.

(True/False)
4.8/5
(37)

A regression residual is the difference between an observed y value and its corresponding predicted value.

(True/False)
4.7/5
(32)

The model E(y)=β0+β1x1+β2x2E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } was fit to a set of data. A partial printout for the analysis follows:  The model  E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 }  was fit to a set of data. A partial printout for the analysis follows:    Interpret the value of the residual when  x _ { 1 } = 7,781  and  x _ { 2 } = 644 . Interpret the value of the residual when x1=7,781x _ { 1 } = 7,781 and x2=644x _ { 2 } = 644 .

(Multiple Choice)
4.8/5
(35)

In situations where two competing models have essentially the same predictive power (as determined by an F-test), it is standard procedure to use the model with the greater number of parameters.

(True/False)
4.9/5
(29)

The model E(y)=β0+β1xE ( y ) = \beta _ { 0 } + \beta _ { 1 } ^ { x } was fit to a set of data, and the following plot of residuals against xx values was obtained.  The model  E ( y ) = \beta _ { 0 } + \beta _ { 1 } ^ { x }  was fit to a set of data, and the following plot of residuals against  x  values was obtained.    Interpret the residual plot. Interpret the residual plot.

(Essay)
4.9/5
(33)

The model E(y)=β0+β1x1+β2x2+β3x3+β4x4E ( y ) = \beta _ { 0 } + \beta _ { 1 } x _ { 1 } + \beta _ { 2 } x _ { 2 } + \beta _ { 3 } x _ { 3 } + \beta _ { 4 } x _ { 4 } was used to relate E(y)E ( y ) to a single qualitative variable, where x1"{1 if level 2 0 if not x2"{1 if level 3 0 if not x _ { 1 } " \left\{ \begin{array} { l l } 1 & \text { if level 2 } \\ 0 & \text { if not } \end{array} \quad x _ { 2 } " \left\{ \begin{array} { l l } 1 & \text { if level 3 } \\ 0 & \text { if not } \end{array} \right. \right. x3"{1 if level 40 if not x4"{1 if level 50 if not x _ { 3 } " \left\{ \begin{array} { l l } 1 & \text { if level } 4 \\ 0 & \text { if not } \end{array} \quad x _ { 4 } " \left\{ \begin{array} { l l } 1 & \text { if level } 5 \\ 0 & \text { if not } \end{array} \right. \right. This model was fit to n=40n = 40 data points and the following result was obtained: y^=14.5+3x14x2+10x3+8x4\hat { y } = 14.5 + 3 x _ { 1 } - 4 x _ { 2 } + 10 x _ { 3 } + 8 x _ { 4 } a. Use the least squares prediction equation to find the estimate of E(y)for each level of the qualitative variable. b. Specify the null and alternative hypothesis you would use to test whether E(y)is the same for all levels of the independent variable.

(Essay)
4.9/5
(36)

Retail price data for n = 60 hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive: y=y = Retail PRICE (measured in dollars) x1=x _ { 1 } = Microprocessor SPEED (measured in megahertz) (Values in sample range from 10 to 40 ) x2=x _ { 2 } = CHIP size (measured in computer processing units) (Values in sample range from 286 to 486 ) A first-order regression model was fit to the data. Part of the printout follows:  Retail price data for n = 60 hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive:  y =  Retail PRICE (measured in dollars)  x _ { 1 } =  Microprocessor SPEED (measured in megahertz) (Values in sample range from 10 to 40 )  x _ { 2 } =  CHIP size (measured in computer processing units) (Values in sample range from 286 to 486 ) A first-order regression model was fit to the data. Part of the printout follows:    Test to determine if the model is adequate for predicting the price of a computer. Use  \alpha =   .01 . Test to determine if the model is adequate for predicting the price of a computer. Use α=\alpha = .01.01 .

(Essay)
4.9/5
(40)

A college admissions officer proposes to use regression to model a studentʹs college GPA at graduation in terms of the following two variables: = high school GPA = SAT score The admissions officer believes the relationship between college GPA and high school GPA is linear and the relationship between SAT score and college GPA is linear. She also believes that the relationship between college GPA and high school GPA depends on the studentʹs SAT score. Write the regression model she should fit.

(Essay)
4.8/5
(52)

A collector of grandfather clocks believes that the price received for the clocks at an auction increases with the number of bidders, but at an increasing (rather than a constant)rate. Thus, the Model proposed to best explain auction price (y, in dollars)by number of bidders (x)is the Quadratic model E(y)=β0+β1x+β2x2E ( y ) = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x ^ { 2 } This model was fit to data collected for a sample of 32 clocks sold at auction; a portion of the printout follows: PARAMETER STANDARD T FOR 0; VARIABLES ESTIMATE ERROR PARAMETER =0 PROB >\mid INTERCEPT 286.42 9.66 .31 .06 29.64 .0001 \cdot -.000067 .00007 5.14 .0016 Give the pp -value for testing H0:β2=0H _ { 0 } : \beta _ { 2 } = 0 against Ha:β20H _ { a } : \beta _ { 2 } \neq 0 .

(Multiple Choice)
4.8/5
(41)

A qualitative variable whose outcomes are assigned numerical values is called a coded variable.

(True/False)
4.8/5
(41)
Showing 61 - 80 of 131
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)