Exam 12: Multiple Regression and Model Building
Exam 1: Statistics, Data, and Statistical Thinking73 Questions
Exam 2: Methods for Describing Sets of Data194 Questions
Exam 3: Probability283 Questions
Exam 4: Discrete Random Variables133 Questions
Exam 5: Continuous Random Variables139 Questions
Exam 6: Sampling Distributions47 Questions
Exam 7: Inferences Based on a Single Sample: Estimation With Confidence Intervals124 Questions
Exam 8: Inferences Based on a Single Sample: Tests of Hypothesis140 Questions
Exam 9: Inferences Based on a Two Samples: Confidence Intervals and Tests of Hypotheses94 Questions
Exam 10: Analysis of Variance: Comparing More Than Two Means90 Questions
Exam 11: Simple Linear Regression111 Questions
Exam 12: Multiple Regression and Model Building131 Questions
Exam 13: Categorical Data Analysis60 Questions
Exam 14: Nonparametric Statistics90 Questions
Select questions type
A certain type of rare gem serves as a status symbol for many of its owners. In theory, for low prices, the demand decreases as the price of the gem increases. However, experts hypothesize that when the gem is valued at very high prices, the demand increases with price due to the status the owners believe they gain by obtaining the gem. Thus, the model proposed to best explain the demand for the gem by its price is the quadratic model
where Demand (in thousands) and Retail price per carat (dollars).
This model was fit to data collected for a sample of 12 rare gems. A portion of the printout is given below:
SOURCE DF SS MS F PR > F Model 2 115145 57573 373 .0001 Error 9 1388 154 TOTAL 11 116533
Root MSE R-Square
PARAMETER T for :
\begin{array} { l r r r r }& \text {PARAMETER}&& \text { \mathrm{T} for \( \mathrm{HO} \) }:\\\text {VARIABLES}& \text {ESTIMATES }& \text {STD. ERROR}& \text { PARAMETER = 0 }& \text {PR \(> | T |\)}\\ \text { INTERPCEP } & 286.42 & 9.66 & 29.64 & .0001 \\ \mathrm { X } & - .31 & .06 & - 5.14 & .0006 \\ \mathrm { X } \cdot \mathrm { X } & .000067 & .00007 & .95 & .3647 \end{array}
Is there sufficient evidence to indicate the model is tuseful for predicting the demand for the gem? Use .
(Essay)
4.8/5
(42)
The value of R2 is only useful when the number of data points is substantially larger than the number of β parameters in the model.
(True/False)
4.8/5
(29)
Residual analysis can be used to check for violations of the assumptions that the distribution of the random error component is normally distributed with mean 0.
(True/False)
4.9/5
(42)
As part of a study at a large university, data were collected on n = 224 freshmen computer science (CS) majors in a particular year. The researchers were interested in modeling y, a student's grade point average (GPA) after three semesters, as a function of the following independent variables (recorded at the time the students enrolled in the university): average high school grade in mathematics (HSM)
average high school grade in science (HSS)
average high school grade in English (HSE)
SAT mathematics score (SATM)
SAT verbal score (SATV)
A first-order model was fit to data with the following results:
SOURCE DF SS MS FVALUE PROB > F MODEL 5 28.64 5.73 11.69 .0001 ERROR 218 106.82 0.49 TOTAL 223 135.46
ROOT MSE 0.700 R-SOUARE 0.211
DEP MEAN 4.635 ADJ R-5Q 0.193
PARAMETER STANDARD T FOR O: VARIABLE ESTIMATE ERROR PARAMETER =0 PROB >|T| INTERCEPT 2.327 0.039 5.817 0.0001 X1 (HSM) 0.146 0.037 3.718 0.0003 X2 (HSS) 0.036 0.038 0.950 0.3432 X3 (HSE) 0.055 0.040 1.397 0.1637 X4 (SATM) 0.00094 0.00068 1.376 0.1702 X5 (SATV) -0.00041 0.00059 -0.689 0.4915
Interpret the value under the column heading .
A) There is sufficient evidence (at ) to conclude that the first-order model is statistically useful for predicting GPA.
B) There is insufficient evidence (at ) to conclude that the first-order model is statistically useful for predicting GPA.
C) Over of the variation in GPAs can be explained by the model.
D) Accept (at ); at least one of the -coefficients in the first-order model is equal to 0 .
(Short Answer)
4.8/5
(33)
The concessions manager at a beachside park recorded the high temperature, the number of people at the park, and the number of bottles of water sold for each of 12 consecutive Saturdays. The data are shown below. Bottles Sold Temperature People 341 73 1625 425 79 2100 457 80 2125 485 80 2800 469 81 2550 395 82 1975 511 83 2675 549 83 2800 543 85 2850 537 88 2775 621 89 2800 897 91 3100 a. Fit the model to the data, letting represent the number of bottles of water sold, the temperature, and the number of people at the park.
b. Find the confidence interval for the mean number of bottles of water sold when the temperature is and there are 2700 people at the park.
c. Find the prediction interval for the number of bottles of water sold when the temperature is and there are 2700 people at the park. 12.5 Interaction Models 1 Write Interaction Model
(Essay)
4.9/5
(36)
A regression residual is the difference between an observed y value and its corresponding predicted value.
(True/False)
4.7/5
(39)
During its manufacture, a product is subjected to four different tests in sequential order. An efficiency expert claims that the fourth (and last) test is unnecessary since its results can be predicted based on the first three tests. To test this claim, multiple regression will be used to model Test 4 score (y), as a function of Test1 score , Test 2 score , and Test3 score . [Note: All test scores range from 200 to 800 , with higher scores indicative of a higher quality product.] Consider the model:
The global statistic is used to test the null hypothesis, . Describe this hypothesis in words.
(Multiple Choice)
4.9/5
(33)
The model x was fit to a set of data, and the following plot of residuals against x values was obtained.
Interpret the residual plot.

(Essay)
4.9/5
(35)
A college admissions officer proposes to use regression to model a student's college GPA at graduation in terms of the following two variables: = high school GPA = SAT score The admissions officer believes the relationship between college GPA and high school GPA is linear and the relationship between SAT score and college GPA is linear. She also believes that the relationship between college GPA and high school GPA depends on the student's SAT score. Write the regression model she should fit. 2 Test if Model is Useful for Predicting y
(Essay)
4.9/5
(39)
Retail price data for n = 60 hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive: Retail PRICE (measured in dollars)
Microprocessor SPEED (measured in megahertz)
(Values in sample range from 10 to 40 )
size (measured in computer processing units)
(Values in sample range from 286 to 486 )
A first-order regression model was fit to the data. Part of the printout follows:
Parameter Estimates
PARAMETER STANDARD T FOR 0: VARIABLE DF ESTIMATE ERROR PARAMETER =0 PROB > |T| = INTERCEPT 1 -373.526392 1258.1243396-0.297 0.7676 SPEED 1 104.838940 22.362981954.688 0.0001 CHIP 1 3.571850 3.894229350.917 0.3629
Identify and interpret the estimate for the SPEED -coefficient, .
A) ; For every 1-megahertz increase in SPEED, we estimate PRICE to increase , holding CHIP fixed.
B) ; For every increase in PRICE, we estimate SPEED to increase 105 megahertz, holding CHIP fixed.
C) For every 1 -megahertz increase in SPEED, we estimate PRICE to increase , holding CHIP fixed.
D) ; For every increase in PRICE, we estimate SPPED to increase by about 4 megahertz, holding CHIP fixed.
(Short Answer)
4.8/5
(38)
A collector of grandfather clocks believes that the price received for the clocks at an auction increases with the number of bidders, but at an increasing (rather than a constant) rate. Thus, the model proposed to best explain auction price (y, in dollars) by number of bidders (x) is the quadratic model
This model was fit to data collected for a sample of 32 clocks sold at atiction; a portion of the printout follows:
SOURCE DF 55 MS FVALUE PROB > F MODEL 2 4277160 2138579 120 .0005 ERROR 29 514034 17725 TOTAL 31 4791194
ROOT MSE 133 R-SQUARE 893 DEP MEAN 1327 ADJ R-SQ .885
PARAMETER STANDARD T FOR 0: VARIABLES ESTIMATE ERROR PARAMETER =0 PROB >|T| INTERCEPT 286.42 9.66 29.64 .0001 .31 .06 5.14 .0016 \cdot -.000067 .00007 -0.95 .3600
An outlier for the model is a clock with a residual that _____ in absolute value. (Fill in the blank.)
(Multiple Choice)
4.8/5
(34)
A certain type of rare gem serves as a status symbol for many of its owners. In theory, for low prices, the demand decreases as the price of the gem increases. However, experts hypothesize that when the gem is valued at very high prices, the demand increases with price due to the status the owners believe they gain by obtaining the gem. Thus, the model proposed to best explain the demand for the gem by its price is the quadratic model
where Demand (in thousands) and Retail price per carat (dollars).
This model was fit to data collected for a sample of 12 rare gems. A portion of the printout is given below:
SOURCE DF 55 M5 F PR > F Model 2 115145 57573 373 ,0001 Error 9 1388 154 TOTAL 11 116533
VARIABLES ESTIMATES STD, ERROR PARAMETER =0 P R>|T| INTERPCEP 286.42 9.66 29.64 .0001 -.31 .06 -5.14 .0006 \cdot .000067 .00007 .95 .3647
Does the quadratic term contribute useful information for predicting the demand for the gem? Use .
(Essay)
4.8/5
(38)
It is desired to build a regression model to predict the sales price of a single family home, based on the size of the house and the neighborhood the home is located in. The goal is to compare the prices of homes that are located in two different neighborhoods. A complete 2nd-order model is proposed. Which regression model proposes the complete 2nd-order model?
A)
B)
C)
D)
(Short Answer)
4.8/5
(33)
The model was used to relate to a single qualitative variable, where
= 1 if level 2 0 if not = 1 if level 3 0 if not = 1 if level 4 0 if not 1 if level 5 0 if not
This model was fit to data points and the following result was obtained:
a. Use the least squares prediction equation to find the estimate of for each level of the qualitative variable.
b. Specify the null and alternative hypothesis you would use to test whether is the same for all levels of the independent variable. 3 Test if Model is Useful for Predicting y
(Essay)
4.8/5
(38)
Once interaction has been established between and , the first-order terms for and may be deleted from the regression model leaving the higher-order term containing the product of and .
(True/False)
4.8/5
(35)
Consider the model
where is a quantitative variable and and are dummy variables describing a qualitative variable at three levels using the coding scheme
The resulting least squares prediction equation is
What is the equation of the response curve for when and ?
A)
B)
C)
D)
(Short Answer)
4.8/5
(34)
The method of fitting first-order models is the same as that of fitting the simple straight-line model, i.e. the method of least squares.
(True/False)
4.8/5
(31)
The model was used to relate E(y) to a single qualitative variable. How many levels does the qualitative variable have?
(Essay)
4.9/5
(34)
The sum of squared errors (SSE) of a least squares regression model decreases when new terms are added to the model.
(True/False)
4.8/5
(25)
The model was fit to a set of data, and the following plot of residuals against x values was obtained.
Interpret the residual plot.

(Essay)
4.8/5
(37)
Showing 101 - 120 of 131
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)