Exam 12: Multiple Regression and Model Building
Exam 1: Statistics, Data, and Statistical Thinking74 Questions
Exam 2: Methods for Describing Sets of Data188 Questions
Exam 3: Probability237 Questions
Exam 4: Random Variables and Probability Distributions273 Questions
Exam 5: Sampling Distributions52 Questions
Exam 6: Inferences Based on a Single Sample: Estimation With Confidence Intervals135 Questions
Exam 7: Inferences Based on a Single Sample: 355 Tests of Hypotheses144 Questions
Exam 8: Inferences Based on Two Samples: Confidence Intervals and Tests of Hypotheses102 Questions
Exam 9: Design of Experiments and Analysis of Variance87 Questions
Exam 10: Categorical Data Analysis59 Questions
Exam 11: Simple Linear Regression113 Questions
Exam 12: Multiple Regression and Model Building131 Questions
Exam 13: Methods for Quality Improvement: Statistical Process Control Available on CD89 Questions
Exam 14: Time Series: Descriptive Analyses, Models, and Forecasting Available on CD73 Questions
Exam 15: Nonparametric Statistics Available on CD49 Questions
Select questions type
Retail price data for hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive:
Retail PRICE (measured in dollars)
Microprocessor SPEED (measured in megahertz)
(Values in sample range from 10 to 40 )
CHIP size (measured in computer processing units)
(Values in sample range from 286 to 486 )
A first-order regression model was fit to the data. Part of the printout follows:
Analysis of Variance SOURCE DF SS MS F VALUE PROB > F MODEL 2 34593103.008 17296051.504 19.018 0.0001 ERROR 57 51840202.926 909477.24431 CTOTAL 59 86432305.933
ROOT MSE 953.66516 R-SQUARE 0.4002 DEP MEAN 3197.96667 ADJ R-SQ 0.3792 C.V. 29.82099
Test to determine if the model is adequate for predicting the price of a computer. Use .
(Essay)
4.8/5
(38)
A qualitative variable whose outcomes are assigned numerical values is called a coded variable.
(True/False)
4.9/5
(33)
The stepwise regression procedure may not be used when the inclusion of one or more dummy variables is under consideration.
(True/False)
4.9/5
(34)
Consider the interaction model . Determine the change in when is changed from 6 to 7 and is held fixed at 3 .
(Multiple Choice)
4.8/5
(35)
As part of a study at a large university, data were collected on freshmen computer science (CS) majors in a particular year. The researchers were interested in modeling , a student's grade point average (GPA) after three semesters, as a function of the following independent variables (recorded at the time the students enrolled in the university):
average high school grade in mathematics (HSM)
average high school grade in science (HSS)
average high school grade in English (HSE)
SAT mathematics score (SATM)
SAT verbal score (SATV)
A first-order model was fit to data.
A confidence interval for is . Interpret this result.
(Multiple Choice)
4.9/5
(47)
A graphing calculator was used to fit the model to a set of data. The resulting screen is shown below.
Which number on the screen represents the estimator of ?

(Multiple Choice)
4.7/5
(46)
Consider the model
where is a quantitative variable and and are dummy variables describing a qualitative variable at three levels using the coding scheme
The resulting least squares prediction equation is
What is the equation of the response curve for when and ?
(Multiple Choice)
4.8/5
(25)
For any given model fit to a data set, the sum of the residuals is 0.
(True/False)
4.8/5
(31)
The method of fitting first-order models is the same as that of fitting the simple straight-line model, i.e. the method of least squares.
(True/False)
4.8/5
(35)
During its manufacture, a product is subjected to four different tests in sequential order. An efficiency expert claims that the fourth (and last) test is unnecessary since its results can be predicted based on the first three tests. To test this claim, multiple regression will be used to model Test4 score , as a function of Test1 score , Test 2 score , and Test3 score . [Note: All test scores range from 200 to 800 , with higher scores indicative of a higher quality product.] Consider the model:
The first-order model was fit to the data for each of 12 units sampled from the production line.
A prediction interval for Test4 score of a product with Test1 , Test2 , and Test is . Interpret this result.
(Multiple Choice)
4.9/5
(33)
The staff of a test kitchen is attempting to determine the baking time, , of a roast, i.e., the time it takes the internal temperature of the roast to reach , using two variables, the temperature setting of the oven, , and the weight of the roast, , in pounds. The data for 24 roasts are shown below.
Baking Times of Roasts
1 2() () 1 2() () 1 2() () 1 2() () 300 2.2 2.6 325 2.1 2.3 350 2.3 2.2 375 2.2 1.9 300 2.7 2.8 325 2.4 2.4 350 2.5 2.3 375 2.6 2.2 300 2.9 3.1 325 2.9 2.6 350 2.8 2.5 375 2.9 2.4 300 3.1 3.2 325 3.0 2.7 350 3.2 2.7 375 3.1 2.6 300 3.2 3.2 325 3.2 2.9 350 3.4 2.8 375 3.3 2.7 300 3.5 3.3 325 3.6 3.1 350 3.5 2.8 375 3.4 2.7
a. Fit a complete second-order model to the data.
b. Do the data provide sufficient evidence to indicate that the second-order terms contribute information for the prediction of ? State the null and alternative hypotheses and the test statistic. Use .
(Essay)
4.9/5
(39)
One of three surfaces is produced by a complete second-order model with two quantitative independent variables: a paraboloid that opens upward, a paraboloid that opens downward, or a saddle-shaped surface.
(True/False)
4.9/5
(39)
There are four independent variables, , and , that might be useful in predicting a response . A total of observations is available, and it is decided to employ stepwise regression to help in selecting the independent variables that appear to useful. The computer fits all possible one-variable models of the form . The information in the table is provided from the computer printout.
Variable \beta s 1 2.4 0.52 2 -0.2 0.03 3 3.6 2.11 4 0.8 0.44
Which independent variable is declared the best one-variable predictor of ?
(Multiple Choice)
4.8/5
(31)
A public health researcher wants to use regression to predict the sun safety knowledge of pre-school children. The researcher randomly sampled 35 preschoolers, assigned them to one of Two groups, and then measured the following three variables:
SUNSCORE: Score on sun-safety comprehension test
READING: Reading comprehension score
GROUP: if child received a Be Sun Safe demonstration, 0 if not
The following two models were hypothesized:
Model 1:
Model 2:
A partial f-test was conducted to compare the two models and the resulting p-value was found to be 0.0023. Fill in the blank. The results lead us to conclude that there is _____
(Multiple Choice)
4.9/5
(34)
We decide to conduct a multiple regression analysis to predict the attendance at a major league baseball game. We use the size of the stadium as a quantitative independent variable and the type Of game as a qualitative variable (with two levels - day game or night game). We hypothesize the
Following model:
Where size of the stadium
if a day game, 0 if a night game
A plot of the relationship would show:
(Multiple Choice)
4.8/5
(43)
The model was fit to a set of data.
A partial printout for the analysis follows:
Actual Predict Lower 95\% CL Upper 95\% CL OBS X1 X2 Value Value Residual Predict Predict 1 7781 644 74.707 83.175 -8.468 47.224 119.126
Interpret the value of the residual when and .
(Multiple Choice)
4.9/5
(43)
An elections officer wants to model voter turnout in a precinct as a function of the type of precinct.
Consider the model relating mean voter turnout, , to precinct type:
E(y)=++, where =1 if urban, 0 if not =1 if suburban, 0 if not (Base level = rural)
The -value for the test is . 14 . Interpret the result.
(Multiple Choice)
5.0/5
(32)
A study of the top MBA programs attempted to predict the average starting salary (in $1000's) of graduates of the program based on the amount of tuition (in $1000's) charged by the program and the average GMAT score of the program's students. The results of a regression analysis based on a sample of 75 MBA programs is shown below: Least Squares Linear Regression of Salary
Predictor Variables Coefficient Std Error T P VIF Constant -203.402 51.6573 -3.94 0.0002 0.0 Gmat 0.39412 0.09039 4.36 0.0000 2.0
R-Squared 0.6857 Resid. Mean Square (MSE) 427.511 Adjusted R-Squared 0.6769 Standard Deviation 20.6763
Identify the test statistic that should be used to test to determine if the amount of tuition charged by a program is a useful predictor of the average starting salary of the graduates of the program.
(Multiple Choice)
4.8/5
(30)
As part of a study at a large university, data were collected on freshmen computer science (CS) majors in a particular year. The researchers were interested in modeling , a student's grade point average (GPA) after three semesters, as a function of the following independent variables (recorded at the time the students enrolled in the university):
average high school grade in mathematics (HSM)
average high school grade in science (HSS)
average high school grade in English (HSE)
SAT mathematics score (SATM)
SAT verbal score (SATV)
A first-order model was fit to data with the following results:
SOURCE DF SS MS F VALUE PROB > F MODEL 5 28.64 5.73 11.69 .0001 ERROR 218 106.82 0.49 TOTAL 223 135.46
ROOT MSE 0.700 R-SQUARE 0.211 DEP MEAN 4.635 ADJR-SQ 0.193
PARAMETER STANDARD T FOR 0: VARIABLES ESTIMATE ERROR PARAMETER =0 PROB >|T| INTERCEPT 2.327 0.039 5.817 0.0001 X1 (HSM) 0.146 0.037 3.718 0.0003 X2 (HSS) 0.036 0.038 0.950 0.3432 X3 (HSE) 0.055 0.040 1.397 0.1637 X4 (SATM) 0.00094 0.00068 1.376 0.1702 X5 (SATV) -0.00041 0.00059 -0.689 0.4915
Interpret the value under the column heading .
(Multiple Choice)
4.8/5
(41)
Showing 81 - 100 of 131
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)