Exam 14: Introduction to Multiple Regression
Exam 1: Introduction and Data Collection137 Questions
Exam 2: Presenting Data in Tables and Charts181 Questions
Exam 3: Numerical Descriptive Measures138 Questions
Exam 4: Basic Probability152 Questions
Exam 5: Some Important Discrete Probability Distributions174 Questions
Exam 6: The Normal Distribution and Other Continuous Distributions180 Questions
Exam 7: Sampling Distributions and Sampling180 Questions
Exam 8: Confidence Interval Estimation185 Questions
Exam 9: Fundamentals of Hypothesis Testing: One-Sample Tests180 Questions
Exam 10: Two-Sample Tests184 Questions
Exam 11: Analysis of Variance179 Questions
Exam 12: Chi-Square Tests and Nonparametric Tests206 Questions
Exam 13: Simple Linear Regression196 Questions
Exam 14: Introduction to Multiple Regression258 Questions
Exam 15: Multiple Regression Model Building88 Questions
Exam 16: Time-Series Forecasting and Index Numbers193 Questions
Exam 17: Decision Making127 Questions
Exam 18: Statistical Applications in Quality Management113 Questions
Exam 19: Statistical Analysis Scenarios and Distributions82 Questions
Select questions type
When an additional explanatory variable is introduced into a multiple regression model, the coefficient of multiple determination will never decrease.
(True/False)
4.9/5
(35)
TABLE 14-4
A real estate builder wishes to determine how house size (House) is influenced by family income (Income), family size (Size), and education of the head of household (School). House size is measured in hundreds of square feet, income is measured in thousands of dollars, and education is in years. The builder randomly selected 50 families and ran the multiple regression.
Microsoft Excel output is provided below:
Regression Stuistics Multiple R 0.865 R Square 0.748 Adjusted R Square 0.726 Standard Error 5.195 Observations 50
ANOVA
d f S S M S F Significance F Regression 3605.7736 1201.9245 0.0000 Residual 1214.2264 26.3962 Total 49 4820.0000
Coefficients Standard Error t Stat p -value Intercept -1.6335 5.8078 -0.281 0.7798 Income 0.4485 0.1137 3.9545 0.0003 Size 4.2615 0.8062 5.286 0.0001 School -0.6517 0.4319 -1.509 0.1383
-Referring to Table 14-4, which of the following values for the level of significance is the smallest for which at least two explanatory variables are significant individually?
(Multiple Choice)
4.8/5
(34)
TABLE 14-14
An econometrician is interested in evaluating the relation of demand for building materials to mortgage rates in Los Angeles and San Francisco. He believes that the appropriate model is
Y = 10 + 5X1 + 8X2
where X1 = mortgage rate in %
X2 = 1 if SF, 0 if LA
Y = demand in $100 per capita
-Referring to Table 14-14, holding constant the effect of city, each additional increase of 1% in the mortgage rate would lead to an estimated increase of _____in the mean demand.
(Short Answer)
4.9/5
(34)
TABLE 14-11
A logistic regression model was estimated in order to predict the probability that a randomly chosen university or college would be a private university using information on average total Scholastic Aptitude Test score (SAT) at the university or college, the room and board expense measured in thousands of dollars (Room/Brd), and whether the TOEFL criterion is at least 550 (Toefl550 = 1 if yes, 0 otherwise.) The dependent variable, Y, is school type (Type = 1 if private and 0 otherwise).
The Minitab output is given below:
Logistic Regression Table
Odds 95: CI Predictor Coef SE Coef Z P Ratio Lower Upper Constant -27.118 6.696 -4.05 0.000 SAT 0.015 0.004666 3.17 0.002 1.01 1.01 1.02 Toefl550 -0.390 0.9538 -0.41 0.682 0.68 0.10 4.39 Room/Brd 2.078 0.5076 4.09 0.000 7.99 2.95 21.60
Log-Likelihood = -21.883
Test that all slopes are zero: G = 62.083, DF = 3, P-Value = 0.000
Goodness-of-Fit Tests
Method Chi-Square DF P Pearson 143.551 76 0.000 Deviance 43.767 76 0.999 Hosmer-Lemeshow 15.731 8 0.046
-Referring to Table 14-11, the null hypothesis that the model is a good-fitting model cannot be rejected when allowing for a 5% probability of making a type I error.
(True/False)
4.8/5
(32)
TABLE 14-17
The marketing manager for a nationally franchised lawn service company would like to study the characteristics that differentiate home owners who do and do not have a lawn service. A random sample of 30 home owners located in a suburban area near a large city was selected; 15 did not have a lawn service (code 0) and 15 had a lawn service (code 1). Additional information available concerning these 30 home owners includes family income (Income, in thousands of dollars), lawn size (Lawn Size, in thousands of square feet), attitude toward outdoor recreational activities (Attitude 0 = unfavorable, 1
= favorable), number of teenagers in the household (Teenager), and age of the head of the household (Age).
The Minitab output is given below:
Odds 95 \% CI Predictor Coef SE Coef Z P Ratio Lower Upper Constant -70.49 47.22 -1.49 0.135 Income 0.2868 0.1523 1.88 0.060 1.33 0.99 1.80 Lawn Size 1.0647 0.7472 1.42 0.154 2.90 0.67 12.54 Attitude -12.744 9.455 -1.35 0.178 0.00 0.00 326.06 Teenager -0.200 1.061 -0.19 0.850 0.82 0.10 6.56 Age 1.0792 0.8783 1.23 0.219 2.94 0.53 16.45
Log-Likelihood = -4.890
Test that all slopes are zero: G = 31.808, DF = 5, P-Value = 0.000
Goodness-of-Fit Tests
Method Chi-Square DF P Pearson 9.313 24 0.997 Deviance 9.780 24 0.995 Hosmer-Lemeshow 0.571 8 1.000
-Referring to Table 14-17, there is not enough evidence to conclude that Age makes a significant contribution to the model in the presence of the other independent variables at a 0.05 level of significance.
(True/False)
4.8/5
(30)
In trying to obtain a model to estimate grades on a statistics test, a professor wanted to include, among other factors, whether the person had taken the course previously. To do this, the professor included a dummy variable in her regression that was equal to 1 if the person had previously taken the course, and 0 otherwise. The interpretation of the coefficient associated with this dummy variable would be the average amount the repeat students tended to be above or below
non-repeaters, with all other factors the same.
(True/False)
4.9/5
(45)
A multiple regression is called "multiple" because it has several explanatory variables.
(True/False)
4.9/5
(38)
TABLE 14-16
The superintendent of a school district wanted to predict the percentage of students passing a sixth-grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state.
Following is the multiple regression output with Y = % Passing as the dependent variable, X1 = % Attendance, X2 = Salaries and
X3 = Spending:
Regression Statistics Multiple R 0.7930 R Square 0.6288 Adjusted R Square 0.6029 Standard Error 10.4570 Observations 47
ANOVA
d f SS MS F Significance F Regression 3 7965.08 2655.03 24.2802 2.3853-09 Residual 43 4702.02 109.35 Total 46 12667.11
Coeffs Stnd Err t Stat p -value Lower 95\% Upper 95\% Intercept -753.4225 101.1149 -7.4511 2.88-09 -957.3401 -549.5050 \% Attend 8.5014 1.0771 7.8929 6.73-10 6.3292 10.6735 Salary 6.85-07 0.0006 0.0011 0.9991 -0.0013 0.0013 Spending 0.0060 0.0046 1.2879 0.2047 -0.0034 0.0153
-Referring to Table 14-16, what is the standard error of estimate?
(Short Answer)
4.8/5
(34)
A regression had the following results: SST = 102.55, SSE = 82.04. It can be said that 20.0% of the variation in the dependent variable is explained by the independent variables in the regression.
(True/False)
4.9/5
(30)
TABLE 14-7
The department head of the accounting department wanted to see if she could predict the GPA of students using the number of course units (credits) and total SAT scores of each. She takes a sample of students and generates the following Microsoft Excel output:
Regression Statistics Multiple R 0.916 R Square 0.839 Adjusted R Square 0.732 Standard Error 0.24685 Observations 6
ANOVA
d f SS M S F Significance F Regression 2 0.95219 0.47610 7.813 0.0646 Residual 3 0.18281 0.06094 Total 5 1.13500
Coefficients Standard Error t Stat p -value Intercept 4.593897 1.13374542 4.052 0.0271 Units -0.247270 0.06268485 -3.945 0.0290 SAT Total 0.001443 0.00101241 1.425 0.2494
-Referring to Table 14-7, the department head wants to test H0 : ?1 = ?2 = 0. The p-value of the test is____ .
(Short Answer)
4.9/5
(28)
TABLE 14-5
A microeconomist wants to determine how corporate sales are influenced by capital and wage spending by companies. She proceeds to randomly select 26 large corporations and record information in millions of dollars. The Microsoft Excel output below shows results of this multiple regression.
Regression Statistics Multiple R 0.830 R Square 0.689 Adjusted R Square 0.662 Standard Error 17501.643 Observations 26
ANOVA
d f S S M S F Significance F Regression 2 15579777040 7789888520 25.432 0.0001 Residual 23 7045072780 306307512 Total 25 22624849820
Coefficients Standard Error t Stat p-value Intercept 15800.0000 6038.2999 2.617 0.0154 C apital 0.1245 0.2045 0.609 0.5485 W ages 7.0762 1.4729 4.804 0.0001
-Referring to Table 14-5, what is the p-value for testing whether Wages have a positive impact on corporate sales?
(Multiple Choice)
4.8/5
(39)
In calculating the standard error of the estimate, ,, there are n - k - 1 degrees of freedom, where n is the sample size and k represents the number of independent variables in the model.
(True/False)
4.8/5
(39)
From the coefficient of multiple determination, we cannot detect the strength of the relationship between Y and any individual independent variable.
(True/False)
4.9/5
(44)
TABLE 14-16
The superintendent of a school district wanted to predict the percentage of students passing a sixth-grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state.
Following is the multiple regression output with Y = % Passing as the dependent variable, X1 = % Attendance, X2 = Salaries and
X3 = Spending:
Regression Statistics Multiple R 0.7930 R Square 0.6288 Adjusted R Square 0.6029 Standard Error 10.4570 Observations 47
ANOVA
d f SS MS F Significance F Regression 3 7965.08 2655.03 24.2802 2.3853-09 Residual 43 4702.02 109.35 Total 46 12667.11
Coeffs Stnd Err t Stat p -value Lower 95\% Upper 95\% Intercept -753.4225 101.1149 -7.4511 2.88-09 -957.3401 -549.5050 \% Attend 8.5014 1.0771 7.8929 6.73-10 6.3292 10.6735 Salary 6.85-07 0.0006 0.0011 0.9991 -0.0013 0.0013 Spending 0.0060 0.0046 1.2879 0.2047 -0.0034 0.0153
-Referring to Table 14-16, the null hypothesis should be rejected at a 5% level of significance when testing whether instructional spending per pupil has any effect on percentage of students passing the proficiency test.
(True/False)
4.9/5
(35)
TABLE 14-16
The superintendent of a school district wanted to predict the percentage of students passing a sixth-grade proficiency test. She obtained the data on percentage of students passing the proficiency test (% Passing), daily average of the percentage of students attending class (% Attendance), average teacher salary in dollars (Salaries), and instructional spending per pupil in dollars (Spending) of 47 schools in the state.
Following is the multiple regression output with Y = % Passing as the dependent variable, X1 = % Attendance, X2 = Salaries and
X3 = Spending:
Regression Statistics Multiple R 0.7930 R Square 0.6288 Adjusted R Square 0.6029 Standard Error 10.4570 Observations 47
ANOVA
d f SS MS F Significance F Regression 3 7965.08 2655.03 24.2802 2.3853-09 Residual 43 4702.02 109.35 Total 46 12667.11
Coeffs Stnd Err t Stat p -value Lower 95\% Upper 95\% Intercept -753.4225 101.1149 -7.4511 2.88-09 -957.3401 -549.5050 \% Attend 8.5014 1.0771 7.8929 6.73-10 6.3292 10.6735 Salary 6.85-07 0.0006 0.0011 0.9991 -0.0013 0.0013 Spending 0.0060 0.0046 1.2879 0.2047 -0.0034 0.0153
-Referring to Table 14-16, estimate the average percentage of students passing the proficiency test for all the schools that have a daily average of 95% of students attending class, an average teacher salary of 40,000 dollars, and an instructional spending per pupil of 2000 dollars.
(Short Answer)
4.8/5
(34)
TABLE 14-9
You decide to predict gasoline prices in different cities and towns in the United States for your term project. Your dependent variable is price of gasoline per gallon and your explanatory variables are per capita income, the number of firms that manufacture automobile parts in and around the city, the number of new business starts in the last year, population density of the city, percentage of local taxes on gasoline, and the number of people using public transportation. You collected data of 32 cities and obtained a regression sum of squares SSR= 122.8821. Your computed value of standard error of the estimate is 1.9549.
-Referring to Table 14-9, if variables that measure the number of new business starts in the last year and population density of the city were removed from the multiple regression model, which of the following would be true?
(Multiple Choice)
4.7/5
(33)
TABLE 14-17
The marketing manager for a nationally franchised lawn service company would like to study the characteristics that differentiate home owners who do and do not have a lawn service. A random sample of 30 home owners located in a suburban area near a large city was selected; 15 did not have a lawn service (code 0) and 15 had a lawn service (code 1). Additional information available concerning these 30 home owners includes family income (Income, in thousands of dollars), lawn size (Lawn Size, in thousands of square feet), attitude toward outdoor recreational activities (Attitude 0 = unfavorable, 1
= favorable), number of teenagers in the household (Teenager), and age of the head of the household (Age).
The Minitab output is given below:
Odds 95 \% CI Predictor Coef SE Coef Z P Ratio Lower Upper Constant -70.49 47.22 -1.49 0.135 Income 0.2868 0.1523 1.88 0.060 1.33 0.99 1.80 Lawn Size 1.0647 0.7472 1.42 0.154 2.90 0.67 12.54 Attitude -12.744 9.455 -1.35 0.178 0.00 0.00 326.06 Teenager -0.200 1.061 -0.19 0.850 0.82 0.10 6.56 Age 1.0792 0.8783 1.23 0.219 2.94 0.53 16.45
Log-Likelihood = -4.890
Test that all slopes are zero: G = 31.808, DF = 5, P-Value = 0.000
Goodness-of-Fit Tests
Method Chi-Square DF P Pearson 9.313 24 0.997 Deviance 9.780 24 0.995 Hosmer-Lemeshow 0.571 8 1.000
-Referring to Table 14-17, what should be the decision ('reject' or 'do not reject') on the null hypothesis when testing whether Age makes a significant contribution to the model in the presence of the other independent variables at a 0.05 level of significance?
(Short Answer)
4.8/5
(36)
TABLE 14-11
A logistic regression model was estimated in order to predict the probability that a randomly chosen university or college would be a private university using information on average total Scholastic Aptitude Test score (SAT) at the university or college, the room and board expense measured in thousands of dollars (Room/Brd), and whether the TOEFL criterion is at least 550 (Toefl550 = 1 if yes, 0 otherwise.) The dependent variable, Y, is school type (Type = 1 if private and 0 otherwise).
The Minitab output is given below:
Logistic Regression Table
Odds 95: CI Predictor Coef SE Coef Z P Ratio Lower Upper Constant -27.118 6.696 -4.05 0.000 SAT 0.015 0.004666 3.17 0.002 1.01 1.01 1.02 Toefl550 -0.390 0.9538 -0.41 0.682 0.68 0.10 4.39 Room/Brd 2.078 0.5076 4.09 0.000 7.99 2.95 21.60
Log-Likelihood = -21.883
Test that all slopes are zero: G = 62.083, DF = 3, P-Value = 0.000
Goodness-of-Fit Tests
Method Chi-Square DF P Pearson 143.551 76 0.000 Deviance 43.767 76 0.999 Hosmer-Lemeshow 15.731 8 0.046
-Referring to Table 14-11, what is the p-value of the test statistic when testing whether SAT makes a significant contribution to the model in the presence of the other independent variables?
(Short Answer)
4.9/5
(38)
TABLE 14-2
A professor of industrial relations believes that an individual's wage rate at a factory (Y) depends on his performance rating (X1) and the number of economics courses the employee successfully completed in college (X2). The professor randomly selects 6 workers and collects the following information:
Employee Y(\ ) 1 10 3 0 2 12 1 5 3 15 8 1 4 17 5 8 5 20 7 12 6 25 10 9
-Referring to Table 14-2, an employee who took 12 economics courses scores 10 on the performance rating. What is her estimated expected wage rate?
(Multiple Choice)
4.8/5
(38)
TABLE 14-11
A logistic regression model was estimated in order to predict the probability that a randomly chosen university or college would be a private university using information on average total Scholastic Aptitude Test score (SAT) at the university or college, the room and board expense measured in thousands of dollars (Room/Brd), and whether the TOEFL criterion is at least 550 (Toefl550 = 1 if yes, 0 otherwise.) The dependent variable, Y, is school type (Type = 1 if private and 0 otherwise).
The Minitab output is given below:
Logistic Regression Table
Odds 95: CI Predictor Coef SE Coef Z P Ratio Lower Upper Constant -27.118 6.696 -4.05 0.000 SAT 0.015 0.004666 3.17 0.002 1.01 1.01 1.02 Toefl550 -0.390 0.9538 -0.41 0.682 0.68 0.10 4.39 Room/Brd 2.078 0.5076 4.09 0.000 7.99 2.95 21.60
Log-Likelihood = -21.883
Test that all slopes are zero: G = 62.083, DF = 3, P-Value = 0.000
Goodness-of-Fit Tests
Method Chi-Square DF P Pearson 143.551 76 0.000 Deviance 43.767 76 0.999 Hosmer-Lemeshow 15.731 8 0.046
-Referring to Table 14-11, what should be the decision ('reject' or 'do not reject') on the null hypothesis when testing whether Toefl500 makes a significant contribution to the model in the presence of the other independent variables at a 0.05 level of significance?
(Short Answer)
5.0/5
(38)
Showing 221 - 240 of 258
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)