Exam 16: Multiple Regression Model Building
Exam 1: Introduction and Data Collection131 Questions
Exam 2: Presenting Data in Tables and Charts178 Questions
Exam 3: Numerical Descriptive Measures148 Questions
Exam 4: Basic Probability146 Questions
Exam 5: Some Important Discrete Probability Distributions169 Questions
Exam 6: The Normal Distribution and Other Continuous Distributions187 Questions
Exam 7: Sampling Distributions183 Questions
Exam 8: Confidence Interval Estimation176 Questions
Exam 9: Fundamentals of Hypothesis Testing: One-Sample Tests167 Questions
Exam 10: Hypothesis Testing: Two Sample Tests160 Questions
Exam 11: Analysis of Variance141 Questions
Exam 12: Simple Linear Regression196 Questions
Exam 13: Introduction to Multiple Regression256 Questions
Exam 14: Time-Series Forecasting and Index Numbers203 Questions
Exam 15: Chi-Square Tests135 Questions
Exam 16: Multiple Regression Model Building92 Questions
Exam 17: Decision Making111 Questions
Exam 18: Statistical Applications in Quality and Productivity Management127 Questions
Exam 19: Further Non-Parametric Tests51 Questions
Select questions type
Instruction 16-6
Given below are results from the regression analysis on 40 observations where the dependent variable is the number of weeks a worker is unemployed due to a layoff (Y)and the independent variables are the age of the worker (X1),the number of years of education received (X2),the number of years at the previous job (X3),a dummy variable for marital status (X4: 1 = married,0 = otherwise),a dummy variable for head of household (X5: 1 = yes,0 = no)and a dummy variable for management position (X6: 1 = yes,0 = no).
The coefficient of multiple determination (R2j)the regression model using each of the 6 variables Xj as the dependent variable and all other X variables as independent variables are,respectively,0.2628,0.1240,0.2404,0.3510,0.3342 and 0.0993.
The partial results from best-subset regression are given below:
Model R Square Adj. R Square Std. Error X1X5Х6 0.4568 0.4116 18.3534 \times1\times2\times5\times6 0.4697 0.4091 18.3919 \times1\times3\times5\times6 0.4691 0.4084 18.4023 \times1\times2\times3\times5\times6 0.4877 0.4123 18.3416 \times1\times2\times3\times4\times5\times6 0.4949 0.4030 18.4861
-Referring to Instruction 16-6,the model that includes X1,X2,X3,X5 and X6 should be among the appropriate models using the Mallow's Cp statistic.
Free
(True/False)
4.7/5
(37)
Correct Answer:
True
The logarithm transformation can be used
Free
(Multiple Choice)
4.8/5
(41)
Correct Answer:
C
Instruction 16-6
Given below are results from the regression analysis on 40 observations where the dependent variable is the number of weeks a worker is unemployed due to a layoff (Y)and the independent variables are the age of the worker (X1),the number of years of education received (X2),the number of years at the previous job (X3),a dummy variable for marital status (X4: 1 = married,0 = otherwise),a dummy variable for head of household (X5: 1 = yes,0 = no)and a dummy variable for management position (X6: 1 = yes,0 = no).
The coefficient of multiple determination (R2j)the regression model using each of the 6 variables Xj as the dependent variable and all other X variables as independent variables are,respectively,0.2628,0.1240,0.2404,0.3510,0.3342 and 0.0993.
The partial results from best-subset regression are given below:
Model R Square Adj. R Square Std. Error X1X5Х6 0.4568 0.4116 18.3534 \times1\times2\times5\times6 0.4697 0.4091 18.3919 \times1\times3\times5\times6 0.4691 0.4084 18.4023 \times1\times2\times3\times5\times6 0.4877 0.4123 18.3416 \times1\times2\times3\times4\times5\times6 0.4949 0.4030 18.4861
-Referring to Instruction 16-6,the model that includes all six independent variables should be selected using the adjusted r2 statistic.
Free
(True/False)
4.7/5
(36)
Correct Answer:
False
A real estate builder wishes to determine how house size (House)is influenced by family income (Income),family size (Size),and education of the head of household (School).House size is measured in hundreds of square metres,income is measured in thousands of dollars,and education is in years.The builder randomly selected 50 families and developed a multiple regression model.The business literature involving human capital shows that education influences an individual's annual income.Combined,these may influence family size.With this in mind,what should the real estate builder be particularly concerned with when analysing the multiple regression model?
(Multiple Choice)
4.8/5
(37)
As a project for his business statistics class,a student examined the factors that determined parking metre rates throughout the campus area.Data were collected for the price per hour of parking,blocks to the quadrangle,and one of the three jurisdictions: on campus,in downtown and off campus,or outside of downtown and off campus.The population regression model hypothesised is Yi = α + β1x1i + β2x2i + β3x3i + εi
Where
Y is the metre price
X1 is the number of blocks to the quad
X2 is a dummy variable that takes the value 1 if the metre is located in downtown
And off campus and the value 0 otherwise
X3 is a dummy variable that takes the value 1 if the metre is located outside of
Downtown and off campus,and the value 0 otherwise
Suppose that whether the metre is located on campus is an important explanatory factor.Why should the variable that depicts this attribute not be included in the model?
(Multiple Choice)
4.9/5
(31)
Instruction 16-5
A chemist employed by a pharmaceutical firm has developed a muscle relaxant.She took a sample of 14 people suffering from extreme muscle constriction.She gave each a vial containing a dose (X)of the drug and recorded the time to relief (Y)measured in seconds for each.She fit a quadratic model to this data.The results obtained by Microsoft Excel follow.
SUMMARY output Regression Statistics Multiple R 0.747 R Square 0.558 Adj. R Square 0.478 Std. Error 363.1 Observations 14 ANOVA df SS MS F Signưf F Regression 2 10344797 5172399 6.94 0.0110 Residual 11 8193929 744903 Total 13 18538726 Coeff StdErior Stat P -value Intercept 1283.0 352.0 3.65 0.0040 CenDose 25.228 3.631 2.92 0.0140 CenDoseSq 0.8604 0.3722 2.31 0.0410 Note: Adj.R Square = Adjusted R Square;Std.Error = Standard Error
-Referring to Instruction 16-5,suppose the chemist decides to use an F test to determine if there is a significant quadratic curvilinear relationship between time and dose.The value of the test statistic is ________.
(Short Answer)
4.9/5
(38)
So that we can fit curves as well as lines by regression,we often use mathematical manipulations for converting one variable into a different form.These manipulations are called dummy variables.
(True/False)
4.8/5
(40)
If a person uses a multiple regression model when it is clear that model assumptions have not been met,this is a violation of ethics.
(True/False)
4.9/5
(39)
One transformation that may help overcome violations to the assumption of equal variance is the square-root transformation.
(True/False)
5.0/5
(35)
Instruction 16-2
A certain type of rare gem serves as a status symbol for many of its owners.In theory,for low prices,the demand decreases as the price of the gem increases.However,experts hypothesise that when the gem is valued at very high prices,the demand increases with price due to the status owners believe they gain in obtaining the gem.Thus,the model proposed to best explain the demand for the gem by its price is the quadratic model:
Y = β0 + β1X + β2X2 + ε
where Y = demand (in thousands)and X = retail price per carat.
This model was fit to data collected for a sample of 12 rare gems of this type.A portion of the computer analysis obtained from Microsoft Excel is shown below:
SUMMARY Regression Statistics Multiple R 0.994 R Square 0.988 Std. Error 12.42 Observations 12 ANOVA If SS MS F Sigưf F Regression 2 115145 57573 373 0.0001 Residual 9 1388 154 Total 11 116533 Coeff StdError Stat P -Value Intercept 286.42 9.66 29.64 0.0001 Price 0.31 0.06 -5.14 0.0006 Price Sq 0.000067 0.00007 0.95 0.3647 Note: Std.Error = Standard Error
-Referring to Instruction 16-2,and noting that this model includes both a linear and a quadratic term,what is the correct interpretation of the coefficient of multiple determination?
(Multiple Choice)
4.8/5
(32)
Instruction 16-5
A chemist employed by a pharmaceutical firm has developed a muscle relaxant.She took a sample of 14 people suffering from extreme muscle constriction.She gave each a vial containing a dose (X)of the drug and recorded the time to relief (Y)measured in seconds for each.She fit a quadratic model to this data.The results obtained by Microsoft Excel follow.
SUMMARY output Regression Statistics Multiple R 0.747 R Square 0.558 Adj. R Square 0.478 Std. Error 363.1 Observations 14 ANOVA df SS MS F Signưf F Regression 2 10344797 5172399 6.94 0.0110 Residual 11 8193929 744903 Total 13 18538726 Coeff StdErior Stat P -value Intercept 1283.0 352.0 3.65 0.0040 CenDose 25.228 3.631 2.92 0.0140 CenDoseSq 0.8604 0.3722 2.31 0.0410 Note: Adj.R Square = Adjusted R Square;Std.Error = Standard Error
-Referring to Instruction 16-5,suppose the chemist decides to use a t test to determine if there is a significant difference between a quadratic model without a linear term and a quadratic model that includes a linear term.The value of the test statistic is ________.
(Short Answer)
4.9/5
(26)
Instruction 16-1
To explain personal consumption (CONS)measured in dollars,data is collected for
INC personal income in dollars \ 1 plus the credit limit in dollars available to the CRDTLIM individual average annualised percentage interest rate for APR: borrowing for the individual per person advertising expenditure in dollars by manufacturers in the city where the ADVT: individual lives GENDER: gender of the individual; 1 if female, 0 if male A regression analysis was performed with CONS as the dependent variable and log(CRDTLIM),log(APR),log(ADVT),and GENDER as the independent variables.The estimated model was
= 2.28 - 0.29 log(CRDTLIM)+ 5.77 log(APR)+ 2.35 log(ADVT)+ 0.39 GENDER
-Referring to Instruction 16-1,and noting that ADVT has been transformed using the log transformation,what is the correct interpretation for the estimated coefficient for ADVT?
(Multiple Choice)
4.7/5
(36)
One of the consequences of collinearity in multiple regression is inflated standard errors in some or all of the estimated slope coefficients.
(True/False)
4.8/5
(30)
Instruction 16-1
To explain personal consumption (CONS)measured in dollars,data is collected for
INC personal income in dollars \ 1 plus the credit limit in dollars available to the CRDTLIM individual average annualised percentage interest rate for APR: borrowing for the individual per person advertising expenditure in dollars by manufacturers in the city where the ADVT: individual lives GENDER: gender of the individual; 1 if female, 0 if male A regression analysis was performed with CONS as the dependent variable and log(CRDTLIM),log(APR),log(ADVT),and GENDER as the independent variables.The estimated model was
= 2.28 - 0.29 log(CRDTLIM)+ 5.77 log(APR)+ 2.35 log(ADVT)+ 0.39 GENDER
-Referring to Instruction 16-1,and noting that ADVT has been transformed using the log transformation,what is the correct interpretation for the estimated coefficient for APR?
(Multiple Choice)
4.8/5
(37)
Instruction 16-2
A certain type of rare gem serves as a status symbol for many of its owners.In theory,for low prices,the demand decreases as the price of the gem increases.However,experts hypothesise that when the gem is valued at very high prices,the demand increases with price due to the status owners believe they gain in obtaining the gem.Thus,the model proposed to best explain the demand for the gem by its price is the quadratic model:
Y = β0 + β1X + β2X2 + ε
where Y = demand (in thousands)and X = retail price per carat.
This model was fit to data collected for a sample of 12 rare gems of this type.A portion of the computer analysis obtained from Microsoft Excel is shown below:
SUMMARY output Regression Statistics Multiple R 0.994 R Square 0.988 Std. Error 12.42 Observations 12 ANOVA dff SS MS F Siguif F Regression 2 115145 57573 373 0.0001 Residual 9 1388 154 Total 11 116533 Coeff StdError t Stat P-Value Intercept 286.42 9.66 29.64 0.0001 Price -0.31 0.06 -5.14 0.0006 Price Sq p.000067 p.00007 p.95 p.3647 Note: Std.Error = Standard Error
-Referring to Instruction 16-2,what is the value of the test statistic for testing whether the quadratic term is necessary in fitting in the response curve relating the demand (Y)and the price (X)?
(Multiple Choice)
4.7/5
(32)
Instruction 16-5
A chemist employed by a pharmaceutical firm has developed a muscle relaxant.She took a sample of 14 people suffering from extreme muscle constriction.She gave each a vial containing a dose (X)of the drug and recorded the time to relief (Y)measured in seconds for each.She fit a quadratic model to this data.The results obtained by Microsoft Excel follow.
SUMMARY output Regression Statistics Multiple R 0.747 R Square 0.558 Adj. R Square 0.478 Std. Error 363.1 Observations 14 ANOVA df SS MS F Signưf F Regression 2 10344797 5172399 6.94 0.0110 Residual 11 8193929 744903 Total 13 18538726 Coeff StdErior Stat P -value Intercept 1283.0 352.0 3.65 0.0040 CenDose 25.228 3.631 2.92 0.0140 CenDoseSq 0.8604 0.3722 2.31 0.0410 Note: Adj.R Square = Adjusted R Square;Std.Error = Standard Error
-Referring to Instruction 16-5,suppose the chemist decides to use an F test to determine if there is a significant quadratic relationship between time and dose.The p-value of the test is ________.
(Short Answer)
4.7/5
(32)
Instruction 16-6
Given below are results from the regression analysis on 40 observations where the dependent variable is the number of weeks a worker is unemployed due to a layoff (Y)and the independent variables are the age of the worker (X1),the number of years of education received (X2),the number of years at the previous job (X3),a dummy variable for marital status (X4: 1 = married,0 = otherwise),a dummy variable for head of household (X5: 1 = yes,0 = no)and a dummy variable for management position (X6: 1 = yes,0 = no).
The coefficient of multiple determination (R2j)the regression model using each of the 6 variables Xj as the dependent variable and all other X variables as independent variables are,respectively,0.2628,0.1240,0.2404,0.3510,0.3342 and 0.0993.
The partial results from best-subset regression are given below:
Model R Square Adj. R Square Std. Error X1X5Х6 0.4568 0.4116 18.3534 \times1\times2\times5\times6 0.4697 0.4091 18.3919 \times1\times3\times5\times6 0.4691 0.4084 18.4023 \times1\times2\times3\times5\times6 0.4877 0.4123 18.3416 \times1\times2\times3\times4\times5\times6 0.4949 0.4030 18.4861
-Referring to Instruction 16-6,the model that includes X1,X5 and X6 should be among the appropriate models using the Mallow's Cp statistic.
(True/False)
4.9/5
(32)
Instruction 16-5
A chemist employed by a pharmaceutical firm has developed a muscle relaxant.She took a sample of 14 people suffering from extreme muscle constriction.She gave each a vial containing a dose (X)of the drug and recorded the time to relief (Y)measured in seconds for each.She fit a quadratic model to this data.The results obtained by Microsoft Excel follow.
SUMMARY output Regression Statistics Multiple R 0.747 R Square 0.558 Adj. R Square 0.478 Std. Error 363.1 Observations 14 ANOVA df SS MS F Signưf F Regression 2 10344797 5172399 6.94 0.0110 Residual 11 8193929 744903 Total 13 18538726 Coeff StdErior Stat P -value Intercept 1283.0 352.0 3.65 0.0040 CenDose 25.228 3.631 2.92 0.0140 CenDoseSq 0.8604 0.3722 2.31 0.0410 Note: Adj.R Square = Adjusted R Square;Std.Error = Standard Error
-Referring to Instruction 16-5,suppose the chemist decides to use a t test to determine if there is a significant difference between a linear model and a quadratic model that includes a linear term.If she used a level of significance of 0.01,she would decide that the linear model is sufficient.
(True/False)
4.7/5
(26)
An independent variable Xj is considered highly correlated with the other independent variables if
(Multiple Choice)
4.9/5
(35)
Instruction 16-6
Given below are results from the regression analysis on 40 observations where the dependent variable is the number of weeks a worker is unemployed due to a layoff (Y)and the independent variables are the age of the worker (X1),the number of years of education received (X2),the number of years at the previous job (X3),a dummy variable for marital status (X4: 1 = married,0 = otherwise),a dummy variable for head of household (X5: 1 = yes,0 = no)and a dummy variable for management position (X6: 1 = yes,0 = no).
The coefficient of multiple determination (R2j)the regression model using each of the 6 variables Xj as the dependent variable and all other X variables as independent variables are,respectively,0.2628,0.1240,0.2404,0.3510,0.3342 and 0.0993.
The partial results from best-subset regression are given below:
Model R Square Adj. R Square Std. Error X1X5Х6 0.4568 0.4116 18.3534 \times1\times2\times5\times6 0.4697 0.4091 18.3919 \times1\times3\times5\times6 0.4691 0.4084 18.4023 \times1\times2\times3\times5\times6 0.4877 0.4123 18.3416 \times1\times2\times3\times4\times5\times6 0.4949 0.4030 18.4861
-Referring to Instruction 16-6,the variable X1 should be dropped to remove collinearity.
(True/False)
4.9/5
(43)
Showing 1 - 20 of 92
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)