Deck 4: Regression Models
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Question
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/133
Play
Full screen (f)
Deck 4: Regression Models
1
In regression,a dependent variable is sometimes called a predictor variable.
False
2
The dependent variable is also called the response variable.
True
3
In any regression model,there is an implicit assumption that a relationship exists between the variables.
True
4
Summing the error values in a regression model is misleading because negative errors cancel out positive errors.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
5
One purpose of regression is to predict the value of one variable based on the other variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
6
Error is the difference in the actual value and the predicted value.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
7
Estimates of the slope,intercept,and error of a regression model are found from sample data.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
8
In a scatter diagram,the dependent variable is typically plotted on the horizontal axis.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
9
The coefficient of determination takes on values between -1 and + 1.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
10
One purpose of regression is to understand the relationship between variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
11
There is no relationship between variables unless the data points lie in a straight line.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
12
The SST measures the total variability in the dependent variable about the regression line.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
13
The coefficient of determination gives the proportion of the variability in the dependent variable that is explained by the regression equation.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
14
The SSE measures the total variability in the independent variable about the regression line.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
15
The variable to be predicted is the dependent variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
16
The regression line minimizes the sum of the squared errors.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
17
In regression,there is random error that can be predicted.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
18
The SSR indicates how much of the total variability in the dependent variable is explained by the regression model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
19
In regression,an independent variable is sometimes called a response variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
20
A scatter diagram is a graphical depiction of the relationship between the dependent and independent variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
21
An F-test is used to determine if there is a relationship between the dependent and independent variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
22
The errors in a regression model are assumed to have an increasing mean.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
23
The correlation coefficient has values between −1 and +1.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
24
When the significance level is small enough in the F-test,we can reject the null hypothesis that there is no linear relationship.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
25
The error standard deviation is estimated by MSE.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
26
The multiple regression model includes multiple slope coefficients.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
27
Often,a plot of the residuals will highlight any glaring violations of the assumptions.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
28
If the significance level for the F-test is high enough,there is a relationship between the dependent and independent variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
29
The coefficients of each independent variable in a multiple regression model represent slopes.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
30
For statistical tests of significance about the coefficients,the null hypothesis is that the slope is 1.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
31
The errors in a regression model are assumed to have zero variance.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
32
The null hypothesis in the F-test is that there is a linear relationship between the X and Y variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
33
The multiple regression model includes several intercept terms.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
34
The regression model assumes the error terms are dependent.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
35
The regression model assumes the errors are normally distributed.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
36
Errors are also called residuals.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
37
The multiple regression model includes several dependent variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
38
Both the p-value for the F-test and r2 can be interpreted the same with multiple regression models as they are with simple linear models.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
39
If the assumptions of regression have been met,errors plotted against the independent variable will typically show patterns.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
40
The standard error of the estimate is also called the variance of the regression.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
41
Dummy variables for regression analysis can take on a value of either -1 or +1.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
42
A dummy variable can be assigned up to three values.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
43
The best model is a statistically significant model with a high r-square and few variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
44
A reference to the criterion used to select the regression line,to minimize the squared distances between the estimated straight line and the observed values is called
A)Mean square error.
B)Sum of Squares.
C)Maximum likelihood.
D)R-square.
E)Least Squares.
A)Mean square error.
B)Sum of Squares.
C)Maximum likelihood.
D)R-square.
E)Least Squares.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
45
The number of dummy variables must equal 1 less than the number of categories of the qualitative variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
46
Which of the following statements is true regarding a scatter diagram?
A)It provides very little information about the relationship between the regression variables.
B)It is a plot of the independent and dependent variables.
C)It is a line chart of the independent and dependent variables.
D)It has a value between -1 and +1.
E)It gives the percent of variation in the dependent variable that is explained by the independent variable.
A)It provides very little information about the relationship between the regression variables.
B)It is a plot of the independent and dependent variables.
C)It is a line chart of the independent and dependent variables.
D)It has a value between -1 and +1.
E)It gives the percent of variation in the dependent variable that is explained by the independent variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
47
Which of the following equalities is correct?
A)SST = SSR + SSE
B)SSR = SST + SSE
C)SSE = SSR + SST
D)SST = SSC + SSR
E)SSE = Actual Value - Predicted Value
A)SST = SSR + SSE
B)SSR = SST + SSE
C)SSE = SSR + SST
D)SST = SSC + SSR
E)SSE = Actual Value - Predicted Value
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
48
If computing a causal linear regression model of Y = a + bX and the resultant r2 is very near zero,then one would be able to conclude that
A)Y = a + bX is a good forecasting method.
B)Y = a + bX is not a good forecasting method.
C)a multiple linear regression model is a good forecasting method for the data.
D)a multiple linear regression model is not a good forecasting method for the data.
E)None of the above
A)Y = a + bX is a good forecasting method.
B)Y = a + bX is not a good forecasting method.
C)a multiple linear regression model is a good forecasting method for the data.
D)a multiple linear regression model is not a good forecasting method for the data.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
49
Transformations may be used when nonlinear relationships exist between variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
50
A variable should be added to the model regardless of the impact (increase or decrease)on the adjusted r2 value.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
51
In regression,a binary variable is also called an indicator variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
52
Another name for a dummy variable is a binary variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
53
If multicollinearity exists,then individual interpretation of the variables is questionable,but the overall model is still good for prediction purposes.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
54
The adjusted r2 will always increase as additional variables are added to the model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
55
Multicollinearity exists when a variable is correlated to other variables.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
56
The sum of squared error (SSE)is
A)a measure of the total variation in Y about the mean.
B)a measure of the total variation in X about the mean.
C)a measure in the variation of Y about the regression line.
D)a measure in the variation of X about the regression line.
E)None of the above
A)a measure of the total variation in Y about the mean.
B)a measure of the total variation in X about the mean.
C)a measure in the variation of Y about the regression line.
D)a measure in the variation of X about the regression line.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
57
Which of the following statements is/are not true about regression models?
A)Estimates of the slope are found from sample data.
B)The regression line minimizes the sum of the squared errors.
C)The error is found by subtracting the actual data value from the predicted data value.
D)The dependent variable is the explanatory variable.
E)The intercept coefficient is not typically interpreted.
A)Estimates of the slope are found from sample data.
B)The regression line minimizes the sum of the squared errors.
C)The error is found by subtracting the actual data value from the predicted data value.
D)The dependent variable is the explanatory variable.
E)The intercept coefficient is not typically interpreted.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
58
The random error in a regression equation
A)is the predicted error.
B)includes both positive and negative terms.
C)will sum to a large positive number.
D)is used to estimate the accuracy of the slope.
E)is maximized in a least squares regression model.
A)is the predicted error.
B)includes both positive and negative terms.
C)will sum to a large positive number.
D)is used to estimate the accuracy of the slope.
E)is maximized in a least squares regression model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
59
The value of r2 can never decrease when more variables are added to the model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
60
A high correlation always implies that one variable is causing a change in the other variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
61
A prediction equation for sales and payroll was performed using simple linear regression.In the regression printout shown below,which of the following statements is/are not true? 
A)Payroll is a good predictor of Sales based on α = 0.05.
B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05.
C)Payroll is not a good predictor of Sales based on α = 0.01.
D)The coefficient of determination is equal to 0.833333.
E)Payroll is the independent variable.

A)Payroll is a good predictor of Sales based on α = 0.05.
B)There is evidence of a positive linear relationship between Sales and Payroll based on α = 0.05.
C)Payroll is not a good predictor of Sales based on α = 0.01.
D)The coefficient of determination is equal to 0.833333.
E)Payroll is the independent variable.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
62
Which of the following conditions can be detected from residual analysis?
A)Nonlinearity Nonconstant variance
B)Multicollinearity
C)A and B
D)A,B,and C
A)Nonlinearity Nonconstant variance
B)Multicollinearity
C)A and B
D)A,B,and C
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
63
Which of the following statements is false concerning the hypothesis testing procedure for a regression model?
A)The F-test statistic is used.
B)The null hypothesis is that the true slope coefficient is equal to zero.
C)The null hypothesis is rejected if the adjusted r2 is above the critical value.
D)An α level must be selected.
E)The alternative hypothesis is that the true slope coefficient is not equal to zero.
A)The F-test statistic is used.
B)The null hypothesis is that the true slope coefficient is equal to zero.
C)The null hypothesis is rejected if the adjusted r2 is above the critical value.
D)An α level must be selected.
E)The alternative hypothesis is that the true slope coefficient is not equal to zero.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
64
If a qualitative variable has three categories,how many dummy variables are needed?
A)0
B)1
C)2
D)3
E)4
A)0
B)1
C)2
D)3
E)4
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
65
A dummy variable is also called a(n)
A)indicator variable.
B)dependent variable.
C)continuous variable.
D)response variable.
E)None of the above
A)indicator variable.
B)dependent variable.
C)continuous variable.
D)response variable.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
66
A prediction equation for starting salaries (in $1,000s)and SAT scores was performed using simple linear regression.In the regression printout shown below,what can be said about the level of significance for the overall model? 
A)SAT is not a good predictor for starting salary.
B)The significance level for the intercept indicates the model is not valid.
C)The significance level for SAT indicates the slope is equal to zero.
D)The significance level for SAT indicates the slope is not equal to zero.
E)None of the above

A)SAT is not a good predictor for starting salary.
B)The significance level for the intercept indicates the model is not valid.
C)The significance level for SAT indicates the slope is equal to zero.
D)The significance level for SAT indicates the slope is not equal to zero.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
67
Which of the following is not an assumption of the regression model?
A)The errors are independent.
B)The errors are normally distributed.
C)The errors have constant variance.
D)The mean of the errors is zero.
E)The errors should have a standard deviation equal to one.
A)The errors are independent.
B)The errors are normally distributed.
C)The errors have constant variance.
D)The mean of the errors is zero.
E)The errors should have a standard deviation equal to one.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
68
The problem of nonconstant error variance is detected in residual analysis by which of the following?
A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
E)a decreasing pattern
A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
E)a decreasing pattern
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
69
A healthcare executive is using regression to predict total revenues.She has decided to include both patient length of stay and insurance type in her model.Insurance type can be grouped into the following categories: Medicare,Medicaid,Managed Care,Self-Pay,and Charity.Which of the following is true?
A)Insurance type will be represented in the regression model by five binary variables.
B)Insurance type will be represented in the regression model by six dummy variables.
C)Insurance type will be represented in the regression model by five dummy variables.
D)Insurance type will be represented in the regression model by four binary variables.
E)Neither binary nor dummy variables are necessary for the regression model.
A)Insurance type will be represented in the regression model by five binary variables.
B)Insurance type will be represented in the regression model by six dummy variables.
C)Insurance type will be represented in the regression model by five dummy variables.
D)Insurance type will be represented in the regression model by four binary variables.
E)Neither binary nor dummy variables are necessary for the regression model.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
70
Which of the following represents the underlying linear model for hypothesis testing?
A)Y = b0 + b1 X + ε
B)Y = b0 + b1 X
C)Y = β0 + β1 X + ε
D)Y = β0 + β1 X
E)None of the above
A)Y = b0 + b1 X + ε
B)Y = b0 + b1 X
C)Y = β0 + β1 X + ε
D)Y = β0 + β1 X
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
71
The problem of a nonlinear relationship is detected in residual analysis by which of the following?
A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
E)a decreasing pattern
A)a cone pattern
B)an arched pattern
C)a random pattern
D)an increasing pattern
E)a decreasing pattern
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
72
The coefficient of determination resulting from a particular regression analysis was 0.85.What was the slope of the regression line?
A)0.85
B)-0.85
C)0.922
D)There is insufficient information to answer the question.
E)None of the above
A)0.85
B)-0.85
C)0.922
D)There is insufficient information to answer the question.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
73
Which of the following statements is true about r2?
A)It is also called the coefficient of correlation.
B)It is also called the coefficient of determination.
C)It represents the percent of variation in X that is explained by Y.
D)It represents the percent of variation in the error that is explained by Y.
E)It ranges in value from -1 to + 1.
A)It is also called the coefficient of correlation.
B)It is also called the coefficient of determination.
C)It represents the percent of variation in X that is explained by Y.
D)It represents the percent of variation in the error that is explained by Y.
E)It ranges in value from -1 to + 1.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
74
The correlation coefficient resulting from a particular regression analysis was 0.25.What was the coefficient of determination?
A)0.5
B)-0.5
C)0.0625
D)There is insufficient information to answer the question.
E)None of the above
A)0.5
B)-0.5
C)0.0625
D)There is insufficient information to answer the question.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
75
Which of the following is an assumption of the regression model?
A)The errors are independent.
B)The errors are not normally distributed.
C)The errors have a standard deviation of zero.
D)The errors have an irregular variance.
E)The errors follow a cone pattern.
A)The errors are independent.
B)The errors are not normally distributed.
C)The errors have a standard deviation of zero.
D)The errors have an irregular variance.
E)The errors follow a cone pattern.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
76
The coefficient of determination resulting from a particular regression analysis was 0.85.What was the correlation coefficient,assuming a positive linear relationship?
A)0.5
B)-0.5
C)0.922
D)There is insufficient information to answer the question.
E)None of the above
A)0.5
B)-0.5
C)0.922
D)There is insufficient information to answer the question.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
77
The diagram below illustrates data with a 
A)negative correlation coefficient.
B)zero correlation coefficient.
C)positive correlation coefficient.
D)correlation coefficient equal to +1.
E)None of the above

A)negative correlation coefficient.
B)zero correlation coefficient.
C)positive correlation coefficient.
D)correlation coefficient equal to +1.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
78
The mean square error (MSE)is
A)denoted by s.
B)denoted by k.
C)the SSE divided by the number of observations.
D)the SSE divided by the degrees of freedom.
E)None of the above
A)denoted by s.
B)denoted by k.
C)the SSE divided by the number of observations.
D)the SSE divided by the degrees of freedom.
E)None of the above
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
79
In a good regression model the residual plot shows
A)a cone pattern.
B)an arched pattern.
C)a random pattern.
D)an increasing pattern.
E)a decreasing pattern.
A)a cone pattern.
B)an arched pattern.
C)a random pattern.
D)an increasing pattern.
E)a decreasing pattern.
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck
80
Suppose that you believe that a cubic relationship exists between the independent variable (of time)and the dependent variable Y.Which of the following would represent a valid linear regression model?
A)Y = b0 + b1 X,where X = time3
B)Y = b0 + b1 X3,where X = time
C)Y = b0 + 3b1 X,where X = time3
D)Y = b0 + 3b1 X,where X = time
E)Y = b0 + b1 X,where X = time1/3
A)Y = b0 + b1 X,where X = time3
B)Y = b0 + b1 X3,where X = time
C)Y = b0 + 3b1 X,where X = time3
D)Y = b0 + 3b1 X,where X = time
E)Y = b0 + b1 X,where X = time1/3
Unlock Deck
Unlock for access to all 133 flashcards in this deck.
Unlock Deck
k this deck