Deck 13: Multiple Regression and Correlation Analysis

Full screen (f)
exit full mode
Question
i. Violating the need for successive observations of the dependent variable to be uncorrelated is called autocorrelation. ii. If an inverse relationship exists between the dependent variable and independent variables, the regression coefficients for the independent variables are negative.
iii. Given a multiple linear equation Y' = 5.1 + 2.2X1 - 3.5X2, assuming other things are held constant, an increase in one unit of the second independent variable will cause a -3.5 unit change in Y.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Use Space or
up arrow
down arrow
to flip the card.
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, and percent of population in the community holding a bachelor's degree as their highest level of education attained. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, and percent of population in the community holding a bachelor's degree as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) B) Poor (%) = -3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) C) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) D) Poor (%) = -3.88 - 0.798 Single-Families (%) - 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) E) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) <div style=padding-top: 35px> <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, and percent of population in the community holding a bachelor's degree as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) B) Poor (%) = -3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) C) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) D) Poor (%) = -3.88 - 0.798 Single-Families (%) - 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) E) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) <div style=padding-top: 35px> Determine the regression equation.

A) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
B) Poor (%) = -3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
C) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
D) Poor (%) = -3.88 - 0.798 Single-Families (%) - 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
E) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) + 0.170 Bachelor's Degree (%)
Question
i. Multiple regression is used when two or more independent variables are used to predict a value of a single dependent variable. ii. The values ofb1, b2andb3in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant.
iii. Autocorrelation often happens when data has been collected over periods of time.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
Interpret the numbers 3.81 and 0.798.

A) As the % of Single Families in an Ontario community increases, the % of poor families increase as well, with a maximum of 3.81% of the population being poor.
B) As the % of Single Families in an Ontario community increases, the % of poor families decrease.
C) There are 3.81% poor families and 0.798 Single-Families in Ontario communities.
D) 3.81 is the y-intercept. When all of the dependent variables have a value of zero, we can expect that 3.81% of the community to be poor. 0.789 indicates that for an extra 0.789% of poor families in a community, we can expect that the % of single families will increase by 1%.
E) When all of the independent variables have a value of zero, we can expect that -3.81% of the community to be poor, i.e. 0%. The 0.798 indicates that for each extra % of single families in a community, we can expect that the % poor will increase by almost 0.8%.
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%).
How many dependent variables are there in this regression?

A) 1
B) 2
C) 3
D) 4
E) 5
Question
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       Predict the number of wins for a team with: BATAVG = 0.260 HOMERUNS = 150 ERA = 3 STOLENBASE = 100 ERROR = 100 PAYROLL = 25(million) ATTENDANCE = 3(million)</strong> A) 77 B) 102 C) 187 D) 210 E) 186 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       Predict the number of wins for a team with: BATAVG = 0.260 HOMERUNS = 150 ERA = 3 STOLENBASE = 100 ERROR = 100 PAYROLL = 25(million) ATTENDANCE = 3(million)</strong> A) 77 B) 102 C) 187 D) 210 E) 186 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       Predict the number of wins for a team with: BATAVG = 0.260 HOMERUNS = 150 ERA = 3 STOLENBASE = 100 ERROR = 100 PAYROLL = 25(million) ATTENDANCE = 3(million)</strong> A) 77 B) 102 C) 187 D) 210 E) 186 <div style=padding-top: 35px> Predict the number of wins for a team with:
BATAVG = 0.260 HOMERUNS = 150 ERA = 3
STOLENBASE = 100 ERROR = 100
PAYROLL = 25(million)
ATTENDANCE = 3(million)

A) 77
B) 102
C) 187
D) 210
E) 186
Question
How is the Y intercept in the multiple regression equation represented?

A) b1
B) x1
C) b2
D) x2
E) a
Question
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   In the regression model, which of the following are dummy variables?</strong> A) Intercept B) Service C) Service and gender D) Gender and job E) Service, gender, and job <div style=padding-top: 35px> In the regression model, which of the following are dummy variables?

A) Intercept
B) Service
C) Service and gender
D) Gender and job
E) Service, gender, and job
Question
i. Multiple regression analysis examines the relationship of several dependent variables on the independent variable. ii. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation.
iii. Autocorrelation often happens when data has been collected over periods of time.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. The values ofb1, b2 and b3in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant. ii. Multiple regression analysis examines the relationship of several dependent variables on the independent variable.
iii. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation. ii. Autocorrelation often happens when data has been collected over periods of time.
iii. Homoscedasticity occurs when the variance of the residuals (Y - Y') is different for different values of Y'.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
Multiple regression analysis is applied when analyzing the relationship between

A) an independent variable and several dependent variables.
B) a dependent variable and several independent variables.
C) several dependent variables and several independent variables.
D) several regression equations and a single sample.
Question
i. The values of b1, b2 and b3in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant. ii. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation.
iii. If an inverse relationship exists between the dependent variable and independent variables, the regression coefficients for the independent variables are positive.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. Autocorrelation often happens when data has been collected over periods of time. ii. Homoscedasticity occurs when the variance of the residuals (Y - Y') is different for different values of Y'.
iii. Violating the need for successive observations of the dependent variable to be uncorrelated is called autocorrelation.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. If an inverse relationship exists between the dependent variable and independent variables, the regression coefficients for the independent variables are positive. ii. Given a multiple linear equation Y' = 5.1 + 2.2X1 - 3.5X2, assuming other things are held constant, an increase of one unit in the second independent variable will cause a -3.5 unit change in Y.
iii. When the variance of the differences between the actual and the predicted values of the dependent variable are approximately the same, the variables are said to exhibit homoscedasticity.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%).
What is the estimated percentage of poor persons living below the poverty line in a community with 5% of the community as single-families, a 5% unemployment rate, only 5% holding a Bachelor's Degree and 25% having High School as their highest attained educational level?

A) 2.375
B) -2.375
C) 11.845
D) -11.845
Question
i. Multiple regression is used when two or more independent variables are used to predict a value of a single dependent variable. ii. The values of b1, b2, and b3 in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant.
iii. Multiple regression analysis examines the relationship of several dependent variables on the independent variable.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
Which single event would have the strongest effect in reducing the % poor in Ontario?

A) Decreasing the % of single families by 5%.
B) Decreasing the Unemployment rate by 5%.
C) Increasing the % of persons with a Bachelor's Degree by 10%.
D) Decreasing the % of persons with a High School Diploma by 40%.
E) Increasing the % of persons with a Bachelor's Degree by 15%.
Question
The following correlations were computed as part of a multiple regression analysis that used education, job, and age to predict income. <strong>The following correlations were computed as part of a multiple regression analysis that used education, job, and age to predict income.   Which is the dependent variable?</strong> A) Income B) Age C) Education D) Job <div style=padding-top: 35px> Which is the dependent variable?

A) Income
B) Age
C) Education
D) Job
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 - 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) B) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) + 0.003 High School (%) C) Poor (%) = 3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) D) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) E) Poor (%) = 3.81 + 0.798 Single-Families (%) - 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) - 0.003 High School (%) <div style=padding-top: 35px> <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 - 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) B) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) + 0.003 High School (%) C) Poor (%) = 3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) D) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) E) Poor (%) = 3.81 + 0.798 Single-Families (%) - 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) - 0.003 High School (%) <div style=padding-top: 35px> Determine the regression equation.

A) Poor (%) = -3.81 - 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
B) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) + 0.003 High School (%)
C) Poor (%) = 3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
D) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
E) Poor (%) = 3.81 + 0.798 Single-Families (%) - 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) - 0.003 High School (%)
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 + 42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b2.

A) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 0.28 points.
B) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 2.8 points.
C) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 28 points.
D) For each additional living child, their satisfaction index is expected to increase by 42 points.
E) For each additional living child, their satisfaction index is expected to increase by 4.2 points.
Question
In regression analysis, the dferr = ________.

A) the sample size -1
B) the sample size -k-1
C) the number of dependent variables
D) the number of independent variables
E) the sample size-k
Question
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The multiple correlation coefficient is:</strong> A) 0.900 B) 0.930 C) 0.656 D) 0.867 E) 0.949 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The multiple correlation coefficient is:</strong> A) 0.900 B) 0.930 C) 0.656 D) 0.867 E) 0.949 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The multiple correlation coefficient is:</strong> A) 0.900 B) 0.930 C) 0.656 D) 0.867 E) 0.949 <div style=padding-top: 35px> The multiple correlation coefficient is:

A) 0.900
B) 0.930
C) 0.656
D) 0.867
E) 0.949
Question
What are the degrees of freedom associated with the regression sum of squares?

A) Number of independent variables
B) 1
C) F-ratio
D) (n - 2)
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $60,000, has two children, has assets of $350,000, has in index of health status of 141, and has 2 social activities per week?

A) 777.7
B) 796.6
C) 617.9
D) 601.6
E) 769.8
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $26,500, has two children, has assets of $156,000, has in index of health status of 141, and has 2.5 social activities per week?

A) 389.1
B) 421.6
C) 366.0
D) 601.6
E) 769.8
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b4.

A) For each additional living child, their satisfaction index is expected to increase by 42 points.
B) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 0.028 points.
C) For each additional$10,000 in assets, their satisfaction index is expected to increase by 0.12 points.
D) For each additional$10,000 in assets, their satisfaction index is expected to increase by 1.2 points.
E) For each additional$10,000 in assets, their satisfaction index is expected to increase by 12 points.
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b3.

A) For each additional living child, their satisfaction index is expected to increase by 4.2 points.
B) For each additional living child, their satisfaction index is expected to increase by 42 points.
C) For each additional living child, their satisfaction index is expected to increase by 0.42 points.
D) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 0.028 points.
E) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 2.8 points.
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 + 42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $100,000, has two children, has assets of $500,000, has in index of health status of 141, and has 2 social activities per week?

A) 777.7
B) 796.6
C) 588.6
D) 601.6
E) 809.1
Question
If the correlation between two variables X and Y, is +0.67, what is the regression coefficient for these two variables?

A) +0.67
B) > 0
C) < 0
D) = 0
Question
In a regression analysis, three independent variables are used in the equation based on a sample of forty observations. What are the degrees of freedom associated with the F-statistic?

A) 3 and 39
B) 4 and 40
C) 3 and 36
D) 2 and 39
Question
In regression analysis, the dfreg = ________.

A) the sample size - 1
B) the sample size -k-1
C) the number of dependent variables
D) the number of independent variables
E) the sample size-k
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $100,000, has two children, has assets of $500,000, has in index of health status of 141, and has 3 social activities per week?

A) 777.7
B) 835.9
C) 588.6
D) 601.6
E) 769.8
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b5.

A) For each additional10 points on the health status index, their satisfaction index is expected to increase by 9 points.
B) For each additional10 points on the health status index, their satisfaction index is expected to increase by 0.09 points.
C) For each additional10 points on the health status index, their satisfaction index is expected to increase by 0.9 points.
D) For each additional10 points on the health status index, their satisfaction index is expected to drop by 0.09 points.
E) For each additional10 points on the health status index, their satisfaction index is expected to decrease by 9 points.
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 + 42X3+ 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $100,000, has two children, has assets of $500,000, has in index of health status of 141, and has 3.5 social activities per week?

A) 777.7
B) 796.6
C) 849.3
D) 601.6
E) 769.8
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b6.

A) For each additional social activity per week, their satisfaction index is expected to increase by 26.8 points.
B) For each additional social activity per week, their satisfaction index is expected to increase by 2.68 points.
C) For each additional10 points on the health status index, their satisfaction index is expected to increase by 0.9 points.
D) For each additional$10,000 in assets, their satisfaction index is expected to increase by 12 points.
E) For each additional living child, their satisfaction index is expected to increase by 42 points.
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $46,000, has two children, has assets of $200,000, has in index of health status of 141, and has 2.5 social activities per week?

A) 368.3
B) 447.3
C) 366.0
D) 601.6
E) 769.8
Question
For a unit change in the first independent variable with other things being held constant, what change can be expected in the dependent variable in the multiple regression equation Y' = 5.2 + 6.3X1- 7.1X2?

A) -7.1
B) +6.3
C) +5.2
D) +4.4
Question
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $46,000, has two children, has assets of $350,000, has in index of health status of 141, and has 2.5 social activities per week?

A) 368.3
B) 421.6
C) 366.0
D) 627.3
E) 769.8
Question
If there are four independent variables in a multiple regression equation, there are also four

A) Y-intercepts.
B) regression coefficients.
C) dependent variables.
D) constant terms.
Question
i. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables. ii. The multiple standard error of estimate for two independent variables measures the variation about a regression plane.
iii. A multiple coefficient of determination equaling -0.76 is definitely possible.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
If the coefficient of multiple determinations is 0.81, what percent of variation is not explained?

A) 19%
B) 90%
C) 66%
D) 81%
Question
i. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is not explained by the variation in the set of independent variables. ii. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables.
iii. The multiple standard error of estimate for two independent variables measures the variation about a regression plane.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
What happens as the scatter of data values about the regression plane increases?

A) Standard error of estimate increases.
B) R2decreases.
C) (1 -R2) decreases.
D) Residual sum of squares decreases.
E) (1 -R2), residual sum of squares and standard error of estimate all increase, andR2decreases.
Question
The coefficient of determination measures the proportion of

A) explained variation relative to total variation.
B) variation due to the relationship among variables.
C) error variation relative to total variation.
D) variation due to regression.
Question
i. A coefficient of multiple determination equaling -0.76 is definitely possible. ii. Multiple R2 measures the proportion of explained variation relative to total variation.
iii. The number of degrees of freedom associated with the regression sum of squares in the regression equation model equals the number of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
What is the measurement of explained variation?

A) Coefficient of multiple determination
B) Coefficient of multiple nondetermination
C) Regression coefficient
D) Correlation matrix
Question
What is the range of values for multiple R?

A) -100% to - 100% inclusive
B) -100% to 0% inclusive
C) 0% to + 100% inclusive
D) Unlimited range
Question
If a multiple regression analysis is based on ten independent variables collected from a sample of 125 observations, what will be the value of the denominator in the calculation of the multiple standard error of estimate?

A) 125
B) 10
C) 114
D) 115
Question
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The standard error of estimate is:</strong> A) 4.467 B) 5.698 C) 5.864 D) 19.977 E) 15.917 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The standard error of estimate is:</strong> A) 4.467 B) 5.698 C) 5.864 D) 19.977 E) 15.917 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The standard error of estimate is:</strong> A) 4.467 B) 5.698 C) 5.864 D) 19.977 E) 15.917 <div style=padding-top: 35px> The standard error of estimate is:

A) 4.467
B) 5.698
C) 5.864
D) 19.977
E) 15.917
Question
i. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is explained by the variation in the set of independent variables. ii. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables.
iii. A coefficient of multiple determination equaling -0.76 is definitely possible.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered. ii. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is not explained by the variation in the set of independent variables.
iii. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. Multiple R2 measures the proportion of explained variation. ii. 90% of total variation in the dependent variable is explained by the independent variable for a multiple R2 = 0.90.
iii. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. 90% of total variation in the dependent variable is explained by the independent variable for a multiple R2 = 0.90. ii. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered.
iii. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is not explained by the variation in the set of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   Based on the ANOVA, the multiple coefficient of determination is</strong> A) 5.957%. B) 59.3%. C) 40.7%. D) cannot be computed. <div style=padding-top: 35px> Based on the ANOVA, the multiple coefficient of determination is

A) 5.957%.
B) 59.3%.
C) 40.7%.
D) cannot be computed.
Question
Which test statistic do we apply to test the null hypothesis that the multiple regression coefficients are all zero?

A) z
B) t
C) F
D) SPSS-X
Question
i. The multiple standard error of estimate for two independent variables measures the variation about a regression plane. ii. A multiple coefficient of determination equaling -0.76 is definitely possible.
iii. Multiple R2 measures the proportion of explained variation relative to total variation.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
What does the multiple standard error of estimate measure?

A) Change in Y' for a change in X1.
B) Variation of the data points between Y and Y'.
C) Variation due to the relationship between the dependent and independent variables.
D) Amount of explained variation.
Question
How is the degree of association between the set of independent variables and the dependent variable is measured?

A) Confidence intervals
B) Autocorrelation
C) Coefficient of multiple determination
D) Standard error of estimate
Question
i. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered. ii. A coefficient of multiple determination equaling -0.76 is definitely possible.
iii. The number of degrees of freedom associated with the regression sum of squares in the regression equation model equals the number of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The critical value of F to be used in the global test of the model is: (5% level of significance)</strong> A) 2.51 B) 2.58 C) 3.70 D) 5.57 E) 3.39 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The critical value of F to be used in the global test of the model is: (5% level of significance)</strong> A) 2.51 B) 2.58 C) 3.70 D) 5.57 E) 3.39 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The critical value of F to be used in the global test of the model is: (5% level of significance)</strong> A) 2.51 B) 2.58 C) 3.70 D) 5.57 E) 3.39 <div style=padding-top: 35px> The critical value of F to be used in the global test of the model is: (5% level of significance)

A) 2.51
B) 2.58
C) 3.70
D) 5.57
E) 3.39
Question
What test investigates whether all the independent variables have zero net regression coefficients?

A) Multicollinearity
B) Autocorrelation
C) Global
D) Pearson
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, and designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the output below, determine which variable Angela should consider deleting. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, and designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the output below, determine which variable Angela should consider deleting.    </strong> A) It doesn't matter which she uses, the results are virtually the same in any case. B) Angela should delete the high school information, because the P-value is over 0.05. C) Angela should delete the bachelor'sdegree information, because the P-value is close to 0.05. D) Angela should delete the unemployment rate information because the P-value is 0.00. E) Angela should exclude the single-family and unemployment information because the P-value values are 0. <div style=padding-top: 35px> <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, and designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the output below, determine which variable Angela should consider deleting.    </strong> A) It doesn't matter which she uses, the results are virtually the same in any case. B) Angela should delete the high school information, because the P-value is over 0.05. C) Angela should delete the bachelor'sdegree information, because the P-value is close to 0.05. D) Angela should delete the unemployment rate information because the P-value is 0.00. E) Angela should exclude the single-family and unemployment information because the P-value values are 0. <div style=padding-top: 35px>

A) It doesn't matter which she uses, the results are virtually the same in any case.
B) Angela should delete the high school information, because the P-value is over 0.05.
C) Angela should delete the bachelor'sdegree information, because the P-value is close to 0.05.
D) Angela should delete the unemployment rate information because the P-value is 0.00.
E) Angela should exclude the single-family and unemployment information because the P-value values are 0.
Question
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information. <div style=padding-top: 35px> <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information. <div style=padding-top: 35px> <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information. <div style=padding-top: 35px> <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information. <div style=padding-top: 35px>

A) It doesn't matter which she uses, the results are virtually the same in either case.
B) Angela should use the data that includes the high school information, because more information is always better than less information.
C) Angela should use the data that includes the high school information, because the calculated F value is lower.
D) Angela should include the high school data because the P-value is 0.977, close to 1.
E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information.
Question
The best example of a null hypothesis for a global test of a multiple regression model is:

A) H0: β\beta 1 = β\beta 2 = β\beta 3 = β\beta 4.
B) H0: μ\mu 1 = μ\mu 2 = μ\mu 3 = μ\mu 4.
C) H0: β\beta 1 = 0.
D) if F is greater than 20.00 then reject.
Question
The best example of an alternate hypothesis for a global test of a multiple regression model is:

A) H1: β\beta 1 = β\beta 2 = β\beta 3 = β\beta 4.
B) H1: β\beta 1 ≠\neqβ\beta 2 ≠\neqβ\beta 3 ≠\neqβ\beta 4.
C) H1: Not all the β\beta 's are 0.
D) if F is less than 20.00 then fail to reject.
Question
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:</strong> A) 27 B) 26 C) 22 D) 21 E) 20 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:</strong> A) 27 B) 26 C) 22 D) 21 E) 20 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:</strong> A) 27 B) 26 C) 22 D) 21 E) 20 <div style=padding-top: 35px> The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:

A) 27
B) 26
C) 22
D) 21
E) 20
Question
The best example of a null hypothesis for testing an individual regression coefficient is:

A) H0: β\beta 1 = β\beta 2 = β\beta 3 = β\beta 4.
B) H0: μ\mu 1 = μ\mu 2 = μ\mu 3 = μ\mu 4.
C) H0: β\beta 1 = 0.
D) if F is greater than 20.00 then reject.
Question
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   Based on the ANOVA and a 0.05 significance level, the global null hypothesis test of the multiple regression model.</strong> A) will be rejected and conclude that monthly salary is related to all of the independent variables. B) will be rejected and conclude that monthly salary is related to at least one of the independent variables. C) will not be rejected. D) will show a high multiple coefficient of determination. <div style=padding-top: 35px> Based on the ANOVA and a 0.05 significance level, the global null hypothesis test of the multiple regression model.

A) will be rejected and conclude that monthly salary is related to all of the independent variables.
B) will be rejected and conclude that monthly salary is related to at least one of the independent variables.
C) will not be rejected.
D) will show a high multiple coefficient of determination.
Question
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The computed F for the global test is:</strong> A) 7.802 B) 25.695 C) 15.790 D) 26.981 E) 114.779 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The computed F for the global test is:</strong> A) 7.802 B) 25.695 C) 15.790 D) 26.981 E) 114.779 <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The computed F for the global test is:</strong> A) 7.802 B) 25.695 C) 15.790 D) 26.981 E) 114.779 <div style=padding-top: 35px> The computed F for the global test is:

A) 7.802
B) 25.695
C) 15.790
D) 26.981
E) 114.779
Question
Hypotheses concerning individual regression coefficients are tested using which statistic?

A) t-statistic
B) z-statistic
C) χ\chi 2(chi-square statistic)
D) H
Question
i. A variable whose possible outcomes are coded as a "1" or a "0" is called a dummy variable. ii. A dummy variable is added to the regression equation to control for error.
iii. If the null hypothesis β\beta 4 = 0 is not rejected, then the independent variable X4 has no effect in predicting the dependent variable.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
i. If the null hypothesis β\beta 4 = 0 is not rejected, then the independent variable X4 has a strong effect in predicting the dependent variable. ii. A dummy variable is added to the regression equation to control for error.
iii. A variable whose possible outcomes are coded as a "1" or a "0" is called a strong independent variable.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   Based on the hypothesis tests for the individual regression coefficients,</strong> A) all the regression coefficients are not equal to zero. B) job is the only significant variable in the model. C) only months of service and gender are significantly related to monthly salary. D) service is the only significant variable in the model. <div style=padding-top: 35px> Based on the hypothesis tests for the individual regression coefficients,

A) all the regression coefficients are not equal to zero.
B) "job" is the only significant variable in the model.
C) only months of service and gender are significantly related to monthly salary.
D) "service" is the only significant variable in the model.
Question
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The t-value computed for testing the coefficient Batg. Avg. is:</strong> A) 112.991. B) 2.086. C) 7.438. D) 2.832. E) -1.593. <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The t-value computed for testing the coefficient Batg. Avg. is:</strong> A) 112.991. B) 2.086. C) 7.438. D) 2.832. E) -1.593. <div style=padding-top: 35px> <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The t-value computed for testing the coefficient Batg. Avg. is:</strong> A) 112.991. B) 2.086. C) 7.438. D) 2.832. E) -1.593. <div style=padding-top: 35px> The t-value computed for testing the coefficient "Batg. Avg." is:

A) 112.991.
B) 2.086.
C) 7.438.
D) 2.832.
E) -1.593.
Question
i. A variable whose possible outcomes are coded as a "1" or a "0" is called a dummy variable. ii. If the null hypothesis β\beta 4 = 0 is not rejected, then the independent variable X4 has no effect in predicting the dependent variable.
iii. A dummy variable is added to the regression equation to control for error.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Question
Which of the following is a characteristic of the F-distribution?

A) Normally distributed
B) Positively skewed
C) Negatively skewed
D) Equal to the t-distribution
Question
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   The results for the variable gender show that,</strong> A) males average $222.78 more than females in monthly salary. B) females average $222.78 more than males in monthly salary. C) gender is not related to monthly salary. D) gender and months of service are correlated. <div style=padding-top: 35px> The results for the variable gender show that,

A) males average $222.78 more than females in monthly salary.
B) females average $222.78 more than males in monthly salary.
C) gender is not related to monthly salary.
D) gender and months of service are correlated.
Question
In multiple regression, a dummy variable can be included in a multiple regression model as

A) an additional quantitative variable.
B) a nominal variable with three or more values.
C) a nominal variable with only two values.
D) a new regression coefficient.
Question
What can we conclude if the net regression coefficients in the population are not significantly different from zero?

A) Strong relationship exists among the variables.
B) No relationship exists between the dependent variable and the independent variables.
C) Independent variables are good predictors.
D) Good forecasts are possible.
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/128
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 13: Multiple Regression and Correlation Analysis
1
i. Violating the need for successive observations of the dependent variable to be uncorrelated is called autocorrelation. ii. If an inverse relationship exists between the dependent variable and independent variables, the regression coefficients for the independent variables are negative.
iii. Given a multiple linear equation Y' = 5.1 + 2.2X1 - 3.5X2, assuming other things are held constant, an increase in one unit of the second independent variable will cause a -3.5 unit change in Y.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
(i), (ii), and (iii) are all correct statements.
2
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, and percent of population in the community holding a bachelor's degree as their highest level of education attained. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, and percent of population in the community holding a bachelor's degree as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) B) Poor (%) = -3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) C) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) D) Poor (%) = -3.88 - 0.798 Single-Families (%) - 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) E) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, and percent of population in the community holding a bachelor's degree as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) B) Poor (%) = -3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) C) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) D) Poor (%) = -3.88 - 0.798 Single-Families (%) - 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) E) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) Determine the regression equation.

A) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
B) Poor (%) = -3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
C) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
D) Poor (%) = -3.88 - 0.798 Single-Families (%) - 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
E) Poor (%) = 3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) + 0.170 Bachelor's Degree (%)
Poor (%) = -3.88 + 0.798 Single-Families (%) + 0.625 Unemployment Rate (%) - 0.170 Bachelor's Degree (%)
3
i. Multiple regression is used when two or more independent variables are used to predict a value of a single dependent variable. ii. The values ofb1, b2andb3in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant.
iii. Autocorrelation often happens when data has been collected over periods of time.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
(i), (ii), and (iii) are all correct statements.
4
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
Interpret the numbers 3.81 and 0.798.

A) As the % of Single Families in an Ontario community increases, the % of poor families increase as well, with a maximum of 3.81% of the population being poor.
B) As the % of Single Families in an Ontario community increases, the % of poor families decrease.
C) There are 3.81% poor families and 0.798 Single-Families in Ontario communities.
D) 3.81 is the y-intercept. When all of the dependent variables have a value of zero, we can expect that 3.81% of the community to be poor. 0.789 indicates that for an extra 0.789% of poor families in a community, we can expect that the % of single families will increase by 1%.
E) When all of the independent variables have a value of zero, we can expect that -3.81% of the community to be poor, i.e. 0%. The 0.798 indicates that for each extra % of single families in a community, we can expect that the % poor will increase by almost 0.8%.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
5
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%).
How many dependent variables are there in this regression?

A) 1
B) 2
C) 3
D) 4
E) 5
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
6
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       Predict the number of wins for a team with: BATAVG = 0.260 HOMERUNS = 150 ERA = 3 STOLENBASE = 100 ERROR = 100 PAYROLL = 25(million) ATTENDANCE = 3(million)</strong> A) 77 B) 102 C) 187 D) 210 E) 186 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       Predict the number of wins for a team with: BATAVG = 0.260 HOMERUNS = 150 ERA = 3 STOLENBASE = 100 ERROR = 100 PAYROLL = 25(million) ATTENDANCE = 3(million)</strong> A) 77 B) 102 C) 187 D) 210 E) 186 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       Predict the number of wins for a team with: BATAVG = 0.260 HOMERUNS = 150 ERA = 3 STOLENBASE = 100 ERROR = 100 PAYROLL = 25(million) ATTENDANCE = 3(million)</strong> A) 77 B) 102 C) 187 D) 210 E) 186 Predict the number of wins for a team with:
BATAVG = 0.260 HOMERUNS = 150 ERA = 3
STOLENBASE = 100 ERROR = 100
PAYROLL = 25(million)
ATTENDANCE = 3(million)

A) 77
B) 102
C) 187
D) 210
E) 186
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
7
How is the Y intercept in the multiple regression equation represented?

A) b1
B) x1
C) b2
D) x2
E) a
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
8
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   In the regression model, which of the following are dummy variables?</strong> A) Intercept B) Service C) Service and gender D) Gender and job E) Service, gender, and job In the regression model, which of the following are dummy variables?

A) Intercept
B) Service
C) Service and gender
D) Gender and job
E) Service, gender, and job
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
9
i. Multiple regression analysis examines the relationship of several dependent variables on the independent variable. ii. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation.
iii. Autocorrelation often happens when data has been collected over periods of time.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
10
i. The values ofb1, b2 and b3in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant. ii. Multiple regression analysis examines the relationship of several dependent variables on the independent variable.
iii. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
11
i. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation. ii. Autocorrelation often happens when data has been collected over periods of time.
iii. Homoscedasticity occurs when the variance of the residuals (Y - Y') is different for different values of Y'.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
12
Multiple regression analysis is applied when analyzing the relationship between

A) an independent variable and several dependent variables.
B) a dependent variable and several independent variables.
C) several dependent variables and several independent variables.
D) several regression equations and a single sample.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
13
i. The values of b1, b2 and b3in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant. ii. A multiple regression equation defines the relationship between the dependent variable and the independent variables in the form of an equation.
iii. If an inverse relationship exists between the dependent variable and independent variables, the regression coefficients for the independent variables are positive.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
14
i. Autocorrelation often happens when data has been collected over periods of time. ii. Homoscedasticity occurs when the variance of the residuals (Y - Y') is different for different values of Y'.
iii. Violating the need for successive observations of the dependent variable to be uncorrelated is called autocorrelation.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
15
i. If an inverse relationship exists between the dependent variable and independent variables, the regression coefficients for the independent variables are positive. ii. Given a multiple linear equation Y' = 5.1 + 2.2X1 - 3.5X2, assuming other things are held constant, an increase of one unit in the second independent variable will cause a -3.5 unit change in Y.
iii. When the variance of the differences between the actual and the predicted values of the dependent variable are approximately the same, the variables are said to exhibit homoscedasticity.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
16
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%).
What is the estimated percentage of poor persons living below the poverty line in a community with 5% of the community as single-families, a 5% unemployment rate, only 5% holding a Bachelor's Degree and 25% having High School as their highest attained educational level?

A) 2.375
B) -2.375
C) 11.845
D) -11.845
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
17
i. Multiple regression is used when two or more independent variables are used to predict a value of a single dependent variable. ii. The values of b1, b2, and b3 in a multiple regression equation are called the net regression coefficients. They indicate the change in the predicted value for a unit change in one X when the other X variables are held constant.
iii. Multiple regression analysis examines the relationship of several dependent variables on the independent variable.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
18
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Given the regression equation Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
Which single event would have the strongest effect in reducing the % poor in Ontario?

A) Decreasing the % of single families by 5%.
B) Decreasing the Unemployment rate by 5%.
C) Increasing the % of persons with a Bachelor's Degree by 10%.
D) Decreasing the % of persons with a High School Diploma by 40%.
E) Increasing the % of persons with a Bachelor's Degree by 15%.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
19
The following correlations were computed as part of a multiple regression analysis that used education, job, and age to predict income. <strong>The following correlations were computed as part of a multiple regression analysis that used education, job, and age to predict income.   Which is the dependent variable?</strong> A) Income B) Age C) Education D) Job Which is the dependent variable?

A) Income
B) Age
C) Education
D) Job
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
20
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 - 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) B) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) + 0.003 High School (%) C) Poor (%) = 3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) D) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) E) Poor (%) = 3.81 + 0.798 Single-Families (%) - 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) - 0.003 High School (%) <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained.     Determine the regression equation.</strong> A) Poor (%) = -3.81 - 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) B) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) + 0.003 High School (%) C) Poor (%) = 3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) D) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%) E) Poor (%) = 3.81 + 0.798 Single-Families (%) - 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) - 0.003 High School (%) Determine the regression equation.

A) Poor (%) = -3.81 - 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
B) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) + 0.003 High School (%)
C) Poor (%) = 3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
D) Poor (%) = -3.81 + 0.798 Single-Families (%) + 0.624 Unemployment Rate (%) - 0.170 Bachelor's Degree (%) - 0.003 High School (%)
E) Poor (%) = 3.81 + 0.798 Single-Families (%) - 0.624 Unemployment Rate (%) + 0.170 Bachelor's Degree (%) - 0.003 High School (%)
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
21
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 + 42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b2.

A) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 0.28 points.
B) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 2.8 points.
C) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 28 points.
D) For each additional living child, their satisfaction index is expected to increase by 42 points.
E) For each additional living child, their satisfaction index is expected to increase by 4.2 points.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
22
In regression analysis, the dferr = ________.

A) the sample size -1
B) the sample size -k-1
C) the number of dependent variables
D) the number of independent variables
E) the sample size-k
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
23
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The multiple correlation coefficient is:</strong> A) 0.900 B) 0.930 C) 0.656 D) 0.867 E) 0.949 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The multiple correlation coefficient is:</strong> A) 0.900 B) 0.930 C) 0.656 D) 0.867 E) 0.949 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The multiple correlation coefficient is:</strong> A) 0.900 B) 0.930 C) 0.656 D) 0.867 E) 0.949 The multiple correlation coefficient is:

A) 0.900
B) 0.930
C) 0.656
D) 0.867
E) 0.949
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
24
What are the degrees of freedom associated with the regression sum of squares?

A) Number of independent variables
B) 1
C) F-ratio
D) (n - 2)
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
25
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $60,000, has two children, has assets of $350,000, has in index of health status of 141, and has 2 social activities per week?

A) 777.7
B) 796.6
C) 617.9
D) 601.6
E) 769.8
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
26
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $26,500, has two children, has assets of $156,000, has in index of health status of 141, and has 2.5 social activities per week?

A) 389.1
B) 421.6
C) 366.0
D) 601.6
E) 769.8
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
27
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b4.

A) For each additional living child, their satisfaction index is expected to increase by 42 points.
B) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 0.028 points.
C) For each additional$10,000 in assets, their satisfaction index is expected to increase by 0.12 points.
D) For each additional$10,000 in assets, their satisfaction index is expected to increase by 1.2 points.
E) For each additional$10,000 in assets, their satisfaction index is expected to increase by 12 points.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
28
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b3.

A) For each additional living child, their satisfaction index is expected to increase by 4.2 points.
B) For each additional living child, their satisfaction index is expected to increase by 42 points.
C) For each additional living child, their satisfaction index is expected to increase by 0.42 points.
D) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 0.028 points.
E) For each additional$1,000 of annual income, their satisfaction index is expected to increase by 2.8 points.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
29
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 + 42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $100,000, has two children, has assets of $500,000, has in index of health status of 141, and has 2 social activities per week?

A) 777.7
B) 796.6
C) 588.6
D) 601.6
E) 809.1
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
30
If the correlation between two variables X and Y, is +0.67, what is the regression coefficient for these two variables?

A) +0.67
B) > 0
C) < 0
D) = 0
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
31
In a regression analysis, three independent variables are used in the equation based on a sample of forty observations. What are the degrees of freedom associated with the F-statistic?

A) 3 and 39
B) 4 and 40
C) 3 and 36
D) 2 and 39
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
32
In regression analysis, the dfreg = ________.

A) the sample size - 1
B) the sample size -k-1
C) the number of dependent variables
D) the number of independent variables
E) the sample size-k
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
33
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $100,000, has two children, has assets of $500,000, has in index of health status of 141, and has 3 social activities per week?

A) 777.7
B) 835.9
C) 588.6
D) 601.6
E) 769.8
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
34
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b5.

A) For each additional10 points on the health status index, their satisfaction index is expected to increase by 9 points.
B) For each additional10 points on the health status index, their satisfaction index is expected to increase by 0.09 points.
C) For each additional10 points on the health status index, their satisfaction index is expected to increase by 0.9 points.
D) For each additional10 points on the health status index, their satisfaction index is expected to drop by 0.09 points.
E) For each additional10 points on the health status index, their satisfaction index is expected to decrease by 9 points.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
35
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 + 42X3+ 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $100,000, has two children, has assets of $500,000, has in index of health status of 141, and has 3.5 social activities per week?

A) 777.7
B) 796.6
C) 849.3
D) 601.6
E) 769.8
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
36
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.09X5 + 26.8X6.
Explain the meaning of b6.

A) For each additional social activity per week, their satisfaction index is expected to increase by 26.8 points.
B) For each additional social activity per week, their satisfaction index is expected to increase by 2.68 points.
C) For each additional10 points on the health status index, their satisfaction index is expected to increase by 0.9 points.
D) For each additional$10,000 in assets, their satisfaction index is expected to increase by 12 points.
E) For each additional living child, their satisfaction index is expected to increase by 42 points.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
37
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $46,000, has two children, has assets of $200,000, has in index of health status of 141, and has 2.5 social activities per week?

A) 368.3
B) 447.3
C) 366.0
D) 601.6
E) 769.8
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
38
For a unit change in the first independent variable with other things being held constant, what change can be expected in the dependent variable in the multiple regression equation Y' = 5.2 + 6.3X1- 7.1X2?

A) -7.1
B) +6.3
C) +5.2
D) +4.4
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
39
A sample of General Mills employees was studied to determine their degree of satisfaction with their present life. A special index, called the index of satisfaction, was used to measure satisfaction. Six factors were studied: age at the time of first marriage (X1), annual income (X2), number of children living (X3), value of all assets (X4), status of health in the form of an index (X5), and the average number of social activities per week (X6). Suppose the multiple regression equation is: Y' = 16.24 + 0.017X1 + 0.00028X2 +42X3 + 0.0012X4 + 0.19X5 + 26.8X6.
What is the estimated index of satisfaction for a person who first married at 25, has an annual income of $46,000, has two children, has assets of $350,000, has in index of health status of 141, and has 2.5 social activities per week?

A) 368.3
B) 421.6
C) 366.0
D) 627.3
E) 769.8
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
40
If there are four independent variables in a multiple regression equation, there are also four

A) Y-intercepts.
B) regression coefficients.
C) dependent variables.
D) constant terms.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
41
i. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables. ii. The multiple standard error of estimate for two independent variables measures the variation about a regression plane.
iii. A multiple coefficient of determination equaling -0.76 is definitely possible.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
42
If the coefficient of multiple determinations is 0.81, what percent of variation is not explained?

A) 19%
B) 90%
C) 66%
D) 81%
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
43
i. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is not explained by the variation in the set of independent variables. ii. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables.
iii. The multiple standard error of estimate for two independent variables measures the variation about a regression plane.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
44
What happens as the scatter of data values about the regression plane increases?

A) Standard error of estimate increases.
B) R2decreases.
C) (1 -R2) decreases.
D) Residual sum of squares decreases.
E) (1 -R2), residual sum of squares and standard error of estimate all increase, andR2decreases.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
45
The coefficient of determination measures the proportion of

A) explained variation relative to total variation.
B) variation due to the relationship among variables.
C) error variation relative to total variation.
D) variation due to regression.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
46
i. A coefficient of multiple determination equaling -0.76 is definitely possible. ii. Multiple R2 measures the proportion of explained variation relative to total variation.
iii. The number of degrees of freedom associated with the regression sum of squares in the regression equation model equals the number of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
47
What is the measurement of explained variation?

A) Coefficient of multiple determination
B) Coefficient of multiple nondetermination
C) Regression coefficient
D) Correlation matrix
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
48
What is the range of values for multiple R?

A) -100% to - 100% inclusive
B) -100% to 0% inclusive
C) 0% to + 100% inclusive
D) Unlimited range
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
49
If a multiple regression analysis is based on ten independent variables collected from a sample of 125 observations, what will be the value of the denominator in the calculation of the multiple standard error of estimate?

A) 125
B) 10
C) 114
D) 115
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
50
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The standard error of estimate is:</strong> A) 4.467 B) 5.698 C) 5.864 D) 19.977 E) 15.917 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The standard error of estimate is:</strong> A) 4.467 B) 5.698 C) 5.864 D) 19.977 E) 15.917 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The standard error of estimate is:</strong> A) 4.467 B) 5.698 C) 5.864 D) 19.977 E) 15.917 The standard error of estimate is:

A) 4.467
B) 5.698
C) 5.864
D) 19.977
E) 15.917
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
51
i. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is explained by the variation in the set of independent variables. ii. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables.
iii. A coefficient of multiple determination equaling -0.76 is definitely possible.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
52
i. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered. ii. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is not explained by the variation in the set of independent variables.
iii. The coefficient of multiple determination reports the strength of the association between the dependent variable and the set of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
53
i. Multiple R2 measures the proportion of explained variation. ii. 90% of total variation in the dependent variable is explained by the independent variable for a multiple R2 = 0.90.
iii. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
54
i. 90% of total variation in the dependent variable is explained by the independent variable for a multiple R2 = 0.90. ii. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered.
iii. The multiple coefficient of determination, R2, reports the proportion of the variation in Y that is not explained by the variation in the set of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
55
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   Based on the ANOVA, the multiple coefficient of determination is</strong> A) 5.957%. B) 59.3%. C) 40.7%. D) cannot be computed. Based on the ANOVA, the multiple coefficient of determination is

A) 5.957%.
B) 59.3%.
C) 40.7%.
D) cannot be computed.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
56
Which test statistic do we apply to test the null hypothesis that the multiple regression coefficients are all zero?

A) z
B) t
C) F
D) SPSS-X
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
57
i. The multiple standard error of estimate for two independent variables measures the variation about a regression plane. ii. A multiple coefficient of determination equaling -0.76 is definitely possible.
iii. Multiple R2 measures the proportion of explained variation relative to total variation.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
58
What does the multiple standard error of estimate measure?

A) Change in Y' for a change in X1.
B) Variation of the data points between Y and Y'.
C) Variation due to the relationship between the dependent and independent variables.
D) Amount of explained variation.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
59
How is the degree of association between the set of independent variables and the dependent variable is measured?

A) Confidence intervals
B) Autocorrelation
C) Coefficient of multiple determination
D) Standard error of estimate
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
60
i. The multiple standard error of estimate measures the variation about the regression plane when two independent variables are considered. ii. A coefficient of multiple determination equaling -0.76 is definitely possible.
iii. The number of degrees of freedom associated with the regression sum of squares in the regression equation model equals the number of independent variables.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
61
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The critical value of F to be used in the global test of the model is: (5% level of significance)</strong> A) 2.51 B) 2.58 C) 3.70 D) 5.57 E) 3.39 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The critical value of F to be used in the global test of the model is: (5% level of significance)</strong> A) 2.51 B) 2.58 C) 3.70 D) 5.57 E) 3.39 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The critical value of F to be used in the global test of the model is: (5% level of significance)</strong> A) 2.51 B) 2.58 C) 3.70 D) 5.57 E) 3.39 The critical value of F to be used in the global test of the model is: (5% level of significance)

A) 2.51
B) 2.58
C) 3.70
D) 5.57
E) 3.39
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
62
What test investigates whether all the independent variables have zero net regression coefficients?

A) Multicollinearity
B) Autocorrelation
C) Global
D) Pearson
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
63
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, and designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the output below, determine which variable Angela should consider deleting. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, and designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the output below, determine which variable Angela should consider deleting.    </strong> A) It doesn't matter which she uses, the results are virtually the same in any case. B) Angela should delete the high school information, because the P-value is over 0.05. C) Angela should delete the bachelor'sdegree information, because the P-value is close to 0.05. D) Angela should delete the unemployment rate information because the P-value is 0.00. E) Angela should exclude the single-family and unemployment information because the P-value values are 0. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, and designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the output below, determine which variable Angela should consider deleting.    </strong> A) It doesn't matter which she uses, the results are virtually the same in any case. B) Angela should delete the high school information, because the P-value is over 0.05. C) Angela should delete the bachelor'sdegree information, because the P-value is close to 0.05. D) Angela should delete the unemployment rate information because the P-value is 0.00. E) Angela should exclude the single-family and unemployment information because the P-value values are 0.

A) It doesn't matter which she uses, the results are virtually the same in any case.
B) Angela should delete the high school information, because the P-value is over 0.05.
C) Angela should delete the bachelor'sdegree information, because the P-value is close to 0.05.
D) Angela should delete the unemployment rate information because the P-value is 0.00.
E) Angela should exclude the single-family and unemployment information because the P-value values are 0.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
64
Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information. <strong>Angela Chou has been asked to investigate the determinants of poverty in Ontario communities. She collected data on 60 communities from Statistics Canada. She selected the percentage of poor persons living under the poverty line [Poor (%)], measured by Low Income Cut-Off, designed by Statistics Canada as a measure of poverty for a community, as the dependent variable. The independent variables selected are percent of single families in each community, the unemployment rate in each community, percent of population in the community holding a bachelor's degree as their highest level of education attained, and percent of population holding a High School Diploma as their highest level of education attained. Using the outputs below for this data set, determine whether Angela should use the model with the high school data included, or the data without the high school data, and why.        </strong> A) It doesn't matter which she uses, the results are virtually the same in either case. B) Angela should use the data that includes the high school information, because more information is always better than less information. C) Angela should use the data that includes the high school information, because the calculated F value is lower. D) Angela should include the high school data because the P-value is 0.977, close to 1. E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information.

A) It doesn't matter which she uses, the results are virtually the same in either case.
B) Angela should use the data that includes the high school information, because more information is always better than less information.
C) Angela should use the data that includes the high school information, because the calculated F value is lower.
D) Angela should include the high school data because the P-value is 0.977, close to 1.
E) Angela should exclude the high school data because the adjusted R-squared value is a little higher than in the data that includes the high school information.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
65
The best example of a null hypothesis for a global test of a multiple regression model is:

A) H0: β\beta 1 = β\beta 2 = β\beta 3 = β\beta 4.
B) H0: μ\mu 1 = μ\mu 2 = μ\mu 3 = μ\mu 4.
C) H0: β\beta 1 = 0.
D) if F is greater than 20.00 then reject.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
66
The best example of an alternate hypothesis for a global test of a multiple regression model is:

A) H1: β\beta 1 = β\beta 2 = β\beta 3 = β\beta 4.
B) H1: β\beta 1 ≠\neqβ\beta 2 ≠\neqβ\beta 3 ≠\neqβ\beta 4.
C) H1: Not all the β\beta 's are 0.
D) if F is less than 20.00 then fail to reject.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
67
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:</strong> A) 27 B) 26 C) 22 D) 21 E) 20 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:</strong> A) 27 B) 26 C) 22 D) 21 E) 20 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:</strong> A) 27 B) 26 C) 22 D) 21 E) 20 The number of degrees of freedom to be used in determining the critical value of t to be used in a hypothesis test of the regression coefficients is:

A) 27
B) 26
C) 22
D) 21
E) 20
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
68
The best example of a null hypothesis for testing an individual regression coefficient is:

A) H0: β\beta 1 = β\beta 2 = β\beta 3 = β\beta 4.
B) H0: μ\mu 1 = μ\mu 2 = μ\mu 3 = μ\mu 4.
C) H0: β\beta 1 = 0.
D) if F is greater than 20.00 then reject.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
69
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   Based on the ANOVA and a 0.05 significance level, the global null hypothesis test of the multiple regression model.</strong> A) will be rejected and conclude that monthly salary is related to all of the independent variables. B) will be rejected and conclude that monthly salary is related to at least one of the independent variables. C) will not be rejected. D) will show a high multiple coefficient of determination. Based on the ANOVA and a 0.05 significance level, the global null hypothesis test of the multiple regression model.

A) will be rejected and conclude that monthly salary is related to all of the independent variables.
B) will be rejected and conclude that monthly salary is related to at least one of the independent variables.
C) will not be rejected.
D) will show a high multiple coefficient of determination.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
70
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The computed F for the global test is:</strong> A) 7.802 B) 25.695 C) 15.790 D) 26.981 E) 114.779 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The computed F for the global test is:</strong> A) 7.802 B) 25.695 C) 15.790 D) 26.981 E) 114.779 <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The computed F for the global test is:</strong> A) 7.802 B) 25.695 C) 15.790 D) 26.981 E) 114.779 The computed F for the global test is:

A) 7.802
B) 25.695
C) 15.790
D) 26.981
E) 114.779
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
71
Hypotheses concerning individual regression coefficients are tested using which statistic?

A) t-statistic
B) z-statistic
C) χ\chi 2(chi-square statistic)
D) H
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
72
i. A variable whose possible outcomes are coded as a "1" or a "0" is called a dummy variable. ii. A dummy variable is added to the regression equation to control for error.
iii. If the null hypothesis β\beta 4 = 0 is not rejected, then the independent variable X4 has no effect in predicting the dependent variable.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
73
i. If the null hypothesis β\beta 4 = 0 is not rejected, then the independent variable X4 has a strong effect in predicting the dependent variable. ii. A dummy variable is added to the regression equation to control for error.
iii. A variable whose possible outcomes are coded as a "1" or a "0" is called a strong independent variable.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
74
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   Based on the hypothesis tests for the individual regression coefficients,</strong> A) all the regression coefficients are not equal to zero. B) job is the only significant variable in the model. C) only months of service and gender are significantly related to monthly salary. D) service is the only significant variable in the model. Based on the hypothesis tests for the individual regression coefficients,

A) all the regression coefficients are not equal to zero.
B) "job" is the only significant variable in the model.
C) only months of service and gender are significantly related to monthly salary.
D) "service" is the only significant variable in the model.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
75
The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The t-value computed for testing the coefficient Batg. Avg. is:</strong> A) 112.991. B) 2.086. C) 7.438. D) 2.832. E) -1.593. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The t-value computed for testing the coefficient Batg. Avg. is:</strong> A) 112.991. B) 2.086. C) 7.438. D) 2.832. E) -1.593. <strong>The information below is from the multiple regression analysis computer output for 28 teams in Major League Baseball. The model is designed to predict wins using attendance, payroll, batting average, home runs, stolen bases, errors, and team ERA.       The t-value computed for testing the coefficient Batg. Avg. is:</strong> A) 112.991. B) 2.086. C) 7.438. D) 2.832. E) -1.593. The t-value computed for testing the coefficient "Batg. Avg." is:

A) 112.991.
B) 2.086.
C) 7.438.
D) 2.832.
E) -1.593.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
76
i. A variable whose possible outcomes are coded as a "1" or a "0" is called a dummy variable. ii. If the null hypothesis β\beta 4 = 0 is not rejected, then the independent variable X4 has no effect in predicting the dependent variable.
iii. A dummy variable is added to the regression equation to control for error.

A) (i), (ii), and (iii) are all correct statements.
B) (i) and (ii) are correct statements but not (iii).
C) (i) and (iii) are correct statements but not (ii).
D) (ii) and (iii) are correct statements but not (i).
E) (i), (ii), and (iii) are all false statements.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
77
Which of the following is a characteristic of the F-distribution?

A) Normally distributed
B) Positively skewed
C) Negatively skewed
D) Equal to the t-distribution
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
78
A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results: <strong>A manager at a local bank analyzed the relationship between monthly salary and three independent variables: length of service (measured in months), gender (0 = female, 1 = male) and job type (0 = clerical, 1 = technical). The following ANOVA summarizes the regression results:   The results for the variable gender show that,</strong> A) males average $222.78 more than females in monthly salary. B) females average $222.78 more than males in monthly salary. C) gender is not related to monthly salary. D) gender and months of service are correlated. The results for the variable gender show that,

A) males average $222.78 more than females in monthly salary.
B) females average $222.78 more than males in monthly salary.
C) gender is not related to monthly salary.
D) gender and months of service are correlated.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
79
In multiple regression, a dummy variable can be included in a multiple regression model as

A) an additional quantitative variable.
B) a nominal variable with three or more values.
C) a nominal variable with only two values.
D) a new regression coefficient.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
80
What can we conclude if the net regression coefficients in the population are not significantly different from zero?

A) Strong relationship exists among the variables.
B) No relationship exists between the dependent variable and the independent variables.
C) Independent variables are good predictors.
D) Good forecasts are possible.
Unlock Deck
Unlock for access to all 128 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 128 flashcards in this deck.