Question 1

The p-value for a one-sided left-tail test is given by A)Pr(Z - t^act )= φ(t^act). B)Pr(Z < t^act )= φ(t^act). C)Pr(Z < t^act )< 1.645. D)cannot be calculated, since probabilities must always be positive.

Accepted Answer

B

Question 2

In the presence of heteroskedasticity, and assuming that the usual least squares assumptions hold, the OLS estimator is&#10;A)efficient.&#10;B)BLUE.&#10;C)unbiased and consistent.&#10;D)unbiased but not consistent.

Accepted Answer

C

Question 3

If the absolute value of your calculated t-statistic exceeds the critical value from the standard normal distribution, you can&#10;A)reject the null hypothesis.&#10;B)safely assume that your regression results are significant.&#10;C)reject the assumption that the error terms are homoskedastic.&#10;D)conclude that most of the actual values are very close to the regression line.

Accepted Answer

A

Question 4

The homoskedasticity-only estimator of the variance of $\hat { \beta }$ ₁ is A) $\frac { s _ { u } ^ { \frac { 2 } { u } } } { \sum _ { i = 1 } ^ { n } \left( X _ { i } - \bar {X } \right) ^ { 2 } }$ B) $\frac { S _ { \hat { u } } ^ { 2 } } { \sqrt { \sum _ { i = 1 } ^ { n } \left( X _ { i } - \bar { X } \right) ^ { 2 } } }$ C) $\frac { S _ { \hat { u } } ^ { 2 } } { \sum _ { i = 1 } ^ { n } X _ { i } ^ { 2 } - \bar { X } } .$ D) $\frac { 1 } { n } \times \frac { \frac { 1 } { n - 2 } \sum _ { i = 1 } ^ { n } \left( X _ { i } - \bar { X } \right) ^ { 2 } \hat { u } _ { i } ^ { 2 } } { \left[ \frac { 1 } { n } \sum _ { i = 1 } ^ { n } \left( X _ { i } - \bar { X } \right) ^ { 2 } \right] ^ { 2 } } { } _ { \times }$

Accepted Answer

The answer of The homoskedasticity-only estimator of the variance of...

Question 5

Finding a small value of the p-value (e.g. less than 5%)&#10;A)indicates evidence in favor of the null hypothesis.&#10;B)implies that the t-statistic is less than 1.96.&#10;C)indicates evidence in against the null hypothesis.&#10;D)will only happen roughly one in twenty samples.

Accepted Answer

The answer of Finding a small value of the p-value...

Question 6

Under the least squares assumptions (zero conditional mean for the error term, X_i and Y_i being i.i.d., and X_i and u_i having finite fourth moments), the OLS estimator for the slope and intercept A)has an exact normal distribution for n > 15. B)is BLUE. C)has a normal distribution even in small samples. D)is unbiased.

Accepted Answer

The answer of Under the least squares assumptions (zero conditional...

Question 7

Consider the following regression line: $\widehat {\text { TestScore }}$ = 698.9 - 2.28 × STR. You are told that the t-statistic on the slope coefficient is 4.38. What is the standard error of the slope coefficient?

A)0.52
B)1.96
C)-1.96
D)4.38

Accepted Answer

The answer of Consider the following regression line: \(\widehat {\text...

Question 8

The construction of the t-statistic for a one- and a two-sided hypothesis&#10;A)depends on the critical value from the appropriate distribution.&#10;B)is the same.&#10;C)is different since the critical value must be 1.645 for the one-sided hypothesis, but 1.96 for the two-sided hypothesis (using a 5% probability for the Type I error).&#10;D)uses &#177;1.96 for the two-sided test, but only +1.96 for the one-sided test.

Accepted Answer

The answer of The construction of the t-statistic for a...

Question 9

In general, the t-statistic has the following form:&#10;A) $\frac{\text {estimate-hypothesize value }}{\text { standard error of estimate}}$&#10;B) $\frac{\text {estimator }}{\text {standard error of estimate }}$&#10;C) $\frac{\text {estimator-hypothesize value  }}{\text { standard error of estimate}}$&#10;D) $\frac { \frac { \text { estimator - hypothesize value } } { \text { standard error of estimator } } } { \sqrt { n } }$&#10;

Accepted Answer

The answer of In general, the t-statistic has the following...

Question 10

When estimating a demand function for a good where quantity demanded is a linear function of the price, you should&#10;A)not include an intercept because the price of the good is never zero.&#10;B)use a one-sided alternative hypothesis to check the influence of price on quantity.&#10;C)use a two-sided alternative hypothesis to check the influence of price on quantity.&#10;D)reject the idea that price determines demand unless the coefficient is at least 1.96.

Accepted Answer

The answer of When estimating a demand function for a...

Question 11

The only difference between a one- and two-sided hypothesis test is&#10;A)the null hypothesis.&#10;B)dependent on the sample size n.&#10;C)the sign of the slope coefficient.&#10;D)how you interpret the t-statistic.

Accepted Answer

The answer of The only difference between a one- and...

Question 12

With heteroskedastic errors, the weighted least squares estimator is BLUE. You should use OLS with heteroskedasticity-robust standard errors because&#10;A)this method is simpler.&#10;B)the exact form of the conditional variance is rarely known.&#10;C)the Gauss-Markov theorem holds.&#10;D)your spreadsheet program does not have a command for weighted least squares.

Accepted Answer

The answer of With heteroskedastic errors, the weighted least squares...

Question 13

The proof that OLS is BLUE requires all of the following assumptions with the exception of: A)the errors are homoskedastic. B)the errors are normally distributed. C)E(u_i $\left| X _ { i } \right) = 0$ D)large outliers are unlikely.

Accepted Answer

The answer of The proof that OLS is BLUE requires...

Question 14

Imagine that you were told that the t-statistic for the slope coefficient of the regression line $\widehat {\text { TestScore }}$ = 698.9 - 2.28 × STR was 4.38. What are the units of measurement for the t-statistic?

A)points of the test score
B)number of students per teacher
C)

\frac { \text { TestScore } } { S T R }

D)standard deviations

Accepted Answer

The answer of Imagine that you were told that the...

Question 15

Heteroskedasticity means that&#10;A)homogeneity cannot be assumed automatically for the model.&#10;B)the variance of the error term is not constant.&#10;C)the observed units have different preferences.&#10;D)agents are not all rational.

Accepted Answer

The answer of Heteroskedasticity means that&#10;A)homogeneity cannot be assumed automatically...

Question 16

One of the following steps is not required as a step to test for the null hypothesis: A)compute the standard error of $\hat { \beta }$ ₁. B)test for the errors to be normally distributed. C)compute the t-statistic. D)compute the p-value.

Accepted Answer

The answer of One of the following steps is not...

Question 17

The error term is homoskedastic if A)var(u_i |$\left. X _ { i } = x \right)$ is constant for i = 1,…, n. B)var(u_i |$\left. X _ { i } = x \right)$ depends on x. C)X_i is normally distributed. D)there are no outliers.

Accepted Answer

The answer of The error term is homoskedastic if A)var(u_i _...

Question 18

A binary variable is often called a&#10;A)dummy variable.&#10;B)dependent variable.&#10;C)residual.&#10;D)power of a test.

Accepted Answer

The answer of A binary variable is often called a&#10;A)dummy...

Question 19

The confidence interval for the sample regression function slope&#10;A)can be used to conduct a test about a hypothesized population regression function slope.&#10;B)can be used to compare the value of the slope relative to that of the intercept.&#10;C)adds and subtracts 1.96 from the slope.&#10;D)allows you to make statements about the economic importance of your estimate.

Accepted Answer

The answer of The confidence interval for the sample regression...

Question 20

The t-statistic is calculated by dividing&#10;A)the OLS estimator by its standard error.&#10;B)the slope by the standard deviation of the explanatory variable.&#10;C)the estimator minus its hypothesized value by the standard error of the estimator.&#10;D)the slope by 1.96.

Accepted Answer

The answer of The t-statistic is calculated by dividing&#10;A)the OLS...

Question 21

You extract approximately 5,000 observations from the Current Population Survey (CPS)and estimate the following regression function: $\widehat { \text { ahe } }$ = 3.32 - 0.45 $\times$ Age, R²= 0.02, SER = 8.66 (1.00)(0.04)
Where ahe is average hourly earnings, and Age is the individual's age. Given the specification, your 95% confidence interval for the effect of changing age by 5 years is approximately

A)[$1.96, $2.54]
B)[$2.32, $4.32]
C)[$1.35, $5.30]
D)cannot be determined given the information provided

Accepted Answer

The answer of You extract approximately 5,000 observations from the...

Question 22

You have collected 14,925 observations from the Current Population Survey. There are 6,285 females in the sample, and 8,640 males. The females report a mean of average hourly earnings of $16.50 with a standard deviation of $9.06. The males have an average of $20.09 and a standard deviation of $10.85. The overall mean average hourly earnings is $18.58.
a. Using the t-statistic for testing differences between two means (section 3.4 of your textbook), decide whether or not there is sufficient evidence to reject the null hypothesis that females and males have identical average hourly earnings.
b. You decide to run two regressions: first, you simply regress average hourly earnings on an intercept only. Next, you repeat this regression, but only for the 6,285 females in the sample. What will the regression coefficients be in each of the two regressions?
_c._{Finally you run a regression over the entire sample of average hourly earnings on an intercept and a binary variable}_DFemme_{, where this variable ta}_{kes on a value of 1 if the individual is a female, and is 0 otherwise. What will be the value of the intercept? What will be the value of the coefficient of the binary variable?}
_d._{What is the standard error on the slope coefficient? What is the}_t_-_statist_ic?
_e._{Had you used the homoskedasticity}_-_{only standard error in (d)and calculated the}_t_-_{statistic, how would you have had to change the test}_-_{statistic in (a)to get the identical result?}

Accepted Answer

The answer of You have collected 14,925 observations from the...

Question 23

If the errors are heteroskedastic, then&#10;A)OLS is BLUE.&#10;B)WLS is BLUE if the conditional variance of the errors is known up to a constant factor of proportionality.&#10;C)LAD is BLUE if the conditional variance of the errors is known up to a constant factor of proportionality.&#10;D)OLS is efficient.

Accepted Answer

The answer of If the errors are heteroskedastic, then&#10;A)OLS is...

Question 24

In order to formulate whether or not the alternative hypothesis is one-sided or two-sided, you need some guidance from economic theory. Choose at least three examples from economics or other fields where you have a clear idea what the null hypothesis and the alternative hypothesis for the slope coefficient should be. Write a brief justification for your answer.

Accepted Answer

The answer of In order to formulate whether or not...

Question 25

Explain carefully the relationship between a confidence interval, a one-sided hypothesis test, and a two-sided hypothesis test. What is the unit of measurement of the t-statistic?

Accepted Answer

The answer of Explain carefully the relationship between a confidence...

Question 26

Carefully discuss the advantages of using heteroskedasticity-robust standard errors over standard errors calculated under the assumption of homoskedasticity. Give at least five examples where it is very plausible to assume that the errors display heteroskedasticity.

Accepted Answer

The answer of Carefully discuss the advantages of using heteroskedasticity-robust...

Question 27

(Requires Appendix material from Chapters 4 and 5)Shortly before you are making a group presentation on the testscore/student-teacher ratio results, you realize that one of your peers forgot to type all the relevant information on one of your slides. Here is what you see:

\widehat {\\text { TestScore }}

= 698.9 - STR, R² = 0.051, SER = 18.6
(9.47)(0.48)
In addition, your group member explains that he ran the regression in a standard spreadsheet program, and that, as a result, the standard errors in parenthesis are homoskedasticity-only standard errors.
(a)Find the value for the slope coefficient.
(b)Calculate the t-statistic for the slope and the intercept. Test the hypothesis that the intercept and the slope are different from zero.
(c)Should you be concerned that your group member only gave you the result for the homoskedasticity-only standard error formula, instead of using the heteroskedasticity-robust standard errors?

Accepted Answer

The answer of (Requires Appendix material from Chapters 4 and...

Question 28

Consider the estimated equation from your textbook $\widehat {\text { TestScore }}$ =698.9 - 2.28 $\times$ STR, R² = 0.051, SER = 18.6 (10.4)(0.52)
The t-statistic for the slope is approximately

A)4.38
B)67.20
C)0.52
D)1.76

Accepted Answer

The answer of Consider the estimated equation from your textbook...

Question 29

(Continuation from Chapter 4, number 6)The neoclassical growth model predicts that for identical savings rates and population growth rates, countries should converge to the per capita income level. This is referred to as the convergence hypothesis. One way to test for the presence of convergence is to compare the growth rates over time to the initial starting level. (a)The results of the regression for 104 countries were as follows: $\widehat { g 6090 }$ = 0.019 - 0.0006 × RelProd₆₀, R²= 0.00007, SER = 0.016 (0.004)(0.0073) where g6090 is the average annual growth rate of GDP per worker for the 1960-1990 sample period, and RelProd₆₀ is GDP per worker relative to the United States in 1960. Numbers in parenthesis are heteroskedasticity robust standard errors. Using the OLS estimator with homoskedasticity-only standard errors, the results changed as follows: $\widehat { g 6090 }$ = 0.019 - 0.0006×RelProd₆₀, R²= 0.00007, SER = 0.016 (0.002)(0.0068) Why didn't the estimated coefficients change? Given that the standard error of the slope is now smaller, can you reject the null hypothesis of no beta convergence? Are the results in the second equation more reliable than the results in the first equation? Explain. (b)You decide to restrict yourself to the 24 OECD countries in the sample. This changes your regression output as follows (numbers in parenthesis are heteroskedasticity robust standard errors): $\widehat { g 6090 }$ = 0.048 - 0.0404 RelProd₆₀, R² = 0.82, SER = 0.0046 (0.004)(0.0063) Test for evidence of convergence now. If your conclusion is different than in (a), speculate why this is the case. (c)The authors of your textbook have informed you that unless you have more than 100 observations, it may not be plausible to assume that the distribution of your OLS estimators is normal. What are the implications here for testing the significance of your theory?

Accepted Answer

The answer of (Continuation from Chapter 4, number 6)The neoclassical...

Question 30

(Continuation from Chapter 4)At a recent county fair, you observed that at one stand people's weight was forecasted, and were surprised by the accuracy (within a range). Thinking about how the person could have predicted your weight fairly accurately (despite the fact that she did not know about your "heavy bones"), you think about how this could have been accomplished. You remember that medical charts for children contain 5%, 25%, 50%, 75% and 95% lines for a weight/height relationship and decide to conduct an experiment with 110 of your peers. You collect the data and calculate the following sums:

\begin{array} { c } \sum _ { i = 1 } ^ { n } Y _ { i } = 17,375 , \sum _ { i = 1 } ^ { n } X _ { i } = 7,665.5 , \\\\\sum _ { i = 1 } ^ { n } y _ { i } ^ { 2 } = 94,228.8 , \sum _ { i = 1 } ^ { n } x _ { i } ^ { 2 } = 1,248.9 , \sum _ { i = 1 } ^ { n } x _ { i } y _ { i } = 7,625.9\end{array}

where the height is measured in inches and weight in pounds. (Small letters refer to deviations from means as in z_i = Z_i -

\bar { Z }

)
(a)Calculate the homoskedasticity-only standard errors and, using the resulting t-statistic, perform a test on the null hypothesis that there is no relationship between height and weight in the population of college students.
(b)What is the alternative hypothesis in the above test, and what level of significance did you choose?
(c)Statistics and econometrics textbooks often ask you to calculate critical values based on some level of significance, say 1%, 5%, or 10%. What sort of criteria do you think should play a role in determining which level of significance to choose?
(d)What do you think the relationship is between testing for the significance of the slope and whether or not the regression R² is zero?

Accepted Answer

The answer of (Continuation from Chapter 4)At a recent county...

Question 31

Using the textbook example of 420 California school districts and the regression of testscores on the student-teacher ratio, you find that the standard error on the slope coefficient is 0.51 when using the heteroskedasticity robust formula, while it is 0.48 when employing the homoskedasticity only formula. When calculating the t-statistic, the recommended procedure is to

A)use the homoskedasticity only formula because the t-statistic becomes larger
B)first test for homoskedasticity of the errors and then make a decision
C)use the heteroskedasticity robust formula
D)make a decision depending on how much different the estimate of the slope is under the two procedures

Accepted Answer

The answer of Using the textbook example of 420 California...

Question 32

The homoskedastic normal regression assumptions are all of the following with the exception of:&#10;A)the errors are homoskedastic.&#10;B)the errors are normally distributed.&#10;C)there are no outliers.&#10;D)there are at least 10 observations.

Accepted Answer

The answer of The homoskedastic normal regression assumptions are all...

Question 33

Using 143 observations, assume that you had estimated a simple regression function and that your estimate for the slope was 0.04, with a standard error of 0.01. You want to test whether or not the estimate is statistically significant. Which of the following possible decisions is the only correct one:

A)you decide that the coefficient is small and hence most likely is zero in the population
B)the slope is statistically significant since it is four standard errors away from zero
C)the response of Y given a change in X must be economically important since it is statistically significant
D)since the slope is very small, so must be the regression R².

Accepted Answer

The answer of Using 143 observations, assume that you had...

Question 34

(Continuation from Chapter 4, number 5)You have learned in one of your economics courses that one of the determinants of per capita income (the "Wealth of Nations")is the population growth rate. Furthermore you also found out that the Penn World Tables contain income and population data for 104 countries of the world. To test this theory, you regress the GDP per worker (relative to the United States)in 1990 (RelPersInc)on the difference between the average population growth rate of that country (n)to the U.S. average population growth rate (n_us )for the years 1980 to 1990. This results in the following regression output:

= 0.518 - 18.831×(n - n_us), R²=0.522, SER = 0.197
(0.056)(3.177)
(a)Is there any reason to believe that the variance of the error terms is homoskedastic?
(b)Is the relationship statistically significant?

Accepted Answer

The answer of (Continuation from Chapter 4, number 5)You have...

Question 35

(Continuation from Chapter 4)Sir Francis Galton, a cousin of James Darwin, examined the relationship between the height of children and their parents towards the end of the 19^th century. It is from this study that the name "regression" originated. You decide to update his findings by collecting data from 110 college students, and estimate the following relationship:

= 19.6 + 0.73 × Midparh, R² = 0.45, SER = 2.0
(7.2)(0.10)
where Studenth is the height of students in inches, and Midparh is the average of the parental heights. Values in parentheses are heteroskedasticity robust standard errors. (Following Galton's methodology, both variables were adjusted so that the average female height was equal to the average male height.)
(a)Test for the statistical significance of the slope coefficient.
(b)If children, on average, were expected to be of the same height as their parents, then this would imply two hypotheses, one for the slope and one for the intercept.
(i)What should the null hypothesis be for the intercept? Calculate the relevant t-statistic and carry out the hypothesis test at the 1% level.
(ii)What should the null hypothesis be for the slope? Calculate the relevant t-statistic and carry out the hypothesis test at the 5% level.
(c)Can you reject the null hypothesis that the regression R² is zero?
(d)Construct a 95% confidence interval for a one inch increase in the average of parental height.

Accepted Answer

The answer of (Continuation from Chapter 4)Sir Francis Galton, a...

Question 36

You have collected data for the 50 U.S. states and estimated the following relationship between the change in the unemployment rate from the previous year ( $\widehat{\Delta u r}$ )and the growth rate of the respective state real GDP (g_y). The results are as follows $\widehat{\Delta u r}$ = 2.81 - 0.23 $\times$ g_y, R²= 0.36, SER = 0.78 (0.12)(0.04)
Assuming that the estimator has a normal distribution, the 95% confidence interval for the slope is approximately the interval

A)[2.57, 3.05]
B)[-0.31,0.15]
C)[-0.31, -0.15]
D)[-0.33, -0.13]

Accepted Answer

The answer of You have collected data for the 50...

Question 37

You recall from one of your earlier lectures in macroeconomics that the per capita income depends on the savings rate of the country: those who save more end up with a higher standard of living. To test this theory, you collect data from the Penn World Tables on GDP per worker relative to the United States (RelProd)in 1990 and the average investment share of GDP from 1980-1990 (S_K), remembering that investment equals saving. The regression results in the following output:

= -0.08 + 2.44×S_K, R²=0.46, SER = 0.21
(0.04)(0.38)
(a)Interpret the regression results carefully.
(b)Calculate the t-statistics to determine whether the two coefficients are significantly different from zero. Justify the use of a one-sided or two-sided test.
(c)You accidentally forget to use the heteroskedasticity-robust standard errors option in your regression package and estimate the equation using homoskedasticity-only standard errors. This changes the results as follows:

= -0.08 + 2.44×S_K, R²=0.46, SER = 0.21
(0.04)(0.26)
You are delighted to find that the coefficients have not changed at all and that your results have become even more significant. Why haven't the coefficients changed? Are the results really more significant? Explain.
(d)Upon reflection you think about the advantages of OLS with and without homoskedasticity-only standard errors. What are these advantages? Is it likely that the error terms would be heteroskedastic in this situation?

Accepted Answer

The answer of You recall from one of your earlier...

Question 38

(continuation from Chapter 4, number 3)You have obtained a sub-sample of 1744 individuals from the Current Population Survey (CPS)and are interested in the relationship between weekly earnings and age. The regression, using heteroskedasticity-robust standard errors, yielded the following result:

= 239.16 + 5.20×Age , R² = 0.05, SER = 287.21.,
(20.24)(0.57)
where Earn and Age are measured in dollars and years respectively.
(a)Is the relationship between Age and Earn statistically significant?
(b)The variance of the error term and the variance of the dependent variable are related. Given the distribution of earnings, do you think it is plausible that the distribution of errors is normal?
(c)Construct a 95% confidence interval for both the slope and the intercept.

Accepted Answer

The answer of (continuation from Chapter 4, number 3)You have...

Question 39

(Continuation of the Purchasing Power Parity question from Chapter 4)The news-magazine The Economist regularly publishes data on the so called Big Mac index and exchange rates between countries. The data for 30 countries from the April 29, 2000 issue is listed below: The concept of purchasing power parity or PPP ("the idea that similar foreign and domestic goods … should have the same price in terms of the same currency," Abel, A. and B. Bernanke, Macroeconomics, 4^th edition, Boston: Addison Wesley, 476)suggests that the ratio of the Big Mac priced in the local currency to the U.S. dollar price should equal the exchange rate between the two countries. After entering the data into your spread sheet program, you calculate the predicted exchange rate per U.S. dollar by dividing the price of a Big Mac in local currency by the U.S. price of a Big Mac ($2.51). To test for PPP, you regress the actual exchange rate on the predicted exchange rate. The estimated regression is as follows: $\widehat{\text { ActualExRate }}$ = -27.05 + 1.35 × 1.35×Pr edExRate R² = 0.994, n = 29, SER = 122.15 (23.74)(0.02) (a)Your spreadsheet program does not allow you to calculate heteroskedasticity robust standard errors. Instead, the numbers in parenthesis are homoskedasticity only standard errors. State the two null hypothesis under which PPP holds. Should you use a one-tailed or two-tailed alternative hypothesis? (b)Calculate the two t-statistics. (c)Using a 5% significance level, what is your decision regarding the null hypothesis given the two t-statistics? What critical values did you use? Are you concerned with the fact that you are testing the two hypothesis sequentially when they are supposed to hold simultaneously? (d)What assumptions had to be made for you to use Student's t-distribution?

Accepted Answer

The answer of (Continuation of the Purchasing Power Parity question...

Question 40

You have obtained measurements of height in inches of 29 female and 81 male students (Studenth)at your university. A regression of the height on a constant and a binary variable (BFemme), which takes a value of one for females and is zero otherwise, yields the following result:

= 71.0 - 4.84×BFemme , R² = 0.40, SER = 2.0
(0.3)(0.57)
(a)What is the interpretation of the intercept? What is the interpretation of the slope? How tall are females, on average?
(b)Test the hypothesis that females, on average, are shorter than males, at the 1% level.
(c)Is it likely that the error term is homoskedastic here?

Accepted Answer

The answer of You have obtained measurements of height in...

Question 41

The effect of decreasing the student-teacher ratio by one is estimated to result in an improvement of the districtwide score by 2.28 with a standard error of 0.52. Construct a 90% and 99% confidence interval for the size of the slope coefficient and the corresponding predicted effect of changing the student-teacher ratio by one. What is the intuition on why the 99% confidence interval is wider than the 90% confidence interval?

Accepted Answer

The answer of The effect of decreasing the student-teacher ratio...

Question 42

Using the California School data set from your textbook, you run the following regression:

\widehat {\text { TestScr }}

= 698.9 - 2.28 STR
n = 420, SER = 9.4
_where_TestScore_{is the average test score in the distr}_{ict and}_STR_{is the student}_-_{teacher ratio. The sample standard deviation of test scores is 19.05, and the sample standard deviation of the student teacher ratio is 1.89.}
_a.
Find the regression R²and the correlation coefficient between test scores and the student teacher ratio.
b.
Find the homoskedasticity-only standard error of the slope.

Accepted Answer

The answer of Using the California School data set from...

Question 43

In many of the cases discussed in your textbook, you test for the significance of the slope at the 5% level. What is the size of the test? What is the power of the test? Why is the probability of committing a Type II error so large here?

Accepted Answer

The answer of In many of the cases discussed in...

Question 44

Using data from the Current Population Survey, you estimate the following relationship between_{average hourly earnings (}_ahe_{)and the number of years of education (}_educ_):

Using data from the Current Population Survey, you estimate the following relationship between<sub> average hourly earnings (</sub><sub>ahe</sub><sub>)and the number of years of education (</sub><sub>educ</sub><sub>):</sub> = -4.58 + 1.71 educ The heteroskedasticity-robust standard error on the slope is (0.03). Calculate the 95% confidence interval for the slope. Repeat the exercise using the 90% and then the 99% confidence interval. Can you reject the null hypothesis that the slope coefficient is zero in the population?<div style=padding-top: 35px>

= -4.58 + 1.71 educ
The heteroskedasticity-robust standard error on the slope is (0.03). Calculate the 95% confidence interval for the slope. Repeat the exercise using the 90% and then the 99% confidence interval. Can you reject the null hypothesis that the slope coefficient is zero in the population?

Accepted Answer

The answer of Using data from the Current Population Survey,...

Question 45

Assume that the homoskedastic normal regression assumption hold. Using the Student t-distribution, find the critical value for the following situation:
(a)n = 28, 5% significance level, one-sided test.
(b)n = 40, 1% significance level, two-sided test.
(c)n = 10, 10% significance level, one-sided test.
(d)n = ∞, 5% significance level, two-sided test.

Accepted Answer

The answer of Assume that the homoskedastic normal regression assumption...

Question 46

Using the California School data set from your textbook, you run the following regression:

\widehat {\text { TestScr }}

= 698.9 - 2.28 STR
n = 420, SER = 9.4
_where_TestScore_{is the average test score in the distr}_{ict and}_STR_{is the student}_-_{teacher ratio. The sample standard deviation of test scores is 19.05, and the sample standard deviation of the student teacher ratio is 1.89.}
_a.
Find the regression R²and the correlation coefficient between test scores and the student teacher ratio.
b.
Find the homoskedasticity-only standard error of the slope.

Accepted Answer

The answer of Using the California School data set from...

Question 47

In a Monte Carlo study, econometricians generate multiple sample regression functions from a known population regression function. For example, the population regression function could be Y_i = ?₀ + ?₁X_i = 100 - 0.5 X_i. The Xs could be generated randomly or, for simplicity, be nonrandom ("fixed over repeated samples"). If we had ten of these Xs, say, and generated twenty Ys, we would obviously always have all observations on a straight line, and the least squares formulae would always return values of 100 and 0.5 numerically. However, if we added an error term, where the errors would be drawn randomly from a normal distribution, say, then the OLS formulae would give us estimates that differed from the population regression function values. Assume you did just that and recorded the values for the slope and the intercept. Then you did the same experiment again (each one of these is called a "replication"). And so forth. After 1,000 replications, you plot the 1,000 intercepts and slopes, and list their summary statistics.

Here are the corresponding graphs:

Using the means listed next to the graphs, you see that the averages are not exactly 100 and -0.5. However, they are "close." Test for the difference of these averages from the population values to be statistically significant.

Accepted Answer

The answer of In a Monte Carlo study, econometricians generate...

Question 48

The neoclassical growth model predicts that for identical savings rates and population growth rates, countries should converge to the per capita income level. This is referred to as the convergence hypothesis. One way to test for the presence of convergence is to compare the growth rates over time to the initial starting level, i.e., to run the regression

\widehat { g 6090 }

=

\widehat { \beta 0 }

+

\widehat { \beta 1 }

× RelProd₆₀ , where g6090 is the average annual growth rate of GDP per worker for the 1960-1990 sample period, and RelProd₆₀ is GDP per worker relative to the United States in 1960. Under the null hypothesis of no convergence, ?₁ = 0; H₁ : ?₁ < 0, implying ("beta")convergence. Using a standard regression package, you get the following output:
Dependent Variable: G6090
Method: Least Squares
Date: 07/11/06 Time: 05:46
Sample: 1 104
Included observations: 104
White Heteroskedasticity-Consistent Standard Errors & Covariance

\begin{array} { c c c c l } \text { Variable } & \text { Coefficient } & \text { Std. Error } & \text { t-Statistic } & \text { Prob. } \\\hline \text { C } & 0.018989 & 0.002392 & 7.939864 & 0.0000 \\\text { YL60 } & - 0.000566 & 0.005056 & - 0.111948 & 0.9111 \\\hline\end{array}

\begin{array} { l l l l } \text { R-squared } & 0.000068 & \text { Mean dependent var } & 0.018846 \\\text { Adjusted R-squared } & - 0.009735 & \text { S.D. dependent var } & 0.015915 \\\text { S.E. of regression } & 0.015992 & \text { Akaike info criterion } & - 5.414418 \\\text { Sum squared resid } & 0.026086 & \text { Schwarz criterion } & - 5.363565 \\\text { Log likelihood } & 283.5498 & \text { F-statistic } & 0.006986 \\\text { Durbin-Watson stat } & 1.367534 & \text { Prob(F-statistic) } & 0.933550\end{array}

You are delighted to see that this program has already calculated p-values for you. However, a peer of yours points out that the correct p-value should be 0.4562. Who is right?

Accepted Answer

The answer of The neoclassical growth model predicts that for...

Question 49

Your textbook discussed the regression model when X is a binary variable
Y_i = ?₀ + ?₁D_i + u_i, i = 1..., n
Let Y represent wages, and let D be one for females, and 0 for males. Using the OLS formula for the slope coefficient, prove that

\hat{\beta}_ { 1 }

is the difference between the average wage for males and the average wage for females.

Accepted Answer

The answer of Your textbook discussed the regression model when...

Question 50

(Requires Appendix material and Calculus)Equation (5.36)in your textbook derives the conditional variance for any old conditionally unbiased estimator $\beta$ ₁ to be var( $\beta$ 1 X₁, ..., X_n)= $\sigma _ { u } ^ { 2 } \sum _ { i = 1 } ^ { n } a _ { i } ^ { 2 }$ where the conditions for conditional unbiasedness are $\sum _ { i = 1 } ^ { n } a _ { i }$ = 0 and $\sum _ { i = 1 } ^ { n } a _ { i } X _ { i }$ = 1. As an alternative to the BLUE proof presented in your textbook, you recall from one of your calculus courses that you could minimize the variance subject to the two constraints, thereby making the variance as small as possible while the constraints are holding. Show that in doing so you get the OLS weights $\hat { a } _ { i }$ (You may assume that X₁,..., X_n are nonrandom (fixed over repeated samples).)

Accepted Answer

The answer of (Requires Appendix material and Calculus)Equation (5.36)in your...

Question 51

Consider the following two models involving binary variables as explanatory variables:

\widehat { \text { Wage } }

=

\widehat { \beta 0 }

+

\widehat { \beta 1 }

DFemme and

\widehat { \text { Wage } }

=

\widehat { \phi _ { 1 } }

DFemme +

\widehat { \phi _ { 2 } }

Male
where Wage is the hourly wage rate, DFemme is a binary variable that is equal to 1 if the person is a female, and 0 if the person is a male. Male = 1 - DFemme. Even though you have not learned about regression functions with two explanatory variables (or regressions without an intercept), assume that you had estimated both models, i.e., you obtained the estimates for the regression coefficients.
What is the predicted wage for a male in the two models? What is the predicted wage for a female in the two models? What is the relationship between the ? s and the ?s? Why would you prefer one model over the other?

Accepted Answer

The answer of Consider the following two models involving binary...

Question 52

Assume that your population regression function is
Y_i = ?_iX_i + u_i
i.e., a regression through the origin (no intercept). Under the homoskedastic normal regression assumptions, the t-statistic will have a Student t distribution with n-1 degrees of freedom, not n-2 degrees of freedom, as was the case in Chapter 5 of your textbook. Explain. Do you think that the residuals will still sum to zero for this case?

Accepted Answer

The answer of Assume that your population regression function is Y_i...

Question 53

(Requires Appendix material)Your textbook shows that OLS is a linear estimator $\hat { \beta }$ ₁ = $\sum _ { i = 1 } ^ { n } \hat { a } _ { i } Y _ { i }$ , where $$\hat { a } _ { i } = \frac { \bar { X } _ { i } - \bar { X } } { \sum _ { i = 1 } ^ { n } \left( X _ { i } - \bar { X } \right) ^ { 2 } }$$ For OLS to be conditionally unbiased, the following two conditions must hold: $\sum _ { i = 1 } ^ { n } \hat { a } _ { i } = 0$ and $\sum _ { i = 1 } ^ { n } \hat { a } _ { i } \bar{X} _ { i }$ = 1. Show that this is the case.

Accepted Answer

The answer of (Requires Appendix material)Your textbook shows that OLS...

Question 54

Let $u _ { i }$ be distributed N(0, $\sigma _ { u } ^ { 2 }$ ), i.e., the errors are distributed normally with a constant variance (homoskedasticity). This results in $\hat{\beta }_ { 1 }$ being distributed N(?₁, $\sigma _ { \hat { \beta } 1 } ^ { 2 }$ ), where $\sigma _ { p 1 } ^ { 2 } = \frac { \sigma _ { u } ^ { 2 } } { \sum _ { i = 1 } ^ { n } \left( X _ { i } - \bar { X } \right) ^ { 2 } }$ Statistical inference would be straightforward if $\sigma _ { u } ^ { 2 }$ was known. One way to deal with this problem is to replace $\sigma _ { u } ^ { 2 }$ with an estimator $S _ { \hat { u} } ^ { 2 }$ Clearly since this introduces more uncertainty, you cannot expect $\hat{\beta} _ { 1 }$ to be still normally distributed. Indeed, the t-statistic now follows Student's t distribution. Look at the table for the Student t-distribution and focus on the 5% two-sided significance level. List the critical values for 10 degrees of freedom, 30 degrees of freedom, 60 degrees of freedom, and finally ? degrees of freedom. Describe how the notion of uncertainty about $\sigma _ { u } ^ { 2 }$ can be incorporated about the tails of the t-distribution as the degrees of freedom increase.

Accepted Answer

The answer of Let $u _ { i }$ be...

Question 55

Changing the units of measurement obviously will have an effect on the slope of your regression function. For example, let Y^*= aY and X^* = bX. Then it is easy but tedious to show that $$\hat { \beta } _ { 1 } ^ {* } = \frac { \sum _ { i = 1 } ^ { n } x _ { i } ^ { * } y _ { i } ^ { * } } { \sum _ { i = 1 } ^ { n } x _ { i } ^ { * 2 } } = \frac { a } { b } \hat { \beta } _ { 1 }$$ Given this result, how do you think the standard errors and the regression R² will change?

Accepted Answer

The answer of Changing the units of measurement obviously will...

Question 56

Consider the sample regression function

\hat { Y }

_i=

\hat{\beta} _ { 0 }

+

\hat{\beta} _ { 1 }

X_i. The table below lists estimates for the slope (

\hat{\beta }_ { 1 }

)and the variance of the slope estimator (

\hat { \sigma } ^ { 2 } { \hat { \beta } 1 }

). In each case calculate the p-value for the null hypothesis of ?₁ = 0 and a two-tailed alternative hypothesis. Indicate in which case you would reject the null hypothesis at the 5% significance level.

\begin{array}{|c|c|c|c|c|} \hline \hat{\beta } _1& -1.76&0.0025&2.85&-0.00014\\\hline \hat{\sigma}^2\hat{\beta}_1&0.37&0.000003&117.5&0.0000013\\\hline\end{array}

Accepted Answer

The answer of Consider the sample regression function \(\hat {...

Question 57

Below you are asked to decide on whether or not to use a one-sided alternative or a two-sided alternative hypothesis for the slope coefficient. Briefly justify your decision.
(a)

\hat { q } _ { i } ^ { d }

=

\hat { \beta }

₀ +

\hat { \beta }

₁_pi, where q^d is the quantity demanded for a good, and p is its price.
(b)

\hat { p } _ { i } ^ { actual}

=

\hat { \beta }

₀ +

\hat { \beta }

₁

\hat { p } _ { i } ^ { actual}

, where

\hat { p } _ { i } ^ { actual}

is the actual house price, and

\hat { p } _ { i } ^ { actual}

is the assessed house price. You want to test whether or not the assessment is correct, on average.
(c)

\hat { C }

_i =

\hat { \beta }

₀ +

\hat { \beta }

₁

Y _ { i } ^ { d }

, where C is household consumption, and Y^d is personal disposable income.

Accepted Answer

The answer of Below you are asked to decide on...

Question 58

Your textbook discussed the regression model when X is a binary variable
Y_i = ?₀ + ?_iD_i + u_i, i = 1,..., n
Let Y represent wages, and let D be one for females, and 0 for males. Using the OLS formula for the intercept coefficient, prove that

\widehat { \beta 0 }

is the average wage for males.

Accepted Answer

The answer of Your textbook discussed the regression model when...

Question 59

Your textbook states that under certain restrictive conditions, the t- statistic has a Student t-distribution with n-2 degrees of freedom. The loss of two degrees of freedom is the result of OLS forcing two restrictions onto the data. What are these two conditions, and when did you impose them onto the data set in your derivation of the OLS estimator?

Accepted Answer

The answer of Your textbook states that under certain restrictive...

Deck 5: Regression With a Single Regressor: Hypothesis Tests and Confidence Intervals