Question 1

With Panel Data, regression software typically uses an "entity-demeaned" algorithm because

Accepted Answer

A) the OLS formula for the slope in the linear regression model contains deviations from means already. 
B) there are typically too many time periods for the regression package too handle. 
C) the number of estimates to calculate can become extremely large when there are a large number of entities. 
D) deviations from means sum up to zero. 
A) the OLS formula for the slope in the linear regression model contains deviations from means already. 
B) there are typically too many time periods for the regression package too handle. 
C) the number of estimates to calculate can become extremely large when there are a large number of entities. 
D) deviations from means sum up to zero. C

Question 2

The notation for panel data is (Xit, Yit), i = 1, ..., n and t = 1, ..., T because

Accepted Answer

A) we take into account that the entities included in the panel change over time and are replaced by others. 
B) the X's represent the observed effects and the Y the omitted fixed effects. 
C) there are n entities and T time periods. 
D) n has to be larger than T for the OLS estimator to exist. 
A) we take into account that the entities included in the panel change over time and are replaced by others. 
B) the X's represent the observed effects and the Y the omitted fixed effects. 
C) there are n entities and T time periods. 
D) n has to be larger than T for the OLS estimator to exist. C

Question 3

A researcher investigating the determinants of crime in the United Kingdom has data for 42 police regions over 22 years. She estimates by OLS the following regression
ln(cmrt)it = ?i + ?t + ?1unrtmit + ?2proythit + ?3 ln(pp)it + uit; i = 1,..., t = 1,..., 22
where cmrt is the crime rate per head of population, unrtm is the unemployment rate of males, proyth is the proportion of youths, pp is the probability of punishment measured as (number of convictions)/(number of crimes reported). ? and ? are area and year fixed effects, where ?i equals one for area i and is zero otherwise for all i, and ?t is one in year t and zero for all other years for t = 2, …, 22. ?1 is not included.
(a)What is the purpose of excluding ?1? What are the terms ? and ? likely to pick up? Discuss the advantages of using panel data for this type of investigation.
(b)Estimation by OLS using heteroskedasticity and autocorrelation-consistent standard errors results in the following output, where the coefficients of the fixed effects are not reported: $\sqrt{\ln (c m r t)_{i t}}=0.063 \times \text { unrtm }_{\mathrm{it}}+3.739 \times \text { proyth }_{\mathrm{it}}-0.588 \times \ln (p p)_{\mathrm{it}} ; R^{2}=0.904$
$\begin{array}{lll}
(0.109) & (0.179)&(0.024)
\end{array}$
Comment on the results. In particular, what is the effect of a ten percent increase in the probability of punishment?
(c)To test for the relevance of the area fixed effects, your restrict the regression by dropping all entity fixed effects and add single constant is added. The relevant F-statistic is 135.28. What are the degrees of freedom? What is the critical value from your F table?
(d)Although the test rejects the hypothesis of eliminating the fixed effects from the regression, you want to analyze what happens to the coefficients and their standard errors when the equation is re-estimated without fixed effects. In the resulting regression, $\hat\beta _ { 2 }$ and $\hat \beta _ { 3 }$ do not change by much, although their standard errors roughly double. However, $\hat { \beta } _ { 1 }$ is now 1.340 with a standard error of 0.234. Why do you think that is?

Accepted Answer

(a)Since there is no constant in addition to the entity and time fixed effects, setting φ_t to one in year t and zero for all other years for t = 1, …, 22 would result in perfect multicollinearity. α picks up omitted variables that are specific to police regions and do not vary over time. φ picks up effects that are common to all police regions in a given year. Attitudes toward crime may vary between rural regions and metropolitan areas. These would be hard to capture through measurable variables. Common macroeconomic shocks that affect all regions equally will be captured by the time fixed effects. Although some of these variables could be explicitly introduced, the list of possible variables is long. By introducing time fixed effects, the effect is captured all in one variable.
(b)A higher male unemployment rate and a higher proportion of youths increase the crime rate, while a higher probability of punishment decreases the crime rate. The coefficients on the probability of punishment and the proportion of youths is statistically significant, while the male unemployment rate is not. The regression explains roughly 90 percent of the variation in crime rates in the sample. A ten percent increase in the number of convictions over the number of crimes reported decreases the crime rate by roughly six percent.
(c)The coefficients of the three regressors other than the entity coefficients would have been unaffected, had there been a constant in the regression and (n-1)police region specific entity variables. In this case, the entity coefficients on the police regions would have indicated deviations from the constant for the first police region. Hence there are 41 restrictions imposed by eliminating the entity fixed effects and adding a constant. Since there are over 100 observations (900 degrees of freedom), the critical value for F_41,_∞ ≈ F_30,_∞ = 1.70 at the 1% level. Hence the restrictions are rejected.
(d)This result would make the male unemployment rate coefficient significant. It suggests that male unemployment rates change slowly over the years in a given police district and that this effect is picked up by the entity fixed effects. Of course, there are other slowly changing variables, such as attitudes towards crime, that are captured by these fixed effects.

Question 4

Consider estimating the effect of the beer tax on the fatality rate, using time and state fixed effect for the Northeast Region of the United States (Maine, Vermont, New Hampshire, Massachusetts, Connecticut and Rhode Island)for the period 1991-2001. If Beer Tax was the only explanatory variable, how many coefficients would you need to estimate, excluding the constant?

Accepted Answer

A) 18 
B) 17 
C) 7 
D) 11 
A) 18 
B) 17 
C) 7 
D) 11

Question 5

Consider the following panel data regression with a single explanatory variable
Yit = β0 + β1Xit + uit.
In each of the examples below, you will be adding entity and time fixed effects. Indicate the total number of coefficients that need to be estimated.
(a)The effect of beer taxes on the fatality rate, annual data, 1982-1988, nine U.S. regions (New England, Pacific, Mid-Atlantic, East North Central, etc.).
(b)The effect of the minimum wage on teenage employment, annual data, 1963-2000, five Canadian Regions (Atlantic Provinces, Quebec, Ontario, Prairies, British Columbia).
(c)The effect of savings rates on per capita income, data for three decades (1960-1969, 1970-1979, 1980-1989; one observation per decade), 104 countries of the world.
(d)The effect of pitching quality in baseball (as measured by the Team ERA)on the winning percentage, annual data, 1998-1999 season, 1999-2000 season, 30 teams.

Accepted Answer

(a)16 coefficients (6 time fixed effects

Question 6

A study, published in 1993, used U.S. state panel data to investigate the relationship between minimum wages and employment of teenagers. The sample period was 1977 to 1989 for all 50 states. The author estimated a model of the following type: $$\ln \left( E _ { i t } \right) = \beta _ { 0 } + \beta _ { 1 } \ln \left( M _ { i t } / W _ { i t } \right) + \gamma _ { 2 } D 2 _ { i } + \ldots + \gamma _ { n } D 50 _ { i } + \delta _ { 2 } B 2 _ { t } + \ldots + \delta _ { \mathrm { T } } B 13 _ { t } + u _ { i t }$$ where E is the employment to population ratio of teenagers, M is the nominal minimum wage, and W is average hourly earnings in manufacturing. In addition, other explanatory variables, such as the adult unemployment rate, the teenage population share, and the teenage enrollment rate in school, were included.
(a)Name some of the factors that might be picked up by time and state fixed effects.
(b)The author decided to use eight regional dummy variables instead of the 49 state dummy variables. What is the implicit assumption made by the author? Could you test for its validity? How?
(c)The results, using time and region fixed effects only, were as follows: $\widehat{\ln E _ { i t }}$ = -0.182 × ln(Mit /Wit )+ ...; R2= 0.727
(0.036)
Interpret the result briefly.
(d)State minimum wages do not exceed federal minimum wages often. As a result, the author decided to choose the federal minimum wage in his specification above. How does this change your interpretation? How is the original equation $$\ln \left( E _ { i t } \right) = \beta _ { 0 } + \beta _ { 1 } \ln \left( M _ { i t } / W _ { i t } \right) + \gamma _ { 2 } D 2 _ { i } + \ldots + \gamma _ { n } D 8 _ { i } + \delta _ { 2 } B 2 _ { t } + \ldots + \delta _ { \mathrm { T } } \mathrm { B } 13 _ { t } + u _ { i t }$$ affected by this?

Accepted Answer

(a)Time effects will pick up the effect

Question 7

A pattern in the coefficients of the time fixed effects binary variables may reveal the following in a study of the determinants of state unemployment rates using panel data:

Accepted Answer

A) macroeconomic effects, which affect all states equally in a given year. 
B) attitude differences towards unemployment between states. 
C) there is no economic information that can be retrieved from these coefficients. 
D) regional effects, which affect all states equally, as long as they are a member of that region. 
A) macroeconomic effects, which affect all states equally in a given year. 
B) attitude differences towards unemployment between states. 
C) there is no economic information that can be retrieved from these coefficients. 
D) regional effects, which affect all states equally, as long as they are a member of that region.

Question 8

Assume that for the T = 2 time periods case, you have estimated a simple regression in changes model and found a statistically significant positive intercept. This implies

Accepted Answer

A) a negative mean change in the LHS variable in the absence of a change in the RHS variable since you subtract the earlier period from the later period 
B) that the panel estimation approach is flawed since differencing the data eliminates the constant (intercept)in a regression 
C) a positive mean change in the LHS variable in the absence of a change in the RHS variable 
D) that the RHS variable changed between the two subperiods 
A) a negative mean change in the LHS variable in the absence of a change in the RHS variable since you subtract the earlier period from the later period 
B) that the panel estimation approach is flawed since differencing the data eliminates the constant (intercept)in a regression 
C) a positive mean change in the LHS variable in the absence of a change in the RHS variable 
D) that the RHS variable changed between the two subperiods

Question 9

You learned in intermediate macroeconomics that certain macroeconomic growth models predict conditional convergence or a catch up effect in per capita GDP between the countries of the world. That is, countries which are further behind initially in per-capita GDP will grow faster than the leader. You gather data from the Penn World Tables to test this theory.
(a)By limiting your sample to 24 OECD countries, you hope to have a more homogeneous set of countries in your sample, i.e., countries that are not too different with respect to their institutions. To simplify matters, you decide to only test for unconditional convergence. In that case, the laggards catch up even without taking into account differences in some of the driving variables. Your scatter plot and regression for the time period 1975-1989 are as follows:   $\widehat { g8975 }$ = 0.024 - 0.005 PCGDP75_US; R2= 0.025, SER = 0.006
(0.06)(0.008)
where $\widehat { g8975 }$ is the average annual growth rate of per capita GDP from 1975-1989, and PCGDP75_US is per capita GDP relative to the United States in 1975. Numbers in parenthesis are heteroskedasticity-robust standard errors.
Interpret the results. Is there indication of unconditional convergence? What critical value did you use?
(b)Although you are quite discouraged by the result, you think that it might be due to the specific time period used. During this period, there were two OPEC oil price shocks with varying degrees of exposure for the OECD countries. You therefore repeat the exercise for the period 1960-1974, with the following results:   $\widehat { g7460 }$ = 0.061 - 0.043 PCGDP60_US; R2= 0.613, SER = 0.008
(0.004)(0.007)
where $\widehat { g7460 }$ is the average annual growth rate of per capita GDP from 1960-1974, and PCGDP60_US is per capita GDP relative to the United States in 1960.
Compare this regression to the previous one.
(c)You decide to run one more regression in differences. The dependent variable is now the change in the growth rate of per capita GDP from 1960-1974 to 1975-1989 (diffg)and the regressor the difference in the initial conditions (diffinit). This produces the following graph and regression:   $\widehat { diffg}$ = -0.006 - 0.096 × diffinit; R2 = 0.468; SER = 0.009
(0.03)(0.021)
Interpret these results. Explain what has happened to unobservable omitted variables that are constant over time. Suggest what some of these variables might be.
(d)Given that there are only two time periods, what other methods could you have employed to generate the identical results? Why do you think that the slope coefficient in this regression is significant given the results over the sub-periods?

Accepted Answer

(a)Although the slope coefficient is neg

Question 10

Consider the case of time fixed effects only, i.e.,
Yit = β0 + β1Xit + β3St + uit,
First replace β0 + β3St with φt. Next show the relationship between the φt and δt in the following equation
Yit = β0 + β1Xit + δ2B2t + ... + δTBTt + uit,
where each of the binary variables B2, …, BT indicates a different time period. Explain in words why the two equations are the same. Finally show why there is perfect multicollinearity if you add another binary variable B1. What is the intuition behind the fact that the OLS estimator does not exist in this case? Would that also be the case if you dropped the intercept?

Accepted Answer

Y_it = β₁X_it + φ_t + u_it. The relationship is φ₁ = β

Question 11

Time Fixed Effects regression are useful in dealing with omitted variables

Accepted Answer

A) even if you only have a cross-section of data available. 
B) if these omitted variables are constant across entities but vary over time. 
C) when there are more than 100 observations. 
D) if these omitted variables are constant across entities but not over time. 
A) even if you only have a cross-section of data available. 
B) if these omitted variables are constant across entities but vary over time. 
C) when there are more than 100 observations. 
D) if these omitted variables are constant across entities but not over time.

Question 12

The Fixed Effects regression model

Accepted Answer

A) has n different intercepts. 
B) the slope coefficients are allowed to differ across entities, but the intercept is "fixed" (remains unchanged). 
C) has "fixed" (repaired)the effect of heteroskedasticity. 
D) in a log-log model may include logs of the binary variables, which control for the fixed effects. 
A) has n different intercepts. 
B) the slope coefficients are allowed to differ across entities, but the intercept is "fixed" (remains unchanged). 
C) has "fixed" (repaired)the effect of heteroskedasticity. 
D) in a log-log model may include logs of the binary variables, which control for the fixed effects.

Question 13

(Requires Matrix Algebra)Consider the time and entity fixed effect model with a single explanatory variable
Yit = ?0 + ?1Xit + $\gamma _ { 2 }$ D2i + ... + $\gamma _ { n }$ Dni + ?2B2t + ... + ?TBTt + uit,
For the case of n = 4 and T = 3, write this model in the form Y = X? + U, where, in general,
Y = $\left( \begin{array} { l } 
Y _ { 1 } \
Y _ { 2 } \
Y _ { n }
\end{array} \right)$ , U = $\left( \begin{array} { l } 
u _ { 1 } \
u _ { 2 } \
u _ { n }
\end{array} \right)$ , X = $\begin{array} { l l l l } 
1 & X _ { 11 } \ldots & X _ { k 1 } \
1 & X _ { 12 } \ldots & X _ { k 1 } \
1 & X _ { 1 n } \ldots & X _ { k n }
\end{array}$ = $\left(\begin{array} { l } 
x _ { 1 } ^ { \prime } \
x _ { 2 } ^ { \prime } \
x _ { n } ^ { \prime }
\end{array}\right)$ , and ? = $\begin{array} { l } 
\beta _ { 0 } \
\beta _ { 1 } \
\beta _ { k }
\end{array}$ How would the X matrix change if you added two binary variables, D1 and B1? Demonstrate that in this case the columns of the X matrix are not independent. Finally show that elimination of one of the two variables is not sufficient to get rid of the multicollinearity problem. In terms of the OLS estimator, $\hat \beta$ = ( $X ^ { \prime }$ X)-1
  $X ^ { \prime }$ Y, why does perfect multicollinearity create a problem?

Accepted Answer

For the case of n = 4 and T = 3, the gen

Question 14

If you included both time and entity fixed effects in the regression model which includes a constant, then

Accepted Answer

A) one of the explanatory variables needs to be excluded to avoid perfect multicollinearity. 
B) you can use the "before and after" specification even for T > 2. 
C) you must exclude one of the entity binary variables and one of the time binary variables for the OLS estimator to exist. 
D) the OLS estimator no longer exists. 
A) one of the explanatory variables needs to be excluded to avoid perfect multicollinearity. 
B) you can use the "before and after" specification even for T > 2. 
C) you must exclude one of the entity binary variables and one of the time binary variables for the OLS estimator to exist. 
D) the OLS estimator no longer exists.

Question 15

When you add state fixed effects to a simple regression model for U.S. states over a certain time period, and the regression R2 increases significantly, then it is safe to assume that

Accepted Answer

A) the included explanatory variables, other than the state fixed effects, are unimportant. 
B) state fixed effects account for a large amount of the variation in the data. 
C) the coefficients on the other included explanatory variables will not change. 
D) time fixed effects are unimportant. 
A) the included explanatory variables, other than the state fixed effects, are unimportant. 
B) state fixed effects account for a large amount of the variation in the data. 
C) the coefficients on the other included explanatory variables will not change. 
D) time fixed effects are unimportant.

Question 16

In the panel regression analysis of beer taxes on traffic deaths, the estimation period is 1982-1988 for the 48 contiguous U.S. states. To test for the significance of time fixed effects, you should calculate the F-statistic and compare it to the critical value from your Fq,∞ distribution, where q equals

Accepted Answer

A) 6. 
B) 7. 
C) 48. 
D) 53. 
A) 6. 
B) 7. 
C) 48. 
D) 53.

Question 17

In the Fixed Time Effects regression model, you should exclude one of the binary variables for the time periods when an intercept is present in the equation

Accepted Answer

A) because the first time period must always excluded from your data set. 
B) because there are already too many coefficients to estimate. 
C) to avoid perfect multicollinearity. 
D) to allow for some changes between time periods to take place. 
A) because the first time period must always excluded from your data set. 
B) because there are already too many coefficients to estimate. 
C) to avoid perfect multicollinearity. 
D) to allow for some changes between time periods to take place.

Question 18

HAC standard errors and clustered standard errors are related as follows:

Accepted Answer

A) they are the same 
B) clustered standard errors are one type of HAC standard error 
C) they are the same if the data is differenced 
D) clustered standard errors are the square root of HAC standard errors 
A) they are the same 
B) clustered standard errors are one type of HAC standard error 
C) they are the same if the data is differenced 
D) clustered standard errors are the square root of HAC standard errors

Question 19

Consider the regression example from your textbook, which estimates the effect of beer taxes on fatality rates across the 48 contiguous U.S. states. If beer taxes were set nationally by the federal government rather than by the states, then

Accepted Answer

A) it would not make sense to use state fixed effect. 
B) you can test state fixed effects using homoskedastic-only standard errors. 
C) the OLS estimator will be biased. 
D) you should not use time fixed effects since beer taxes are the same at a point in time across states. 
A) it would not make sense to use state fixed effect. 
B) you can test state fixed effects using homoskedastic-only standard errors. 
C) the OLS estimator will be biased. 
D) you should not use time fixed effects since beer taxes are the same at a point in time across states.

Question 20

In the Fixed Effects regression model, using (n - 1)binary variables for the entities, the coefficient of the binary variable indicates

Accepted Answer

A) the level of the fixed effect of the ith entity. 
B) will be either 0 or 1. 
C) the difference in fixed effects between the ith and the first entity. 
D) the response in the dependent variable to a percentage change in the binary variable. 
A) the level of the fixed effect of the ith entity. 
B) will be either 0 or 1. 
C) the difference in fixed effects between the ith and the first entity. 
D) the response in the dependent variable to a percentage change in the binary variable.

With Panel Data, regression software typically uses an "entity-demeaned" algorithm because

The notation for panel data is (X_it, Y_it), i = 1, ..., n and t = 1, ..., T because

A pattern in the coefficients of the time fixed effects binary variables may reveal the following in a study of the determinants of state unemployment rates using panel data:

Assume that for the T = 2 time periods case, you have estimated a simple regression in changes model and found a statistically significant positive intercept. This implies

Time Fixed Effects regression are useful in dealing with omitted variables

The Fixed Effects regression model

If you included both time and entity fixed effects in the regression model which includes a constant, then

When you add state fixed effects to a simple regression model for U.S. states over a certain time period, and the regression R² increases significantly, then it is safe to assume that

In the Fixed Time Effects regression model, you should exclude one of the binary variables for the time periods when an intercept is present in the equation

HAC standard errors and clustered standard errors are related as follows:

Consider the regression example from your textbook, which estimates the effect of beer taxes on fatality rates across the 48 contiguous U.S. states. If beer taxes were set nationally by the federal government rather than by the states, then

In the Fixed Effects regression model, using (n - 1)binary variables for the entities, the coefficient of the binary variable indicates

Economic Questions and Data

Review of Probability

Review of Statistics

Linear Regression With One Regressor

Regression With a Single Regressor: Hypothesis Tests and Confidence Intervals

Linear Regression With Multiple Regressors

Hypothesis Tests and Confidence Intervals in Multiple Regression

Nonlinear Regression Functions

Assessing Studies Based on Multiple Regression

Regression With a Binary Dependent Variable

Instrumental Variables Regression

Experiments and Quasi-Experiments

Introduction to Time Series Regression and Forecasting

Estimation of Dynamic Causal Effects

Additional Topics in Time Series Regression

The Theory of Linear Regression With One Regressor

The Theory of Multiple Regression

Filters

Exam 10: Regression With Panel Data

With Panel Data, regression software typically uses an "entity-demeaned" algorithm because

The notation for panel data is (Xit, Yit), i = 1, ..., n and t = 1, ..., T because

A pattern in the coefficients of the time fixed effects binary variables may reveal the following in a study of the determinants of state unemployment rates using panel data:

Assume that for the T = 2 time periods case, you have estimated a simple regression in changes model and found a statistically significant positive intercept. This implies

Time Fixed Effects regression are useful in dealing with omitted variables

The Fixed Effects regression model

If you included both time and entity fixed effects in the regression model which includes a constant, then

When you add state fixed effects to a simple regression model for U.S. states over a certain time period, and the regression R2 increases significantly, then it is safe to assume that

In the Fixed Time Effects regression model, you should exclude one of the binary variables for the time periods when an intercept is present in the equation

HAC standard errors and clustered standard errors are related as follows:

Consider the regression example from your textbook, which estimates the effect of beer taxes on fatality rates across the 48 contiguous U.S. states. If beer taxes were set nationally by the federal government rather than by the states, then

In the Fixed Effects regression model, using (n - 1)binary variables for the entities, the coefficient of the binary variable indicates

Economic Questions and Data

Review of Probability

Review of Statistics

Linear Regression With One Regressor

Regression With a Single Regressor: Hypothesis Tests and Confidence Intervals

Linear Regression With Multiple Regressors

Hypothesis Tests and Confidence Intervals in Multiple Regression

Nonlinear Regression Functions

Assessing Studies Based on Multiple Regression

Regression With a Binary Dependent Variable

Instrumental Variables Regression

Experiments and Quasi-Experiments

Introduction to Time Series Regression and Forecasting

Estimation of Dynamic Causal Effects

Additional Topics in Time Series Regression

The Theory of Linear Regression With One Regressor

The Theory of Multiple Regression

Filters

The notation for panel data is (X_it, Y_it), i = 1, ..., n and t = 1, ..., T because

When you add state fixed effects to a simple regression model for U.S. states over a certain time period, and the regression R² increases significantly, then it is safe to assume that