Exam 1: Sample Exam for Chapters 1-3

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

Briefly identify the following in words or equations as appropriate: (a) Degrees of freedom (b) Estimated regression equation (c) The Six Steps in Applied Regression Analysis (d) Ordinary Least Squares (e) The meaning of β\beta 1 in: Yi = β\beta 0 + β\beta 1 X1i+ β\beta 2 X2i + ε\varepsilon i

Free
(Essay)
4.8/5
(33)
Correct Answer:
Answered by Examlex AI Copilot

(a) Degrees of freedom: In statistics, degrees of freedom refer to the number of values in the final calculation of a statistic that are free to vary. It is often denoted by the symbol "df" and is used in various statistical tests and analyses.

(b) Estimated regression equation: The estimated regression equation is a mathematical model that represents the relationship between a dependent variable and one or more independent variables. It is typically expressed in the form Y = a + bX, where Y is the dependent variable, X is the independent variable, a is the intercept, and b is the slope.

(c) The Six Steps in Applied Regression Analysis: The six steps in applied regression analysis include: 1) Formulating the research question, 2) Collecting and preparing the data, 3) Exploring and visualizing the data, 4) Fitting the regression model, 5) Assessing the model, and 6) Using the model for prediction or inference.

(d) Ordinary Least Squares: Ordinary Least Squares (OLS) is a method used in regression analysis to estimate the parameters of a linear regression model. It minimizes the sum of the squared differences between the observed and predicted values of the dependent variable.

(e) The meaning of β\beta 1 in the given equation is not clear and appears to be a string of characters without a specific statistical meaning. It does not correspond to any standard notation or formula in regression analysis.

A source of constant discussion among applied econometricians is the degree to which measures of the overall fit of an estimated equation also measure the quality of that regression. To date, we have introduced something like four different measures of overall fit, but the two most used are R2\mathrm { R } ^ { 2 } and R2\overline { \mathrm { R } } ^ { 2 } . (a) Carefully distinguish between R2\mathrm { R } ^ { 2 } and R2\overline { \mathrm { R } } ^ { 2 } . (b) Of the two, which do you recommend typically using? Why? (c) What drawbacks are there to the use of the measure you chose (as your answer to part (b) above) as the primary determinant of the overall quality of a regression?

Free
(Essay)
4.7/5
(43)
Correct Answer:
Answered by Examlex AI Copilot

(a) The main difference between R2R^2 and R2\overline{R}^2 lies in how they account for the number of independent variables in the regression model. R2R^2 measures the proportion of the variance in the dependent variable that is explained by the independent variables in the model. However, it does not account for the number of independent variables, which means that adding more variables to the model will always increase R2R^2 , even if those variables are not truly adding explanatory power. On the other hand, R2\overline{R}^2 adjusts for the number of independent variables in the model, providing a more accurate measure of the overall fit.

(b) Typically, it is recommended to use R2\overline{R}^2 as the primary measure of overall fit in a regression model. This is because R2\overline{R}^2 takes into account the number of independent variables in the model, providing a more accurate assessment of the model's quality. By adjusting for the degrees of freedom, R2\overline{R}^2 helps to prevent overfitting and provides a more reliable indication of the true explanatory power of the model.

(c) However, there are drawbacks to using R2\overline{R}^2 as the primary determinant of the overall quality of a regression. One drawback is that it may penalize models with a large number of independent variables, even if those variables are truly adding explanatory power. Additionally, R2\overline{R}^2 may not capture the full complexity of the relationship between the independent and dependent variables, as it is based on the assumption of a linear relationship. Therefore, while R2\overline{R}^2 is a more robust measure of overall fit compared to R2R^2 , it should be used in conjunction with other diagnostic tools and considerations to fully assess the quality of a regression model.

Consider the following estimated equation: C^t=18.50.07Pt+0.93YDt0.74Dt1.3D2t1.3D3t\hat { \mathrm { C } } _ { \mathrm { t } } = 18.5 - 0.07 \mathrm { P } _ { \mathrm { t } } + 0.93 \mathrm { YD } _ { \mathrm { t } } - 0.74 \mathrm { D } _ { \mathrm { t } } - 1.3 \mathrm { D } _ { 2 \mathrm { t } } - 1.3 \mathrm { D } _ { 3 \mathrm { t } } where: Ct= per-capita pounds of pork consumed in the United States in quarter t Pt =the price of a hundred pounds of pork (in dollars) in quarter t YDt =per capita disposable income (in dollars) in quarter t D1t =dummy equal to 1 in the first quarter (Jan.-Mar.) of the year and 0 otherwise D2t =dummy equal to 1 in the second quarter of the year and 0 otherwise D3t =dummy equal to 1 in the third quarter of the year and 0 otherwise (a) What is the meaning of the estimated coefficient of YD? (b) Specify expected signs for each of the coefficients. Explain your reasoning. (c) Suppose we changed the definition of D3t so that it was equal to 1 in the fourth quarter and 0 otherwise and re-estimated the equation with all the other variables unchanged. Which of the estimated coefficients would change? Would your answer to part (b) above change? Explain your answer.

Free
(Essay)
4.8/5
(44)
Correct Answer:
Verified

We'd expect pork consumption to be the highest in the fourth quarter due to holidays, so the expected signs of the dummy coefficients are negative. For part (c), the coefficients of P and YD shouldn't change, but the others should, because the omitted condition is now the third quarter and not the fourth quarter. Thus the expected signs of the coefficients of D1 and D2 are no longer negative. (Some of the best students will note that the estimate of the coefficient of D2 will be almost exactly zero.)

Two of the most important econometric concepts to date have been the stochastic error term and the residual. Carefully distinguish between these two concepts, being sure to: (a) Define both terms. (b) State how they are similar. (c) State how they are different. (d) Give an example of an equation that contains a stochastic error term. (e) Give an example of an equation that contains a residual.

(Essay)
5.0/5
(41)
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)