Exam 13: Nonlinear and Multiple Regression

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

Incorporating a categorical variable with 5 possible categories into a multiple regression model requires the use of __________ dummy variables.

(Short Answer)
4.8/5
(46)

Which of the following statements are true?

(Multiple Choice)
4.9/5
(35)

Which of the following statements are not true?

(Multiple Choice)
4.8/5
(31)

Which of the following statements are true?

(Multiple Choice)
4.9/5
(34)

In multiple regression analysis with n observations and k predictors (or equivalently k+1 parameters), inferences concerning a single parameter βi\beta _ { i } are based on the standardized variable  In multiple regression analysis with n observations and k predictors (or equivalently k+1 parameters), inferences concerning a single parameter  \beta _ { i }  are based on the standardized variable   , which has a t-distribution with degrees of freedom equal to , which has a t-distribution with degrees of freedom equal to

(Multiple Choice)
4.8/5
(43)

The following data resulted from an experiment to assess the potential of unburnt colliery spoil as a medium for plant growth. The variables are x=acid extractable cations and y=exchangeable acidity/total cation exchange capacity. x -23 -5 16 26 30 38 52 x 1.50 1.46 1.32 1.17 .96 .78 .77 x 58 67 81 96 100 113 x .91 .78 .69 .52 .48 .55 Standardizing the independent variable x to obtain xt=(xxˉ)/sxx ^ { t } = ( x - \bar { x } ) / s _ { x } and fitting the regression function y=β0+β1xt+β2(xt)2y = \vec { \beta } _ { 0 } + \vec\beta _ { 1 } x ^ { t } + \vec { \beta } _ { 2 } ( x ^{t}) ^ { 2 } yielded the accompanying computer output.  The following data resulted from an experiment to assess the potential of unburnt colliery spoil as a medium for plant growth. The variables are x=acid extractable cations and y=exchangeable acidity/total cation exchange capacity.  \begin{array}{l} \begin{array} { c c c c c c c c }  \hline x & - 23 & - 5 & 16 & 26 & 30 & 38 & 52 \\ \hline x & 1.50 & 1.46 & 1.32 & 1.17 & .96 & .78 & .77 \\ \hline \end{array}\\\\ \begin{array} { l l l l l l l }  \hline x & 58 & 67 & 81 & 96 & 100 & 113 \\ \hline x & .91 & .78 & .69 & .52 & .48 & .55 \\ \hline \end{array} \end{array}  Standardizing the independent variable x to obtain  x ^ { t } = ( x - \bar { x } ) / s _ { x }  and fitting the regression function  y = \vec { \beta } _ { 0 } + \vec\beta _ { 1 } x ^ { t } + \vec { \beta } _ { 2 } ( x ^{t}) ^ { 2 }   yielded the accompanying computer output.    a. Estimate  \mu _ { y ^ { 50 } }   .  b. Compute the value of the coefficient of multiple determination. c. What is the estimated regression function  \hat { \beta } _ { 10 } + \hat { \beta } _ { 1 } x + \hat { \beta } _ { 2 } x ^ { 2 }   using the unstandardized variable x?  d. What is the estimated standard deviation of  \hat { \beta } _ { 2 }   computed in part ( c )?  e. Carry out a test using the standardized estimates to decide whether the quadratic term should be retained in the model. Repeat using the unstandardized estimates. Do your conclusions differ? a. Estimate μy50\mu _ { y ^ { 50 } } . b. Compute the value of the coefficient of multiple determination. c. What is the estimated regression function β^10+β^1x+β^2x2\hat { \beta } _ { 10 } + \hat { \beta } _ { 1 } x + \hat { \beta } _ { 2 } x ^ { 2 } using the unstandardized variable x? d. What is the estimated standard deviation of β^2\hat { \beta } _ { 2 } computed in part ( c )? e. Carry out a test using the standardized estimates to decide whether the quadratic term should be retained in the model. Repeat using the unstandardized estimates. Do your conclusions differ?

(Essay)
4.8/5
(31)

In general, with SSEαS S E _ { \alpha } is the error sum of squares from a kth degree polynomial, SSEkS S E _ { k } ____________ SSEkS S E _ { k } , and Rk2R _ { k } ^ { 2 } ____________ Rk2R _ {k} ^ { 2 } whenever ktk ^ { t } > k.

(Essay)
4.9/5
(43)

For the quadratic model with regression function μix=β0+β1x+β2x2\mu _ { i\cdot x } = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x ^ { 2 } , the parameters β0,β1, and β2\beta _ { 0 } , \beta _ { 1 } \text {, and } \beta _ { 2 } characterize the behavior of the function near

(Multiple Choice)
4.8/5
(31)

Suppose the variables x=commuting distance and y=commuting time are related according to the simple linear regression model with σ=10.\sigma = 10 . a. If n=5 observations are made at the x values x1=4,x2=9,x3=14,x4=19, and x5=24x _ { 1 } = 4 , x _ { 2 } = 9 , x _ { 3 } = 14 , x _ { 4 } = 19 , \text { and } x _ { 5 } = 24 calculate the standard deviations of the five corresponding residuals. b. Repeat part (a) for x1=4,x2=9,x3=14,x4=19, and x5=49x _ { 1 } = 4 , x _ { 2 } = 9 , x _ { 3 } = 14 , x _ { 4 } = 19 , \text { and } x _ { 5} = 49 c. What do the results of parts (a) and (b) imply about the deviation of the estimated line from the observation made at the largest sampled x value?

(Essay)
4.9/5
(35)

A function relating y to x is ___________ if by means of a transformation on x and / or y, the function can be expressed as y=β0+β1xy ^ { \prime } = \beta _ { 0 } + \beta _ { 1 } x ^ { \prime } , where xtx ^ { t } is the transformed independent variable and yty ^ {t } is the transformed dependent variable.

(Short Answer)
4.9/5
(39)

When the numbers of predictors is too large to allow for an explicit or implicit examination of all possible subsets, several alternative selection procedures generally will identify good models. The simplest such procedure is the __________, known as BE method.

(Short Answer)
4.7/5
(35)

For a multiple regression model, (yiyˉ)2=250\sum \left( y _ { i } - \bar { y } \right) ^ { 2 } = 250 , and (yiy^i)2=60\sum \left( y _ { i } - \hat { y } _ { i } \right) ^ { 2 } = 60 , then the proportion of the total variation in the observed yiy _ { i} 's that is not explained by the model is

(Multiple Choice)
4.8/5
(36)

In many multiple regression data sets, the predictors x1,x3,x4,,xkx _ { 1 } , x _ { 3 } , x _ { 4 } , \ldots \ldots , x _ { k } are highly interdependent. When the sample xix _ { i } values can be predicted very well from the other predictor values, for at least one predictor, the data is said to exhibit __________.

(Short Answer)
4.8/5
(38)

Which of the following statements are not true?

(Multiple Choice)
4.8/5
(35)

The regression coefficient β2\beta _ { 2 } in the multiple regression model Y=β0+β1x+β2x2++βkxk+εY = \beta _ { 0 } + \beta _ { 1 } x + \beta _ { 2 } x ^ { 2 } + \cdots \cdots + \beta _ { k } x ^ { k} + \varepsilon is interpreted as the expected change in ___________ associated with a 1-unit increase in ___________,while___________ are held fixed.

(Essay)
4.9/5
(44)

Wear resistance of certain nuclear reactor components made of Zircaloy-2 is partly determined by properties of the oxide layer. The following data appears in a study that proposed a new nondestructive testing method to monitor thickness of the layer. The variables are x =oxide-layer thickness ( (μm)( \mu m ) and y =eddy-current respond (arbitrary units). x 0 7 17 114 133 142 190 218 237 285 x 20.3 19.8 19.5 15.9 15.1 14.7 11.9 11.5 8.3 6.6 The equation of the least squares line is y^\hat { y } =20.6 - .047x. Calculate and plot the residuals against x and then comment on the appropriateness of the simple linear regression model.

(Essay)
4.8/5
(31)

Cardiorespiratory fitness is widely recognized as a major component of overall physical well-being. Direct measurement of maximal oxygen uptake (VO2max)\left( \mathrm { VO } _ { 2 } \max \right) is the single best measure of such fitness, but direct measurement is time-consuming and expensive. It is therefore desirable to have a prediction equation for VO2max\mathrm { VO } _ { 2 } \max in terms of easily obtained quantities. Consider the variables y=VO2max(L/min)y = V O _ { 2 } \max ( L / \min ) x1= weight (kg) x _ { 1 } = \text { weight (kg) } x2=age(y)x _ { 2 } = \operatorname { age } ( \mathrm { y } ) x3= time necessary to  walk 1 m ile (min) x _ { 3 } = \text { time necessary to } \text { walk } 1 \mathrm {~m} \text { ile (min) } x4= heart r ate at the end of the  walk (beats/min) x _ { 4 } = \text { heart } \mathrm { r } \text { ate at the end of the } \text { walk (beats/min) } Here is one possible model, for male students: Y=5.0+.015x1.053x2.134x3.011x4+εY = 5.0 + .015 x _ { 1 } - .053 x _ { 2 } - .134 x _ { 3 } - .011 x _ { 4 } + \varepsilon , and σ=.4\sigma = .4 a. Interpret β1 and β3\beta _ { 1 } \text { and } \beta _ { 3 } . b. What is the expected value of VO2max\mathrm { VO } _ { 2 } \max when weight 75 kg. age is 20 yr, walk time is 15 minutes, and heart rate is 140 b/m? c. What is the probability that VO2max\mathrm { VO } _ { 2 } \max will be between 1.00 and 2.60 for a single observation made when the values of the predictors are as stated in part (b)?

(Essay)
4.8/5
(29)

It is important to find characteristics of the production process that produce tortilla chips with an appealing texture. The following data on x = frying time (sec) and y = moisture content (%) are obtained: x 5 10 15 20 25 30 45 60 x 16.3 11.4 8.1 4.5 3.4 2.9 1.9 1.3 a. Construct a scatter plot of y versus x and comment. b. Construct a scatter plot of the (In(x), In(y)) pairs and comment. c. What probabilistic relationship between x and y is suggested by the linear pattern in the plot of part (b)? d. Predict the value of moisture content when frying time is 20 in a way that conveys information about reliability and precision.

(Essay)
4.8/5
(37)

If the regression parameters β0\beta _ { 0 } and β1\beta _ { 1 } are estimated by minimizing the expression fw(b0,b1)=wi[yi(b0+b1xi)]2f _ {w } \left( b _ { 0 } , b _ { 1 } \right) = \sum w _ { i } \left[ y _ { i } - \left( b _ { 0 } + b _ { 1 } x _ { i } \right) \right] ^ { 2 } , where the wiw _ { i } 's are weights that decrease with increasing xix _ { i } , this yields____________estimates.

(Short Answer)
4.8/5
(37)

In each of the following cases, decide whether the given function is intrinsically linear. If so, identify xt and ytx ^ { t } \text { and } y ^ { t } and then explain how a random error term ε \varepsilon can be introduced to yield an intrinsically linear probabilistic model. a. y=1/(α+βx)y = 1 / ( \alpha + \beta x ) b. y=1/(1+eα+βx)y = 1 / \left( 1 + e ^ { \alpha+ \beta x } \right) c.  In each of the following cases, decide whether the given function is intrinsically linear. If so, identify  x ^ { t } \text { and } y ^ { t }  and then explain how a random error term   \varepsilon     can be introduced to yield an intrinsically linear probabilistic model.  a.  y = 1 / ( \alpha + \beta x )   b.  y = 1 / \left( 1 + e ^ { \alpha+ \beta x } \right)   c.    (a Gompertz curve)  d.  y = \alpha + \beta e ^ { \lambda x } (a Gompertz curve) d. y=α+βeλxy = \alpha + \beta e ^ { \lambda x }

(Essay)
4.9/5
(42)
Showing 21 - 40 of 77
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)