Exam 2: Exploring Relationships Between Variables

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

If a data set has a relationship that is best described by a linear model, than the residual plot will

(Multiple Choice)
4.9/5
(41)

Researchers studying growth patterns of children collect data on the heights of fathers and sons. The correlation between the fathers' heights and the heights of their 16 year-old sons is most likely To be . . .

(Multiple Choice)
4.8/5
(27)

A plot of the residuals versus the fitted values for record-breaking times of female marathon runners for the years 1998 - 2003 is: A plot of the residuals versus the fitted values for record-breaking times of female marathon runners for the years 1998 - 2003 is:    Based on this residuals plot, does it seem reasonable to use linear regression for this model? Explain. Based on this residuals plot, does it seem reasonable to use linear regression for this model? Explain.

(Essay)
4.7/5
(45)

Which statement about re-expressing data is not true? I. Unimodal distributions that are skewed to the left will be made more symmetric by taking the Square root of the variable. II. A curve in which the direction of the association changes from negative to positive Will not benefit from re-expression. III. One goal of re-expression may be to make the variability of the response variable more Uniform.

(Multiple Choice)
4.8/5
(34)

A study examined the number of trees in a variety of orange groves and the corresponding number of oranges that each grove produces in a given harvest year. Linear regression was calculated and the results are below. linear regression results: Dependent Variable: oranges Independent Variable: trees Sample size: 9 -=0.886 =31394.7 Parameter Estimate Std. Err. Constant 390.59 16328.8 Trees 525.84 71.22  A study examined the number of trees in a variety of orange groves and the corresponding number of oranges that each grove produces in a given harvest year. Linear regression was calculated and the results are below. linear regression results: Dependent Variable: oranges Independent Variable: trees Sample size: 9   \begin{array}{l} \mathrm{R}-\mathrm{sq}=0.886 \\ \mathrm{~s}=31394.7 \end{array}     \begin{array}{|l|r|r|} \hline \text { Parameter } & \text { Estimate } & \text { Std. Err. } \\ \hline \text { Constant } & 390.59 & 16328.8 \\ \hline \text { Trees } & 525.84 & 71.22 \\ \hline \end{array}      -If these data were collected in California, would you feel confident in using this equation to make predictions about Florida orange groves also? Explain. -If these data were collected in California, would you feel confident in using this equation to make predictions about Florida orange groves also? Explain.

(Essay)
4.9/5
(46)

In an effort to decide if there is an association between the year of a postal increase and the new postal rate for first class mail, the data were gathered from the United States Postal Service. In 1981, the United States Postal Service changed their rates on March 22 and November 1. This information is shown in the table. Year Rate 1971 0.08 1974 0.10 1975 0.13 1978 0.15 1981 0.18 1981 0.20 1985 0.22 1988 0.25 1991 0.29 1995 0.32 -Interpret the slope of your model in context.

(Essay)
5.0/5
(39)

In an effort to decide if there is an association between the year of a postal increase and the new postal rate for first class mail, the data were gathered from the United States Postal Service. In 1981, the United States Postal Service changed their rates on March 22 and November 1. This information is shown in the table. Year Rate 1971 0.08 1974 0.10 1975 0.13 1978 0.15 1981 0.18 1981 0.20 1985 0.22 1988 0.25 1991 0.29 1995 0.32 -Interpret the intercept of your model in context.

(Essay)
5.0/5
(31)

Residuals are . . .

(Multiple Choice)
4.8/5
(30)

Using the equation in number #8 again, if a homeowner doubles the number of lights he uses from 500 to 1000, how much do we predict he will increase his electric bill by?

(Multiple Choice)
4.9/5
(39)

An article in the Journal of Statistics Education reported the price of diamonds of different sizes in Singapore dollars (SGD). The following table contains a data set that is consistent with this data, adjusted to US dollars in 2004: 2004 US \ Carat 494.82 0.12 768.03 0.17 1105.03 0.20 1508.88 0.25 1826.18 0.28 2096.89 0.33 2004 US\ Carat 688.24 0.15 944.90 0.18 1071.75 0.21 1504.44 0.26 1908.28 0.29 2409.76 0.35 2004 US \ Carat 748.10 0.16 1076.18 0.19 1289.20 0.23 1597.63 0.27 2038.09 0.32 -Interpret the intercept of your model in context.

(Essay)
4.8/5
(39)

On the axes below, sketch a scatterplot described: a. a strong negative association  On the axes below, sketch a scatterplot described: a. a strong negative association    b. a strong association but  r  is near 0    c. a weak but positive association     b. a strong association but rr is near 0  On the axes below, sketch a scatterplot described: a. a strong negative association    b. a strong association but  r  is near 0    c. a weak but positive association     c. a weak but positive association  On the axes below, sketch a scatterplot described: a. a strong negative association    b. a strong association but  r  is near 0    c. a weak but positive association

(Essay)
4.8/5
(28)

A consumer group collected information on HDTVs . They created a linear model to estimate the Cost of an HDTV (in $) based on the screen size (in inches). Which is the most likely value of the Slope of the line of best fit?

(Multiple Choice)
4.8/5
(37)

Storks Data show that there is a positive association between the population of 17 European countries and the number of stork pairs in those countries. a. Briefly explain what "positive association" means in this context. b. Wildlife advocates want the stork population to grow, and jokingly suggest that citizens should be encouraged to have children. As a statistician, what do you think of this suggestion? Explain briefly.

(Essay)
4.8/5
(40)

The correlation coefficient between the hours that a person is awake during a 24-hour period and The hours that same person is asleep during a 24-hour period is most likely to be

(Multiple Choice)
4.9/5
(38)

Current research states that a good diet should contain 20-35 grams of dietary fiber. Research also states that each day should start with a healthy breakfast. The nutritional information for 77 breakfast cereals was reviewed to find the grams of fiber and the number of calories per serving. The scatterplot below shows the relationship between fiber and calories for the cereals. Current research states that a good diet should contain 20-35 grams of dietary fiber. Research also states that each day should start with a healthy breakfast. The nutritional information for 77 breakfast cereals was reviewed to find the grams of fiber and the number of calories per serving. The scatterplot below shows the relationship between fiber and calories for the cereals.   -Do you think a model could accurately predict the number of calories in a serving of cereal that has 22 grams of fiber? Explain. -Do you think a model could accurately predict the number of calories in a serving of cereal that has 22 grams of fiber? Explain.

(Essay)
5.0/5
(44)

The bigger the stop sign, the more expensive it is. Here is a graph of the height of a sign in inches versus its cost in dollars. The bigger the stop sign, the more expensive it is. Here is a graph of the height of a sign in inches versus its cost in dollars.   -Describe why performing linear regression with these data is not a good decision. -Describe why performing linear regression with these data is not a good decision.

(Essay)
4.9/5
(35)

All but one of these statements contain a mistake. Which could be true?

(Multiple Choice)
4.8/5
(33)

Breaking strength A company manufactures polypropylene rope in six different sizes. To assess the strength of the ropes they test two samples of each size to see how much force (in kilograms) the ropes will hold without breaking. The table shows the results of the tests. We want to create a model for predicting the breaking strength from the diameter of the rope. Diameter () Strength () 4 60 76 7 157 153 10 254 262 12 334 388 15 551 529 20 938 893 a. Find a model that uses re-expressed data to straighten the scatterplot. b. The company is thinking of introducing a new 25mm rope. How strong should it be? (Write a sentence in context based on one of your models.)

(Essay)
4.9/5
(41)

Gas mileage again In the Data Desk lab last week you analyzed the association between a car's fuel economy and its weight. Another important factor in the amount of gasoline a car uses is the size of the engine. Called "displacement", engine size measures the volume of the cylinders in cubic inches. The regression analysis is shown. Dependent variable is: MPG 89 total cases of which 0 are missing R squared =60.9%= 60.9 \% \quad R squared (adjusted) =60.0%= 60.0 \% s=3.056\mathrm { s } = 3.056 with 892=8789 - 2 = 87 degrees of freedom Source Sum of Squares df Mean Square F-ratio Regression 696.744 1 696.744 74.6 Residual 448.236 48 9.33826 Variable Coefficient s.e. of Coeff t-ratio prob Constant 34.9799 1.231 28.4 0.0001 Eng. Displcmt -0.066196 0.0077 -8.64 S 0.0001 a. How many cars were included in this analysis? b. What is the correlation between engine size and fuel economy? c. A car you are thinking of buying is available with two different size engines, 190 cubic inches or 240 cubic inches. How much difference might this make in your gas mileage? (Show work)

(Essay)
4.9/5
(31)

Which statement about residuals plots is true? I. A curved pattern indicates nonlinear association between the variables. II. A pattern of increasing spread indicates the predicted values become less reliable as the Explanatory variable increases. III. Randomness in the residuals indicates the model will predict accurately.

(Multiple Choice)
4.7/5
(46)
Showing 141 - 160 of 165
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)