Exam 4: Describing Bivariate Numerical Data

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

The Des Moines Register reported the ratings of high school sportsmanship as compiledby the Iowa High School Athletic Association. The participants and coaches from eachschool were rated by referees. (1 = superior, 5 = unsatisfactory.) A regression analysisof data on the average scores given to wrestling spectators and coaches is shown below. The Des Moines Register reported the ratings of high school sportsmanship as compiledby the Iowa High School Athletic Association. The participants and coaches from eachschool were rated by referees. (1 = superior, 5 = unsatisfactory.) A regression analysisof data on the average scores given to wrestling spectators and coaches is shown below.   a) Interpret the value of the correlation between the ratings of spectators andwrestlers? b) Interpret the value of the coefficient of determination.c) Interpret the value of the standard deviation about the least squares line. a) Interpret the value of the correlation between the ratings of spectators andwrestlers? b) Interpret the value of the coefficient of determination.c) Interpret the value of the standard deviation about the least squares line.

Free
(Essay)
4.7/5
(39)
Correct Answer:
Verified

a) Correlation is a) Correlation is   .This indicates a moderately strong linear relationship between rated sportsmanship for spectators and participants.   b) r<sup>2</sup> = .467 means that about 47% of the observed differences in sportsmanship among spectators can be explained by differences in participants' sportsmanship.   c) s = .322 is a measure of how far a typical point will be above or below the least squares line. .This indicates a moderately strong linear relationship between rated sportsmanship for spectators and participants.

b) r2 = .467 means that about 47% of the observed differences in sportsmanship among spectators can be explained by differences in participants' sportsmanship.

c) s = .322 is a measure of how far a typical point will be above or below the least squares line.

Assessing the "goodness" of a regression line involves considering several aspects ofthe fit. Consider the characteristics below. How does each contribute to anassessment of fit? That is, for each characteristic, what about the given characteristicwould indicate that the regression line is "good"? a) The shape of the scatter plotb) The correlation coefficientc) The standard deviation of the residualsd) The coefficient of determination

Free
(Essay)
4.7/5
(31)
Correct Answer:
Verified

a) Points on the graph lined up in a pattern that is consistently increasing or decreasing, rather than curved.
b) Values of r that are close to -1 or 1.
c) Small value of the standard deviation of the residuals (close to zero).
d) Value o f r2 close to 1.

If on average y increases as x increases, the correlation coefficient ispositive.

Free
(True/False)
4.9/5
(36)
Correct Answer:
Verified

True

Some delicate fabrics are natural silks, made of protein and biodegradable. It would be beneficial to be able to assess the delicacy of a fabric before making decisions about displaying it in a museum. Chemical analysis might give some evidence about the brittle nature of a fabric. Biochemical data were acquired from the linings of sixteen 19th and early 20th century Japanese kimonos. Investigators measured the concentration of certain amino acids (“Amino acid ratio”) as well as the breaking stress (“tenacity”) of the 16 kimono fabrics. Some delicate fabrics are natural silks, made of protein and biodegradable. It would be beneficial to be able to assess the delicacy of a fabric before making decisions about displaying it in a museum. Chemical analysis might give some evidence about the brittle nature of a fabric. Biochemical data were acquired from the linings of sixteen 19th and early 20th century Japanese kimonos. Investigators measured the concentration of certain amino acids (“Amino acid ratio”) as well as the breaking stress (“tenacity”) of the 16 kimono fabrics.      -a) What is the equation of the least-squaresline for predicting tenacity using aminob) Graph the least squares best fit line on the scatter plotthat appears on  . c) Approximately what proportion of the variability inacid ratio tenacity is explained by the linear relationship between tenacity and the amino acid ratio? -a) What is the equation of the least-squaresline for predicting tenacity using aminob) Graph the least squares best fit line on the scatter plotthat appears on Some delicate fabrics are natural silks, made of protein and biodegradable. It would be beneficial to be able to assess the delicacy of a fabric before making decisions about displaying it in a museum. Chemical analysis might give some evidence about the brittle nature of a fabric. Biochemical data were acquired from the linings of sixteen 19th and early 20th century Japanese kimonos. Investigators measured the concentration of certain amino acids (“Amino acid ratio”) as well as the breaking stress (“tenacity”) of the 16 kimono fabrics.      -a) What is the equation of the least-squaresline for predicting tenacity using aminob) Graph the least squares best fit line on the scatter plotthat appears on  . c) Approximately what proportion of the variability inacid ratio tenacity is explained by the linear relationship between tenacity and the amino acid ratio? . c) Approximately what proportion of the variability inacid ratio tenacity is explained by the linear relationship between tenacity and the amino acid ratio?

(Essay)
4.7/5
(37)

One of the properties of Pearson's r is: "The value of r does not depend on which ofthe two variables is labeled as x." In your own words, what does this mean?

(Essay)
4.9/5
(31)

As early as 3 years of age, children begin to show preferences for playing withmembers of their own sex, and report having more same-sex than opposite-sexfriends. Researchers believe that this may be the result of perceived differences inpersonality. In a study of 3rd and 4th graders' views on a number personality traits,children were asked to rate on a "5-point" scale:-2 = "someone possessing that trait is probably a boy"-1 = "someone possessing that trait might be a boy"0 = "can't tell"1 = "someone possessing that trait might be a girl"2 = "someone possessing that trait is probably a girl"A scatterplot of the data is presented below. A single point represents the (averagegirls' rating, average boys' rating) for a given trait. As early as 3 years of age, children begin to show preferences for playing withmembers of their own sex, and report having more same-sex than opposite-sexfriends. Researchers believe that this may be the result of perceived differences inpersonality. In a study of 3rd and 4th graders' views on a number personality traits,children were asked to rate on a 5-point scale:-2 = someone possessing that trait is probably a boy-1 = someone possessing that trait might be a boy0 = can't tell1 = someone possessing that trait might be a girl2 = someone possessing that trait is probably a girlA scatterplot of the data is presented below. A single point represents the (averagegirls' rating, average boys' rating) for a given trait.   a) Circle the single point that represents the most influential observation. Whataspect of this point makes it the most influential? b) Suppose a personality trait similar to those used in the survey was given anaverage of 0.0 (can't tell) by the girls. The predicted boys' average ratingwould be closest to which of the 5 categories described above? c) The traits plotted above are those the researchers believe are positive traits, suchas mature, honest, and polite. The researchers thought that on average girlswould rate these positive traits as characteristic of girls to a greater extent thanboys would. What aspects of the plot and/or regression analysis presented aboveare consistent with this thinking? a) Circle the single point that represents the most influential observation. Whataspect of this point makes it the most influential? b) Suppose a personality trait similar to those used in the survey was given anaverage of 0.0 ("can't tell") by the girls. The predicted boys' average ratingwould be closest to which of the 5 categories described above? c) The traits plotted above are those the researchers believe are "positive" traits, suchas "mature," "honest," and "polite." The researchers thought that on average girlswould rate these positive traits as characteristic of girls to a greater extent thanboys would. What aspects of the plot and/or regression analysis presented aboveare consistent with this thinking?

(Essay)
4.7/5
(26)

A transformation, or re-expression, of a variable is accomplished bysubstituting a function of the variable in place of the variable in furtheranalyses.

(True/False)
4.7/5
(35)

Early humans were similar in shape to most modern large primates. The data beloware average male hind limb and forelimb lengths for different species of earlyhominids (humans and their ancestors.) Early humans were similar in shape to most modern large primates. The data beloware average male hind limb and forelimb lengths for different species of earlyhominids (humans and their ancestors.)   a) What is the value of the correlation coefficient for these data? b) What is the equation of the least squares line describing the relationship betweenx = hind limb length and y = forelimb length.c) Suppose these species are representative of all species of early human ancestors.If a new homonin species dating from about the same time were to be discoveredwith an average hind limb length of 500 mm, what would you predict to be theaverage forelimb length of this species? a) What is the value of the correlation coefficient for these data? b) What is the equation of the least squares line describing the relationship betweenx = hind limb length and y = forelimb length.c) Suppose these species are representative of all species of early human ancestors.If a new homonin species dating from about the same time were to be discoveredwith an average hind limb length of 500 mm, what would you predict to be theaverage forelimb length of this species?

(Essay)
4.8/5
(31)

The correlation coefficient, r, does not depend on the units ofmeasurement of the two variables.

(True/False)
4.8/5
(33)

The value of the correlation coefficient, r, is always between 0 and 1.

(True/False)
4.8/5
(37)

The Des Moines Register reported the ratings of high school sportsmanship as compiledby the Iowa High School Athletic Association. The participants and coaches from eachschool were rated by referees. (1 = superior, 5 = unsatisfactory.) A regression analysisof data on the average scores given to football players and coaches is shown below. The Des Moines Register reported the ratings of high school sportsmanship as compiledby the Iowa High School Athletic Association. The participants and coaches from eachschool were rated by referees. (1 = superior, 5 = unsatisfactory.) A regression analysisof data on the average scores given to football players and coaches is shown below.   a) Interpret the value of the correlation between the ratings of coaches andparticipants.b) Interpret the value of the coefficient of determination.c) Interpret the value of the standard deviation about the least squares line. a) Interpret the value of the correlation between the ratings of coaches andparticipants.b) Interpret the value of the coefficient of determination.c) Interpret the value of the standard deviation about the least squares line.

(Essay)
4.9/5
(38)

Does the transformed model appear to be no improvement over the linear model, aslight improvement, or a significant improvement? Justify your response with anappropriate statistical argument.

(Essay)
4.8/5
(30)

The standard deviation about the least squares line is roughly the typicalamount by which an observation deviates from the least squares line.

(True/False)
4.9/5
(27)

The slope of the least squares line is the amount by which y increases, onaverage, as x increases by one unit.

(True/False)
4.8/5
(29)

A large value of r2 indicates strong evidence for a causal relationshipbetween x and y.

(True/False)
4.9/5
(37)

Hemorrhagic disease in white-tailed deer is caused by a virus known as EHD.Immunity is given to fawns by transfer of EHD antibodies from the mother. In astudy to determine how long the maternal antibodies last, blood samples were takenfrom a large sample of fawns of varying ages. The mean levels of EHD antibodyconcentration and the associated ages of fawns are given in the table below.After using the data to fit a straight line model, Eˆ = a + bW , significant curvature wasdetected in the residual plot. Two nonlinear models were chosen for further analysis,the exponential and the power models. (For these data, common logs were used toperform the transformations.) The computer output for these models is given below,and the residual plots are on the next page. Hemorrhagic disease in white-tailed deer is caused by a virus known as EHD.Immunity is given to fawns by transfer of EHD antibodies from the mother. In astudy to determine how long the maternal antibodies last, blood samples were takenfrom a large sample of fawns of varying ages. The mean levels of EHD antibodyconcentration and the associated ages of fawns are given in the table below.After using the data to fit a straight line model, Eˆ = a + bW , significant curvature wasdetected in the residual plot. Two nonlinear models were chosen for further analysis,the exponential and the power models. (For these data, common logs were used toperform the transformations.) The computer output for these models is given below,and the residual plots are on the next page.    Residual Plots    a) For the exponential model, calculate the predicted logarithm of the EHD antibodyconcentration for an age of 5 weeks.  b) Generally speaking, which of the two models, power or exponential, is a betterchoice for predicting the logarithm of the EHD antibody concentration?  Providestatistical justification for your choice based on both the residual plot and thenumerical summary statistics above.  c) The researchers want use their model to predict EHD antibody concentrations forfawns up to 24 weeks of age. Do you think this would be reasonable?  Explainwhy or why not. Residual Plots Hemorrhagic disease in white-tailed deer is caused by a virus known as EHD.Immunity is given to fawns by transfer of EHD antibodies from the mother. In astudy to determine how long the maternal antibodies last, blood samples were takenfrom a large sample of fawns of varying ages. The mean levels of EHD antibodyconcentration and the associated ages of fawns are given in the table below.After using the data to fit a straight line model, Eˆ = a + bW , significant curvature wasdetected in the residual plot. Two nonlinear models were chosen for further analysis,the exponential and the power models. (For these data, common logs were used toperform the transformations.) The computer output for these models is given below,and the residual plots are on the next page.    Residual Plots    a) For the exponential model, calculate the predicted logarithm of the EHD antibodyconcentration for an age of 5 weeks.  b) Generally speaking, which of the two models, power or exponential, is a betterchoice for predicting the logarithm of the EHD antibody concentration?  Providestatistical justification for your choice based on both the residual plot and thenumerical summary statistics above.  c) The researchers want use their model to predict EHD antibody concentrations forfawns up to 24 weeks of age. Do you think this would be reasonable?  Explainwhy or why not. a) For the exponential model, calculate the predicted logarithm of the EHD antibodyconcentration for an age of 5 weeks. b) Generally speaking, which of the two models, power or exponential, is a betterchoice for predicting the logarithm of the EHD antibody concentration? Providestatistical justification for your choice based on both the residual plot and thenumerical summary statistics above. c) The researchers want use their model to predict EHD antibody concentrations forfawns up to 24 weeks of age. Do you think this would be reasonable? Explainwhy or why not.

(Essay)
4.8/5
(31)

Assessing the "goodness" of a regression line involves considering several aspects ofthe fit. Consider the characteristics below. How does each contribute to anassessment of fit? That is, for each characteristic, what about the given characteristicwould indicate that the regression line is "good"? a) The shape of the residual plot b) The correlation coefficient c) The existence of outliersd) The coefficient of determination

(Essay)
4.7/5
(36)

The slope of the least squares line for predicting y from x and the slope ofthe least squares line for predicting x from y are equal.

(True/False)
4.8/5
(38)

The theory of fiber strength suggests that the relationship between fiber tenacity and amino acid ratio is logarithmic, i.e The theory of fiber strength suggests that the relationship between fiber tenacity and amino acid ratio is logarithmic, i.e  ,where T is the tenacity and R is the amino acid ratio. Perform the appropriate transformation of variable(s) and fit this logarithmic model to the data -What is the best fit line using thetransformed data?,where T is the tenacity and R is the amino acid ratio. Perform the appropriate transformation of variable(s) and fit this logarithmic model to the data -What is the best fit line using thetransformed data?

(Essay)
4.9/5
(31)

The study of prehistoric birds depends on imprints of a prehistoric creature’s remains in stone, commonly known as fossils. To study ancient ecosystems effectively it would be useful know the actual mass of individual birds, but this information is not preserved in the fossil record. It seems reasonable that the biomechanics of birds is much the same today as in the past. For example, today’s relationship between the wing length and total weight of a bird should be very similar to that for birds from the distant past. The wing lengths of ancient birds are readily obtainable from the fossil record, but the weight is not. A regression model expressing the relationship between wing length and total weight of modern birds could be used to estimate the mass of similar prehistoric birds. Data for some species of modern birds of prey and are given below. The study of prehistoric birds depends on imprints of a prehistoric creature’s remains in stone, commonly known as fossils. To study ancient ecosystems effectively it would be useful know the actual mass of individual birds, but this information is not preserved in the fossil record. It seems reasonable that the biomechanics of birds is much the same today as in the past. For example, today’s relationship between the wing length and total weight of a bird should be very similar to that for birds from the distant past. The wing lengths of ancient birds are readily obtainable from the fossil record, but the weight is not. A regression model expressing the relationship between wing length and total weight of modern birds could be used to estimate the mass of similar prehistoric birds. Data for some species of modern birds of prey and are given below.     -Investigators would like to model the relationship between Wing Length and Weight.The least squares line for predicting total weight using wing length as a predictor is ofinterest. a) What is the equation of the least-squares line? b) Graph the least-squares line on the scatter plot    c) Approximately what proportion of thevariability in weight is explained by thewing length? -Investigators would like to model the relationship between Wing Length and Weight.The least squares line for predicting total weight using wing length as a predictor is ofinterest. a) What is the equation of the least-squares line? b) Graph the least-squares line on the scatter plot The study of prehistoric birds depends on imprints of a prehistoric creature’s remains in stone, commonly known as fossils. To study ancient ecosystems effectively it would be useful know the actual mass of individual birds, but this information is not preserved in the fossil record. It seems reasonable that the biomechanics of birds is much the same today as in the past. For example, today’s relationship between the wing length and total weight of a bird should be very similar to that for birds from the distant past. The wing lengths of ancient birds are readily obtainable from the fossil record, but the weight is not. A regression model expressing the relationship between wing length and total weight of modern birds could be used to estimate the mass of similar prehistoric birds. Data for some species of modern birds of prey and are given below.     -Investigators would like to model the relationship between Wing Length and Weight.The least squares line for predicting total weight using wing length as a predictor is ofinterest. a) What is the equation of the least-squares line? b) Graph the least-squares line on the scatter plot    c) Approximately what proportion of thevariability in weight is explained by thewing length? c) Approximately what proportion of thevariability in weight is explained by thewing length?

(Essay)
4.9/5
(42)
Showing 1 - 20 of 29
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)