Exam 2: Exploring Relationships Between Variables

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

Which is true? I. Random scatter in the residuals indicates a model with high predictive power. II. If two variables are very strongly associated, then the correlation between them will be near +1.0 Or -1.0. III. The higher the correlation between two variables the more likely the association is based in Cause and effect.

(Multiple Choice)
4.8/5
(42)

After conducting a marketing study to see what consumers thought about a new tinted contact lens they were developing, an eyewear company reported, "Consumer satisfaction is strongly correlated with eye color." Comment on this observation.

(Essay)
4.8/5
(32)

A study by a prominent psychologist found a moderately strong positive association between the number of hours of sleep a person gets and the person's ability to memorize information. a. Explain in the context of this problem what "positive association" means. b. Hoping to improve academic performance, the psychologist recommended the school board allow students to take a nap prior to any assessment. Discuss the psychologist's recommendations.

(Essay)
4.8/5
(38)

Halloween is a fun night. It seems that older children might get more candy because they can travel further while trick-or-treating. But perhaps the youngest kids get extra candy because they are so cute. Here are some data that examine this question, along with the regression output. Dependent Variable: candy Sample size: 9 RR (correlation coefficient) =0.19534425= \mathbf { 0 . 1 9 5 3 4 4 2 5 } Rsq=0.038159375\mathrm { R } - \mathrm { sq } = 0.038159375 s=11.297554s = 11.297554 Parameter Estimate Std. Err. Intercept 13.569231 9.0783516 Age 3.4038462 1.0175376  Halloween is a fun night. It seems that older children might get more candy because they can travel further while trick-or-treating. But perhaps the youngest kids get extra candy because they are so cute. Here are some data that examine this question, along with the regression output. Dependent Variable: candy Sample size: 9  R  (correlation coefficient)  = \mathbf { 0 . 1 9 5 3 4 4 2 5 }   \mathrm { R } - \mathrm { sq } = 0.038159375   s = 11.297554   \begin{array} { l r r } \text { Parameter } & \text { Estimate } & \text { Std. Err. } \\ \text { Intercept } & 13.569231 & 9.0783516 \\ \text { Age } & 3.4038462 & 1.0175376 \end{array}     -The next day, a young girl reveals that her older brother also went trick-or-treating, but didn't want to admit that he participated. He was added to the data set and these are the results. Dependent Variable: candy Sample size: 10  R  (correlation coefficient  ) = 0.76362369   \mathrm { R } - \mathrm { sq } = 0.58312115   \mathrm { s } = 12.709041   \begin{array} { l r r } \text { Parameter } & \text { Estimate } & \text { Std. Err. } \\ \text { Intercept } & 13.569231 & 9.0783516 \\ \text { Age } & 3.4038462 & 1.0175376 \end{array}      Describe the effect of this new candy collector on the regression model. -The next day, a young girl reveals that her older brother also went trick-or-treating, but didn't want to admit that he participated. He was added to the data set and these are the results. Dependent Variable: candy Sample size: 10 RR (correlation coefficient )=0.76362369) = 0.76362369 Rsq=0.58312115\mathrm { R } - \mathrm { sq } = 0.58312115 s=12.709041\mathrm { s } = 12.709041 Parameter Estimate Std. Err. Intercept 13.569231 9.0783516 Age 3.4038462 1.0175376  Halloween is a fun night. It seems that older children might get more candy because they can travel further while trick-or-treating. But perhaps the youngest kids get extra candy because they are so cute. Here are some data that examine this question, along with the regression output. Dependent Variable: candy Sample size: 9  R  (correlation coefficient)  = \mathbf { 0 . 1 9 5 3 4 4 2 5 }   \mathrm { R } - \mathrm { sq } = 0.038159375   s = 11.297554   \begin{array} { l r r } \text { Parameter } & \text { Estimate } & \text { Std. Err. } \\ \text { Intercept } & 13.569231 & 9.0783516 \\ \text { Age } & 3.4038462 & 1.0175376 \end{array}     -The next day, a young girl reveals that her older brother also went trick-or-treating, but didn't want to admit that he participated. He was added to the data set and these are the results. Dependent Variable: candy Sample size: 10  R  (correlation coefficient  ) = 0.76362369   \mathrm { R } - \mathrm { sq } = 0.58312115   \mathrm { s } = 12.709041   \begin{array} { l r r } \text { Parameter } & \text { Estimate } & \text { Std. Err. } \\ \text { Intercept } & 13.569231 & 9.0783516 \\ \text { Age } & 3.4038462 & 1.0175376 \end{array}      Describe the effect of this new candy collector on the regression model. Describe the effect of this new candy collector on the regression model.

(Essay)
4.9/5
(28)

A common objective for many school administrators is to increase the number of students taking SAT and ACT tests from their school. The data from each state from 2003 are reflected in the scatterplot.  A common objective for many school administrators is to increase the number of students taking SAT and ACT tests from their school. The data from each state from 2003 are reflected in the scatterplot.    a. Write a few sentences describing the association. b. Estimate the correlation.  r = __________  c. If the point in the top left corner (4, 1215) were removed, would the correlation become stronger, weaker, or remain about the same? Explain briefly. d. If the point in the very middle (38, 1049) were removed, would the correlation become stronger, weaker, or remain about the same? Explain briefly. a. Write a few sentences describing the association. b. Estimate the correlation. r=r = __________ c. If the point in the top left corner (4, 1215) were removed, would the correlation become stronger, weaker, or remain about the same? Explain briefly. d. If the point in the very middle (38, 1049) were removed, would the correlation become stronger, weaker, or remain about the same? Explain briefly.

(Essay)
4.9/5
(34)

Associations For each pair of variables, indicate what association you expect: positive(+),  negative (), curved (C), or none (N)\text { negative } ( - ) \text {, curved } ( \mathrm { C } ) \text {, or none } ( \mathrm { N } ) \text {. } a. power level setting of a microwave; number of minutes it takes to boil water b. number of days it rained in a month (during the summer); number of times you mowed your lawn that month c. number of hours a person has been up past a normal bedtime; number of minutes it takes the person to do a crossword puzzle d. number of hockey games played in Minnesota during a week; sales of suntan lotion in Minnesota during that week e. length of a student's hair; number of credits the student earned last year

(Essay)
4.9/5
(41)

Earning power A college's job placement office collected data about students' GPAs and the salaries they earned in their first jobs after graduation. The mean GPA was 2.9 with a standard deviation of 0.4. Starting salaries had a mean of $47,200 with a SD of $8500. The correlation between the two variables was r=0.72r = 0.72 The association appeared to be linear in the scatterplot. (Show work) a. Write an equation of the model that can predict salary based on GPA. b. Do you think these predictions will be reliable? Explain. c. Your brother just graduated from that college with a GPA of 3.30. He tells you that based on this model the residual for his pay is -$1880. What salary is he earning?

(Essay)
4.8/5
(41)

Math and Verbal Suppose the correlation between SAT Verbal scores and Math scores is 0.57 and that these scores are normally distributed. If a student's Verbal score places her at the 90th percentile, at what percentile would you predict her Math score to be? (SHOW WORK)

(Essay)
4.8/5
(44)

Which of the following is not a source of caution in regression between two variables?

(Multiple Choice)
4.8/5
(33)

The correlation coefficient between high school grade point average (GPA) and college GPA is 0)560. For a student with a high school GPA that is 2.5 standard deviations above the mean, we Would expect that student to have a college GPA that is ______ the mean.

(Multiple Choice)
4.8/5
(24)

To achieve linearity, the data was transformed using a square root function of cost. Here are the results and a residual plot. Dependent Variable: sqrt(cost) s: 0.21410.2141 Parameter coeff se Intercept 1.1857 0.4346 height 0.1792 0.0151  To achieve linearity, the data was transformed using a square root function of cost. Here are the results and a residual plot. Dependent Variable: sqrt(cost) s:  0.2141   \begin{array} { l r r } \text { Parameter } & \text { coeff } & \text { se } \\ \text { Intercept } & 1.1857 & 0.4346 \\ \text { height } & 0.1792 & 0.0151 \end{array}       -Interpret R-sq in the context of this problem. -Interpret R-sq in the context of this problem.

(Essay)
4.8/5
(35)

A regression analysis of students' college grade point averages (GPAs) and their high school GPAs Found R2=0.311R ^ { 2 } = 0.311 11) Which of these is true? I. High school GPA accounts for 31.1% of college GPA. II. 31.1% of college GPAs can be correctly predicted with this model. III. 31.1% of the variance in college GPA can be accounted for by the model

(Multiple Choice)
4.8/5
(32)

You are given the following costs to build a square deck for your house:  Width (ft) 456789101112131415 Cost ($) 150255350500650800100012001450170019502250\begin{array} { | l | c | c | c | c | c | c | c | c | c | c | c | c | } \hline \text { Width (ft) } & 4 & 5 & 6 & 7 & 8 & 9 & 10 & 11 & 12 & 13 & 14 & 15 \\\hline \text { Cost (\$) } & 150 & 255 & 350 & 500 & 650 & 800 & 1000 & 1200 & 1450 & 1700 & 1950 & 2250 \\\hline\end{array} a. Use re-expressed data to create a model that predicts the cost of the deck based on the width. b. Why do you think that your model is appropriate? c. Find the predicted cost of a square deck that is 10.5 feet wide. d. Is it reasonable to use this model to predict the cost of a square deck that is 20 feet wide? Explain.

(Essay)
5.0/5
(36)

During a science lab, students heated water, allowed it to cool, and recorded the temperature over time. They computed the difference between the water temperature and the room temperature. The results are in the table. Time (in minutes) 10 20 30 40 50 60 Difference in temp. (degrees F) 68 36 20 10 6 4 -Would it be better for customers for a year to have a negative residual or a positive residual from this model? Explain.

(Essay)
4.9/5
(37)

The residuals plot for a linear model is shown. Which is true? The residuals plot for a linear model is shown. Which is true?

(Multiple Choice)
4.8/5
(35)

After conducting a survey of his students, a professor reported that "There appears to be a strong correlation between grade point average and whether or not a student works." Comment on this observation.

(Essay)
4.8/5
(30)

Penicillin Doctors studying how the human body assimilates medication inject some patients with penicillin, and then monitor the concentration of the drug (in units/cc) in the patients' blood for seven hours. The data are shown in the scatterplot. First they tried to fit a linear model. The regression analysis and residuals plot are shown. Dependent variable is: Concentration No Selector R squared =90.8%= 90.8 \% \quad R squared (adjusted) =90.6%= 90.6 \% s=3.472s = 3.472 with 432=4143 - 2 = 41 degrees of freedom Source Sum of Squares df Mean Square F-ratio Regression 4900.55 1 4900.55 407 Residual 494.199 41 12.0536 Variable Coefficient s.e. of Coeff t-ratio prob Constant 40.3266 1.295 31.1 \leq0.0001 Time -5.95956 0.2956 -20.2 \leq0.0001  Penicillin Doctors studying how the human body assimilates medication inject some patients with penicillin, and then monitor the concentration of the drug (in units/cc) in the patients' blood for seven hours. The data are shown in the scatterplot. First they tried to fit a linear model. The regression analysis and residuals plot are shown. Dependent variable is: Concentration No Selector R squared  = 90.8 \% \quad  R squared (adjusted)  = 90.6 \%   s = 3.472  with  43 - 2 = 41  degrees of freedom    \begin{array}{llrrr} \text { Source } & \text { Sum of Squares } & \text { df } & \text { Mean Square } & \text { F-ratio } \\ \text { Regression } & 4900.55 & 1 & 4900.55 & 407 \\ \text { Residual } & 494.199 & 41 & 12.0536 & \end{array}      \begin{array}{lllrl} \text { Variable } & \text { Coefficient } & \text { s.e. of Coeff } & \text { t-ratio } & \text { prob } \\ \text { Constant } & 40.3266 & 1.295 & 31.1 & \leq 0.0001 \\ \text { Time } & -5.95956 & 0.2956 & -20.2 & \leq 0.0001 \end{array}        a. Find the correlation between time and concentration. b. Using this model, estimate what the concentration of penicillin will be after 4 hours. c. Is that estimate likely to be accurate, too low, or too high? Explain. Now the researchers try a new model, using the re-expression log(Concentration). Examine the regression analysis and the residuals plot below. Dependent variable is: LogCnn No Selector R squared  = 98.0 \% \quad  R squared (adjusted)  = 98.0 \%   s = 0.0451  with  43 - 2 = 41  degrees of freedom    \begin{array}{llrrr} \text { Source } & \text { Sum of Squares } & \text { df } & \text { Mean Square } & \text { F-ratio } \\ \text { Regression } & 4.11395 & 1 & 4.11395 & 2022 \\ \text { Residual } & 0.083412 & 41 & 0.002034 & \end{array}    \begin{array}{llllc} \text { Variable } & \text { Coefficient } & \text { s.e. of Coeff } & \text { t-ratio } & \text { prob } \\ \text { Constant } & 1.80184 & 0.0168 & 107 & \leq 0.0001 \\ \text { Time } & -0.172672 & 0.0038 & -45.0 & \leq 0.0001 \end{array}       d. Explain why you think this model is better than the original linear model. e. Using this new model, estimate the concentration of penicillin after 4 hours. a. Find the correlation between time and concentration. b. Using this model, estimate what the concentration of penicillin will be after 4 hours. c. Is that estimate likely to be accurate, too low, or too high? Explain. Now the researchers try a new model, using the re-expression log(Concentration). Examine the regression analysis and the residuals plot below. Dependent variable is: LogCnn No Selector R squared =98.0%= 98.0 \% \quad R squared (adjusted) =98.0%= 98.0 \% s=0.0451s = 0.0451 with 432=4143 - 2 = 41 degrees of freedom Source Sum of Squares df Mean Square F-ratio Regression 4.11395 1 4.11395 2022 Residual 0.083412 41 0.002034 Variable Coefficient s.e. of Coeff t-ratio prob Constant 1.80184 0.0168 107 \leq0.0001 Time -0.172672 0.0038 -45.0 \leq0.0001  Penicillin Doctors studying how the human body assimilates medication inject some patients with penicillin, and then monitor the concentration of the drug (in units/cc) in the patients' blood for seven hours. The data are shown in the scatterplot. First they tried to fit a linear model. The regression analysis and residuals plot are shown. Dependent variable is: Concentration No Selector R squared  = 90.8 \% \quad  R squared (adjusted)  = 90.6 \%   s = 3.472  with  43 - 2 = 41  degrees of freedom    \begin{array}{llrrr} \text { Source } & \text { Sum of Squares } & \text { df } & \text { Mean Square } & \text { F-ratio } \\ \text { Regression } & 4900.55 & 1 & 4900.55 & 407 \\ \text { Residual } & 494.199 & 41 & 12.0536 & \end{array}      \begin{array}{lllrl} \text { Variable } & \text { Coefficient } & \text { s.e. of Coeff } & \text { t-ratio } & \text { prob } \\ \text { Constant } & 40.3266 & 1.295 & 31.1 & \leq 0.0001 \\ \text { Time } & -5.95956 & 0.2956 & -20.2 & \leq 0.0001 \end{array}        a. Find the correlation between time and concentration. b. Using this model, estimate what the concentration of penicillin will be after 4 hours. c. Is that estimate likely to be accurate, too low, or too high? Explain. Now the researchers try a new model, using the re-expression log(Concentration). Examine the regression analysis and the residuals plot below. Dependent variable is: LogCnn No Selector R squared  = 98.0 \% \quad  R squared (adjusted)  = 98.0 \%   s = 0.0451  with  43 - 2 = 41  degrees of freedom    \begin{array}{llrrr} \text { Source } & \text { Sum of Squares } & \text { df } & \text { Mean Square } & \text { F-ratio } \\ \text { Regression } & 4.11395 & 1 & 4.11395 & 2022 \\ \text { Residual } & 0.083412 & 41 & 0.002034 & \end{array}    \begin{array}{llllc} \text { Variable } & \text { Coefficient } & \text { s.e. of Coeff } & \text { t-ratio } & \text { prob } \\ \text { Constant } & 1.80184 & 0.0168 & 107 & \leq 0.0001 \\ \text { Time } & -0.172672 & 0.0038 & -45.0 & \leq 0.0001 \end{array}       d. Explain why you think this model is better than the original linear model. e. Using this new model, estimate the concentration of penicillin after 4 hours. d. Explain why you think this model is better than the original linear model. e. Using this new model, estimate the concentration of penicillin after 4 hours.

(Essay)
4.8/5
(38)

When using midterm exam scores to predict a student's final grade in a class, the student would Prefer to have a

(Multiple Choice)
4.8/5
(38)

Put to Work Some students have to work part time jobs to pay for college expenses. A researcher examined the academic performance of students with jobs versus those without. He found a positive association between the number of hours worked and GPA. Explain what "positive association" means in this context.

(Essay)
4.8/5
(31)

Education research consistently shows that students from wealthier families tend to have higher SAT scores. The slope of the line that predicts SAT score from family income is 6.25 points per $1000, And the correlation between the variables is 0.48. Then the slope of the line that predicts family Income from SAT score (in $1000 per point) …

(Multiple Choice)
4.9/5
(32)
Showing 21 - 40 of 165
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)