Question 1

Repeated use of the same test typically results in different scores.How does classical test theory account for this?&#10;A) poor test validity&#10;B) systematic variability&#10;C) random error&#10;D) inattention

Accepted Answer

C

Question 2

If you have three clocks in your house,and every clock is 10 minutes fast,this is an example of&#10;A) systematic error.&#10;B) random error.&#10;C) measurement error.&#10;D) a rubber yardstick.

Accepted Answer

A

Question 3

When talking about errors in terms of psychological testing,we are referring to the fact that:&#10;A) someone got an answer incorrect.&#10;B) there is always some inaccuracy in the measurement.&#10;C) the test was inappropriate for that particular group.&#10;D) the score is too subjective to be accurate.

Accepted Answer

B

Question 4

What is Cronbach known for?&#10;A) Developing measures to evaluate sources of error&#10;B) Creating the basics of multivariate analysis&#10;C) Developed the basics of contemporary measurement theory&#10;D) Distinguished between objective and subjective measures

Accepted Answer

The answer of What is Cronbach known for?&#10;A) Developing measures...

Question 5

The work of Charles Spearman combined what two measurement concepts?&#10;A) mean and variance&#10;B) sample statistics and population parameters&#10;C) sampling error and correlation&#10;D) reliability and validity

Accepted Answer

The answer of The work of Charles Spearman combined what...

Question 6

We can get an idea of how much measurement error is present in a score through the&#10;A) true score.&#10;B) observed score.&#10;C) standard error of the mean.&#10;D) standard error of measurement.

Accepted Answer

The answer of We can get an idea of how...

Question 7

The basic theory of reliability was first worked out by&#10;A) Karl Pearson.&#10;B) Charles Spearman.&#10;C) Julian Stanley.&#10;D) Lee Cronbach.

Accepted Answer

The answer of The basic theory of reliability was first...

Question 8

When creating a test,one generally uses a subset of items to represent a larger construct.This is known as&#10;A) a population parameter.&#10;B) a domain sampling.&#10;C) a sampling error.&#10;D) descriptive statistics.

Accepted Answer

The answer of When creating a test,one generally uses a...

Question 9

Classical Test Theory assumes that&#10;A) errors are systematic.&#10;B) errors are random.&#10;C) true scores cannot be estimated.&#10;D) the length of a test has no bearing on its reliability.

Accepted Answer

The answer of Classical Test Theory assumes that&#10;A) errors are...

Question 10

Because classic test theory assumes a person's true score is the same over time,repeating the same test over and over gives a distribution of scores that reflect what?&#10;A) systematic error&#10;B) random error&#10;C) reliability&#10;D) internal consistency

Accepted Answer

The answer of Because classic test theory assumes a person's...

Question 11

What is Spearman known for?&#10;A) Working out the basics of reliability theory&#10;B) Developing the notion of sampling error&#10;C) Creating methods for measuring error&#10;D) Developing multivariate analysis

Accepted Answer

The answer of What is Spearman known for?&#10;A) Working out...

Question 12

Which of the following is an important distinction between systematic errors and random errors?&#10;A) Random errors are more likely than systematic errors to cause errors in conclusions.&#10;B) Systematic errors occur only in objective measures and random errors occur only in subjective measures.&#10;C) Random errors can be eliminated by careful wording of test items.&#10;D) Systematic errors are extremely rare among psychological tests.

Accepted Answer

The answer of Which of the following is an important...

Question 13

Assuming the &#34;rubber yardstick&#34; shrinks and expands at random,what can be said about the distribution of scores from the rubber yardstick?&#10;A) It will have a mean of zero (0).&#10;B) It will be normal.&#10;C) It will have a standard error of zero (0).&#10;D) It will be skewed.

Accepted Answer

The answer of Assuming the &#34;rubber yardstick&#34; shrinks and expands...

Question 14

Classical Test Theory assumes&#10;A) the length of a test has no bearing on its reliability.&#10;B) measurement errors occur systematically.&#10;C) it is not possible to estimate true scores.&#10;D) the distribution of random errors is the same for every respondent.

Accepted Answer

The answer of Classical Test Theory assumes&#10;A) the length of...

Question 15

Who developed methods for evaluating sources of error in behavioral research?&#10;A) Edward Thorndike&#10;B) Kuder and Richardson&#10;C) Charles Spearman&#10;D) Cronbach

Accepted Answer

The answer of Who developed methods for evaluating sources of...

Question 16

Theoretically,reliability is&#10;A) the correlation of the observed test score with the true score.&#10;B) the square root of the ratio of true to the observed score.&#10;C) the ratio of true to the observed score squared.&#10;D) not possible to define.

Accepted Answer

The answer of Theoretically,reliability is&#10;A) the correlation of the observed...

Question 17

Theoretically,if Susie repeatedly took the 6^th grade achievement test,you would be able to find her true score by finding the ____ of the distribution of her scores. A) mean B) standard deviation C) variance D) standard error of measurement

Accepted Answer

The answer of Theoretically,if Susie repeatedly took the 6^th grade...

Question 18

If we repeatedly administered the same test to the same individual,the standard deviation of the person's score would be the&#10;A) standard error of the mean.&#10;B) variance.&#10;C) reliability of the test.&#10;D) standard error of measurement.

Accepted Answer

The answer of If we repeatedly administered the same test...

Question 19

According to classical test theory,errors of measurement are&#10;A) always overestimates of true score.&#10;B) always underestimates of true score.&#10;C) random.&#10;D) constant.

Accepted Answer

The answer of According to classical test theory,errors of measurement...

Question 20

An observed score is composed of&#10;A) the residual and the true score.&#10;B) the criterion and the predictor.&#10;C) the measurement error and the predictor.&#10;D) the true score and the measurement error.

Accepted Answer

The answer of An observed score is composed of&#10;A) the...

Question 21

A split-half correlation,KR 20,and coefficient alpha are all used to evaluate&#10;A) standard errors of measurement.&#10;B) internal consistency.&#10;C) variance.&#10;D) validity.

Accepted Answer

The answer of A split-half correlation,KR 20,and coefficient alpha are...

Question 22

Sources of error associated with time sampling are measured using&#10;A) the test-retest method.&#10;B) the split half method.&#10;C) KR 20.&#10;D) the alpha method.

Accepted Answer

The answer of Sources of error associated with time sampling...

Question 23

Suppose you were trying to estimate the reliability of a whole test on the basis of the correlation between scores on the two halves of the test.In order to correct for using scores based on the halves,you might use the

A) KR 20.
B) alpha method.
C) Spearman-Brown formula.
D) split half method.

Accepted Answer

The answer of Suppose you were trying to estimate the...

Question 24

Federal government guidelines require that a test be&#10;A) standardized for use among all U.S.sub-populations.&#10;B) factor analyzed before it can be used to make employment decisions.&#10;C) reliable before it can be used to make employment decisions.&#10;D) reliable above the .90 level.

Accepted Answer

The answer of Federal government guidelines require that a test...

Question 25

The Spearman Brown formula corrects for deflated reliability due to&#10;A) half-length tests.&#10;B) small sample size.&#10;C) systematic error.&#10;D) poor test item construction.

Accepted Answer

The answer of The Spearman Brown formula corrects for deflated...

Question 26

In the domain sampling model,the error that is being considered is the error caused by&#10;A) choosing the wrong domain.&#10;B) systematic error.&#10;C) using a limited sample of items.&#10;D) random error.

Accepted Answer

The answer of In the domain sampling model,the error that...

Question 27

Why might different random samples of domain items yield different estimates of the true score?&#10;A) sampling error&#10;B) poor reliability&#10;C) respondent error&#10;D) item bias

Accepted Answer

The answer of Why might different random samples of domain...

Question 28

Which of the following would tend to provide the most conservative estimate of split-half reliability?&#10;A) the Phillips method&#10;B) the Spearman-Brown formula&#10;C) coefficient alpha&#10;D) the odd-even reliability coefficient

Accepted Answer

The answer of Which of the following would tend to...

Question 29

Dr.Janine developed two equivalent forms of a test and administered them both,in counter-balanced order,to a group of people on the same day in order to assess reliability.What is this called?&#10;A) test- retest&#10;B) parallel forms&#10;C) split-half&#10;D) KR 20

Accepted Answer

The answer of Dr.Janine developed two equivalent forms of a...

Question 30

How does the domain sampling model conceptualize reliability?&#10;A) The absolute value of the difference between the standard error of measurement and the variance&#10;B) The ratio of variance of the observed scores on the short version of a test and the variance of the long-run true scores&#10;C) The sum of squares of the difference between the observed and true scores&#10;D) The ratio of the number of sample items to the number of domain items,multiplied by the mean of the sample distribution

Accepted Answer

The answer of How does the domain sampling model conceptualize...

Question 31

The method for estimating the internal consistency of a test that simultaneously considers all possible ways of splitting the items is the&#10;A) Spearman Brown formula.&#10;B) Kuder-Richardson formula.&#10;C) Cronbach's alpha.&#10;D) the odd-even method.

Accepted Answer

The answer of The method for estimating the internal consistency...

Question 32

As opposed to reliability based on the classical test theory,____ focuses on the range of item difficulty that is useful in assessing an individual's ability.&#10;A) domain sampling&#10;B) internal consistency&#10;C) coefficient alpha&#10;D) item response theory

Accepted Answer

The answer of As opposed to reliability based on the...

Question 33

Dr.Smith is trying to determine the reliability of a new personality test.Two randomly parallel tests,A and B,have a correlation of .81.What is the estimated reliability of the new personality test?&#10;A) .81&#10;B) -.9&#10;C) .9&#10;D) .81/t

Accepted Answer

The answer of Dr.Smith is trying to determine the reliability...

Question 34

The problems created by using a limited number of items to represent a larger and more complicated construct are explicitly considered in the ____ model.&#10;A) multivariate&#10;B) random sampling&#10;C) domain sampling&#10;D) standard error of measurement

Accepted Answer

The answer of The problems created by using a limited...

Question 35

A reliability coefficient of .60 suggests that&#10;A) 64% of the variance on the test is error.&#10;B) 40% of the variance on the test is error.&#10;C) 78% of the variance on the test is error.&#10;D) the test can be used for clinical purposes but not for research.

Accepted Answer

The answer of A reliability coefficient of .60 suggests that&#10;A)...

Question 36

Upon repeated applications of the same test,performance on the second application may be affected by previous experience on the test.This is known as&#10;A) attenuation.&#10;B) a carryover effect.&#10;C) shrinkage.&#10;D) selected recall.

Accepted Answer

The answer of Upon repeated applications of the same test,performance...

Question 37

Professor Pine constructed five different short history tests by randomly drawing questions from the huge pool of all possible questions about the current material.He has created&#10;A) randomly parallel tests.&#10;B) a large sample size.&#10;C) systematic errors.&#10;D) attenuation effects.

Accepted Answer

The answer of Professor Pine constructed five different short history...

Question 38

Tests designed according to item response theory&#10;A) are no longer considered useful.&#10;B) can only be used with non-objective material&#10;C) yield more reliable results with fewer items&#10;D) provide low-tech methods for field use.

Accepted Answer

The answer of Tests designed according to item response theory&#10;A)...

Question 39

The difference between David's two typing tests,one at the beginning of the semester and one at the end,reflects the fact that he typed quite a few term papers during the semester.This reflects&#10;A) attenuation.&#10;B) random error.&#10;C) practice effects.&#10;D) domain sampling.

Accepted Answer

The answer of The difference between David's two typing tests,one...

Question 40

If a researcher is attempting to assess the reliability of a measure of depression,the method of choice would be&#10;A) internal consistency.&#10;B) time sampling.&#10;C) the test-retest method.&#10;D) more than one of these.

Accepted Answer

The answer of If a researcher is attempting to assess...

Question 41

Measures of test-retest reliability are sometimes considered inappropriate for the evaluation of health status because&#10;A) health status tests should not given at multiple points in time.&#10;B) variations in health status may be related to true changes over time rather than measurement error.&#10;C) there is no domain of health status.&#10;D) health status is too complicated to measure.

Accepted Answer

The answer of Measures of test-retest reliability are sometimes considered...

Question 42

The standard error of measurement allows us to&#10;A) estimate the degree to which a test provides inaccurate readings.&#10;B) have an acceptable margin of error.&#10;C) determine the source of error.&#10;D) avoid any measurement error.

Accepted Answer

The answer of The standard error of measurement allows us...

Question 43

If the same test,given at different points in time to the same test takers,yields different scores,then the method typically used to assess this source of error is&#10;A) test-retest.&#10;B) alternate forms/parallel forms.&#10;C) split-half.&#10;D) KR 20.

Accepted Answer

The answer of If the same test,given at different points...

Question 44

What is the impact of carryover effects on test-retest reliability?&#10;A) Test-retest reliability is not influenced by carryover effects.&#10;B) Carryover effects result in an overestimation of reliability.&#10;C) Carryover effects result in an underestimation of reliability.&#10;D) Test-retest reliability increases carryover effects.

Accepted Answer

The answer of What is the impact of carryover effects...

Question 45

Items are probably measuring the same thing when the correlation between an item and the total score&#10;A) is high.&#10;B) is low.&#10;C) approaches 0.&#10;D) is negative.

Accepted Answer

The answer of Items are probably measuring the same thing...

Question 46

Jennifer read a report in which the agreement between raters of children's aggressive behavior was .50,indicating&#10;A) the raters agreed at chance levels.&#10;B) agreement was poor.&#10;C) agreement was excellent.&#10;D) agreement was moderate.

Accepted Answer

The answer of Jennifer read a report in which the...

Question 47

Difference scores are created by&#10;A) subtracting one test score from another.&#10;B) subtracting the true score from a predicted score.&#10;C) eliminating error from true scores.&#10;D) giving a test to two different individuals.

Accepted Answer

The answer of Difference scores are created by&#10;A) subtracting one...

Question 48

Which of the following is used to estimate the number of items that should be added to a test to achieve a specified reliability?&#10;A) KR 20&#10;B) coefficient alpha&#10;C) Spearman-Brown prophecy formula&#10;D) split-half technique

Accepted Answer

The answer of Which of the following is used to...

Question 49

The kappa statistic is used to&#10;A) assess the level of agreement among several observers.&#10;B) estimate the correlation between a continuous variable and an artificially dichotomous variable.&#10;C) estimate the percentage of disagreement between observers.&#10;D) estimate the validity of behavioral observation.

Accepted Answer

The answer of The kappa statistic is used to&#10;A) assess...

Question 50

The difference between KR 20 and coefficient alpha is&#10;A) KR 20 can be used to evaluate time sampling problems while alpha cannot.&#10;B) Alpha can be used to evaluate time sampling problems while KR 20 cannot.&#10;C) KR 20 can only be used for items scored right or wrong but Alpha can be used for items in any format.&#10;D) Alpha can only be used for items scored right or wrong but KR 20 can be used for items in any format.

Accepted Answer

The answer of The difference between KR 20 and coefficient...

Question 51

Correction for attenuation is used&#10;A) to estimate the validity of a test.&#10;B) to correct for tests that are short.&#10;C) to correct for tests that are long.&#10;D) to estimate the true correlation between variables that have been measured with error.

Accepted Answer

The answer of Correction for attenuation is used&#10;A) to estimate...

Question 52

Standard errors of measurement are used to&#10;A) determine whether an observed score is the &#34;true&#34; score.&#10;B) determine the standard deviation of the scores.&#10;C) calculate the exact true score.&#10;D) create confidence intervals around specific observed test scores.

Accepted Answer

The answer of Standard errors of measurement are used to&#10;A)...

Question 53

Which of the following is a problem in evaluating the agreement between observers in behavioral studies?&#10;A) The observers are usually not trained.&#10;B) The behaviors being studied are usually not directly observable.&#10;C) There will always be some agreement by chance.&#10;D) There is no method for evaluating the agreement between observers.

Accepted Answer

The answer of Which of the following is a problem...

Question 54

Which of the following is a source of measurement error?&#10;A) respondent sampling&#10;B) scorer sampling&#10;C) internal consistency&#10;D) external consistency

Accepted Answer

The answer of Which of the following is a source...

Question 55

Test constructors can improve test reliability by&#10;A) increasing the number of items.&#10;B) decreasing the number of items.&#10;C) retaining items that have the most face validity.&#10;D) reducing the item to total correlation.

Accepted Answer

The answer of Test constructors can improve test reliability by&#10;A)...

Question 56

The reliability of a difference score is&#10;A) equal to the reliability of the most reliable of the two measures.&#10;B) equal to the reliability of the least reliable of the two measures.&#10;C) the average reliability of the two measures.&#10;D) expected to be lower than the reliability of either of the two measures.

Accepted Answer

The answer of The reliability of a difference score is&#10;A)...

Question 57

Which of the following is true of the parallel forms method?&#10;A) It is the most often used method for estimating reliability.&#10;B) It provides one of the most rigorous methods for estimating reliability.&#10;C) It is largely ineffective with psychological tests.&#10;D) Sophisticated computer programs have made it unnecessary.

Accepted Answer

The answer of Which of the following is true of...

Question 58

In order to determine the unidimensionality of a test,you can use&#10;A) factor analysis.&#10;B) split half reliability.&#10;C) parallel forms assessment.&#10;D) the Spearman-Brown prophecy formula.

Accepted Answer

The answer of In order to determine the unidimensionality of...

Question 59

The preferred method for assessing the level of agreement between observers is the&#10;A) kappa statistic&#10;B) Spearman coefficient&#10;C) coefficient alpha&#10;D) rank-order statistic

Accepted Answer

The answer of The preferred method for assessing the level...

Question 60

Approximately what value must a reliability coefficient have for most purposes in basic research?&#10;A) .90&#10;B) .50&#10;C) .70&#10;D) .30

Accepted Answer

The answer of Approximately what value must a reliability coefficient...

Question 61

The prophecy formula is used to&#10;A) predict expected values.&#10;B) estimate how long a test must be to achieve a desired level of reliability.&#10;C) estimate how long a test must be to achieve a desired level of validity.&#10;D) calculate test reliability.

Accepted Answer

The answer of The prophecy formula is used to&#10;A) predict...

Question 62

Describe some of the advantages and disadvantages associated with behavioral observation techniques.Provide examples.

Accepted Answer

The answer of Describe some of the advantages and disadvantages...

Question 63

Tests will be most reliable if they are&#10;A) multidimensional.&#10;B) unidimensional.&#10;C) brief.&#10;D) criterion-referenced.

Accepted Answer

The answer of Tests will be most reliable if they...

Question 64

What is the most useful indicator of reliability for the interpretation of individual scores?&#10;A) split-half variance&#10;B) test-retest&#10;C) item sampling&#10;D) standard error of measurement

Accepted Answer

The answer of What is the most useful indicator of...

Deck 4: Reliability