Deck 15: Data Preparation and Description

Full screen (f)
exit full mode
Question
In the example,which of the following characteristics of coding rules did the researcher fail to address adequately?

A) Appropriate categories
B) Exhaustive options
C) Mutual exclusivity
D) Single dimension
E) Closed-ended
Use Space or
up arrow
down arrow
to flip the card.
Question
A codebook generally contains all of the following information except _____.

A) variable number
B) record number
C) code description
D) instructions for coding
E) variable name
Question
Which of the following code descriptions is most appropriate for the employment status question?

A) 1 = not employed, 2 = employed part-time, 3 = employed full-time, 4 = full-time student, 5 = other, 9 = missing
B) 1 = checked, 0 = not checked
C) 1 = not employed, full-time student, 2 = employed full-time, 3 = employed part-time, full-time student, 4 = not employed, other, 5 = other
D) All of the above are correct
E) None of the above are correct
Question
All of the following activities except _____ are included in the data preparation process.

A) editing
B) coding
C) data entry
D) analyzing frequency distributions
E) evaluating reliability estimates
Question
The process of ensuring the accuracy of data and their conversion from raw form into classified forms appropriate for analysis is called _____.

A) coding
B) data entry
C) data preparation
D) data measurement
E) quality control
Question
For the vacation survey question,how many variables would be represented in the codebook with this question?

A) 1
B) 2
C) 3
D) 4
E) 6
Question
Which term below refers to an analytical process for measuring the semantic content of a communication?

A) ANOVA
B) Regression
C) Content analysis
D) Factor analysis
E) Reliability analysis
Question
In the example,the use of the response option "other" is used to meet the requirement that categories be _____.

A) appropriate
B) exhaustive
C) mutually exclusive
D) of a single dimension
E) numeric
Question
What percentage of respondents is contacted typically during data validation?

A) 10%
B) 15%
C) 25%
D) 33%
E) 40%
Question
Which type of variable is coded using a 0 for one response and a 1 for the other?

A) Nonmetric
B) Ordinal
C) Interval
D) Dummy
E) Ratio
Question
Field editing is necessary to _____.

A) send data back to the analysis team each day
B) elaborate on abbreviations commonly used during field work
C) validate that interviews took place
D) screen for qualified respondents
E) all of the above
Question
What type of data results from the employment status question?

A) Dichotomous
B) Nominal
C) Ordinal
D) Interval
E) Ratio
Question
Closed questions are favored over open-ended questions by researchers for all of the following reasons except _____.

A) ease of coding
B) ease of recording
C) ease of analysis
D) insight provided
E) efficiency
Question
Allison is editing a data set and finds that a response of 7 has been entered for a question that requests a response between 1 and 5.What should Allison do?

A) Enter a response between 1 and 5
B) Enter a response of unknown
C) Enter a mode for the question
D) Enter a 3 as a neutral response
E) None of the above
Question
Which of the following code descriptions is most appropriate for coding responses to the vacation survey question?

A) 1 = beach, 2 = spa, 3 = adventure, 4 = big city, 5 = foreign travel, 6 = cruise
B) 1 = beach only, 2 = beach and spa, 3 = beach, spa, and adventure, 4 = beach, spa, adventure, and big city, 5 = beach, spa, adventure, big city, and foreign travel, 6 = all of the above
C) 1 = one item checked, 2 = two items checked, 3 = three items checked, 4 = four items checked, 5 = five items checked, 6 = six items checked
D) 1 = yes, 0 = no
E) None of the above
Question
Which of the following guidelines must be followed in the development and use of coding rules?

A) Categories within a single variable must be appropriate
B) Categories within a single variable must be exhaustive
C) Categories within a single variable must be mutually exclusive
D) Categories must be derived from one classification principles
E) All of the above
Question
What type of data results from the vacation question?

A) Dichotomous
B) Nominal
C) Ordinal
D) Ratio
E) Both a and b
Question
_____ involves assigning numbers or other symbols to answers so that the responses can be grouped into a limited number of categories.

A) Editing
B) Data entry
C) Coding
D) Measurement
E) Operationalization
Question
In the example,employment levels and student status are both included as response options.This indicates that the requirement for categories to be _____ may be in violation.

A) appropriate
B) exhaustive
C) mutually exclusive
D) of a single dimension
E) numeric
Question
Which number below is frequently used to code missing responses?

A) 0
B) 9
C) 11
D) 100
E) Missing responses are not coded
Question
All of the following techniques are used in content analysis except _____.

A) counting words
B) categorizing phrases
C) assessing the structure of expressions
D) triangulation
E) all of the above are used with content analysis
Question
Which term below refers to an ordered array of all values for a variable?

A) Correlation table
B) Frequency distribution
C) Frequency table
D) Scatter diagram
E) Histogram
Question
If we were using content analysis software to analyze the PLAVIX advertisement,which of the following words would require the use of stemming?

A) Protect
B) Help
C) Aspirin
D) Form
E) Heart attack
Question
Which of the following statements is not true of the mode?

A) The mode is the most frequently occurring value
B) There may be more than one mode in a distribution
C) Distributions may not be multimodal
D) Some distributions have no mode
E) The mode is most appropriate for nominal data
Question
In a content analysis of the PLAVIX advertisement,the words "adding" or "added" are used four times and the phrase "heart attack" is used five times.These are _____ units in our analysis.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Question
When viewing data entered into a spreadsheet,the columns identify _____.

A) records
B) variables
C) categories
D) coding instructions
E) variable labels
Question
The broad topic of "protection" featured in the PLAVIX advertisement is a _____ unit.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Question
If we were to conduct a content analysis of the PLAVIX advertisement,platelets would be considered _____ units.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Question
A collection of data organized for computerized retrieval is called a _____.

A) data mine
B) data field
C) database
D) data code
E) computer recognition retrieval system
Question
Which of the following groups of words indicates a series of words for which aliasing would be used in the analysis?

A) be, is, of, the
B) pretty, beautiful, attractive
C) search, searching, searches, searched
D) going, walking, running, flying
E) all of the above
Question
Which missing data technique is most appropriate when missing data are considered MCAR?

A) Pair-wise deletion
B) Replacement
C) Listwise deletion
D) Minimized deletion
E) Weighting
Question
The value obtained by summing all elements in a set and dividing by the number of elements is the _____.

A) mean
B) median
C) mode
D) range
E) standard score
Question
Which of the following groups of words indicates a series of words for which stemming would be used in the analysis?

A) be, is, of, the
B) pretty, beautiful, attractive
C) search, searching, searches, searched
D) going, walking, running, flying
E) all of the above
Question
In a standard normal distribution,the mean is _____ and the standard deviation is _____.

A) 0, 1
B) 5, 1
C) 3, 1
D) 1, 5
E) it varies based on the sample
Question
Which data entry technique uses a software program to transfer printed text into a computer file for editing?

A) Data mining
B) OCR
C) Keyboarding
D) Transcripting
E) Voice recognition
Question
With the _____ method,missing values are estimated using all cases that had data for each variable in the analysis.

A) pair-wise deletion
B) replacement
C) listwise deletion
D) minimized deletion
E) case-specific deletion
Question
Which of the following can be determined using a frequency distribution?

A) How close a sample comes to the null hypothesis
B) How does one variable, X, relate another variable, Y
C) Whether a systematic association exist between two variables
D) The shape of the variable's distribution
E) The systematic variance associated with the sample
Question
In the PLAVIX advertisement,the claim that adding PLAVIX to aspirin helps raise a person's protection against future heart attack is a _____ unit,because it makes an assertion about the benefits of aspirin alone.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Question
Which of the following shapes best represents a normal distribution as it is depicted graphically?

A) Square
B) Bell
C) Triangle
D) Star
E) Hat
Question
Which of the following is an accurate interpretation for a "don't know" response from the respondent?

A) It isn't important enough to answer
B) I don't know
C) I don't want to answer that question
D) I'm too ambivalent to figure out the best answer
E) All of the above
Question
In the array provided,what is the median?

A) 7
B) 9
C) 10
D) 15
E) Both A and B
Question
A _____ is equal to the observation minus the mean.

A) standard deviation
B) deviation score
C) range
D) mode
E) quartile
Question
What are the four rules that guide the coding and categorization of a data set? Explain why each one is important for researchers.
Question
Why might a data set suffer from missing data? Explain the techniques researchers may use to handle missing data during data analysis.
Question
The difference between the smallest and the largest values in a distribution is the _____.

A) mean
B) median
C) mode
D) range
E) deviation
Question
The measure of deviation from the mean such that cases stretch toward one tail or the other is called _____.

A) kurtosis
B) platykurtic
C) skewness
D) ku
E) mesokurtic
Question
In the array provided,what is the mean?

A) 7
B) 9
C) 10
D) 15
E) Both A and B
Question
What is the range for the array provided?

A) 5
B) 7
C) 9
D) 10
E) 16
Question
Common measures of _____ include the range,interquartile range,variance or standard deviation,and variation.

A) central tendency
B) variability
C) shape
D) location
E) significance
Question
_____ is a measure of the relative peakedness or flatness of the curve defined by the frequency distribution.

A) Kurtosis
B) Deviation
C) Point estimate
D) Standard error
E) Z score
Question
In the array provided,what is the mode?

A) 7
B) 9
C) 10
D) 15
E) Both A and B
Question
The median is an appropriate measure of central tendency for _____ data.

A) interval
B) ratio
C) ordinal
D) nominal
E) all of the above
Question
What is meant by stemming,aliasing,and the use of exclusion filters in content analysis software applications?
Unlock Deck
Sign up to unlock the cards in this deck!
Unlock Deck
Unlock Deck
1/53
auto play flashcards
Play
simple tutorial
Full screen (f)
exit full mode
Deck 15: Data Preparation and Description
1
In the example,which of the following characteristics of coding rules did the researcher fail to address adequately?

A) Appropriate categories
B) Exhaustive options
C) Mutual exclusivity
D) Single dimension
E) Closed-ended
C
2
A codebook generally contains all of the following information except _____.

A) variable number
B) record number
C) code description
D) instructions for coding
E) variable name
B
3
Which of the following code descriptions is most appropriate for the employment status question?

A) 1 = not employed, 2 = employed part-time, 3 = employed full-time, 4 = full-time student, 5 = other, 9 = missing
B) 1 = checked, 0 = not checked
C) 1 = not employed, full-time student, 2 = employed full-time, 3 = employed part-time, full-time student, 4 = not employed, other, 5 = other
D) All of the above are correct
E) None of the above are correct
A
4
All of the following activities except _____ are included in the data preparation process.

A) editing
B) coding
C) data entry
D) analyzing frequency distributions
E) evaluating reliability estimates
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
5
The process of ensuring the accuracy of data and their conversion from raw form into classified forms appropriate for analysis is called _____.

A) coding
B) data entry
C) data preparation
D) data measurement
E) quality control
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
6
For the vacation survey question,how many variables would be represented in the codebook with this question?

A) 1
B) 2
C) 3
D) 4
E) 6
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
7
Which term below refers to an analytical process for measuring the semantic content of a communication?

A) ANOVA
B) Regression
C) Content analysis
D) Factor analysis
E) Reliability analysis
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
8
In the example,the use of the response option "other" is used to meet the requirement that categories be _____.

A) appropriate
B) exhaustive
C) mutually exclusive
D) of a single dimension
E) numeric
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
9
What percentage of respondents is contacted typically during data validation?

A) 10%
B) 15%
C) 25%
D) 33%
E) 40%
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
10
Which type of variable is coded using a 0 for one response and a 1 for the other?

A) Nonmetric
B) Ordinal
C) Interval
D) Dummy
E) Ratio
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
11
Field editing is necessary to _____.

A) send data back to the analysis team each day
B) elaborate on abbreviations commonly used during field work
C) validate that interviews took place
D) screen for qualified respondents
E) all of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
12
What type of data results from the employment status question?

A) Dichotomous
B) Nominal
C) Ordinal
D) Interval
E) Ratio
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
13
Closed questions are favored over open-ended questions by researchers for all of the following reasons except _____.

A) ease of coding
B) ease of recording
C) ease of analysis
D) insight provided
E) efficiency
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
14
Allison is editing a data set and finds that a response of 7 has been entered for a question that requests a response between 1 and 5.What should Allison do?

A) Enter a response between 1 and 5
B) Enter a response of unknown
C) Enter a mode for the question
D) Enter a 3 as a neutral response
E) None of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
15
Which of the following code descriptions is most appropriate for coding responses to the vacation survey question?

A) 1 = beach, 2 = spa, 3 = adventure, 4 = big city, 5 = foreign travel, 6 = cruise
B) 1 = beach only, 2 = beach and spa, 3 = beach, spa, and adventure, 4 = beach, spa, adventure, and big city, 5 = beach, spa, adventure, big city, and foreign travel, 6 = all of the above
C) 1 = one item checked, 2 = two items checked, 3 = three items checked, 4 = four items checked, 5 = five items checked, 6 = six items checked
D) 1 = yes, 0 = no
E) None of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
16
Which of the following guidelines must be followed in the development and use of coding rules?

A) Categories within a single variable must be appropriate
B) Categories within a single variable must be exhaustive
C) Categories within a single variable must be mutually exclusive
D) Categories must be derived from one classification principles
E) All of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
17
What type of data results from the vacation question?

A) Dichotomous
B) Nominal
C) Ordinal
D) Ratio
E) Both a and b
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
18
_____ involves assigning numbers or other symbols to answers so that the responses can be grouped into a limited number of categories.

A) Editing
B) Data entry
C) Coding
D) Measurement
E) Operationalization
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
19
In the example,employment levels and student status are both included as response options.This indicates that the requirement for categories to be _____ may be in violation.

A) appropriate
B) exhaustive
C) mutually exclusive
D) of a single dimension
E) numeric
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
20
Which number below is frequently used to code missing responses?

A) 0
B) 9
C) 11
D) 100
E) Missing responses are not coded
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
21
All of the following techniques are used in content analysis except _____.

A) counting words
B) categorizing phrases
C) assessing the structure of expressions
D) triangulation
E) all of the above are used with content analysis
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
22
Which term below refers to an ordered array of all values for a variable?

A) Correlation table
B) Frequency distribution
C) Frequency table
D) Scatter diagram
E) Histogram
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
23
If we were using content analysis software to analyze the PLAVIX advertisement,which of the following words would require the use of stemming?

A) Protect
B) Help
C) Aspirin
D) Form
E) Heart attack
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
24
Which of the following statements is not true of the mode?

A) The mode is the most frequently occurring value
B) There may be more than one mode in a distribution
C) Distributions may not be multimodal
D) Some distributions have no mode
E) The mode is most appropriate for nominal data
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
25
In a content analysis of the PLAVIX advertisement,the words "adding" or "added" are used four times and the phrase "heart attack" is used five times.These are _____ units in our analysis.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
26
When viewing data entered into a spreadsheet,the columns identify _____.

A) records
B) variables
C) categories
D) coding instructions
E) variable labels
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
27
The broad topic of "protection" featured in the PLAVIX advertisement is a _____ unit.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
28
If we were to conduct a content analysis of the PLAVIX advertisement,platelets would be considered _____ units.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
29
A collection of data organized for computerized retrieval is called a _____.

A) data mine
B) data field
C) database
D) data code
E) computer recognition retrieval system
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
30
Which of the following groups of words indicates a series of words for which aliasing would be used in the analysis?

A) be, is, of, the
B) pretty, beautiful, attractive
C) search, searching, searches, searched
D) going, walking, running, flying
E) all of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
31
Which missing data technique is most appropriate when missing data are considered MCAR?

A) Pair-wise deletion
B) Replacement
C) Listwise deletion
D) Minimized deletion
E) Weighting
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
32
The value obtained by summing all elements in a set and dividing by the number of elements is the _____.

A) mean
B) median
C) mode
D) range
E) standard score
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
33
Which of the following groups of words indicates a series of words for which stemming would be used in the analysis?

A) be, is, of, the
B) pretty, beautiful, attractive
C) search, searching, searches, searched
D) going, walking, running, flying
E) all of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
34
In a standard normal distribution,the mean is _____ and the standard deviation is _____.

A) 0, 1
B) 5, 1
C) 3, 1
D) 1, 5
E) it varies based on the sample
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
35
Which data entry technique uses a software program to transfer printed text into a computer file for editing?

A) Data mining
B) OCR
C) Keyboarding
D) Transcripting
E) Voice recognition
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
36
With the _____ method,missing values are estimated using all cases that had data for each variable in the analysis.

A) pair-wise deletion
B) replacement
C) listwise deletion
D) minimized deletion
E) case-specific deletion
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
37
Which of the following can be determined using a frequency distribution?

A) How close a sample comes to the null hypothesis
B) How does one variable, X, relate another variable, Y
C) Whether a systematic association exist between two variables
D) The shape of the variable's distribution
E) The systematic variance associated with the sample
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
38
In the PLAVIX advertisement,the claim that adding PLAVIX to aspirin helps raise a person's protection against future heart attack is a _____ unit,because it makes an assertion about the benefits of aspirin alone.

A) syntactical
B) referential
C) propositional
D) thematic
E) semiotic
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
39
Which of the following shapes best represents a normal distribution as it is depicted graphically?

A) Square
B) Bell
C) Triangle
D) Star
E) Hat
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
40
Which of the following is an accurate interpretation for a "don't know" response from the respondent?

A) It isn't important enough to answer
B) I don't know
C) I don't want to answer that question
D) I'm too ambivalent to figure out the best answer
E) All of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
41
In the array provided,what is the median?

A) 7
B) 9
C) 10
D) 15
E) Both A and B
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
42
A _____ is equal to the observation minus the mean.

A) standard deviation
B) deviation score
C) range
D) mode
E) quartile
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
43
What are the four rules that guide the coding and categorization of a data set? Explain why each one is important for researchers.
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
44
Why might a data set suffer from missing data? Explain the techniques researchers may use to handle missing data during data analysis.
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
45
The difference between the smallest and the largest values in a distribution is the _____.

A) mean
B) median
C) mode
D) range
E) deviation
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
46
The measure of deviation from the mean such that cases stretch toward one tail or the other is called _____.

A) kurtosis
B) platykurtic
C) skewness
D) ku
E) mesokurtic
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
47
In the array provided,what is the mean?

A) 7
B) 9
C) 10
D) 15
E) Both A and B
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
48
What is the range for the array provided?

A) 5
B) 7
C) 9
D) 10
E) 16
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
49
Common measures of _____ include the range,interquartile range,variance or standard deviation,and variation.

A) central tendency
B) variability
C) shape
D) location
E) significance
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
50
_____ is a measure of the relative peakedness or flatness of the curve defined by the frequency distribution.

A) Kurtosis
B) Deviation
C) Point estimate
D) Standard error
E) Z score
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
51
In the array provided,what is the mode?

A) 7
B) 9
C) 10
D) 15
E) Both A and B
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
52
The median is an appropriate measure of central tendency for _____ data.

A) interval
B) ratio
C) ordinal
D) nominal
E) all of the above
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
53
What is meant by stemming,aliasing,and the use of exclusion filters in content analysis software applications?
Unlock Deck
Unlock for access to all 53 flashcards in this deck.
Unlock Deck
k this deck
locked card icon
Unlock Deck
Unlock for access to all 53 flashcards in this deck.