Exam 7: Quantitative Data Preparation and Descriptive Statistics
List and describe the three measures of central tendency. Then, describe a situation where you would choose to use each central tendency.
The three measures of central tendency are mean, median, and mode.
1. Mean: The mean is the average of a set of numbers. It is calculated by adding up all the numbers in the set and then dividing by the total number of values. The mean is useful when you want to find the average value of a data set. For example, if you want to find the average score of a class on a test, you would use the mean.
2. Median: The median is the middle value in a data set when the values are arranged in ascending order. If there is an even number of values, the median is the average of the two middle values. The median is useful when you want to find a value that represents the middle of the data set, especially when there are outliers that could skew the mean. For example, if you want to find the middle income in a group of people, you would use the median.
3. Mode: The mode is the value that appears most frequently in a data set. A data set can have one mode, more than one mode, or no mode at all. The mode is useful when you want to find the most common value in a data set. For example, if you want to find the most popular color in a survey, you would use the mode.
In a situation where the data set is normally distributed and does not have any extreme outliers, the mean would be the best measure of central tendency to use. In a situation where the data set has extreme outliers, the median would be the best measure of central tendency to use. In a situation where you want to find the most common value in a data set, the mode would be the best measure of central tendency to use.
Describe what needs to be done to prepare data for analysis. Why are these steps necessary?
To prepare data for analysis, several steps need to be taken. First, the data needs to be collected from various sources and organized into a structured format. This may involve cleaning the data to remove any errors, duplicates, or inconsistencies. Once the data is cleaned, it needs to be transformed into a format that is suitable for analysis, such as a spreadsheet or database.
Next, the data needs to be analyzed to identify any patterns, trends, or relationships. This may involve using statistical techniques or data visualization tools to gain insights from the data. Additionally, the data may need to be aggregated or summarized to make it more manageable for analysis.
Finally, the data needs to be interpreted and communicated to stakeholders. This may involve creating reports, dashboards, or presentations to convey the findings from the analysis.
These steps are necessary to ensure that the data is accurate, reliable, and relevant for analysis. By cleaning and organizing the data, it becomes easier to identify patterns and trends. Additionally, transforming the data into a suitable format and analyzing it allows for meaningful insights to be gained. Finally, interpreting and communicating the findings ensures that the analysis is actionable and valuable to stakeholders. Overall, these steps are essential for preparing data for analysis and making informed decisions based on the insights gained.
Interval variables are often referred to as grouping variables.
False
What is the median of the following set of numbers: 3, 5, 5, 8,6, 10?
Interval levels of measurement have a true zero while ratio measures do not.
If you have a kurtosis value of zero, the shape of the distribution would be described as _________________.
When measuring central tendency by calculating the mean, you should also consider the impact of _______________ which may skew the result.
The highest value of your data set is 155 and the lowest number is 55. The difference between them is 100, which is also known as the ______________.
A ________________ is a graph which visually represents a frequency distribution.
In Emily's story, she will create a measure of cultural competence by averaging the responses to multiple items in the survey. This new measure is known as a __________________.
_______________ and _______________ are both considered continuous levels of measurement.
Prior to data being analyzed it is referred to as ________________ data.
A frequency distribution in which the mode is greater than the median and mean is skewed in which direction?
The difference between each value and the mean is known as the __________________.
_______________ is a validation process researchers use to check the data for errors and screen for accuracy.
You ask respondents to fill out a survey question that asks, "have you ever been to Washington D.C.?" to which they either respond yes or no. This is an example of a ________________ variable.
_____________________ is the study and set of tools and techniques used to quantitatively describe, organize, analyze, interpret, and present data.
Double entry is a procedure in which duplicate responses are removed from a data set.
Mode is a useful measure of central tendency to use with categorical variables.
Filters
- Essay(0)
- Multiple Choice(0)
- Short Answer(0)
- True False(0)
- Matching(0)