Exam 2: Data Management and Wrangling

arrow
  • Select Tags
search iconSearch Question
  • Select Tags

In R, to sort data in descending order, we use a negative parameter in the order function.

(True/False)
4.8/5
(37)

Four observations were binned into one group. In this group, the values are: 40, 45, 66, & 33. What is the average of the group?

(Multiple Choice)
4.8/5
(27)

A foreign key (FK) is the only unique identifier in a table structure.

(True/False)
4.8/5
(35)

Megan took a phone survey where each question posed had an answer range of unsatisfied to completely satisfied describing her purchase experience. Because the categories are in equal increments, the category can be recoded into a number transforming the category into what is called a category score.

(True/False)
4.7/5
(45)

Subsetting is a technique used to convert numerical values into categorical variables.

(True/False)
4.7/5
(35)

When too many variables are categorized in an analysis, several potential issues may occur. Which of the following is not one of the issues that may occur?

(Multiple Choice)
4.8/5
(30)

Using the simple mean imputation strategy, what value would be placed in the missing observation in x1? Using the simple mean imputation strategy, what value would be placed in the missing observation in x<sub>1</sub>?

(Multiple Choice)
4.8/5
(36)

In a data set with 20 variables, if 8% of the values, randomly spread across observations, are missing (blank), what is the probable percent of complete and usable observations?

(Multiple Choice)
4.8/5
(37)

Using R, what function is used to evaluate the categories in the variable to identify the dummy variables?

(Multiple Choice)
4.7/5
(38)

Ann is analyzing a data set that contains two variables, Job Title and 401K. 401K contains the name of the three companies that carry the retirement accounts. It is mandatory to have an account, thus no observation is blank. If 401K was transformed to dummy variables, how many should be created?

(Multiple Choice)
4.9/5
(40)

Kara is reviewing categories where a series of numbers represent the type of loan. She would prefer the actual name of the loan be retained when running her analysis. Using Analytic Solver, what function will allow Kara to retain the category name instead of recording them in numbers?

(Multiple Choice)
4.9/5
(32)

A dummy variable takes on a value of 1 or 0 to describe two categories of a variable.

(True/False)
4.9/5
(35)

In the following table, there are four observations with three variables. Which category is the best fit to be transferred into dummy variables? In the following table, there are four observations with three variables. Which category is the best fit to be transferred into dummy variables?

(Multiple Choice)
4.9/5
(38)

Marcus wants to include the month of the year in the analysis as categories. How many dummy variables will be needed?

(Multiple Choice)
4.9/5
(39)

Which term represents data items, events, or things stored in a database file?

(Multiple Choice)
4.8/5
(36)

A non-relational database structure that can support the storage of a wide ranges of data, including structured, semi-structured, and unstructured is called ___________.

(Multiple Choice)
4.8/5
(39)

The primary purpose of a(n) _____________ is to support decision-making and provide a composite view of the organization.

(Multiple Choice)
4.9/5
(35)

Four observations were binned into one group. In this group, the values are: 40, 45, 38, & 33. What is the average of the group?

(Multiple Choice)
5.0/5
(42)

Select, From, and Where keywords are statements used in __________.

(Multiple Choice)
4.8/5
(47)

Simple mean imputation is the best route for replacing large quantities of missing variables within a data set without distorting the relationship among variables.

(True/False)
4.9/5
(39)
Showing 21 - 40 of 46
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)