Exam 2: Data Management

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

In the context of database management systems, primary keys and foreign keys are important in relational databases because they help

Free
(Multiple Choice)
4.8/5
(34)
Correct Answer:
Verified

A

A company combined monthly sales data into a single group to calculate the total sales by a quarter or a year. This is an example of the ________ step of data transformation.

Free
(Multiple Choice)
4.9/5
(28)
Correct Answer:
Verified

B

Describe and explain the four processes of data transformation.

Free
(Essay)
4.9/5
(36)
Correct Answer:
Answered by Examlex AI Copilot

The four processes of data transformation are extraction, transformation, loading, and validation.

1. Extraction: This process involves extracting data from various sources such as databases, files, and applications. The data is collected and brought together in a centralized location for further processing.

2. Transformation: Once the data is extracted, it needs to be transformed into a format that is suitable for analysis and reporting. This process involves cleaning the data, removing duplicates, and standardizing formats. It may also involve applying business rules and calculations to the data.

3. Loading: After the data has been transformed, it needs to be loaded into a data warehouse or other storage system. This process involves moving the data from the extraction location to the destination, and ensuring that it is organized and accessible for analysis.

4. Validation: The final step in the data transformation process is validation. This involves checking the accuracy and completeness of the transformed data. It may involve running quality checks, verifying data integrity, and ensuring that the data meets the requirements of the end users.

Overall, the four processes of data transformation are essential for ensuring that data is accurate, consistent, and ready for analysis and reporting. By following these processes, organizations can make better-informed decisions and gain valuable insights from their data.

Identify the second step the MapReduce platform performs to solve problems of big data computation.

(Multiple Choice)
4.9/5
(44)

Compare and contrast relational and non-relational databases.

(Short Answer)
4.9/5
(35)

Companies that adopt ________ typically achieve up to 6 percent higher productivity and output than their peer companies.

(Multiple Choice)
4.9/5
(30)

In a dataset, values that are at a considerable distance from any of the other data clusters are treated as outliers.

(True/False)
4.8/5
(40)

Most traditional Extract, Transform, and Load (ETL) tools can process only relational datasets for unstructured data, semi-structured, and machinery sensor data, but newer systems are much more flexible.

(True/False)
4.8/5
(34)

In data transformation, ________ involves creating a dichotomous value from a categorical value.

(Multiple Choice)
4.7/5
(33)

The veracity characteristic of big data provides businesses a holistic understanding of their customers and market situations.

(True/False)
4.8/5
(45)

The ________ characteristic of big data enables real-time business responses and strategies.

(Multiple Choice)
4.9/5
(37)

Unlike relational databases, non-relational databases

(Multiple Choice)
4.9/5
(30)

Explain how the Extract, Transform, and Load (ETL) integration process uses Hadoop to capture, store, process, secure, and then analyze complex data.

(Essay)
4.8/5
(34)

A ________ of data can contain as much information as half of the contents of all U.S. academic research libraries.

(Multiple Choice)
4.7/5
(42)

Which of the following characteristics of big data increases complexity and reduces confidence in the data?

(Multiple Choice)
4.8/5
(34)

In the context of data quality, ________ is the extent to which information is adequately presented or delivered for efficient and effective understanding.

(Multiple Choice)
4.8/5
(45)

In data transformation, ________ involves subtracting a variable from the mean and then dividing it by the standard deviation.

(Multiple Choice)
4.8/5
(31)

Imagine a company wants to send a follow-up email to its best customers immediately after a purchase. However, email addresses were not collected from the customers at the time of purchase. This is an example of missing the data quality of ________.

(Multiple Choice)
4.8/5
(35)

Asking the question "Is the data correct, reliable, and precisely measured?" will help in determining the data quality of ________.

(Multiple Choice)
4.8/5
(42)

A major challenge of today's data management is that

(Multiple Choice)
4.7/5
(30)
Showing 1 - 20 of 31
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)