Exam 7: Text and Web Mining

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

Stop words,such as a,am,the,and was,are words that are filtered out prior to or after processing of natural language data.

Free
(True/False)
4.9/5
(26)
Correct Answer:
Verified

True

One of the main approaches to text classification is ________ in which an expert's knowledge is encoded into the system either declaratively or in the form of procedural classification rules.

Free
(Short Answer)
4.9/5
(37)
Correct Answer:
Verified

knowledge engineering

Why will computers probably not be able to understand natural language the same way and with the same accuracy that humans do?

Free
(Multiple Choice)
4.8/5
(37)
Correct Answer:
Verified

A

Describe a marketing application of text mining.

(Essay)
4.8/5
(32)

What are three of the challenges for effective and efficient knowledge discovery posed by the Web?

(Essay)
4.8/5
(39)

In the text mining process,the output of task two is a flat file called a ________ where the cells are populated with the term frequencies.

(Short Answer)
4.8/5
(37)

In linguistics,a(n)________ is a large and structured set of texts prepared for the purpose of conducting knowledge discovery.

(Short Answer)
4.9/5
(36)

All of the following are popular application areas of text mining except:

(Multiple Choice)
4.7/5
(43)

List two options for managing or reducing the dimensionality (size)of the term-document matrix (TDM).

(Essay)
4.8/5
(43)

A vast majority of all business data are captured and stored in structured text documents.

(True/False)
4.9/5
(41)

A simple keyword-based search engine suffers from several deficiencies,which include all of the following except:

(Multiple Choice)
4.9/5
(37)

________ is the process of reducing inflected words to their base or root form.

(Short Answer)
4.8/5
(36)

Commercial software tools include all of the following except:

(Multiple Choice)
4.8/5
(33)

Define the three main areas of Web mining and each area's source of information.

(Essay)
4.9/5
(31)

Using ________ as a rich source of knowledge and a strategic weapon,Kodak not only survives but excels in its market segment defined by innovation and constant change.

(Multiple Choice)
4.9/5
(36)

________ mining is the process of extracting useful information from the links embedded in Web documents.

(Short Answer)
4.8/5
(24)

Unstructured data has a predetermined format.It is usually organized into records as categorical,ordinal,and continuous variables and stored in databases.

(True/False)
4.8/5
(42)

Which of the following correctly defines a text mining term?

(Multiple Choice)
4.9/5
(40)

________ applications focus on "who and how" questions by gathering and reporting direct feedback from site visitors,by benchmarking against other sites and offline channels,and by supporting predictive modeling of future visitor behavior.

(Short Answer)
4.8/5
(34)

Amazon.com leverages Web usage history usage dynamically and recognizes the user by reading a cookie written by a Web site on the visitor's computer.

(True/False)
4.9/5
(27)
Showing 1 - 20 of 69
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)