Exam 7: Text and Web Mining

arrow
  • Select Tags
search iconSearch Question
flashcardsStudy Flashcards
  • Select Tags

NLP has successfully been applied to a variety of tasks via computer programs to automatically process natural human language that previously could only be done by humans.List three of the most popular of these tasks.

(Essay)
4.8/5
(33)

The purpose and processes of text mining are different from those of data mining because with text mining the input to the process are data files such as Word documents,PDF files,text excerpts,and XML files.

(True/False)
5.0/5
(35)

________ words or noise words are words that are filtered out prior to or after processing of natural language data.

(Short Answer)
4.8/5
(42)

________ mining is the extraction of useful information from data generated through Web page visits and transactions.

(Short Answer)
4.8/5
(37)

The two main approaches to text classification are ________ and ________.

(Multiple Choice)
4.8/5
(26)

________ is the discovery and analysis of interesting and useful information from the Web,about the Web,and usually though Web-based tools.

(Short Answer)
4.7/5
(39)

The goal of natural language processing (NLP)is syntax-driven text manipulation.

(True/False)
4.7/5
(32)

Web crawlers are Web content mining tools that are used to read through the content of a Web site automatically.

(True/False)
4.9/5
(31)

In ________,the problem is to group an unlabelled collection of objects,such as documents,customer comments,and Web pages into meaningful groups without any prior knowledge.

(Multiple Choice)
4.9/5
(30)

Web pages consisting of unstructured textual data coded in HTML or XML,hyperlink information,and logs of visitors' interactions provide rich data for effective and efficient knowledge discovery:

(True/False)
4.8/5
(39)

The term "stop-words" are used by text mining to ________ commonly used words.

(Short Answer)
4.8/5
(44)

Why will computers probably not be able to understand natural language the same way and with the same accuracy that humans do?

(Essay)
4.7/5
(40)

The main categories of knowledge extraction methods are recall,search,and signaling.

(True/False)
4.9/5
(27)

When registered users revisit Amazon.com,they are greeted by name.This task involves recognizing the user by ________.

(Multiple Choice)
4.8/5
(34)

________ is a technique used to detect favorable and unfavorable opinions toward specific products and services using textual data sources,such as customer feedback in Web postings and the detection of unfavorable rumors.

(Short Answer)
4.7/5
(28)

A vast majority of business data are stored in text documents that are ________.

(Multiple Choice)
4.8/5
(43)

A(n)________ is one or more Web pages that provide a collection of links to authoritative pages.

(Short Answer)
5.0/5
(26)

List three business applications of Web mining.

(Essay)
5.0/5
(35)

Which of the following refers to developing useful information from the links included in the Web documents?

(Multiple Choice)
4.7/5
(31)

It has been shown that the bag-of-word method may not produce good enough information content for text mining tasks.More advanced techniques such as ________ are needed.

(Multiple Choice)
4.8/5
(29)
Showing 41 - 60 of 69
close modal

Filters

  • Essay(0)
  • Multiple Choice(0)
  • Short Answer(0)
  • True False(0)
  • Matching(0)