Question 1

The natural logarithm of the odds is called the logit.

Accepted Answer

True

Question 2

Linear discriminate analysis does not assume that for each particular category being considered, the joint probability distribution of the predictor variables is a multivariate normal distribution.

Accepted Answer

False

Question 3

The penalty weight used in the neural network models controls the trade-off between overfitting and underfitting.

Accepted Answer

True

Question 4

It is not possible to apply a logistic regression using a neural network algorithm.

Accepted Answer

The answer of It is not possible to apply a...

Question 5

A regression technique for analyzing large data sets is neural network modeling.

Accepted Answer

The answer of A regression technique for analyzing large data...

Question 6

Forward selection with simultaneous validation is when one predictor variable is added at a time to a model where the variable added to the model has the smallest p-value when added to the model.

Accepted Answer

The answer of Forward selection with simultaneous validation is when...

Question 7

Linear discriminate analysis can classify accurately even when the assumptions are not valid.

Accepted Answer

The answer of Linear discriminate analysis can classify accurately even...

Question 8

A major drawback of neural network modeling is that its parameters are usually uninterpretable.

Accepted Answer

The answer of A major drawback of neural network modeling...

Question 9

In logistic regression the goal is to predict the true value of the independent variable.

Accepted Answer

The answer of In logistic regression the goal is to...

Question 10

The odds of an event occurring is the probability that the event will not occur divided by the probability that the event will occur.

Accepted Answer

The answer of The odds of an event occurring is...

Question 11

The logistic regression equation involves the exponential function and it estimates the probability that an observation described by a specified set of predictor variable values will fall into a particular class.

Accepted Answer

The answer of The logistic regression equation involves the exponential...

Question 12

A neural network is nonlinear.

Accepted Answer

The answer of A neural network is nonlinear....

Question 13

A dummy variable is a continuous variable.

Accepted Answer

The answer of A dummy variable is a continuous variable....

Question 14

A single-layer perceptron neural network model consists of only one layer called the hidden layer.

Accepted Answer

The answer of A single-layer perceptron neural network model consists...

Question 15

The confusion matrix helps you assess your model's accuracy and avoid overfitting.

Accepted Answer

The answer of The confusion matrix helps you assess your...

Question 16

Linear regression requires the assumptions of independence, linearity, normality, homogeneity of variance and non-multicollinearity.

Accepted Answer

The answer of Linear regression requires the assumptions of independence,...

Question 17

If the outcome variable is quantitative and all explanatory variables take values 0 or 1, a logistic regression model is appropriate.

Accepted Answer

The answer of If the outcome variable is quantitative and...

Question 18

To avoid overfitting in the neural network model, the parameter estimates that are used minimize the least squares criterion.

Accepted Answer

The answer of To avoid overfitting in the neural network...

Question 19

The AIC never penalizes the model for the number of model coefficients; therefore we do not favor a minimum AIC value.

Accepted Answer

The answer of The AIC never penalizes the model for...

Question 20

Neural network modeling represents the response variable as a linear function of the predictor variables.

Accepted Answer

The answer of Neural network modeling represents the response variable...

Question 21

In logistic regression, ________ is used as a goodness-of-fit test.&#10;A) deviance&#10;B) Pearson&#10;C) Hosmer-Lemeshow&#10;D) All of the choices are correct.

Accepted Answer

The answer of In logistic regression, ________ is used as...

Question 22

Neural networks are common for large data mining projects to analyze a data set involving millions of observations.

Accepted Answer

The answer of Neural networks are common for large data...

Question 23

After calculating the squared differences in linear discriminate analysis, you can estimate the probability of how that specific observation fares with regard to the categorical outcome.

Accepted Answer

The answer of After calculating the squared differences in linear...

Question 24

In linear discriminate analysis we calculate the means of the predictor variable values for the observations belonging to both categories that are in the data set.

Accepted Answer

The answer of In linear discriminate analysis we calculate the...

Question 25

In logistic regression,&#10;A) the dependent variable is continuous.&#10;B) the dependent variable is divided into two equal subcategories.&#10;C) the dependent variable is categorical.&#10;D) there is no dependent variable.

Accepted Answer

The answer of In logistic regression,&#10;A) the dependent variable is...

Question 26

In logistic regression, deviance is a measure of difference between a ________ model and the ________ model.&#10;A) saturated, fitted&#10;B) fitted, saturated

Accepted Answer

The answer of In logistic regression, deviance is a measure...

Question 27

The dependent variable in logistic regression can take on values of 0 and 1.

Accepted Answer

The answer of The dependent variable in logistic regression can...

Question 28

The probability of a new employee passing a test is .20. What are the odds of the employee passing the test?&#10;A) .20&#10;B) .25&#10;C) .80&#10;D) .75

Accepted Answer

The answer of The probability of a new employee passing...

Question 29

When we compare the squared differences in linear discriminate analysis we are looking for the larger value to predict the outcome of that data point.

Accepted Answer

The answer of When we compare the squared differences in...

Question 30

The odds of an event occurring is the probability that the event will occur divided by the probability that the event will not occur.

Accepted Answer

The answer of The odds of an event occurring is...

Question 31

Since the neural network model employs many parameters, there is a danger that we will overfit the model.

Accepted Answer

The answer of Since the neural network model employs many...

Question 32

In linear discriminate analysis we compare squared differences to see where a particular data point lies in relation to the categorical outcome.

Accepted Answer

The answer of In linear discriminate analysis we compare squared...

Question 33

Squared distances can be used in two different ways in linear discriminate analysis.

Accepted Answer

The answer of Squared distances can be used in two...

Question 34

Which of the following methods do we use to best fit the data in logistic regression?&#10;A) Least square error&#10;B) Maximum likelihood&#10;C) Exponential decay&#10;D) Both least square error and exponential decay

Accepted Answer

The answer of Which of the following methods do we...

Question 35

The ________ regression method is used when the response variable is a qualitative or a categorical variable.&#10;A) quadratic&#10;B) logistic&#10;C) multiple&#10;D) simple

Accepted Answer

The answer of The ________ regression method is used when...

Question 36

Which of the following tests can be used to assess whether the logistic regression model is a good fit?&#10;A) Homer-Lemeshow test&#10;B) Pearson&#10;C) ROC curve&#10;D) All of the choices are correct.

Accepted Answer

The answer of Which of the following tests can be...

Question 37

In logistic regression, the goal is to find&#10;A) the actual value of the dependent variable.&#10;B) the odds of the dependent variable.&#10;C) the log of the dependent variable.&#10;D) None of the choices are correct.

Accepted Answer

The answer of In logistic regression, the goal is to...

Question 38

A soccer player takes a shot on goal 15 times and scores a goal 3 of those attempts. What are the odds of the soccer player scoring a goal?&#10;A) 3/15&#10;B) 5&#10;C) 4&#10;D) 3/12

Accepted Answer

The answer of A soccer player takes a shot on...

Question 39

The hidden node function in neural networks is usually linear.

Accepted Answer

The answer of The hidden node function in neural networks...

Question 40

Logistic regression is used when you want to&#10;A) predict a categorical variable from continuous or categorical variables.&#10;B) predict a continuous variable from categorical variables.&#10;C) predict any categorical variable from other categorical variables.&#10;D) predict a continuous variable from a categorical variable.

Accepted Answer

The answer of Logistic regression is used when you want...

Question 41

Maximum likelihood estimation is an advanced statistical procedure which provides point estimates, and it is typically&#10;A) ahead of the logistic regression.&#10;B) a part of the logistic regression.&#10;C) after the logistic regression to ensure statistical accuracy.&#10;D) a simple process, completed to provide comparison estimates.

Accepted Answer

The answer of Maximum likelihood estimation is an advanced statistical...

Question 42

Logistic regression analysis can answer what three major questions about a dataset?&#10;A) Causal analysis, forecasting of an outcome, and trend forecasting&#10;B) Causal analysis, odds of success, and trend forecasting&#10;C) Odds of success, trend forecasting, and the confusion matrix&#10;D) Maximum likelihood, causal analysis, and odds of success

Accepted Answer

The answer of Logistic regression analysis can answer what three...

Question 43

Which method will we use to best fit the data in logistic regression?&#10;A) Maximum likelihood&#10;B) Least effect likelihood&#10;C) Least square error&#10;D) Jaccard distance

Accepted Answer

The answer of Which method will we use to best...

Question 44

An example of a dummy variable is used in categorical data applications and might include&#10;A) assigning the number 1 to Democrats and 2 to Republicans.&#10;B) assigning the number 1 to the first place runner, and 2 to the second place runner.&#10;C) assigning a 0 to students who can run a 10-minute mile, and a number 1 to students who run a 7-minute mile.&#10;D) assigning a 1 to the probability of the customer receiving a $10 gift card and a 0 to the customer receiving a $20 gift card, when 10 of each denomination are being sent to a random sample of customers.

Accepted Answer

The answer of An example of a dummy variable is...

Question 45

One of the very good methods to analyze the performance of logistic regression is AIC, which is similar to R-squared in Linear Regression. Which of the following is true about AIC?&#10;A) Model with minimum AIC value&#10;B) Model with maximum AIC value&#10;C) Both but depends on the situation&#10;D) Both regardless of the situation

Accepted Answer

The answer of One of the very good methods to...

Question 46

In a neural network, the penalty weight controls the trade-off between ________ and ________.&#10;A) overfitting; underfitting&#10;B) overfitting; not fitting&#10;C) underfitting; fitting&#10;D) fitting; not fitting

Accepted Answer

The answer of In a neural network, the penalty weight...

Question 47

Deviance, Pearson and Hosmer-Lemeshow are all ________ that can be used in ________.&#10;A) goodness of fit tests; logistic regression&#10;B) predictive analysis tests; logistic regression&#10;C) probability tests; neural network modeling&#10;D) goodness of fit tests; neural network modeling

Accepted Answer

The answer of Deviance, Pearson and Hosmer-Lemeshow are all ________...

Question 48

You want to add new features to data to which you applied a logistic regression model and got a training accuracy of X and a testing accuracy of Y. Which options are correct, if you consider the remaining parameters are the same?

A) Training accuracy increases and testing accuracy increases or remains the same.
B) Training accuracy increases or remains the same.
C) Testing accuracy decreases and training accuracy remains the same.
D) Testing accuracy increases or remains the same and training accuracy decreases.

Accepted Answer

The answer of You want to add new features to...

Question 49

What is the natural logarithm of the odds called?&#10;A) The logit&#10;B) The odds logarithm&#10;C) The odds logit&#10;D) The natural odds

Accepted Answer

The answer of What is the natural logarithm of the...

Question 50

In a neural network model, the ________ is a nonlinear function of the linear predictor values.&#10;A) response variable&#10;B) null variable&#10;C) predictor variable&#10;D) quantitative response variable

Accepted Answer

The answer of In a neural network model, the ________...

Question 51

Which type of predictive analytic would be used to determine whether players on a football team, either historical or future, would perform successfully or unsuccessfully in a certain position on the field?

A) Linear discriminate
B) Logistic regression
C) Neural networks
D) Any of the choices could be used.

Accepted Answer

The answer of Which type of predictive analytic would be...

Question 52

Consider the following model for logistic regression: P (y = 1|x, h) = g(h₀ + h₁x) where g(z) is the logistic function. In the preceding equation the P (y = 1|x; h) , viewed as a function of x, that we can get by changing the parameters h. What would be the range of P in such case?

A) (0, 1)
B) (0, infinity)
C) (−infinity, 0)
D) (−infinity, infinity)

Accepted Answer

The answer of Consider the following model for logistic regression:...

Question 53

In logistic regression, the difference between a fitted model and the saturated model can be measured with what test?&#10;A) Deviance&#10;B) Pearson&#10;C) Hosmer-Lemeshow&#10;D) All of the choices are correct.

Accepted Answer

The answer of In logistic regression, the difference between a...

Question 54

A logistic curve initially grows exponentially and&#10;A) is theoretical.&#10;B) is a summary of the probabilities.&#10;C) is a consistent curve regardless of the data.&#10;D) reflects the confidence intervals.

Accepted Answer

The answer of A logistic curve initially grows exponentially and&#10;A)...

Question 55

What are used to avoid overfitting in the neural network model, and are used to minimize a penalized least squares criterion?&#10;A) Parameter estimates&#10;B) Least squares estimates&#10;C) Probability estimates&#10;D) Any of the choice could be used.

Accepted Answer

The answer of What are used to avoid overfitting in...

Question 56

The odds of an event occurring is: A. The probability that the event will occur divided by the probability that the event will not occur.
B. The probability that the event will not occur divided by the probability that the event will occur.
C. The probability that the event will occur divided by the total of the probability that it will occur, plus the probability of the event not occurring.

A) Option A
B) Option B
C) Option C
D) Either option A or B would create the same outcome.

Accepted Answer

The answer of The odds of an event occurring is:...

Question 57

A ________ consists of three layers: input, hidden, and output.&#10;A) single-layer perceptron neural network model&#10;B) double-layer perceptron neural network model&#10;C) triple-layer perceptron neural network model&#10;D) neural network model

Accepted Answer

The answer of A ________ consists of three layers: input,...

Question 58

If the odds of a student passing a class are .25, what is the probability that the student will pass the class?&#10;A) .20&#10;B) .25&#10;C) .75&#10;D) .80

Accepted Answer

The answer of If the odds of a student passing...

Question 59

Parametric predictive analysis can be used to classify qualitative variables into several groups. The processes might include:&#10;A) logistic regression.&#10;B) simple regression.&#10;C) vertical discriminate analysis.&#10;D) stratification and clustering techniques.

Accepted Answer

The answer of Parametric predictive analysis can be used to...

Question 60

What is a major drawback of the neural network modeling?&#10;A) The parameters are typically uninterpretable.&#10;B) The model is not easily utilized with existing data sets.&#10;C) The parameters minimize the least squares criterion.&#10;D) This model only uses a single-layer perceptron.

Accepted Answer

The answer of What is a major drawback of the...

Question 61

Even if a residual may be unusually large, the standardized residual rule might not identify the observation as being an outlier. How can this difficulty be circumvented?&#10;A) by using studentized deleted residuals&#10;B) by performing a logistic regression&#10;C) through linear discriminate analysis&#10;D) It cannot be circumvented and should be considered a component of the validation process.

Accepted Answer

The answer of Even if a residual may be unusually...

Question 62

The chart provided shows the probability that a customer will use a coupon offered in a local paper by Lucky Shirts Store. Compute the odds ratio for the BankOne variable x₂ = 1, holding annual spending constant at x₁ = 4000. Annual Spending at Lucky Shirts Store

A) 1.3787
B) .4594
C) 1.4845
D) .3148

Accepted Answer

The answer of The chart provided shows the probability that...

Question 63

Below gives the data concerning (1) the dependent variable Default which equals 1 if a customer defaults on their loan and 0 if they do not; (2) the independent variable Price of Home, which is the price of the home (in tens) and (3) the independent variable First Purchase which equals 0 if the customer has owned a home before and 1 if this is their first home. Identify and interpret the odds ratio estimate for First Purchase.

A) Odds ratio: 10.8675; a first-time home buyer is 10 times less likely to default than a buyer who has bought a home before.
B) Odds ratio: 10.8675; a first-time home buyer is 11 times more likely to default than a buyer who has bought a home before.
C) Odds ratio: 10.8675; a first-time home buyer is 10% times more likely to default than a buyer who has bought a home before.
D) Odds ratio: 10.8675; a first-time home buyer is 11% times more likely to default than a buyer who has bought a home before.

Accepted Answer

The answer of Below gives the data concerning (1) the...

Question 64

Below gives the data concerning (1) the dependent variable Default which equals 1 if a customer defaults on their loan and 0 if they do not; (2) the independent variable Price of Home, which is the price of the home (in tens) and (3) the independent variable First Purchase which equals 1 if the customer has owned a home before and 0 if this is their first home. Estimate the probability that a buyer who spent $435,000 on their first home purchase will default.

A) .99636
B) .9899
C) .9768
D) .9695

Accepted Answer

The answer of Below gives the data concerning (1) the...

Deck 16: Predictive Analytics Ii: Logistic Regression, Discriminate Analysis,