Multiple Choice
A Data Scientist is building a model to predict customer churn using a dataset of 100 continuous numerical features. The Marketing team has not provided any insight about which features are relevant for churn prediction. The Marketing team wants to interpret the model and see the direct impact of relevant features on the model outcome. While training a logistic regression model, the Data Scientist observes that there is a wide gap between the training and validation set accuracy. Which methods can the Data Scientist use to improve the model performance and satisfy the Marketing team's needs? (Choose two.)
A) Add L1 regularization to the classifier
B) Add features to the dataset
C) Perform recursive feature elimination
D) Perform t-distributed stochastic neighbor embedding (t-SNE)
E) Perform linear discriminant analysis
Correct Answer:

Verified
Correct Answer:
Verified
Q137: A manufacturer of car engines collects data
Q138: A telecommunications company is developing a mobile
Q139: A machine learning specialist is developing a
Q140: A data scientist is using an Amazon
Q141: A Data Scientist wants to gain real-time
Q143: A company offers an online shopping service
Q144: A large mobile network operating company is
Q145: A Machine Learning Specialist is building a
Q146: A technology startup is using complex deep
Q147: A financial services company is building a