Multiple Choice
A Data Scientist is developing a binary classifier to predict whether a patient has a particular disease on a series of test results. The Data Scientist has data on 400 patients randomly selected from the population. The disease is seen in 3% of the population. Which cross-validation strategy should the Data Scientist adopt?
A) A k-fold cross-validation strategy with k=5
B) A stratified k-fold cross-validation strategy with k=5
C) A k-fold cross-validation strategy with k=5 and 3 repeats
D) An 80/20 stratified split between training and validation
Correct Answer:

Verified
Correct Answer:
Verified
Q32: A data scientist needs to identify fraudulent
Q33: A company is setting up an Amazon
Q34: A company is building a predictive maintenance
Q35: A Data Scientist needs to analyze employment
Q36: Given the following confusion matrix for a
Q38: A retail company is using Amazon Personalize
Q39: An insurance company is developing a new
Q40: A manufacturing company has structured and unstructured
Q41: A library is developing an automatic book-borrowing
Q42: A Machine Learning Specialist receives customer data