Hi,
Thanks for your reply. Even if we add new data with missing values of Y variable to the training data set but the logistic model needs to rerun to score the new observations, right? In that case also it increase the size of the data, given that in predictive modeling the input data is huge.
In the explanation written in blue states the same if I understand correctly and that is why I thought option A is correct.
However, I am not sure why option C is selected as not an appropriate way as that is a one of the standard procedures to save the estimates from logistic and use that estimate to score new observations.
Kindly let me know what I am missing here.
Thanks,
Siuli Basu