Hello , All Suppose I have a dataset with several variables ( say Var1-Var20);and suppose Var20 is a categorical variable with following values: low, medium, high. I have two choices: Choice 1: build one single model using all variables (Var1-Var20) Choice 2: split the original dataset into three so that dataset1 only contain those observations of which Var20 falls in "low", dataset2 only contain those observations of which Var20 falls in "medium", and dataset3 only contain those observations of which Var20 falls in "high"; and build three models using Var1-Var19 for dataset1, dataset2, and dataset3, respectively. My question: which choice should I go for? If it is choice1, what is the advantage of this method? what dose choice2 miss? Thank you very much for educating me.
... View more