Hi, There are few things that might be helpful in your design. 1. If your variables have lot of missing values let us 50% or more then it's better to drop those variable for further analysis. We can't generalize all the time that missing will always be 'No'. 2. Not sure what do you mean by drop missing values using impute node.? 3. After oversampling you will have a clustered data based on your target variable. You can perform cluster analysis for two clusters solution based on independent variables and correlate target and non-target customers within each cluster. This will give you an idea how significant are the independent variables in clustering target and non- target customers.
... View more