12-21-2015 04:58 PM
My data set has variables user_id and others like v1, v2, .. v5.
There are some duplicate cases which have the same user_id.
is there a way to label the duplicate cases, by adding a new variable, value 1 for the primary cases, and value 0 for the second case with the same user_id? By doing this I do not need to delete the duplicate cases, but when analysis I can select the distinct cases by this new variable.
Thanks in advance.
Need further help from the community? Please ask a new question.