About Centaurea

Centaurea · ‎09-29-2021

Thank you for your answer! Well, all the A variables are character, all the B are numeric, all the C values have a date format (not in this example, but in my real dataset). Actually, A variables contain first names and last names (of people receiving subsidies), but not all of the A variables (coming from different data providers but merged in a big dataset) contain these names as I have indicated it in the example. These cells are fully empty - I designated them as missing values. These missing values cause difficulty for me. In fact, you highlighted that treating these missing values was the key issue here, because they could 'cheat' or mislead me in comparing the values. My goal is to compare every cell that contain a value (so not empty) by row (concerning my real dataset: my purpose is data correction and imputation). Is there a quicker or simplier solution for this than comparing the variables one by one using e.g. an 'ID' variable indicating which of the A1-4 variable contain a value (e.g. 0101 (so A2 and A4), 1100 (so A1 and A2))?

Centaurea · ‎09-29-2021

Dear All, I would like you to help me please find out how to compare 11 x 4 coloumns in a dataset. But for simplicity, let use just 3 x 4 coloumn like this: A1 A2 A3 A4 B1 B2 B3 B4 C1 C2 C3 C4 Ba Ba 11 11 15 Pap Pap Pat Li Lo 58 58 Ha Ha Nin Nin Nid 87 87 Cet Cot I have to compare the A1-4, B1-4 and C1-4 variables. These contain missing values as well. I would like to have an A_compare B_compare C_compare variable which show that in that row the values are the same (e.g. A_compare = 0) or the values differ in one or more letter or number (e.g. A_compare = 1). Or, if I should use the compare function then these 'flag-colums' can be skipped I suppose. Do you have any idea how to solve this? Thank you very much in advance!

Online Status	Offline
Date Last Visited	‎09-29-2021 08:58 PM

Re: Comparing values in many columns of a dataset

Comparing values in many columns of a dataset

Re: Comparing values in many columns of a dataset

Re: Comparing values in many columns of a dataset

Comparing values in many columns of a dataset