Sorry if I am being a bit unclear in my actual needs. I took a screenshot of the actual data. Some of the variables are in Danish, but I will give a translation to each of them Order_ID Item_ID / Order date / shipping date Supplier Price return N Thanks for your help Arthur, but in the actual data set we have 800.000 observations, can I somehow use the code you mentioned above for the entire data set? Im thinking of the part you mention after "Card;", this will be impossible to do for 800.000 observations. (I tried to highlight it with bold) data have; informat item_id $10.; infile cards dlm='09'x; input Order_ID Item_ID; cards; 212913 5577-RE 212888 5877-MA 212888 9780 212790 855-140-CP 212790 5877-MA ;
... View more