Hello,
I'm working with large datasets. With only a part of the dataset I want to do calculations. How can I filter or restrict my data set in such a manner that only the important data is included. I have columns with labels on which I can select.
e.g. a dataset:
Names dataA dataB
NameA 1 2
NameB 1 3
NameC 1 4
NameB 3 6
NameC 5 9
NameB 5 9
NameA 3 0
NameB 7 0
I want only the observations with the label NameB, thus the result must be the dataset:
Names dataA dataB
NameB 1 3
NameB 3 6
NameB 5 9
NameB 7 0
I tried some things with proc sort, but then the dataset is sorted according to the labels, but I want to exclude the data which have not the right label.
Thanks in advance,
Govert