Dear all,
I am running codes and get following results,
141
142 DATA Step1.Publicationsnew1 ;
143 SET Pat_ori.Publicationsnew ;
NOTE: Data file PAT_ORI.PUBLICATIONSNEW.DATA is in a format that is native to another host, or the
file encoding does not match the session encoding. Cross Environment Data Access will be used,
which might require additional CPU resources and might reduce performance.
144 IF YEAR(publn_date) NE 9999 ;
145 RUN;
NOTE: Missing values were generated as a result of performing an operation on missing values.
Each place is given by: (Number of times) at (Line):(Column).
4 at 144:7
NOTE: There were 100491575 observations read from the data set PAT_ORI.PUBLICATIONSNEW.
NOTE: The data set STEP1.PUBLICATIONSNEW1 has 97135175 observations and 10 variables.
NOTE: DATA statement used (Total process time):
real time 14:43.50
cpu time 3:40.86
Should I rebuild the data set or do something else ? Could you please give me some suggestion?
thanks in advance.
Regarding CEDA: as it states, the worst that could happen is some performance penalty in the first step where you read it.
Regarding the missings: since the 4 observations will be included in your new dataset, filter them out and inspect them. 4 missings out of ~100 million might also indicate other bad data in those observations.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.