About s_lassen

s_lassen · ‎02-19-2018

The means you could get using PROC MEANS or SQL. But when you also want last minus first for another variable, you may as well do the whole thing in a datastep, e.g.: Data want; Sum_var1=0; Count_var1=0; do until(last.customer); set have; by customer; if not missing(Variable1) then do; Sum_var1=Sum_var1+Variable1; Count_var1=Count_var1+1; end; if first.customer then First_var2=Variable2; end; Diff_var2=Variable2-First_var2; /* last value minus first value */ if Count_var1 then Mean_var1=Sum_var1/Count_var1; keep customer Mean_var1 Diff_var2; run; The data should be sorted by customer and month, of course. It can also be done in SQL, though: proc sql; create table want as select a.*, mean(a.Variable1) as mean_var1, Last.Variable2-First.variable2 as Diff_var2 from have a, have first, have last where first.customer=a.customer and last.customer=a.customer group by a.customer having first.month=min(a.month) and last.month=max(a.month) order by a.customer, a.month ; quit; Here, you can easily get all the original rows without remerging with the original data afterwards. If you only want the two summary rows, you should add a DISTINCT after SELECT, replace a.* with a.customer, and drop a.month from the ORDER BY clause.

s_lassen · ‎02-19-2018

I would make a view containing a subset of data, and then use standard security to give users access to the view instead of the original table. Only problem is, SAS does not do that (yet). In SAS you have to have read access to the table in order to use the view. So my suggestion is that you move your data to a database system that has that kind of built-in row-level security, e.g. SQL Server. Of course, there may be an additional cost here, if you do not already have a SAS/Access license for the database you decide to use (the databases, e.g. the Express Edition of SQL Server, you can often get for free, on the other hand). The alternative could be to encrypt and password-protect the table, and hide the password in a view declaration (Data step views compiled with the SOURCE=NOSAVE option, for instance, will not let the user see the underlying code), e.g.: data user1/view=user1(source=nosave); set mydata(where=(country='SWEDEN') read=mysecretpassword); run; which will give users with access to the USER1 view to view the Swedish records, but not the others, unless they know the secret password.

s_lassen · ‎02-06-2018

Here is a possible solution. I am not sure how fast it runs with real-life data, but it seems to work: data want; set name_value; if 0 then set original; /* "declare" variables, for use with CALL SET later */ array names(*) var_name1-var_name2; array values(*) var_value1-var_value2; length cond $100; do _N_=1 to dim(names); call catx(' and ',cond,cats(names(_N_),'=',values(_N_))); end; dsid=open(cats('original(where=(',cond,'))')); call set(dsid); do while(fetch(dsid)=0); output; end; dsid=close(dsid); drop dsid; run; I did not drop the condition (COND variable), as it is nice to have for checking, at least initially.

s_lassen · ‎02-05-2018

The problem is probably that you use the "o" option on your expression, meaning that it will compile only once in the datastep. Try changing "/io" in the end of the expression to just "/i".

s_lassen · ‎02-01-2018

I think the COALESCE function does what you want: proc sql; create table test3 as select test1.id, coalesce(test2.age,test1.age) as age from test1 left join test2 on test1.id=test2.id; quit;

s_lassen · ‎02-01-2018

I think an array approach will work: data want; set have; array arr (*) zkz1_2 zkz2_3 zkz3_4 zkz4_5; zkz=arr(laktnr); run;

s_lassen · ‎02-01-2018

I think you will have to figure out your requirements correctly first. Take data like this: data have; input Child_ID $ Mother_ID $ Father_ID $; cards; A1 Y1 X1 A2 Y1 X1 A3 Y2 X1 A4 Y3 X1 A5 Y3 X2 A6 Y4 X2 A7 Y5 X2 A8 Y5 X3 ;run; According to your rule, A5 and A7 should have the same family ID, because they have the same father. And A7 and A8 should have the same family ID because they have the same mother. So A5 and A8 will end up with the same family ID, even though they have no parents in common. Is this really what you want?

s_lassen · ‎01-31-2018

The KEEP statement is not used to limit the number of observations, but the number of variables. But the WHERE statement does what you want. Only you should limit it to one statement, which combines the two conditions. And then you need to change your SAS date constants (I assume that your dates are SAS dates in the format DDMMYY10.): data saspms.datensatz_new; set saspms.datensatz_pms_neu; where '01JAN2014'd <= notificationdate <= '31DEC2016'D and Art_der_Anzeige= Erstanzeige; run;

s_lassen · ‎01-31-2018

This seems to match your specification: proc sql; create table want as select a.key0, a.key1, a.key2, a.value_A, b.key, cats(b.key=a.key1,b.key=a.key2) length=2 as keymatch from a,b where a.key0=b.key0 and (a.key1=b.key or a.key2=b.key) ; quit; But it does not quite match the data you present. Why is the second last row in B not matched to the last row in A in your example?

s_lassen · ‎01-30-2018

That's not so easy. You basically need to do an outer join, on the condition that one or more words from one name is in the other name, or vice versa. Here is a possible solution that uses a datastep to create the outer product (by reading every observation from the second dataset with POINT=) and compare: data one; input name1 $40. /amount1; cards; wwwamazoncom 100.5 toysrus 50.25 OLIVE GARDEN 61.85 walMart 86.24 ;run; data two; input name2 $40. /amount2; cards; US AMAZON AR USA 25.68 online toysrus newjersey us 126.98 ORDER olivegarden Washington DC 29.99 us wwwwalmartcom toys texas 75.86 ;run; data want; set one; score=0; length common $60; do _N_=1 to nobs; set two nobs=nobs point=_N_; common=' '; score=0; do i=1 to countw(name1); if length(scan(name1,i))<3 then continue; if find(name2,scan(name1,i),'i') then do; score=score+1; if not findw(common,scan(name1,i),' ','i') then call catx(' ',common,scan(name1,i)); end; end; do i=1 to countw(name2); if length(scan(name2,i))<3 then continue; if find(name1,scan(name2,i),'i') then do; score=score+1; if not findw(common,scan(name2,i),' ','i') then call catx(' ',common,scan(name2,i)); end; end; if score>0 then output; end; run; Note that one pair of values were joined just on the word "toys", you may want to increase the length value in the line with "continue".

s_lassen · ‎01-30-2018

I would use CALL CATS, which appends the second parameter (stripped of blanks) to the first variable: data want; set have; length longstring $5000; retain longstring; call cats(longstring,string); run;

s_lassen · ‎01-26-2018

If I understand your problem correctly, DSNB is always the leftmost substring of DSNA. Then something like this may work: data a; length dsna $20 vsna 8; input dsna vsna; cards; ABC.DEF.GHI.DEU 5 ABC.DEF.GHI.KYZ 1 ABC.DEF.GHI.LMB 2 ADD.XYZ.GHI.ABD 1 ;run; data b; length dsnb $20 vsnb 8; input dsnb vsnb; cards; ABC.DEF.GHI 3 ADD.XYZ.GHI 5 ;run; proc sort data=a; by dsna; run; proc sort data=b; by dsnb; run; data want; set a(in=a) b(rename=(dsnb=dsna vsnb=vsna) in=b); by dsna; if b then do; dsnb=dsna; vsnb=vsna; end; retain dsnb vsnb; if a; if dsna=:trim(dsnb); run;

s_lassen · ‎01-26-2018

If you have the program for one dataset, I would suggest trying this: Write another SAS program, which writes the first program, and put the variable parameters in datastep variables, e.g.: filename tempsas temp; /* you can also allocate a permanent file here */ data _null_; input dsname $; file tempsas; put 'data outlib.' dsname ';' / ' set inlib.' dsname '(keep=a b c d f);' / ' where c<33;' / 'run;' / ; cards; mydata yourdata hisdata herdata ourdata ;run; You can then take a look at the code that is generated (open the TEMPSAS file in an editor window). If it looks OK, try submitting one code section (one datastep in the example), and see if the results are what you expected. If it looks OK, submit some more and check the log and the output. If something goes wrong, go back and change the original program (the one that wrote the code) and submit it again. When all looks OK, you can insert the line "%include tempsas;" at the bottom of your original program. A lot of people want to use macros and call execute. That may work too, but you have a lot less control of the process, and it is much harder to debug.

s_lassen · ‎01-26-2018

While checking the performance of singular "typical" jobs may shine some light on the capacity of the server, what really matters is how the server stacks up when many processes are running in parallel. So you would probably want to start a number of jobs simultaneously, and check how fast it goes with different numbers of users (jobs). You may want to supervise the performance on the server at the same time - use the "Resource surveillance" or what ever it is called (running on a Danish version of Windows right now) in the bottom of the "Performance" tab in the Windows job list (which you get by pressing ctrl-alt-delete). One interesting measure is the number of "Page Faults" in the Memory tab - if you have many of those, you are running short on memory. Another is simply checking if the CPUs are working full out, or if disk performance is limiting the speed of the server. I think you can run the tests by setting SAS/EG up to run in parallel, import varying programs (and varying numbers of them) to different process flows, and then run the various process flows one by one (or all of them with "Run Project"), and see what happens.

s_lassen · ‎01-24-2018

Sorry, did not know that. Is it possible to use the MSTORED and SASMSTORE option, saving the compiled macro in a permanent library? Then it may be available for all the parallel sessions.

Online Status	Offline
Date Last Visited	‎08-31-2025 10:49 PM

Re: Macro MINOPERATOR Help

Re: how to print all obs for an ID that has a flag value > 0 in any of...

Re: Array- PCT change from month to next month

Re: Add in additional rows to fill missing data

Re: How exactly dose SAS process if/do/end in a data step?

Re: How to generate dataset with macro function and looping?

Re: SAS installation is stuck

Re: SAS installation is stuck

Re: Convert Text Date in a Proc SQL Where

Re: How to check if data set is sorted

Re: Counting decimals in numeric fields.

Re: special characters

Bug in proc sql and fcmp variable scope (SAS 9.4M8)

Re: nth max value using proc sql and datastep

Re: Please explain me this macro below bolded line

Re: how to print all obs for an ID that has a flag value > 0 in any of...

Re: Array- PCT change from month to next month

Re: How exactly dose SAS process if/do/end in a data step?

Re: How to generate dataset with macro function and looping?

Re: Extracting line from free text that contains a keyword

Re: Creating new aggregated variables for base table

Re: code to implement row level security to data set using enterprise...

Re: SQL on name of variable and value of variable

Re: PRXMATCH, Regular expression id not changing

Re: Proc Sql + Left Join - Overwrite values of existing variables

Re: Create a new variable from three other variables

Re: Creating family IDs

Re: Keep 2 Variables based on conditions

Re: match on one of possible keys

Re: fuzzy data merge or join

Re: how to join observations into one string?

Re: Search a dataset using a substring of a variable

Re: How to run the same code for different data sets?

Re: Simulation of SAS program execution in SAS server

Re: Macros and Parallel processing