About Haikuo

Haikuo · ‎05-05-2016

It does now.

Haikuo · ‎05-05-2016

Here is another option, concatenate all the grade within same ID following the data order, then check 'ABC': data have; input id term grade$ subj; cards; 10 2002 D 332 10 2002 A 333 11 2005 A 232 11 2005 B 232 11 2005 C 232 15 2010 A 130 15 2010 B 130 ; data want; length _cat $ 100; do until (last.term); set have; by id term notsorted; _cat=cats(_cat,grade); end; if find(_cat,'ABC') then output; run;

Haikuo · ‎05-04-2016

Another alternative ( input data shamelessly stolen from @FreelanceReinh). This approach does not required data to be sorted, however, it assumes 'before'/'after' is chronological instead of positional. data have; input Loan Code $ Date :mmddyy.; format date mmddyy10.; cards; 1 QQQQ 5/1/2015 2 AAAA 2/1/2016 3 BBBB 1/15/2016 4 CCCC 1/16/2016 5 DDDD 1/17/2016 ; data want; set have(in=up) have(in=down); retain last_d first_q; if up and code='QQQQ' then first_q=min(date,first_q); if up and code='DDDD' then last_d=max(date,last_d); if not up; if not (code='BBBB' and date > first_q); if not (code='CCCC' and date < last_d); drop last_d first_q; run;

Haikuo · ‎04-29-2016

Quick glance at your code, some usual suspects caught my eyes, '&VAR' , is this your real intention? macro instructions will not resolve inside a pair of single quote. use double quote instead: "&VAR" to make it resolve. Not sure if this is your only issues.

Haikuo · ‎04-29-2016

I am afraid that there really is no easy programmable way to tell. I would reach out to the data provider get some metadata information, at least on ballpark level, such as how many records, fields in total etc, and understanding how the data is generated also helps, for instance, if you know you are get whole year of data that is collected on monthly basis, then there is a chance of your having 'end of file' symbol embedded.

Haikuo · ‎04-29-2016

If your SAS is running on Windows, there is another possibility that your text file has embedded with 'end of file' unprinted symbol, namely '1A'x. In this case, you need to tell SAS to ignore it: infile test ignoredoseof;

Haikuo · ‎04-28-2016

999999 or 000000 or blank is easy to tackle, I will leave it for you. Before the code, I want to inform you of system option YEARCUTOFF, depending on your data, it may make a big difference, google it and make it right for you if the default (YEARCUTOFF=1920) does not suit you. For the clean ones: 23 data _null_; 24 old='050131'; 25 new=input(old,yymmdd6.); 26 format new mmddyy8.; 27 put new=; 28 run; new=01/31/05

Haikuo · ‎04-28-2016

if old_date is number, try this: new_date='22mar2010'd+old_date;

Haikuo · ‎04-27-2016

@Reeza, Dudette!!! :).

Haikuo · ‎04-26-2016

SAS version? OS? I couldn't repeat your issue. 23 data _null_ ; 24 x=0.032949; 25 y=put(round(x,0.0001),7.4); 26 put y=; 27 run; y=0.0329 NOTE: DATA statement used (Total process time): real time 0.00 seconds cpu time 0.00 seconds

Haikuo · ‎04-26-2016

Sarah, Here are some data step code. Please note: 1. if you have 18M records, and data came as being presented, you only need to upload 3 variables to Hash table, which is roughly 3*8*18million/1024/1024 = 412M bytes, so theoretically if you have Giga + byte memory (which is common), you should be able to fit the whole thing into Hash table. If not, we will have to break it up by ID, and it will take 2XPass, meaning it will take at leaset double amount of time to finish. 2. The code can still be tweaked to be more efficient, in theory, such as using Hiter plus SETCUR method, however, I am NOT sure how much leverage you can benefit from it. data want; if _n_=1 then do; if 0 then set sample; dcl hash id(dataset:'sample (keep=individual_id prescriber_num date_dispensed rename=(prescriber_num=_pre date_dispensed=_date))', multidata &colon; 'y'); id.definekey('individual_id'); id.definedata('individual_id', '_pre', '_date'); id.definedone(); dcl hash pre(); pre.definekey('_pre'); pre.definedone(); call missing(_pre,_date); end; set sample; rc=id.find(); _beg=intnx('year',date_dispensed,-1,'s'); do rc=0 by 0 while (rc=0); if _beg <= _date <= date_dispensed then rc=pre.add(); rc=id.find_next(); end; prov_cnt_back=pre.num_items; rc=pre.clear(); drop rc _:; run; Update: To call INTNX() only once per obs instead of up to 365 times, maybe saving your some time.

Haikuo · ‎04-22-2016

You sure can try it using Hash object, Proc SQL is just simpler to code: data have; input ID$ Date:mmddyy10. email_address: $40.; format date mmddyy10.; cards; josh 1/1/2015 josh1@gmail.com josh 1/2/2015 josh2@yahoo.com josh 1/3/2015 josh3@yahoo.com mary 1/4/2015 mary123@aol.com mary 1/5/2015 mars@blah.com josh 1/6/2015 josh1@gmail.com josh 1/7/2015 josh2@yahoo.com josh 1/8/2015 josh3@yahoo.com mary 1/9/2015 mary123@aol.com mary 1/10/2015 mars@blah.com ; proc sql; create table want as select *, (select count(distinct email_address) from have where id=a.id and date le a.date) as distinct_emails_for_user, (select case when sum(email_address=a.email_address)>1 then 'NO' ELSE 'YES' END from have where id=a.id and date le a.date) As first_time_email_seen_with_user fROM HAVE A ; QUIT;

Haikuo · ‎04-21-2016

Here is a 'quick and dirty' SQL approach, it may have performance issue when dealing with huge table. Data step can also achieve the same result, but code could get complex. data sample; input individual_id prescriber_num date_dispensed :mmddyy10.; format date_dispensed mmddyy10.; datalines; 1 101 1/1/2012 1 564 5/1/2012 1 88 6/15/2012 1 101 10/4/2012 1 3351 4/1/2013 2 1199 5/1/2012 2 1199 6/1/2012 2 1199 7/1/2012 2 1811 8/1/2012 3 646 4/1/2012 3 646 5/1/2012 3 7752 6/1/2012 3 505 7/1/2012 3 505 8/1/2012 3 646 4/1/2013 3 505 5/1/2013 3 871 5/15/2013 ; run; proc sql; create table want as select *, (select count(distinct prescriber_num) from sample where individual_id=a.individual_id and date_dispensed between a.date_dispensed and intnx('year',a.date_dispensed,-1,'s') ) as prov_cnt_back from sample a ; quit;

Haikuo · ‎04-21-2016

Don't think your version is working. Noticing the 'And' in the following code, so if you have a 'Or', you will be skipping those 'id's that need to be uploaded into Hash. if h.check(key:source_id) and h.check(key:dest_id) Edit: Maybe I have mistaken your intention, so do please do ignore my comments above if it is the case.

Haikuo · ‎04-21-2016

Yes, @Reeza. It has been doing this to all of the hash code from day one. I have no idea when and if it can be fixed :(.

Online Status	Offline
Date Last Visited	‎02-27-2023 12:47 PM

Re: How to convert GPS Coordinates with degree into longitudes and lat...

Re: Multiple Observations from Single Record

Re: Filling in missing sequence number and linearly interpolating betw...

Re: repeat observation by id

Re: repeat observation by id

Re: Output from Proc Datasets as a data file

Re: Warning in log - why?

Re: complicated conversion

Re: assign unscheduled visits

Re: Use a multi-selection macro variables from a prompt in pass throug...

Re: PROC JSON - create a hierarchical file

Re: Copy values based on certain criteria

Re: Importing a file with multiple delimiters per record

How to handle line breaks in csv files

Re: Importing a .CSV file from R with NA's as missing

Re: capitalize first letter of first word

Re: Copy dataset structure

Re: how to check if user is running any session and using SAS metadata...

Re: Remove leading and trailing zeros from character field

Re: how to get valid data

Re: 'A' in it then was changed to 'B' then it was changed to 'C' .

Re: 'A' in it then was changed to 'B' then it was changed to 'C' .

Re: Delete rows based on condition of previous

Re: Macro debug call execute

Re: Import raw text file

Re: Import raw text file

Re: Date stored as $6. need to convet to mmddyy8. ??

Re: Converting Integers into Date

Re: Congratulations to Reeza on her 10,000th post!

Re: Round giving unexpected result

Re: How to count unique prescribers per patient in a rolling 365 day p...

Re: "Moving Distinct" Counters

Re: How to count unique prescribers per patient in a rolling 365 day p...

Re: Match on two variable

Re: Match on two variable

SAS Analytics Explorers