About Haikuo

Haikuo · ‎03-12-2012

Nikos, It seems to me that there is NO two '_' in a row addressed by your first post.

Haikuo · ‎03-12-2012

Hi Mspak, My solution to your another question could be applied to this senario without many changes: data want (drop=_:); if _n_=1 then do; set geog.mutualcom (obs=1 rename=(year=fyear zip=_zip)); dcl hash h(dataset:'geog.mutualcom(rename=(year=fyear zip=_zip))', multidata:'yes', hashexp:16); h.definekey('fyear'); h.definedata('_zip'); h.definedone(); end; set geog.comp; _ct=0; _rc=find(); do while (_rc=0); if .< zipcitydistance(zipcode, _zip)<=60 then _ct+1; _rc=find_next(); end; mutual_density=log(1+_ct); run; of couse not tested, so my code is subjected to typo and other errors. Regards, Haikuo Edit: Before running the code, I would clean up the tables to make sure all of the zip codes are legit per zipcitydistance().

Haikuo · ‎03-12-2012

Array() seems to have a straigh shot on this one: data have; infile cards; input OFFSET TEMP_1 TEMP_2 TEMP_3; cards; 1 1 2 3 3 4 5 6 2 7 8 9 ; data want; set have; array tmp (*) temp_1-temp_3; new_column=tmp(offset); run; proc print;run; Regards, Haikuo

Haikuo · ‎03-11-2012

Patrick's direction is definitely applaudable. I have tried SQL first, and stop it after 30-mins of running. Here is my hash approach, I think we may not need to use hiter, as both table are identical. After making sure that all of the zipcodes are included in the sashelp.zipcodes, 407 data want (drop=_:); 408 if _n_=1 then do; 409 set comp( rename=(tic=_tic zip=_zip) obs=1); 410 dcl hash h(dataset: 'comp (rename=(tic=_tic zip=_zip))', multidata: 'yes'); 411 h.definekey('fyear','sic3'); 412 h.definedata('_tic','_zip'); 413 h.definedone(); 414 end; 415 416 set comp; 417 _ct=0; 418 _rc=h.find(); 419 do while (_rc=0); 420 if .<zipcitydistance(_zip,zip) <=60 then _ct+1; 421 h.has_next(result: _r); 422 if _r ne 0 then _rc=h.find_next(); 423 else leave; 424 end; 425 DENSITY_INDUSTRY=log(_ct); 426 427 run; NOTE: There were 66452 observations read from the data set WORK.COMP. NOTE: There were 1 observations read from the data set WORK.COMP. NOTE: There were 66452 observations read from the data set WORK.COMP. NOTE: The data set WORK.WANT has 66452 observations and 5 variables. NOTE: DATA statement used (Total process time): real time 5:26.81 cpu time 5:17.99 Less than 6mins, Not too shady. BTW, the has_next() method is not a must-have here. You can just use: data want (drop=_:); if _n_=1 then do; set comp( rename=(tic=_tic zip=_zip) obs=1); dcl hash h(dataset: 'comp (rename=(tic=_tic zip=_zip))', multidata: 'yes'); h.definekey('fyear','sic3'); h.definedata('_tic','_zip'); h.definedone(); dcl hiter hi('h'); end; set comp ; _ct=0; _rc=h.find(); do while (_rc=0); if .<zipcitydistance(_zip,zip) <=60 then _ct+1; _rc=h.find_next(); end; DENSITY_INDUSTRY=log(_ct); run; I was hoping has_next will help the efficiency, well, it turned out no difference. Regards, Haikuo

Haikuo · ‎03-09-2012

%macro test; %if &dxlist in 'I420' 'I425' 'I426' %then %let dxtypelista='1' 'W' 'X' 'Y' '3'; %else %let dxtypelista='1' 'W' 'X' 'Y'; %mend; %test; Reeza is right. you have a 'then' , which does not belong to macro, therefore need to removed. Sorry I overlooked that.

Haikuo · ‎03-09-2012

%macro test; %if &dxlist in 'I420' 'I425' 'I426' then %then %let dxtypelista='1' 'W' 'X' 'Y' '3'; %else %let dxtypelista='1' 'W' 'X' 'Y'; %mend; %test; I have removed %do in your code, it is not necessary in your context. if you use it, you need to add %end to complete the loop, %macro test; %if &dxlist in 'I420' 'I425' 'I426' then %then %do; %let dxtypelista='1' 'W' 'X' 'Y' '3'; %end; %else %let dxtypelista='1' 'W' 'X' 'Y'; %mend; %test;

Haikuo · ‎03-09-2012

":" here is wildcards. 'col:' represents all of the variable names starting with 'col'.

Haikuo · ‎03-09-2012

Many macro statements can NOT be used out of macro definition. So you will have to put all of these macro statements inside the macro definition: %macro test; /*this is the beginning of macro definition, the name of macro is 'test'*/ blah; blah ; %mend; /*this is the end of macro definition*/ /*then if you call it when you decide to excute it*/ %test Therefore you need to put %if &dxlist in 'I420' 'I425' 'I426' then %then %do: %let dxtypelista='1' 'W' 'X' 'Y' '3'; %end; inside your macro definition. BTW, in this case, you need %end to complete the %do loop. HTH, Haikuo

Haikuo · ‎03-09-2012

A minor modification on LinLin's code, in case you don't want to count the number of your tables: proc sql ; select memname into : a1 separated by ' ' from dictionary.tables where libname='YOURLIBRARY'; quit; %macro csv; %do i=1 %to %sysfunc(countw(&a1)) ; proc export data=YOURLIBRARY.&&a&i outfile="c:\temp\&&a&i...csv " dbms=csv replace;run; %end; %mend csv; %csv HTH, Haikuo

Haikuo · ‎03-09-2012

%macro test; %if &dxlist in 'I420' 'I425' 'I426' then %then %do: %let dxtypelista='1' 'W' 'X' 'Y' '3'; %mend; %test Open code means anywhere out of the box of %macro xxx; to %mend; Haikuo

Haikuo · ‎03-09-2012

I don't think format matters. data _null_; X=94365;/*x=1994365*/ a=put(datejul(x),mmddyy10.); b=datejul(x); put a; put b / b date9.; run; OP, please show some raw data of your julian dates, as you can see, I can't repeat your problem. Regards, Haikuo

Haikuo · ‎03-09-2012

If you use max(), min(), sum(), count(), freq, n, range, nmiss, std and many many others, which are so called summary functions, you HAVE to use 'group by' at the same time. Regards, Haikuo

Haikuo · ‎03-09-2012

Efficiency wise, I also suggest you take a look into hash object. After SAS 9.2, Hash is capable to do lots of summary and does support non-unique key. Regards, Haikuo

Haikuo · ‎03-09-2012

Thanks, Ksharp. I thought about it. I have a 'return' inside the do-loop, will that lead to 'leave' action in the context of this code? Haikuo

Haikuo · ‎03-09-2012

And another one, featuring intnx(): data have; input ID $ EffDate :ddmmyy10. ExpDate :ddmmyy10.; format EffDate date9. ExpDate date9.; cards; A 01/04/2008 01/08/2008 B 01/02/2008 01/07/2010 ; data want (drop=exp); set have end=done; exp=expdate; if expdate <= intnx('year',effdate,0,'e') then output; else do until (intnx('year',effdate,0,'e') > exp); expdate= intnx('year',effdate,0,'e'); output; effdate=intnx('year',effdate,1,'b'); if (intnx('year',effdate,0,'e') > exp) then do; expdate=exp; output; end; end; run;

Online Status	Offline
Date Last Visited	‎02-27-2023 12:47 PM

Re: How to convert GPS Coordinates with degree into longitudes and lat...

Re: Multiple Observations from Single Record

Re: Filling in missing sequence number and linearly interpolating betw...

Re: repeat observation by id

Re: repeat observation by id

Re: Output from Proc Datasets as a data file

Re: Warning in log - why?

Re: complicated conversion

Re: assign unscheduled visits

Re: Use a multi-selection macro variables from a prompt in pass throug...

Re: PROC JSON - create a hierarchical file

Re: Copy values based on certain criteria

Re: Importing a file with multiple delimiters per record

How to handle line breaks in csv files

Re: Importing a .CSV file from R with NA's as missing

Re: capitalize first letter of first word

Re: Copy dataset structure

Re: how to check if user is running any session and using SAS metadata...

Re: Remove leading and trailing zeros from character field

Re: how to get valid data

Character variables with varying number of "words" and how to parse th...

Re: PROC SQL PROCESSING

Help dynamically selecting columns to assign values to a new column

Re: Density of industrial firms

% If statement not valid in open code

% If statement not valid in open code

problem catx with macro variable passed in

Re: % If statement not valid in open code

Export multiple tables from SAS to .csv

% If statement not valid in open code

Re: Julian Dates conversion

MAX Help

PROC SQL OR TABULATE

multiple variables use the same condition, how to write them in the sa...

Re: Splitting a row into multiple records based on dates

SAS Analytics Explorers