About Daryl

Daryl · ‎06-22-2023

Maybe show us some sample code?

Daryl · ‎06-22-2023

You can generalize your macro for multiple segments if there are going to be segments beyond the first two: %macro update_historyN(snapshot,segment); data historic_info_customers; set &snapshot; valid_to_date='31dec9999'd; segment_generation = trim("segment_generation_&segment"); modify historic_info_customers key=idx2; if not _iorc_ then do; if segment_belonging ne segment_generation_&segment then do; valid_to_date=date-1; replace; valid_from_date=date; valid_to_date='31dec9999'd; segment_generation=trim("segment_generation_&segment"); segment_belonging=segment_generation_&segment; output; end; end; else do; _error_=0; valid_from_date=date; valid_to_date='31dec9999'd; segment_generation=trim("segment_generation_&segment"); segment_belonging=segment_generation_&segment; output; end; run; %mend; data customer_table_snapshot_6; input date :yymmdd10. customer_id segment_generation_1 $ segment_generation_2 $ segment_generation_3 $; format date yymmdd10.; datalines; 2023-09-27 111 A C D 2023-09-27 222 C C E 2023-09-27 333 B B F ; run; data customer_table_snapshot_7; input date :yymmdd10. customer_id segment_generation_1 $ segment_generation_2 $ segment_generation_3 $; format date yymmdd10.; datalines; 2023-10-21 111 A C F 2023-10-21 222 C C E 2023-10-21 333 B B D ; run; %update_historyN(customer_table_snapshot_6,3); %update_historyN(customer_table_snapshot_7,3); data actual_result; set historic_info_customers; run; proc sort data=actual_result; by customer_id segment_generation valid_from_date; run;

Daryl · ‎06-22-2023

@SasStatistics wrote: This works, thanks. Why does it not work to use the same thing for idx as for idx2? proc sql; create index idx on historic_info_customers(customer_ID,segment_generation,valid_to_date); quit; I imagine it could. I just created an idx2 because you had previously created an index with the name idx.

Daryl · ‎06-22-2023

@SasStatistics wrote: How do you see that that is actually happening? I ran your code by submitting it step by step. When you run the 2nd macro for the first time on snapshot 3, and look at the resulting file, you can see that rows for segment 1 were modified.

Daryl · ‎06-21-2023

See code suggestion above.

Daryl · ‎06-20-2023

If you only want to update rows from segment generation 2, you could add the segment generation to the index. proc sql; create index idx2 on historic_info_customers(customer_ID,segment_generation,valid_to_date); quit; %macro update_history2(snapshot); data historic_info_customers; set &snapshot; valid_to_date='31dec9999'd; segment_generation = 'segment_generation_2'; modify historic_info_customers key=idx2; if not _iorc_ then do; if segment_belonging ne segment_generation_2 then do; valid_to_date=date-1; replace; valid_from_date=date; valid_to_date='31dec9999'd; segment_generation='segment_generation_2'; segment_belonging=segment_generation_2; output; end; end; else do; _error_=0; valid_from_date=date; valid_to_date='31dec9999'd; segment_generation='segment_generation_2'; segment_belonging=segment_generation_2; output; end; run; %mend;

Daryl · ‎06-20-2023

Is macro update_history2 meant to update rows where segment_generation = 'segment_generation_1' ? Because that's what is happening.

Daryl · ‎11-02-2022

I once wrote a SAS program that would shell out to the operating system, run an operating system directory listing to create a list of subdirectories in a folder (for example, in Windows:) prompt:> cd C:\path\to\directory prompt:> DIR *. I may have redirected the output to a text file or somehow dumped the output (subdirectory names) into a SAS table, and then I just wrote a macro to iterate through the table of available directories. Looking for that code now but it might be gone.

Daryl · ‎06-16-2021

Nice presentation Michele. Yes, documentation is key for these procedures! If available, I use a point-and-click code generator to get a starting point for correct syntax, and then I tinker with options after browsing SAS doc.

Daryl · ‎04-28-2020

So HAVE1 is what is loaded as of April 25. You want to drop the last 10 days of data from HAVE1 and insert the last 11 days (10 prior days, plus April 26 data) from the incremental set?

Daryl · ‎04-25-2020

In PROC SQL: data have; infile datalines; input time $ anomaly; datalines; A1 42 A2 43 A3 45 A4 48 A5 55 A5 51 A6 65 A7 75 A8 63 A9 50 A10 48 A10 47 A10 51 A10 55 A11 42 A12 44 B1 125 B2 128 B3 125 B4 132 B5 139 B6 141 B7 158 B7 159 B7 161 B7 147 B7 144 B8 150 B9 142 B10 147 B11 122 B12 123 C1 1135 C2 1139 C3 1135 C4 1144 C5 1147 C6 1151 C6 1144 C7 1159 C8 1144 C9 1140 C10 1138 C11 1133 C12 1129 ; run; proc sql; create table need (drop=ob) as select distinct(time),mean(anomaly) as anomaly, monotonic() as ob from have group by time order by ob; quit; run;

Daryl · ‎04-24-2020

Probably not the most efficient but I think it will work for you. data work.have; infile datalines; input Action $ State $ Date1 :mmddyy10. Date2 :mmddyy10.; format Date1 mmddyy10. Date2 mmddyy10.; datalines; Walk AK 04/01/2020 04/09/2020 Run AL 03/30/2020 04/03/2020 Walk AR 03/26/2020 04/02/2020 Walk AZ 04/03/2020 04/08/2020 Sit CA 03/28/2020 04/01/2020 Run CO 04/01/2020 04/09/2020 ; run; data expand_have; set have; format date mmddyy10.; do date = date1 to date2; action_value=0; if action="Run" then action_value=2; else if action="Walk" then action_value=1; else if action="Sit" then action_value=3; output; end; drop action date1 date2; run; proc sql noprint; create table min_max as select min(date1) as min_date1 format=mmddyy10., max(date1) as max_date1 format=mmddyy10., min(date2) as min_date2 format=mmddyy10., max(date2) as max_date2 format=mmddyy10. from have; select min(min_date1,min_date2) into :min_date from min_max; select max(max_date1,max_date2) into :max_date from min_max; select distinct(state) into :states separated by ',' from have; select count(distinct(state)) into :statecount from have; quit; run; data _null_; put "States is &states."; put "Count is &statecount."; put "Min date is &min_date."; put "Max date is &max_date."; stop; run; data stage; format date mmddyy10.; format state $2.; states = "&states."; do date = &min_date to &max_date; do i = 1 to &statecount; state = trim(scan(states,i)); output; end; end; stop; drop states i; run; proc sql; create table need as select a.*,coalesce(b.action_value,0) as action from stage a left join expand_have b on a.state = b.state and a.date=b.date order by date, state; quit; run;

Daryl · ‎04-24-2020

Rahul, Can you give a more concrete example with a sample table please? Daryl

Daryl · ‎04-24-2020

Check into PROC TRANSPOSE. DATA mcook_has; INFILE DATALINES DSD; INPUT VAR1 $ VAR2 $ nCount; DATALINES; item1,A,1 item1,B,2 item1,C,2 item2,A,3 item2,B,5 item2,C,9 item3,A,3 item3,B,2 item3,C,5 ; run; proc transpose data=mcook_has out=mcook_transposed prefix=num_; by var1; id var2; run;

Daryl · ‎04-24-2020

Here's a basic solution that works for small "families" (3 or less observations that share a common "survivor"). If you have larger families, you would need to to make this a macro with a do loop and keep running the sort & merge until no survivor swaps were made. If you have a circular association in a family, I worry that this would run infinitely. DATA have; INFILE DATALINES DSD; INPUT NAME1 $ RECORDS1 NAME2 $ RECORDS2; DATALINES; TOM,5243,TOMMY,4 BRAD,873,BRADLEY,219 BRADLEY,219,BRAD,873 JOHN,61017,JOHNNY,905 JOHNNY,905,JOHN,61017 JONATHAN,500,JOHNNY,905 ; run; /* name with highest count will become "survivor" and always stored in name1 */ /* move the more frequent name into name1 and less frequent name to name2 */ data have2; set have; if records2 > records1 then do; tempname = name2; temprecords = records2; name2 = name1; records2 = records1; name1 = tempname; records1 = temprecords; end; drop tempname temprecords; run; /* name2 is the less frequently used name; call it "nickname" */ /* drop out the redundant pairs */ proc sort data=have2 out=have3 nodupkey; by name1 records1 name2 records2; run; /* create merge key */ data have3; set have3; rename name1=namekey; rename records1=recordskey; run; /* make a copy of the file and sort it on the name2 ("nickname") */ proc sort data=have3 out=have4; by name2 records2 name1 records1; run; /* create a merge key */ data have4; set have4; rename name2=namekey; rename records2=recordskey; run; /* merge dataset onto itself. if nickname also appears in the table as a survivor name, assign real survivor to nickname as new survivor (where nickname appears as survivor). */ data have5; merge have3 (in=a) have4 (in=b); by namekey; if a and not b then do; output; end; if a and b then do; namekey = name3; recordskey = records3; output; end; rename namekey=name1; rename recordskey=records1; drop name3 records3; run;

Online Status	Offline
Date Last Visited	‎04-19-2024 06:58 PM

Re: Delete _null_ dataset

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Create a 'DO' loop to run macro Based on the Folder names present ...

Re: Visualizing Your Data with SG Procedures

Re: Incremental load with removing 10 days of data

Wolfensäs: The FPS game that was written in SAS

3 Grid Technologies at SAS

2023 Customer Awards: Parexel - Curious Thinker

Visualizing Your Data with SG Procedures

Missing GeneratePieChart.sas for Practice in module 1, course 2, lesso...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a table based on prior records

Get Table Attributes from metadata with metadata_* functions

Get Table Attributes from metadata with metadata_* functions

Re: Delete _null_ dataset

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Creating a from_to table - My program does not satisfy the test ca...

Re: Create a 'DO' loop to run macro Based on the Folder names present ...

Re: Visualizing Your Data with SG Procedures

Re: Incremental load with removing 10 days of data

Re: Replacing semi-duplicated rows with average column values

Re: Transforming a dataset and calculating variables with dates

Re: Incremental load with removing 10 days of data

Re: Creating a table of Grouped Data.

Re: Creating a table based on prior records