About Zen_aly

Zen_aly · ‎07-22-2025

Thank you, I appreciate your response! I thought I responded on Saturday, but I can't seem to find my post for some reason. The code you provided works with a slight modification - which is my fault. I can see that for flag1 I specified that type 10 or 4 cannot be in between, type 11 should have been in that list. Once I add the 11 the code works correctly.

Zen_aly · ‎07-18-2025

Hi, I am looking for an efficient way to create two flags based on ID level patterns of observation types and I am struggling to find pertinent documentation. With this data the focus tends be on the most recent observation for an ID, but in some cases I need to look back when certain criteria are met. The first flag I would like to create is when there are two observations with type=99, but only when there are no observations with type 10 or 4 in between them. The second flag I would like to create is when value= 2, but only in cases where the following two observations have value =1 and then value= 0 respectively (no missing values may be in between). Ideally in this instance I would also pull the date from the most recent observation with value=0 to the observation with the value=2. With respect to flag1 - in the past I kept only type=99 observations and used lag to pull in values from the prior observation. However that approach will no longer work because if there are one or more observations in between that do have type 10 or 4 then that ID is disqualified. Flag2 is a new need so I haven't coded it yet. My thought was to use a double lag, but in the rare instance that there are multiple zeros, ideally I would pull in the date from the most recent 0. Any suggestions are much appreciated! Here is simplified input data for have and want datasets- data have; input ID $ date date9. type$ type2; format date date9.; datalines; A 06JUN2025 11 2 A 13JUN2025 1 1 A 13JUN2025 10 2 A 15JUN2025 1 1 A 23JUN2025 99 0 A 27JUN2025 99 0 B 13JUN2025 1 . B 14JUN2025 99 . B 16JUN2025 11 2 B 23JUN2025 1 1 B 26JUN2025 99 0 C 15JUN2025 99 . C 23JUN2025 10 . C 27JUN2025 99 . D 01JUN2025 10 2 D 5JUN2025 99 1 D 20JUN2025 3 0 D 29JUN2025 99 0 ; run; data want; input ID $ date date9. type$ flag1 type2 flag2 date2 date9.; format date date2 date9.; datalines; A 06JUN2025 11 . 2 . A 13JUN2025 1 . 1 . A 13JUN2025 10 . 2 1 27JUN2025 A 15JUN2025 1 . 1 . A 23JUN2025 99 1 0 . A 27JUN2025 99 1 0 . B 13JUN2025 1 . . . B 14JUN2025 99 0 . . B 16JUN2025 11 . 2 1 26JUN2025 B 23JUN2025 1 . 1 . B 26JUN2025 99 0 0 . C 15JUN2025 99 0 . . C 23JUN2025 10 . . . C 27JUN2025 99 0 . . D 01JUN2025 10 . 2 1 29JUN2025 D 5JUN2025 99 1 1 . D 20JUN2025 3 . 0. . D 29JUN2025 99 1 0 . ; run;

Zen_aly · ‎03-21-2025

Thanks for the responses! I am adding an excel file and some code that I tried based on responses: %macro loop (varlist=); %let varlist=%cmpres(&varlist); %do i=1 %to %sysfunc(countW(&varlist, ' ')); %let x=%scan(&varlist,&i,' '); data work.example1; set work.example; if . < Measure_&x._value < 0 then Measure_&x._value=0; else if Measure_&x._value > 40 then Measure_&x._value=40; if visit_&x._count < 40 then do; Measure_&x._value=.; visit_&x._count=0; run; %end; %mend; *loop(); %let CY25=100 abc def 222 ghi jklmno 300 350 399 pqrs; %loop(varlist=&CY25); The above code runs with no error but the changes I asked for did not take place. I just don't seem to get some of the details that I need to successfully create a wide variety of macros.

Zen_aly · ‎03-17-2025

I have searched around, but no luck finding an answer (perhaps I am using the wrong terms). I have arrays set up to reference full lists of variables, but they are becoming quite bulky. Is it possible to reference only a few characters in the middle of a variable name? In stata it would look like this: for each x in cath fall { use `datafile', clear collapse (mean) measure_`x'_value /// pred_measure_`x'_value [aweight = measure_`x'_count], by (state) I am not worried about the collapse or aweight portions, but I am trying to find out if it is possible to replicate that reference to `x' and cycle through just the middle of the variable name. In stata, the `x' would be replaced to measure_cath_value and measure_fall_value etc. Thanks for any help!

Zen_aly · ‎07-12-2021

It looks like I do have SAS/IML!

Zen_aly · ‎07-12-2021

Hi, Currently this selection is done manually so I have no code to share. I believe I can automate the process in SAS and I thought it would be good to ask here in case anyone can give me some pointers. I have looked for like questions but didn't find anything. The end goal is to identify a sample that represents group characteristics with as few of the observations as possible. Based on group totals formulas and rounding are used to generate a list of “need” characteristic totals. After an observation is selected to be a part of the sample they need to be deducted from the needed count (Need_initial), until a sample is selected that represents the total desired count for each characteristic. I have added a pretend dataset that represents a group. In the table below I put the initial total counts (Need initial), an example of an observation selected for the sample, and then how that first selection impacted the needed totals (Need_2). Sometimes subcategory counts may not be in line with the category total. For example, in this case the sample only needs 1 observation with patterns but the sample should include 1 pattern_dots and 1 pattern_plaid regardless of whether that combination can be identified within one observation. Need_Initial Selection_1 Need_2 Dinner 14 1 13 Likes_talking 8 1 7 Visiting 8 1 7 Hates_talking 7 0 7 Green 3 0 3 Dark_Yellow 3 1 2 Dark_Green 2 1 1 Dark_White 2 0 2 Blue 1 0 1 Yellow 1 1 0 White 1 0 1 Red 1 0 1 Black 1 1 0 Grey 1 0 1 Light_Blue 1 0 1 Light_orange 1 0 1 Light_red 1 0 1 Dark_Black 1 1 0 Patterns 1 1 0 Pattern_Dots 1 0 1 Pattern_Plaid 1 1 0

Online Status	Offline
Date Last Visited	‎09-16-2025 05:22 PM

Re: Identify patterns across observations

Identify patterns across observations

Re: Can a SAS array be used to reference the middle few characters of ...

Can a SAS array be used to reference the middle few characters of a va...

Re: Iteratively identify sample based on changing totals

Iteratively identify sample based on changing totals