About ANKH1

ANKH1 · ‎02-02-2024

In SAS I have two datasets. Dataset1: age_event variable is the age at which each ID reported an event. One same ID can have one or more than one event. Not all IDs from the sample reported events. IDs that did not reported events were not included in this dataset. data ds1; input ID age_event; datalines; a1 32 b2 54 b2 67 c3 34 c3 45 c3 78 ; run; Dataset2: All IDs of the sample are reported. This dataset contains one row per ID. variable "last_agerecorded" is the age at which each reported their last record for the whole study. data main; input ID last_agerecorded; datalines; a1 56 a2 67 b1 68 b2 72 b3 132 c2 121 c3 124 c4 58 d1 89 d2 95 e2 74 ; run; We would like to create a count_event variable that counts the number of events per ID. However, if an ID reports more than one event, count_event variable will need to add the total of events up to the last "age_event" reported. See cases for ID=b2 and c3 below. Each event has one row and the "count_event" variable sums the number of event at the last age_event. If the ID did not report any event at all, then age_event should be equal to "last_agerecorded" from the main dataset and the count_event should be equal to zero. This is the required output: ID age_event count_event a1 32 1 a2 67 0 b1 68 0 b2 54 1 b2 67 2 b3 132 0 c2 121 0 c3 34 1 c3 45 2 c3 78 3 c4 58 0 d1 89 0 d2 95 0 e2 74 0 How can we get the required output? Thanks!

ANKH1 · ‎01-30-2024

Thank you so much!

ANKH1 · ‎01-30-2024

I'm using a macro. The input variables are like this: %macro1(indata=din(where=group1=1 or group2=1)), outdata=dsout, var1=age); I need to add another filter for indata. The condition is: if pair in(2,4,5) then delete; I know I can do this in a data step, but I want to know if it is possible to add the condition in the macro parameters beside the where statement? Thanks!

ANKH1 · ‎12-28-2023

Thank you!

ANKH1 · ‎12-28-2023

Thank you!

ANKH1 · ‎12-27-2023

Hi, we have this raw data which reports age at interventions: data have; input ID AGE; datalines; 1 0 1 0 1 3 2 0 2 9 4 3 4 6 4 7 4 10 4 10 ; run; The variable "AGE" is the age when each ID had an intervention. We need to create a dataset that: 1) counts and accumulates the number of interventions at each age per ID. 2) accounts for all ages from the minimum to the maximum age reported by any ID. In this dataset the minimum is 0 and the maximum age is 10. 3) each ID should have a record from the minimum age to the maximum age. Even when they did not reported any intervention. 4) if an ID is missing in the intervention dataset (the main dataset contains all IDs in the study) it means the subject didn't have any intervention for the study duration. All IDs need to be accounted for. 5) variable "INTERVENTION" 0=no intervention, 1= intervention This dataset could be the "temp" dataset. Notice that for IDs 1 and 4 they had 2 interventions at the same age. data temp; input ID AGE INTERVENTION COUNT_INT; datalines; 1 0 1 2 1 0 1 . 1 1 0 2 1 2 0 2 1 3 1 3 1 4 0 3 1 5 0 3 1 6 0 3 1 7 0 3 1 8 0 3 1 9 0 3 1 10 0 3 2 0 1 1 2 1 0 1 2 2 0 1 2 3 0 1 2 4 0 1 2 5 0 1 2 6 0 1 2 7 0 1 2 8 0 1 2 9 1 2 2 10 0 2 3 1 0 0 3 2 0 0 3 3 0 0 3 4 0 0 3 5 0 0 3 6 0 0 3 7 0 0 3 8 0 0 3 9 0 0 3 10 0 0 4 0 0 0 4 1 0 0 4 2 0 0 4 3 1 1 4 4 0 1 4 5 0 1 4 6 1 2 4 7 1 3 4 8 0 3 4 9 0 3 4 10 1 5 4 10 1 . ; run; The final dataset should look this: data want; input ID AGE INTERVENTION COUNT_INT; datalines; 1 0 1 2 1 1 0 2 1 2 0 2 1 3 1 3 1 4 0 3 1 5 0 3 1 6 0 3 1 7 0 3 1 8 0 3 1 9 0 3 1 10 0 3 2 0 1 1 2 1 0 1 2 2 0 1 2 3 0 1 2 4 0 1 2 5 0 1 2 6 0 1 2 7 0 1 2 8 0 1 2 9 1 2 2 10 0 2 3 1 0 0 3 2 0 0 3 3 0 0 3 4 0 0 3 5 0 0 3 6 0 0 3 7 0 0 3 8 0 0 3 9 0 0 3 10 0 0 4 0 0 0 4 1 0 0 4 2 0 0 4 3 1 1 4 4 0 1 4 5 0 1 4 6 1 2 4 7 1 3 4 8 0 3 4 9 0 3 4 10 1 5 ; run;

ANKH1 · ‎12-19-2023

Hi! Sorry for not getting back earlier. I deleted the second output, but the issue is that it only reports the 3's that were originally in the dataset and not the 3's derived from counting the observations from 1 and 2.

ANKH1 · ‎12-14-2023

Hi, I was presented with another scenario of data. data output; input time group var1 count percentage; datalines; 0 1 1 3 42.8 0 1 2 4 57.14 0 2 1 5 45.45 0 2 2 5 45.45 0 2 3 1 9.09 3 1 1 2 28.6 3 1 2 5 71.5 3 2 1 1 50 3 2 2 1 50 6 1 2 3 100 6 2 1 2 100 9 1 2 2 100 9 2 1 2 66.6 9 2 2 2 33.3 ; run; The difference is that some datasets will have data for var1=3 (see above). How can you add the counts for the same var1=3? Right now if I run the code as it is I get data want as this: data out; input time group var1 count percentage; datalines; 0 1 1 3 25 0 1 2 4 33.33333333 0 1 3 5 41.66666667 0 2 1 5 41.66666667 0 2 2 5 41.66666667 0 2 3 1 8.333333333 0 2 3 1 8.333333333 3 1 1 2 16.66666667 3 1 2 5 41.66666667 3 1 3 5 41.66666667 3 2 1 1 8.333333333 3 2 2 1 8.333333333 3 2 3 10 83.33333333 6 1 2 3 25 6 1 3 9 75 6 2 1 2 16.66666667 6 2 3 10 83.33333333 9 1 2 2 16.66666667 9 1 3 10 83.33333333 9 2 1 2 16.66666667 9 2 2 2 16.66666667 9 2 3 8 66.66666667 ; run; But what I need is this: data want; input time group var1 count percentage; datalines; 0 1 1 3 25 0 1 2 4 33.33333333 0 1 3 5 41.66666667 0 2 1 5 41.66666667 0 2 2 5 41.66666667 0 2 3 2 16.66666667 3 1 1 2 16.66666667 3 1 2 5 41.66666667 3 1 3 5 41.66666667 3 2 1 1 8.333333333 3 2 2 1 8.333333333 3 2 3 10 83.33333333 6 1 2 3 25 6 1 3 9 75 6 2 1 2 16.66666667 6 2 3 10 83.33333333 9 1 2 2 16.66666667 9 1 3 10 83.33333333 9 2 1 2 16.66666667 9 2 2 2 16.66666667 9 2 3 8 66.66666667 ; run; Can the code provided be modified? Or how can data want be accomplished?

ANKH1 · ‎12-06-2023

Thank you so much!

ANKH1 · ‎12-05-2023

Hi, I ran the following proc and got the data below. proc freq data=ds1; table var1; by time group; run; data output; input time group var1 count percentage; datalines; 0 1 1 3 42.8 0 1 2 4 57.14 0 2 1 7 58.3 0 2 2 5 41.6 3 1 1 2 28.6 3 1 2 5 71.5 3 2 1 1 50 3 2 2 1 50 6 1 2 3 100 6 2 1 2 100 9 1 2 2 100 9 2 1 2 66.6 9 2 2 2 33.3 ; run; Var1 is a categorical variable (1=yes, 2=no, 3=unknown). My question is if there is a way to use proc freq or if a data step is what is needed to get the desired output. These are the points to consider: 1) This is data grouped by window and by group. The percentage for each category of var1 is calculated in the above proc freq by: (count of category/sum of counts with categories with answers)*100. 2) However, we need to calculate the percentages by taking into account that the sample is 12. That is for the first row, the percentage should be calculated by (count of category/12)*100= 25%.Second row the percentage should be 33.33%. 3) A third category should be created to account for the unknown in the sample size of 12 (12-(3+4)=5). That means for the first group, the percentages should be 1=25%, 2=33.3%, 3= 41.6%. But since since this step is just one before other data wrangling, new rows have to be created (all rows were var1=3): data want; input time group var1 count percentage; datalines; 0 1 1 3 25 0 1 2 4 33.33333333 0 1 3 5 41.66666667 0 2 1 7 58.33333333 0 2 2 5 41.66666667 0 2 3 0 0 3 1 1 2 16.66666667 3 1 2 5 41.66666667 3 1 3 5 41.66666667 3 2 1 1 8.333333333 3 2 2 1 8.333333333 3 2 3 10 83.33333333 6 1 2 3 25 6 1 3 9 75 6 2 1 2 16.66666667 6 2 3 10 83.33333333 9 1 2 2 16.66666667 9 1 3 10 83.33333333 9 2 1 2 16.66666667 9 2 2 2 16.66666667 9 2 3 8 66.66666667 ; run;

ANKH1 · ‎11-30-2023

Worked! Thank you so much!

ANKH1 · ‎11-30-2023

Hi, I am trying to open and work with a SAS dataset provided by the CDC in this link: https://www.cdc.gov/nccdphp/dnpao/growthcharts/resources/sas.htm The dataset is the one under step 1 for instructions for SAS users. I downloaded the file and copied it in a folder in SAS studio. I tried opening just from the folder and by calling it in a data step but it says it doesn't exist. The error when clicking in the file (from the folder) says: File_TEMP1.CDCREF_D.DATA does not exist. However, the file opens from SAS universal viewer. How can I open it from SAS studio?

ANKH1 · ‎11-27-2023

Oh! Thank you so much for your explanation! I really appreciate it.

ANKH1 · ‎11-27-2023

Thank you!

ANKH1 · ‎11-27-2023

Thank you! The output is exactly what we need. I have a question, how does if first.window; accounts for the first window for each value, i.e., first value of window=0 and does not delete the window=6, 12, etc. rows? I would've thought that first.window, filters the very first row of the variable window. Thanks!

Online Status	Offline
Date Last Visited	‎09-28-2025 09:47 AM

Re: What do when referencing a macro variable which value is 0?

What do when referencing a macro variable which value is 0?

Re: Using proc report how can you generate a pdf with repeated columns...

Re: Dataset containing only rows where DATE_DOSE2 is after DATE_DOSE1 ...

Using proc report how can you generate a pdf with repeated columns and...

Dataset containing only rows where DATE_DOSE2 is after DATE_DOSE1 by I...

Re: Median and confidence intervals missing when using proc lifetest

Re: Median and confidence intervals missing when using proc lifetest

Re: Median and confidence intervals missing when using proc lifetest

Re: Combine multiple columns into one

Re: Dataset containing only rows where DATE_DOSE2 is after DATE_DOSE1 ...

Re: Median and confidence intervals missing when using proc lifetest

Re: Filter data by multiple variables

Re: porc phreg output interpretation

Re: porc phreg output interpretation

Create a count variable that accumulates events and accounts for no ev...

Re: Conditional filter within macro parameters

Conditional filter within macro parameters

Re: Create variable that counts, accumulates and displays events at di...

Re: Create variable that counts, accumulates and displays events at di...

Create variable that counts, accumulates and displays events at differ...

Re: Use of output from proc freq

Re: Use of output from proc freq

Re: Use of output from proc freq

Use of output from proc freq

Re: SAS dataset opens in SAS Universal Viewer but not in SAS studio

SAS dataset opens in SAS Universal Viewer but not in SAS studio

Re: Replace missing values in one dataset with values from another dat...

Re: Replace missing values in one dataset with values from another dat...

Re: Replace missing values in one dataset with values from another dat...