About stancemcgraw

Quentin · ‎12-21-2023

Good tip @Tom , thanks. I didn't realize Windows Excel had an option to use 1904 as day 0, but there it is: https://support.microsoft.com/en-us/office/date-systems-in-excel-e7fe7167-48a9-4b96-bb53-5612a800b487 Makes me wonder if all the SAS engines that read excel are smart enough to honor this option. I never learned the history of why Mac Excel used the 1904 base date. The internet says that microsoft intentionally coded the 1900 leapyear bug into Excel to make it compatible with Lotus 1-2-3. But maybe when they got around to making a Mac version, Steve Jobs wasn't convinced it was a helpful bug, so went with 1904?

Kurt_Bremser · ‎12-14-2022

Try PROC TRANSPOSE: proc transpose data=have out=want ( drop=_name_ rename=(col1=icd_code) where=(icd_code ne "") ) ; by studynum; var icd:; run;

tarheel13 · ‎12-13-2022

you may find this paper useful. make an array of the ICD codes and then loop over them. https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-proceedings/2019/3117-2019.pdf data example; input studynum icd10code_1 $ icd10code_2 $ icd10code_3 $; datalines; 1 S02.028A S00.03XA S06.5X9A 2 S14.5X9A S00.83XA S22.028A 3 S06.5X9A S61.412A S34.5X9A ; proc print; run; data two; set example; spinal_injury=0; /*initialize to 0*/; tbi_injury=0; /*initialize to 0*/; array diagvar $ icd10code_1-icd10code_3; do over diagvar; if diagvar in: ("S14","S24","S34") then spinal_injury=1; if ("S06.0" <=: diagvar <=: "S06.6") or (diagvar in: ("S02.0","S02.1","S02.9", "S06.8","S06.9","S06.A","S07.1")) then tbi_injury=1; end; proc print; run; The above code achieves your desired results.

Tom · ‎08-11-2022

@stancemcgraw wrote: I tried this and it sets the new variable to missing and says, "NOTE: Invalid argument to function INPUT at line 24 column 16." So perhaps the answer to my question was that it was the First of March and not January Third. Did you try using DDMMYY as the informat instead of MMDDYY? Or perhaps there are just some values that are blank? You could try only trying to create a date when the string is not empty. it not missing(arrival_date) then arrival_date_ = input(arrival_date,mmddyy10.);

Kurt_Bremser · ‎07-29-2022

proc sql; create table want as select studynum, (max(med_admin_date) - min(med_admin_date)) / 3600 as difference from have group by studynum ; quit;

stancemcgraw · ‎02-22-2021

Thank you! That worked!!!

Kurt_Bremser · ‎01-27-2021

Use the SCAN() function with -2 and -1.

Reeza · ‎01-19-2021

I'm assuming that you'd only keep the last record. You're assuming that you only remove the first record. Given the data, I highly suspect it would make sense to only include the last record but only OP can clarify that issue.

FreelanceReinh · ‎11-14-2020

Hello @stancemcgraw, Try this: week=ceil((date-'15MAR2019'd)/7); Equivalently (for your date range) you can use the INTCK function (with the shifted interval 'week.7' -- weeks starting on Saturdays), as suggested by ballardw: week=intck('week.7','15MAR2019'd, date); Or the WEEK function, as suggested by SASKiwi: week=week(date-69); The "magic" number 69 can be computed as '16MAR2019'd-nwkdom(1,1,1,2019), i.e., the difference (in days) between the first day of week 1 in your date range and the first day of week 1 of 2019 according to the WEEK function (with the default "descriptor").

stancemcgraw · ‎10-29-2020

Thank you I always forget the going from wide to long step, you're right. You've really helped my day, thanks so much!

FreelanceReinh · ‎10-23-2020

Hi @stancemcgraw, As you can see from @ballardw's suggestion, this is no problem technically. But before taking a single step towards changing the IDs, I would rather take ten steps to investigate why there are duplicates (check source data, read documentation, ask co-workers, etc.). IDs, in particular patient IDs, are crucial and must not be changed in an ad-hoc manner. They are likely to occur in several datasets and are typically used as key variables to join tables. (That is, a change in one dataset would require consistent changes in other datasets.) There is also a risk of incorrectly splitting the observations from a single patient by assigning different IDs. Age can change over time, errors in the data are possible and the same combination of Age and Sex may or may not belong to different patients. One possible reason for duplicate IDs is that only the combination of two (or more) key variables is unique. For example, in multi-center clinical trials it is common to use the combination of center ID and patient number as a unique key on patient level. Duplicates in character variables can also result from truncation in an earlier step: What if you notice that all duplicates start with "ID-10," whereas the "ID-9..." cases are unique in a Studyid variable with length 7? ID-9998 ID-9999 ... ID-10361 ID-10370 ID-10364 ID-10372 ID-10369 ID-10375

Reeza · ‎10-04-2020

I think this is identical to a previous question you asked recently... Given that diseases aren't coded cleanly, I'd definitely recommending using the dynamic approach instead. 1. Separate out terms into individual rows (data step) 2. Clean them up here for the issues mentioned in your second post. 3. Add an indicator variable to get the 1 4. Flip the clean version wide using PROC TRANSPOSE

mklangley · ‎10-02-2020

@stancemcgraw Since you already have the number of days in character strings, give this a try. data have; input hlos $31.; datalines; 2.293055555555555556 8.490277777777777 8.0291667 ; run; data want; set have; hlos_numeric = round(input(hlos, 12.4), 1); run; (Also, I believe your second example would round off to 8 days, not 9.)

Tom · ‎09-27-2020

Excel stores time as fraction of a day. So just multiple by 24 hours. data have; input arrival $20.; cards; .73194444444 .14652777778 .34444444444 .3 ; data want; set have; time = '24:00't * input(arrival,32.); format time tod5.; run; Results: Obs arrival time 1 .73194444444 17:34 2 .14652777778 03:31 3 .34444444444 08:16 4 .3 07:12

Tom · ‎09-25-2020

To change the string values to date values first convert to a number and then adjust for the difference in how Excel and SAS count dates. Then you can attach any date type format you want to have the values display in a human readable way. date = input(doa,32.)+'30DEC1899'd ; format date date9.;

Online Status	Offline
Date Last Visited	‎12-21-2023 05:08 PM

Character dates and times to numeric dates and times

Code out wide data with an array

Coding data wide to long

Re: Character dates

Re: Character dates

Character dates

Trying to find difference between first and last date of same subject

Re: Identify and remove hidden characters in character variable

Re: Identify and remove hidden characters in character variable

Re: Identify and remove hidden characters in character variable

Re: Convert character hours and minutes to numeric days

Re: Formatting 24 hr character time to 24hr numeric time

Re: Change character date in format $13. to ddmmyy10. numeric

Re: Create new dummy variables from multiple columns of character vari...

Re: Scan or Find or Count option

Code out wide data with an array

Re: Character dates and times to numeric dates and times

Re: Coding data wide to long

Re: Code out wide data with an array

Re: Character dates

Re: Trying to find difference between first and last date of same subj...

Re: Identify and remove hidden characters in character variable

Re: Pull out last two decimals of number

Re: Repeating values

Re: Identifying week numbers from specific dates

Re: Create variables from multiple character text

Re: Assign values to a duplicate ID

Re: Transposing wide data

Re: Convert character hours and minutes to numeric days

Re: Formatting 24 hr character time to 24hr numeric time

Re: Change character date in format $13. to ddmmyy10. numeric