About mh04

mh04 · ‎08-28-2023

Thank you! This is a bit over my head but I look forward to testing it and learning new things.

mh04 · ‎08-28-2023

Thank you!

mh04 · ‎08-28-2023

Yes, your interpretation is correct. Thank you!

mh04 · ‎08-28-2023

Thank you and sorry about my formatting!

mh04 · ‎08-24-2023

I have a long dataset with a 36-char STRING variable by ID and CLASS level. Some ID's CLASS values have multiple STRING values and I need to collapse the STRING by unique ID-CLASS group. Each character in the string represents a month and values should be collapsed as follows: if month X is 0 across all STRING within an ID-CLASS group, then month X should be 0 in final string; if month X is a 1 in any STRING within an ID-CLASS group, then month X should be 1; and if there are only 2s or 2s and 0s in month X in all STRING within an ID-CLASS group, then month X should be 2. So for ID #1002, the final data should contain one row for CLASS #973, with the same info as before, and only one row for CLASS #934 where the STRING would be 010122111110000000000000000000000000. ID CLASS STRING 10002 973 000000000000011111000000000000111110 10002 934 010122121110000000000000000000000000 10002 934 020222111100000000000000000000000000 I tried creating 36 month-level indicators based on the string and using them below. It's working in some cases, but not in others. I think when an earlier string contains a 2 and a later one has a 1, it updates to 1 correctly. But when a 1 appears in an earlier string and a 2 in a later string, it changes the 1 to 2. data dsn; length newstring $36; set dsn; by id class; array mstr (36) STR01-STR36; array nstr (36) NSTR01-NSTR36; do i=1 to 36; if first.unitid then nstr[i] = 0; end; retain nstr01--nstr36; do i=1 to 36 ; if mstr[i] > 0 then nstr[i] = min(of mstr[i]); end; newstring = cats(of nstr:); run; I'd appreciate any tips or suggestions. Thanks.

mh04 · ‎09-07-2020

Thank you! If GRADDT is before 2017, EMPDT should be set to the first month&year in the string that equals 1/2/3. I had very few cases in that scenario so to save time I hardcoded based on position of first 1/2/3.

mh04 · ‎09-07-2020

Thank you! I was able to make this work for me.

mh04 · ‎09-04-2020

Yes, sorry for leaving out important details about the data. GRADDT ranges between 200901 and 201912. EMPDT doesn't exist in the data and should be missing if GRADDT is missing. If there's no 1/2/3 on or after GRADDT, EMPDT should be 0. My data are always in SAS datasets so I know nothing about reading in raw data. I'm not sure how to edit your code to make it work for me. Thanks anyway though.

mh04 · ‎09-04-2020

Similar to the issue mentioned earlier, if there are no values of 1/2/3 after GRADDT, it returns the date of the prior month instead of 0 or missing. Not sure how to edit the code to fix that.

mh04 · ‎09-04-2020

Thank you! This works pretty well. There are a few places where it doesn't work because the data are a bit more messy than mentioned earlier. I have some cases with GRADDT before 201701. Using this code sets EMPDT to the month before GRADDT instead of the first occurrence of 1/2/3 in the string. In the example below, EMPDT gets set to 201608 instead of 201702. MONTHSTR GRADDT EMPDT 011110000000000000000000000000000000 201609 201608

mh04 · ‎09-04-2020

I meant chronological order between 201701 and 201912..

mh04 · ‎09-04-2020

The string for ID #3 is 111001111000000000000000001111100022; the first 1/2/3 that comes on or after the GRADDT of 201704 (so in 4th position or after) is in the 6th position (in red). Since the string represents months in chronological order between 201601 and 201912, the 6th position means 201706. Hope this makes sense. GRADDT is not a SAS date, it's character. I'd like the new variable to also be character.

mh04 · ‎09-04-2020

Hi, I have these 36-character strings where each character represents a month between 201701 and 201912 (in chronological order). I need to create EMPDT which would capture the date of the first 1, 2, or 3 in the string that occurs on or after GRADDT. For ID #1, EMPDT would be 201702 because the second character in the string (which corresponds to 201702) is a 1; for ID #3, EMPDT should be 201706. I had initially created 36 month-level variables, but I haven't figured out how to avoid writing clunky code and loop over them in a way that would get me what I need… I would appreciate any tips. ID MONTHSTR GRADDT EMPDT 1 210000000000000000000000000000010111 201702 201702 2 000000000000000000000000000000011111 201805 201908 3 111001111000000000000000001111100022 201704 201706 4 222200000000000030000000000000000011 201903 201911 5 000001111111000200000000000000011111 201706 201706 6 001110000000000000000000001000001111 201707 201903 7 100000022000000000000000000000022211 201809 201908 8 000220000001111111111000000000000000 201712 201712 9 033330000000000000000000000001111100 201801 201806

mh04 · ‎07-30-2020

Thank you! You both make it look so easy...

mh04 · ‎07-30-2020

Thank you!

Online Status	Offline
Date Last Visited	‎09-09-2023 01:02 AM

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Collapsing string values within by-groups

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Re: Getting dates from same length strings

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Re: Collapsing string values within by-groups

Collapsing string values within by-groups

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Re: Getting dates from same length strings

Getting dates from same length strings

Re: Counting groups of consecutive characters in string

Re: Counting groups of consecutive characters in string