About Krysia24

Krysia24 · ‎05-26-2020

Oh wow! Such a silly mistake on my part. Thank you!

Krysia24 · ‎05-22-2020

Hello SAS Users! I am using a proc export step to export a txt file; however, the output keeps exporting without the ".txt" file extension. PROC EXPORT DATA= survey OUTFILE= "\\fakepath\survey_&today.txt" DBMS= tab REPLACE; PUTNAMES=YES; RUN ; Output file looks like this: "survey_22May2020" without the ".txt" extension. So I have to open it in Notepad and save as a .txt as a manual workaround... Thoughts?

Krysia24 · ‎04-16-2020

Thank you! This worked perfectly!

Krysia24 · ‎04-15-2020

Hi all, I'd like to create a table where the count of something is displayed by day. I am interested in also displaying the days where the count was zero. Here's what I have, which obviously only yields dates where count for that day was >=1. PROC SQL ; CREATE TABLE graph_data as SELECT dtevent, count(*) as totalnum FROM latestcounts GROUP BY dtevent ORDER By dtevent ; QUIT ;

Krysia24 · ‎10-30-2019

As a follow-up, if a lot of individuals don't have all three time points - it sounds like a mixed model (Proc Mixed) would be my best bet in order not to lose all those data. However, if I just want to study individuals who have all 3 time points (to see what they specifically look like), repeated measures ANOVA (2-factor?) can be used and will take into account treatment group vs. control as well as the time point differences?

Krysia24 · ‎10-30-2019

Hi Rick, Sorry for the confusion. The ID's were supposed to be listed as 1-8.

Krysia24 · ‎10-29-2019

I am trying to compare repeated measures on a continuous variable between 2 different groups over time. So let's say the data look like this: ID Group Time Point 1 Time Point 2 Time Point 3 Time Point 4 1 Treatment 97 92 90 2 Treatment 103 104 107 102 3 Treatment 99 98 96 4 Treatment 92 96 99 5 Control 100 95 94 90 6 Control 99 96 7 Control 95 94 93 8 Control 89 88 85 I want to compare the mean score at the different time points between the 2 groups. Notice how there might be missing data at the different time points (most people are not going to have all four time points) so I'm thinking a repeated measures ANOVA of some sort would delete a lot of the responses. What's my best option in SAS?

Krysia24 · ‎05-07-2019

Okay thanks! I think I'll explore analyses with both formats and see what works better.

Krysia24 · ‎05-07-2019

So one way we'd be using this is trend analysis to see if a patient's severity/risk increases over time with their multiple visits. (Not something you can tell with what I plugged in - I just put in some random discharge codes).

Krysia24 · ‎05-07-2019

So again the data as it in the cells is just an example of what I have but what I'll be doing is repeat measures analysis so I thought it'd be easier to have it as one line for those with multiple visits.

Krysia24 · ‎05-07-2019

Having a hard time trying to get my transpose right. Perhaps it's not even the right thing to be going for. I have something that looks like this (just an example): ID Visit_Date Discharge_Dx Sex DOB Chief_Complaint 1 3/1/2019 G44 M 3/24/1957 headache 2 2/22/2019 W34 F 8/23/1967 GSW 2 4/15/2019 S01 F 8/23/1967 cut 3 1/17/2019 T50 M 9/6/1991 overdose 4 3/27/2019 J02 F 11/3/2012 sore throat 5 2/3/2019 R11 F 12/19/2008 vomiting 6 1/6/2019 M25 M 7/7/1977 body aches 6 2/11/2019 M54 M 7/7/1977 back pain 6 3/7/2019 R05 M 7/7/1977 cough But what I'd want it more to look like this - where those ID's with multiple visit dates are condensed into a single line. DOB and sex can stay the same but I'd like multiple columns for visit date, chief complaint, and discharge dx if a patient has multiple records for those: ID Visit_Date_1 Visit_Date_2 Visit_Date_3 Discharge_Dx_1 Discharge_Dx_2 Discharge_Dx_3 Sex DOB Chief_Complaint_1 Chief_Complaint_2 Chief_Complaint_3 1 3/1/2019 . . G44 . . M 3/24/1957 headache . . 2 2/22/2019 4/15/2019 . W34 S01 . F 8/23/1967 GSW cut . 3 1/17/2019 . . T50 . . M 9/6/1991 overdose . . 4 3/27/2019 . . J02 . . F 11/3/2012 sore throat . . 5 2/3/2019 . . R11 . . F 12/19/2008 vomiting . . 6 1/6/2019 2/11/2019 3/7/2019 M25 M54 R05 M 7/7/1977 body aches back pain cough

Krysia24 · ‎12-13-2018

Hello, I would like to create a more efficient code for trying to truncate dozens of variables in a list. So, for instance I have this: DX1 DX2 DX3 DX4 DX5 DX6 DX7 … F15.151 S32.602A S72.401B T79.4XXA R78.81 G82.20 S22.39XA J96.90 M79.5 W34.00XA R57.0 N28.9 S81.801A K21.9 E87.6 S71.132A … And would like just the first three characters from each string: F15, S32, S72, T79, etc. Is there a way to do this without individually creating new variables for each of the dx vars using the SUBSTR function? In actuality I have 79 of these Dx variables so that would be a time consuming process, and I imagine there's a better way. Thanks!

Krysia24 · ‎10-10-2018

Comes to us as a SAS dataset. Hospital Name is actually "FAC_ID." I did not include the rest of the variables for simplicity but let me know if that's helpful.

Krysia24 · ‎10-10-2018

This is the dataset - it comes in like this automatically. And the number of rows per hospital changes each month so this is not something I want to manually code each month by saying "if it's rows 1:6 then hospital name = Hospital A". Again, the number of rows per hospital changes monthly. Is there an array or something that we can do so that "Hospital Name" gets populated with the correct hospital name by saying fill it in with Hospital A until there is no longer a blank, then fill it in with the next name? Hospital Name Month Hospital A Jan Feb March April May June Hospital B Jan Feb March Hospital C Jan Feb March April This is what I want: Hospital Name Month Hospital A Jan Hospital A Feb Hospital A March Hospital A April Hospital A May Hospital A June Hospital B Jan Hospital B Feb Hospital B March Hospital C Jan Hospital C Feb Hospital C March Hospital C April Any insight would be appreciated!

Krysia24 · ‎07-30-2018

Hi again, Apologies for resurfacing this but upon some QA I am realizing I have an issue with the code that I'm not sure what the cause of it is. Here is the code that I ran based on what you provided: data match ; if _n_ = 1 then do ; dcl hash x () ; x.definekey ("MOMSSN") ; x.definedata ("p") ; x.definedone () ; dcl hash y () ; y.definekey ("BMOMFIRSTNAME", "BMOMLASTNAME", "momyeardob") ; y.definedata ("p") ; y.definedone () ; if 0 then set deaths2 ; do p = 1 by 1 until (y2) ; set births2 end = y2 ; x.ref() ; y.ref() ; end ; end ; call missing (of _all_) ; set deaths2 ; if (SSN) = . then Match = 0 ; /*first trying to match on SSN alone and do not want matches on missing SSN's alone*/ else Match = x.find(key:SSN) eq 0 ; if not Match then Match = y.find(key:firstname,key:lastname,key:dbirthyear) eq 0 ; /*if there was not a match on SSN, trying to match on first name. last name, and year of birth*/ if Match then set births2 point = p ; run ; Now, in the output dataset MATCH - there is a key variable (datedeath) that I need for further analysis - that went missing for a lot of observations. The source dataset for this variable is deaths2 and in deaths2, none of the observations have missing data for this variable. It appears that most of the observations that have missing data for datedeath in the matched dataset MATCH now are the variables that have match =1. (And again - given that the source data does not have any observations missing this information, this is odd. Is there something in the match step that would have caused this, even though I do not cite that specific variable at all? Also, one thing of note (not sure if it's relevant) - there should only be one death record for each individual BUT they can potentially be in the matched dataset (births2) multiple times if they've had multiple births... Thank you for your help! I thought I was grasping this but this error is leaving me a bit confused.

Online Status	Offline
Date Last Visited	‎11-18-2022 11:22 PM

Re: Proc Report - Only summarize one column

Proc Report - Only summarize one column

Re: Perl Reg Expression in Array

Perl Reg Expression in Array

Re: Proc SQL duplicate rows group by

Proc SQL duplicate rows group by

Re: Pull in the last modified file

Pull in the last modified file

Re: Extracting text from string

Extracting text from string

Re: Proc SQL duplicate rows group by

Re: Proc SQL duplicate rows group by

Re: SAS skips %include statements

Re: SAS skips %include statements

Re: More efficient way to truncate many variables at once

Re: Help with table displaying dates with zero counts

Re: Help with Proc Transpose

Re: Proc Export - .txt being excluded from file name

Proc Export - .txt being excluded from file name

Re: Help with table displaying dates with zero counts

Help with table displaying dates with zero counts

Re: Repeated ANOVA or Proc Mixed?

Re: Repeated ANOVA or Proc Mixed?

Repeated ANOVA or Proc Mixed?

Re: Help with Proc Transpose

Re: Help with Proc Transpose

Re: Help with Proc Transpose

Help with Proc Transpose

More efficient way to truncate many variables at once

Re: Array for populating rows

Array for populating rows

Re: Matching data