About kmardinian

kmardinian · ‎10-10-2017

Thank you so much, this has been super helpful!

kmardinian · ‎10-10-2017

Hi mkeintz, That sounds like it would work perfectly, do you think you could provide me with sample code to see how SAS likes it to be wrriten out? Thank you!

kmardinian · ‎10-10-2017

I have a dataset that I created from merging two different datasets by ID number. It resembles something like this; ID N N2 1232 KRAS TIR 1232 KRAS EGF 1111 KRAS MET 1111 EGF PTEN 1111 EGF PTEN 2342 PTEN LKR 2323 ERK MET 2323 MET TER 2222 MET REK 2222 MET MET Unfortunately, they're are many duplicates of each ID number and N and N2, so my issue is I'd like to find out how many unique observations are there through proc freq. So for ID 1232, it would count KRAS only once and for ID 1111 it would count EGF and PTEN only one as well. Is there anyway to do this through Proc freq? Thank you!

kmardinian · ‎10-10-2017

I have a dataset that I created from merging two different datasets by ID number. It resembles something like this; ID N N2 1232 KRAS TIR 1232 KRAS EGF 1111 KRAS MET 1111 EGF PTEN 1111 EGF PTEN 2342 PTEN LKR 2323 ERK MET 2323 MET TER 2222 MET REK 2222 MET MET Unfortunately, they're are many duplicates of each ID number and N and N2, so my issue is I'd like to find out how many unique observations are there through proc freq. So for ID 1232, it would count KRAS only once and for ID 1111 it would count EGF and PTEN only one as well. Is there anyway to do this through Proc freq? Thank you!

kmardinian · ‎10-10-2017

Ok, thank you! I was able to get it to work.

kmardinian · ‎10-10-2017

If I were to change ID to just be read as character, where would it best to write that code?

kmardinian · ‎10-10-2017

What's strange is the ID in both excel sheets are in the exact same format. The cells are formatted as "General" and the ID numbers are all 8 numbers long. So I'm not sure why SAS is reading them as different?

kmardinian · ‎10-10-2017

LOG: 1 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK; 61 62 63 Data T2; 64 Set T1 (keep= ID Name); 65 IDchar = put(ID, z8.); WARNING: Variable ID has already been defined as numeric. 66 drop ID; 67 rename IDchar=ID; 68 Run; NOTE: There were 26932 observations read from the data set WORK.T1. NOTE: The data set WORK.T2 has 26932 observations and 2 variables. NOTE: DATA statement used (Total process time): real time 0.01 seconds cpu time 0.01 seconds 69 70 71 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK; 84 The Log isn't giving me too much information, so it probably didn't work in fixing this issue. I'm not really sure what next steps to take

kmardinian · ‎10-10-2017

Data T2; Set T1 (keep= ID Name); IDchar = put(ID, z8.); drop ID; rename IDchar=ID; Run; Data BT (keep= ID Name); Merge T2 MCC1; By ID; *If inT2 and inMCC1; Run; So adding in the set statement I think worked! But it did give me a warning saying "WARNING: Variable MRN has already been defined as numeric." Does that still mean it's ok? But now I have also another issue, when I merge T2 and MCC1 to create the dataset BT, I lose the ID the Names from MCC1...I took out my "if inT2 and inMCC1" statement for now because it is still giving me errors

kmardinian · ‎10-10-2017

I see, I assumed sas would pull the variables from the excel spreadsheet. What would be the easiest way to do that? Should I use a Keep statement when merging the two datasets? Thank you!

kmardinian · ‎10-10-2017

I used proc import to import the excel datasets and they seem to have imported correctly.

kmardinian · ‎10-10-2017

proc sort data= G out=G1; by ID; run; proc sort data= P out=P1; by ID; run; Proc sort data=T out=T1; by ID; Run; Data CombinedData; Merge G1 P1; By ID; Run; Data MCC1; Set CombinedData (keep= Name ID protocol); If missing(protocol) then delete; run; Data T1; IDchar = put(ID, $8.); drop ID; rename IDchar=ID; Run; Data BT; Merge T1 MCC1; By ID Name; If inT1 and inMCC1; Run; Here is my code above, I've been getting this error for the ID variable where SAS states that it is not defined as character or numeric, so I added in a part of the code above to try to transform the ID variable (IDchar = put(ID, $8.); drop ID; rename IDchar=ID;). But now my Data BT just shows up blank and no observations are printed or read through SAS. Any help is much appreciated, I am still pretty new to SAS. Thank you!

kmardinian · ‎09-20-2017

Yes, that worked nicely! Thank you everyone for all your help. This seemed to be the easiest way to do this, I appreciate the advice. Thank you!

kmardinian · ‎09-19-2017

John Doe EGFR RS2R John Doe EGFR 5539 Jane Williams BRCA1 6006 Jane Williams BRCA1 2002 Tom Ford BRCA1 4008 Tom Ford BRCA1 2343 Tom Ford BRCA1 2343 Tom Ford EGFR 6382 Luis Mo ALK1 8373 Luis Mo EGFR 3378 Katie Lu BRCA1 3873 katie Lu EGFR 8739 This is a tiny example of what my dataset looks like. I have 2,000 different individuals, each with their own different set of mutations. I want to find the frequency of each mutation (ex. EGFR, BRCA1, etc ) in my total population, regardless of what location the mutation is on the gene (so I don't care about the numbers). So I wanted to find an easy way to fine the freuquency of EGFR mutations, by grouping all the EGFR XXXX into one variable category (EGFR), the BRAC1 XXXX into another, etc... without having to manually do it for 2,000 people. I would like my output to show the percentage of each mutation in my population with the new variables I create. I'm sorry my explanations are poor, as you can see I am definitely a SAS beginner and have not mastered a lot of the data management aspect. Thank you!

kmardinian · ‎09-19-2017

So the only issue with the scan function, is that I have over 50 variables with EGFR as a prefix, so is the only way to write out each line of code for the 50 variables? Thank you!

Online Status	Offline
Date Last Visited	‎05-27-2021 02:24 PM

Re: SAS changing my character variable to dates from Excel

SAS changing my character variable to dates from Excel

Re: ERROR: Invalid value for width specified - width out of range

ERROR: Invalid value for width specified - width out of range

Re: PROC POWER Calculation

Re: PROC POWER Calculation

PROC POWER Calculation

Re: Proc Power without Reference MEANDIFF

Re: Proc Power without Reference MEANDIFF

Re: Proc Power without Reference MEANDIFF

Re: SAS/ACCESS and MySQL error

Re: NOTE: Character values have been converted to numeric values

Re: ERROR: Import cancelled. The dataset INPUT.X is being used and can...

Re: SAS/ACCESS and MySQL error

Re: If then statement involving two different dataset and two differen...

Re: Dealing with Duplicate observations with Proc Freq

Re: Dealing with Duplicate observations with Proc Freq

Dealing with Duplicate observations with Proc Freq

Dealing with Duplicates with Proc Freq

Re: ERROR: ID is has been defined as both character and numeric

Re: ERROR: ID is has been defined as both character and numeric

Re: ERROR: ID is has been defined as both character and numeric

Re: ERROR: ID is has been defined as both character and numeric

Re: ERROR: ID is has been defined as both character and numeric

Re: ERROR: ID is has been defined as both character and numeric

Re: ERROR: ID is has been defined as both character and numeric

ERROR: ID is has been defined as both character and numeric

Re: Renaming variables that have the same prefix

Re: Renaming variables that have the same prefix

Re: Renaming variables that have the same prefix