About Geoghegan

Geoghegan · ‎10-04-2025

Thank you! My example used fake numbers but I adjusted a few of mine to be in a descending order like my example and your suggestion worked, thank you! Thanks others for the ideas, if I have to do further I'll keep them in mind!

Geoghegan · ‎10-02-2025

I have a messy dataset where I have up to 16 iterations of a variable Result1-Result16. Some people in the dataset only have a few iterations, some have 16 and everywhere in between. I'm trying to create a result summary variable that takes into account each of these versions. The Result variable is in number form and I have a format for it that interprets what the numbers mean, I'm not sure if that matters for the coding. When interpreting the result options into the SummaryResult variable I'd like to create, some only match up with one interpretation but others should be grouped together since they mean essentially the same thing for the SummaryResult variable. For example: A result of 100 means UNKNOWN A result of 200 means NO But then a result of 300, 400, 500, etc means YES Many are also missing with . as the result, so the ResultSummary should be . also To complicate matters, I can't just look at the last/highest iteration of Result because the correct/best ResultSummary may not be there, it could be the first or middle or last. The ResultSummary should be determined by a hierarchy: If there are any results that mean YES, the ResultSummary variable should be YES. If there are no YES results, but there is at least one NO, the ResultSummary variable should be NO. If there are no YES or NO results, but there is at least one UNKNOWN, the ResultSummary variable should be UNKNOWN. If . is the only result for all iterations of Result for that person, the ResultSummary should be . (missing). Here is a fake example: Person Result1 Result2 Result3 Result4 Result5 SummaryResult 111 . . 100 . 100 UNKNOWN 112 200 200 . . 500 YES 113 . . 100 200 200 NO 114 . . . . . . 115 . 300 100 100 400 YES I know only basics about assays from a class about 6 years ago and I just can't wrap my head around how I could get it to check each iteration while not keeping the last version instead of the best summary based on the hierarchy. Thank you for any help!!

Geoghegan · ‎05-14-2024

Ahh thank you! I opened a new program and typed it out and it worked just fine, thanks so much for your help!

Geoghegan · ‎05-14-2024

Submission: May 14, 2024 4:29:12 PM 1 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK; 72 73 data underfive; 74 length Group $15 Test $3; 75 input Group Test N; 76 datalines; NOTE: Invalid data for N in line 79 1-6. RULE: ----+----1----+----2----+----3----+----4----+----5----+----6----+----7----+----8----+----9----+----0 79 NonWor Yes 71 Group=Worcester Test=No N= _ERROR_=1 _N_=2 NOTE: SAS went to a new line when INPUT statement reached past the end of a line. NOTE: The data set WORK.UNDERFIVE has 3 observations and 3 variables. NOTE: Compressing data set WORK.UNDERFIVE increased size by 100.00 percent. Compressed is 2 pages; un-compressed would require 1 pages. NOTE: DATA statement used (Total process time): real time 0.01 seconds user cpu time 0.00 seconds system cpu time 0.01 seconds memory 567.62k OS Memory 31904.00k Timestamp 05/14/2024 08:29:13 PM Step Count 201 Switch Count 2 Page Faults 0 Page Reclaims 208 Page Swaps 0 Voluntary Context Switches 24 Involuntary Context Switches 0 Block Input Operations 832 Block Output Operations 264 81 ; 82 83 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK; 95

Geoghegan · ‎05-14-2024

data underfive; length Group $15 Test $3; input Group Test N; datalines; Worcester Yes 55 Worcester No 45027 NonWor Yes 71 NonWor No 311726 ;

Geoghegan · ‎05-14-2024

Do the amount of spaces on the lines where it has CountyA Yes etc.. matter? I'm trying to figure out why it's telling me it only has three obs: NOTE: SAS went to a new line when INPUT statement reached past the end of a line. NOTE: The data set WORK.UNDERFIVE has 3 observations and 3 variables.

Geoghegan · ‎05-14-2024

Thank you! That helped make it create a dataset, though now the variables don't have the values they should (only had 3 obs and one is blank for N)

Geoghegan · ‎05-14-2024

</> data underfive; length Group $9 Test $3; input Group Test N; datalines; Worcester Yes 55 Worcester No 45027 NonWor Yes 71 NonWor No 311726 ; </> Sorry, I had copied part of an old version in, this is my current code. It gives this error: NOTE: Invalid data for N in line 79 1-6. But does create a dataset but the numbers aren't all included

Geoghegan · ‎05-14-2024

I'm trying to follow the code on this site Test for the equality of two proportions in SAS - The DO Loop for the section called A chi-square test for association in SAS. I basically need to compare the proportion in one area which was tested for something to the proportion in another area which was tested and see if they are significantly different proportions, but I can't get the code to work right. I get this error: NOTE: Invalid data for N in line 79 1-6. RULE: ----+----1----+----2----+----3----+----4----+----5----+----6----+----7----+----8----+----9----+----0 79 CountyB Yes 71 Group=CountyA Seq=No N=. _ERROR_=1 _N_=2 NOTE: SAS went to a new line when INPUT statement reached past the end of a line. My full code is: data underfive; length Group $15 Test $3; input Group Test N; datalines; CountyA Yes 55 CountyA No 45027 CountyB Yes 71 CountyB No 311726; Once I had that in I figured I would run this: proc freq data=underfive order=data; weight N; tables Group*Test/chisq; run;

Geoghegan · ‎04-24-2024

oh, thank you! I think the proc logistic may be very helpful!

Geoghegan · ‎04-24-2024

Thank you, I'll look through those!

Geoghegan · ‎04-24-2024

thank you for the explanation, I think I may be thinking about this wrong. I wanted to compare the distribution of ages between one location and the other, not between age groups within one location. So it seems like I already have that (just comparing all of the age group distribution in one to all of the age group distribution in the other). One last question - does the chi-squared test comparing these just compare the percentages in each age group between the two locations or does it take into account whether or not the sample size in each age group is enough to determine if the age distribution really was different? For example, if one location only had 2 people in one age group, that could make it difficult to be certain if the age distribution really was different. I'm sorry if that doesn't make sense!

Geoghegan · ‎04-23-2024

I'm using SAS Studio and trying to compare the proportions of people in seven age groups within two populations (location variable is 1/0) and see if there is a significant different in distribution. Currently my code is: proc freq data=demographics; table agegroup*location/chisq nocum norow nopercent; run; Am I correct in thinking that this will show me if there is a statistically significant difference in the proportions in the age groups comparing the two location options (1 vs 0)? Also, is there a reasonable way to compare each age group (for example, to see if there is a significant difference between the proportion in the first age group in the 1 location compared to the first age group in the 0 location, etc for each age group? Thank you!!

Geoghegan · ‎07-06-2022

Perfect, thank you! That fixed it!

Geoghegan · ‎07-06-2022

I am trying to transpose from wide to long because I have 30 variables (dx1-dx30) which I'd like to put into one column with a different row for each dx variable. I want all other variables in that dataset copied for each row for those dx variables. I was able to do most of this (my dx variables worked the way I want them to and it created a new row for each of them etc) but the other variables which I'd like copied into the other rows are only being filled into the first row. In the rows for dx2-dx30, it just has a period for the other variables. This is the code I have: Proc transpose data=totranspose2016 out=transposed2016 name=diagnosis prefix=code; By uniqueadmission; copy patientuhin cdiff primary readmit agecat died; /*want these copied to each corresponding row but only shows on first row for each uniqueadmission number, then has a period for these variables in all the other rows*/ Var dx1-dx30; /*I want these to go from being 30 different columns to one column with one row for each dx number - this part worked*/ Run; Is it possible to change this so the variables listed in the copy statement are copied to all of the corresponding rows?

Online Status	Offline
Date Last Visited	‎10-07-2025 02:15 PM

Re: Need help with array to check multiple iterations of one variable ...

Need help with array to check multiple iterations of one variable to c...

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

creating dataset to use for chi squared test of proportions

Re: question about using/interpreting the chi-squared option in proc f...

Re: Need help with array to check multiple iterations of one variable ...

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: How do I use an array to check for several values in multiple vari...

Re: How do I use an array to check for several values in multiple vari...

Re: How do I use an array to check for several values in multiple vari...

How do I use an array to check for several values in multiple variable...

Re: Need help with array to check multiple iterations of one variable ...

Need help with array to check multiple iterations of one variable to c...

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

Re: creating dataset to use for chi squared test of proportions

creating dataset to use for chi squared test of proportions

Re: question about using/interpreting the chi-squared option in proc f...

Re: question about using/interpreting the chi-squared option in proc f...

Re: question about using/interpreting the chi-squared option in proc f...

question about using/interpreting the chi-squared option in proc freq ...

Re: How do I get variables copied into all the new rows when transposi...

How do I get variables copied into all the new rows when transposing f...