About amarikow57

PaigeMiller · ‎04-27-2021

The Type III test is testing to see if the levels of each variable are different from one another. So the levels of TEMP are different from one another, the levels of PRESSURE are not. The parameter estimate tests are testing to see if the estimate is different from zero. So for PRESSURE, the estimates are different from zero.

SteveDenham · ‎04-16-2021

Thanks @jiltao for pointing out the error with my code, and even better, posting a solution that did use PROC GLM. I wish I could mark your reply as correct. SteveDenham

amarikow57 · ‎04-13-2021

Nevermind... It's the commas. Saw the mistake as it was posting.

gcjfernandez · ‎03-08-2021

Please do a variable selection , optimal binning of interval inputs and then try Gradient Boosting. Finally compare the performance with the Decision Tree model.

PGStats · ‎03-06-2021

Please note that proc optnet was introduced with SAS 9.3TS1M2.

ghosh · ‎02-20-2021

Hard to give an answer without any sample data. I would transpose the data into three columns (eg. ADOS2_3_type, ADOS2_3_val ATEAM_AUTISM_YES_NO) and use sgpanel to display them side by side: proc sgpanel data=mydat; panelby ADOS2_3_type / rows=6 columns=5 ; vbox ADOS2_3_val / category=ATEAM_AUTISM_YES_NO; run;

Astounding · ‎02-15-2021

Yes, assigning numeric values that match EVENT would be a good idea, necessary for sorting in the proper order. And yes again, UPDATE ignores missing values, but replaces the current value with any nonmissing values it encounters. That's what makes the order essential. Whatever appears in the BY statement, you will automatically get one observation for each (in this case one observation for each ID). The obs=1 is a tricky concoction. UPDATE requires two data sets. So using obs=1 creates a second version of the data to use (so that UPDATE won't complain about having just one data set). There is only one observation brought in from that obs=1 data set, and it is immediately overwritten by an observation from the complete HAVE data set. It looks like you are well on your way to understanding the UPDATE process, so don't get bogged down in the tricky aspects at this point. In all likelihood, this program eliminates the need to process intermediate data sets by dropping variables that are always missing. If it's desirable, you could apply that logic to the final data set.

Astounding · ‎02-09-2021

PROC TABULATE automatically removes observations where any CLASS variable has a missing value. You can override that behavior by adding the MISSING option to the PROC statement.

Ksharp · ‎02-09-2021

data have; input ID EVENT SCORE; cards; 1 1 3 1 2 3 2 1 7 2 2 7 3 1 8 3 2 6 4 1 . 4 2 5 ; proc sql; create table want as select * from have group by id having count(distinct score) > 1 or ( n(score) and nmiss(score)) ; quit;

Kurt_Bremser · ‎02-09-2021

You suffer from a bad dataset structure, causing you to use name literals, waste space for missing values, and write bad code. First, transpose your dataset to a long layout: options validvarname=any; data have; input ID SURVEY "1Q1"n "1Q2"n "1Q3"n "2Q1"n "2Q2"n "3Q1"n "3Q2"n "3Q3"n "3Q4"n "3Q5"n "4Q1"n "4Q2"n "4Q3"n "4Q4"n "5Q1"n "5Q2"n "5Q3"n ; datalines; 1 1 1 2 3 . . . . . . . . . . . . . . 1 2 . . . 8 0 . . . . . . . . . . . . 1 3 . . . . . 4 0 1 5 2 . . . . . . . 1 4 . . . . . . . . . . 1 4 2 1 . . . 1 5 . . . . . . . . . . . . . . 1 3 1 2 1 0 3 6 . . . . . . . . . . . . . . 2 2 . . . 7 1 . . . . . . . . . . . . 2 3 . . . . . 5 1 1 5 3 . . . . . . . 2 5 . . . . . . . . . . . . . . 0 2 0 3 1 1 3 3 . . . . . . . . . . . . . . 3 2 . . . 6 0 . . . . . . . . . . . . 3 3 . . . . . 3 1 0 5 1 . . . . . . . 3 4 . . . . . . . . . . 1 4 2 1 . . . 3 5 . . . . . . . . . . . . . . 0 3 1 4 1 1 1 5 . . . . . . . . . . . . . . 4 2 . . . 4 0 . . . . . . . . . . . . 4 3 . . . . . 6 0 1 5 2 . . . . . . . 5 1 0 3 4 . . . . . . . . . . . . . . 5 3 . . . . . 9 0 0 4 1 . . . . . . . 5 4 . . . . . . . . . . 1 4 2 0 . . . 5 5 . . . . . . . . . . . . . . 1 3 1 ; proc transpose data=have out=l1 ( rename=(col1=answer) where=(answer ne .) ) ; by id survey; var "1Q1"n--"5Q3"n; run; data long; set l1; surv = input(scan(_name_,1,"Q"),best.); question = input(scan(_name_,2,"Q"),best.); if surv ne survey then putlog "ERROR"; keep id survey question answer; run; In the resulting dataset, it is easy to group by (or select for) ID's, surveys, questions. For reporting purposes, you can use the question as an ACROSS variable in proc report to create a wide layout. For processing, a long layout is always preferred.

Reeza · ‎01-25-2021

1. Fill in birth date for all missing records proc sql; create table temp1 as select *, max(dob) as DOB_ALL from have group by patient_Id order by 1, 2, 3; quit; 2. Calculate age at diagnosis, adding on to step 1 (replaces step1) proc sql; create table temp1 as select *, max(dob) as DOB_ALL, yrdif(dob_all, visit_date, 'AGE') as AGE_DIAGNOSIS from have group by patient_Id order by 1, 2, 3; quit; 3. Merge main table with filtered table for diagnosis = 1 data want; merge have temp1 (where=(diagnosis=1) keep = (patient_ID DOB_ALL AGE_DIAGNOSIS)); by patient_ID; run; @amarikow57 wrote: My dataset has multiple visit dates for a given patient (in long format). Let's say: PATIENT_ID VIST VISIT_DATE DOB DIAGNOSIS 1 1 01/12/2011 02/15/1997 1 1 2 02/17/2011 1 3 06/23/2011 02/15/1997 1 4 11/12/2011 2 1 01/13/2011 09/21/1995 2 2 09/17/2011 0 3 1 02/03/2011 3 2 04/15/2011 11/19/2001 3 3 07/06/2011 11/19/2001 1 4 1 01/29/2011 4 2 05/30/2011 4 3 08/22/2011 07/16/2003 0 4 4 12/01/2011 I want to (a) make sure that I have the birthdate repeated for the corresponding PATIENT_ID, then I want to (b) create an AGE_AT_DIAG variable for when they were assessed. That is VISIT_DATE - DOB when DIAGNOSIS = 1 or DIAGNOSIS = 0, and then repeated for the corresponding ID. How can I go about this?

Astounding · ‎12-07-2020

It sounds like the structure of your data is slightly different from run to run. One (or more) variable is supposed to use the CDTYPEI format. As long as that variable is numeric, the program works fine. But sometimes that variable is character, causing SAS to look for the $CDTYPEI format (which does not exist). Similarly, one (or more) variable is supposed to use the HCFASAF format. As long as that variable is numeric, the program works fine. But sometimes that variable is character, causing SAS to look for the $HCFASAF format (which does not exist). If you fix the data and make those variables numeric, the program should work. However, that may not be a simple task. The variables may be character because they occasionally contain characters such as "N/A". It's not clear what the proper fix would be to make the variable numeric. So there are decisions to make. But the task remains the same. Fix the data.

Kurt_Bremser · ‎12-06-2020

proc sort data=library.data; /* PROC SORT statement needs a DATA= option */ by id date; run; data library.data_dates; set library.data (keep = id date); by id; format startdate enddate yymmdd10.; retain startdate; if first.id then startdate = date; if last.id; /* will output only the last observation per id group */ enddate = date; drop date; run; No need to shout at SAS, it will work when being talked to in normal voice. In PROC SQL, it looks like this: proc sql; create table library.data_dates as select id, min(date) as startdate format=yymmdd10. max(date) as enddate format = yymmdd10. from library.dates group by id ; quit;

amarikow57 · ‎11-03-2020

Yes! I figured out how to do it with PROC SUMMARY too, and the numbers match up. Thank you!

Online Status	Offline
Date Last Visited	‎07-20-2021 06:17 PM

Additive Two-Way ANOVA Model - nonsignificant model but significant pa...

Re: Specify Pairwise Comparisons Using PROC GLM

Specify Pairwise Comparisons Using PROC GLM

Re: Perform Contrasts with PROC GLM

Perform Contrasts with PROC GLM

Re: Keep rows that are mostly complete

Re: Keep rows that are mostly complete

Alternative to HPFOREST in SAS University (data contains missing obser...

Re: Keep rows that are mostly complete

Re: Keep rows that are mostly complete

Re: Specify Pairwise Comparisons Using PROC GLM

Re: Want to keep latest date when merging data

Re: Want to keep latest date when merging data

Re: Additive Two-Way ANOVA Model - nonsignificant model but significan...

Re: Specify Pairwise Comparisons Using PROC GLM

Re: Perform Contrasts with PROC GLM

Re: Alternative to HPFOREST in SAS University (data contains missing o...

Re: Keep rows that are mostly complete

Re: Efficiently graph multiple variables (columns) stratified by dich...

Re: Accept first value when merging datasets

Re: PROC TABULATE not reading in all data

Re: Identify rows based on two conditions

Re: Collapse ID while keeping the max value of the column

Re: Calculate age from dates based on a condition

Re: Variable copying over empty

Re: Want to keep latest date when merging data

Re: 2x2 Frequency Table - Add Totals