About GreggB

GreggB · ‎03-09-2020

/* split the data randomly with 50/50 split */ data train valid; set twoyears; /* 2 years of data combined */ if ranuni(7) <= .5 then output train; else output valid; run; /*compare the 2 data sets */ proc logistic data = train outest=estimates_train; model camp_flag = rit; run; quit; proc logistic data = valid outest=estimates_valid; model camp_flag = rit; run; quit; Based on what I have studied I believe this is the next step. Here is the % concordant for train and valid, respectively. Is PROC SCORE my next step, using "twoyears"? I'm not sure which portion of the output to look at to determine if I have a model that's good for prediction. Association of Predicted Probabilities and Observed Responses Percent Concordant 94.3 Somers' D 0.892 Percent Discordant 5.1 Gamma 0.898 Percent Tied 0.6 Tau-a 0.099 Pairs 29455 c 0.946 Association of Predicted Probabilities and Observed Responses Percent Concordant 89.0 Somers' D 0.788 Percent Discordant 10.1 Gamma 0.795 Percent Tied 0.9 Tau-a 0.063 Pairs 23648 c 0.894

GreggB · ‎03-08-2020

The summer camp is for grade 3 only. The only way a student would attend twice would be if they are retained in grade 3 and they score low enough both times to be flagged for attendance at the summer camp. Since all the data sets have a unique student ID I can easily find scenarios like this if they occurred

GreggB · ‎03-08-2020

updated code: proc logistic data = twoyears outest=estimates_2yrs; class termname; model camp_flag = termname rit; run; quit; Since termname is not numeric I used a CLASS statement. Is this correct? if so, I interpret this as TermName not being signficant. Analysis of Maximum Likelihood Estimates Parameter DF Estimate Standard Error Wald Chi-Square Pr > ChiSq Intercept 1 18.7084 1.8919 97.7879 <.0001 TermName Fall 2016-2017 1 -0.1980 0.1377 2.0676 0.1505 rit 1 -0.1225 0.0113 118.1675 <.0001

GreggB · ‎03-08-2020

My mistake. It is the fall reading score I referred to as f_read earlier.

GreggB · ‎03-08-2020

proc logistic data = twoyears outest=estimates_2yrs; model camp_flag = RIT; run; quit; twoyears looks like so: (ID is unique; termName has 2 possible values; camp_flag is 0 or 1) termName ID RIT camp_flag 2016-2017 001 249 0 2017-2018 002 279 1 1. You're saying my model should be camp_flag = termName RIT ? 2. I want to make sure my objective is clear: I have a 3rd data set (termName = 2019-2020) that contains RIT and I want to predict the camp_flag value so that students most likely to have a value of 0 based on their end-of-year test can be identified now and receive academic intervention. My next step?

GreggB · ‎03-08-2020

They would attend only once. To be sure I can unduplicate by Student ID to make sure. I think I read about what you're saying - the data is divided into 2 sets using ranuni. One set is used to create the model and the other half is used for prediction?

GreggB · ‎03-08-2020

Is the time issue because the 2 tests are several months apart or because my 2 data sets are from 2 different years?

GreggB · ‎03-07-2020

GreggB · ‎03-07-2020

My objective is to predict if a student will be flagged to attend a summer reading camp that is determined by a test score generated during end-of-year testing in May. The variable used to predict is a reading score earned in the Fall. I call the response variable camp_flag and the fall score f_read. My model (I’m assuming) is something like: model camp_flag = f_read I have 2 years of data, so I want to use one year to create the model and use the other year to test the accuracy of the model’s ability to predict camp_flag. Camp_flag is 0 or 1. My online search is a bit overwhelming. I just need a suggestion on which procedure to learn to accomplish this.

GreggB · ‎02-14-2020

proc print data=sashelp.cars (obs=1); title "Report for &sysdate"; run; No idea how to fix this.... Report for 13FEB20 Obs Make Model Type Origin DriveTrain MSRP Invoice EngineSize Cylinders Horsepower MPG_City MPG_Highway Weight Wheelbase Length 1 Acura MDX SUV Asia All $36,945 $33,337 3.5 6 265 17 23 4451 106 189

GreggB · ‎01-27-2020

Data set one: first last UID tom jones 1 elvis pressley 2 frank sinatra 3 Data set two: first last Ella Fitzgerald Doris Day I want to combine the 2 data sets and start assigning the UID where data set one leaves off. So, Ella has UID = 4 , Doris has UID = 5 and so on.

GreggB · ‎01-14-2020

the xls file comes from an outside vendor. not sure why they won't switch over to xlsx format

GreggB · ‎01-14-2020

When I do that it reads column 1 only (in a weird fashion) like so: should have: 3701551 instead: 3 7 0 1

GreggB · ‎01-14-2020

I'm running Base SAS 9.4

GreggB · ‎01-14-2020

I changed my code to dbms=excel and got a different error. ERROR: Connect: Class not registered ERROR: Error in the LIBNAME statement. ERROR: Connection Failed. See log for details. NOTE: The SAS System stopped processing this step because of errors. NOTE: PROCEDURE IMPORT used (Total process time): real time 0.20 seconds cpu time 0.07 seconds

Online Status	Offline
Date Last Visited	‎05-18-2022 06:46 PM

comparing dates in different formats

using SCAN

Re: show cell values in heat map of a correlation matrix

show cell values in heat map of a correlation matrix

Re: proc sgplot

Re: proc sgplot

Re: proc sgplot

Re: proc sgplot

Re: proc sgplot

Re: proc sgplot

Re: using SCAN

Re: using SCAN

Re: using SCAN

Re: show cell values in heat map of a correlation matrix

Re: proc sgplot

finding columns that are blank

Re: proc sgplot

SAS profile error on load

accidental change in my cursor

Re: proc import vs. libname when reading an xlsx file

Re: Predicting a binary response variable

Re: Predicting a binary response variable

Re: Predicting a binary response variable

Re: Predicting a binary response variable

Re: Predicting a binary response variable

Re: Predicting a binary response variable

Re: Predicting a binary response variable

Re: Predicting a binary response variable

Predicting a binary response variable

&SYSDATE boycotting Valentines Day

adding a counter to a data set

Re: PROC IMPORT and xls file

Re: PROC IMPORT and xls file

Re: PROC IMPORT and xls file

Re: PROC IMPORT and xls file

SAS Analytics Explorers