Hi, thanks for answering my question! Prod is a new data file that contains the same information as the data file that the model was trained on. The only difference between the two files is that the column 'loan status' in prod is empty. I want to use the logistic regression model to make new predictions about the loan status(default or current) in new prod data. Essentially, I expect the previously empty 'loan status' column to be filled with 0 or 1 in the final outputted prod data file However when I tried to use the score statement as described in your response, the model failed to predict anything. I'm unsure if it is due to some problem with my data. These are the columns of the data on which the model was trained on PROC SQL;
CREATE TABLE WORK.query AS
SELECT loanId , memberId , 'date'n ,
purpose , isJointApplication , loanAmount ,
term , interestRate , monthlyPayment ,
grade , loanStatus , residentialState ,
yearsEmployment , homeOwnership , annualIncome ,
incomeVerified , dtiRatio , lengthCreditHistory ,
numTotalCreditLines , numOpenCreditLines ,
numOpenCreditLines1Year , revolvingBalance , revolvingUtilizationRate ,
numDerogatoryRec , numDelinquency2Years , numChargeoff1year ,
numInquiries6Mon , bad_good FROM WORK.MERGED_LABEL;
RUN;
QUIT; and these are the columns in prod data PROC SQL;
CREATE TABLE WORK.query AS
SELECT loanId , memberId , 'date'n ,
purpose , isJointApplication , loanAmount ,
term , interestRate , monthlyPayment ,
grade , loanStatus , residentialState ,
yearsEmployment , homeOwnership , annualIncome ,
incomeVerified , dtiRatio , lengthCreditHistory ,
numTotalCreditLines , numOpenCreditLines ,
numOpenCreditLines1Year , revolvingBalance , revolvingUtilizationRate ,
numDerogatoryRec , numDelinquency2Years , numChargeoff1year ,
numInquiries6Mon, loanStatus FROM WORK.PROD;
RUN;
QUIT; Thank you for your help!
... View more