Hello everyone, I have an assignment that simply states, to "include predictive statistics for store sales and store orders and to include charts, graphs, and tables, and analysis of your findings." I am not really sure what type of predictive statistics I should include. So far I have used proc corr and proc reg shown below, where log_average is the average amount spent per visit. proc sgplot data=data1; histogram log_average; run; proc corr data=data1; var log_average; with 'Customer Id'n ZIP_CODE FRE MON CC_CARD PSWEATERS PKNIT_TOPS PKNIT_DRES PBLOUSES PJACKETS PCAR_PNTS PCAS_PNTS PSHIRTS PDRESSES PSUITS POUTERWEAR PJEWELRY PFASHION PLEGWEAR PCOLLSPND GMP PROMOS DAYS MARKDOWN CLUSTYPE PERCRET 'ln days between purchases'n 'ln lifetime ave time betw visits'n; run; proc reg data=data1 outest=est1; model log_average= 'Customer Id'n ZIP_CODE FRE MON CC_CARD PSWEATERS PKNIT_TOPS PKNIT_DRES PBLOUSES PJACKETS PCAR_PNTS PCAS_PNTS PSHIRTS PDRESSES PSUITS POUTERWEAR PJEWELRY PFASHION PLEGWEAR PCOLLSPND GMP PROMOS DAYS MARKDOWN CLUSTYPE PERCRET 'ln days between purchases'n 'ln lifetime ave time betw visits'n / slstay=0.15 slentry=0.15 selection=forward ss2 sse aic; output out=out1 p=p r=r; run; quit; I was told that I need to provide more information about the results. Basically I am finding which variables have a positive correlation with the average amount spent per visit, is that correct? How can I use this information, or what simpler methods of predictive analytics can I use? My results are shown below. I also get the fit diagnostics plots and residual by regressors. I really don't understand how this tells me anything besides the correlation between all my variables. Thank you for any input you may have.
... View more