About therock

japelin · ‎04-12-2021

It would be helpful if you could provide some sample data for these questions. Now, in the following code, 16 rows from 1997 to 2012 are extracted as the key. /* create test data */ data have; length cusip $10 year 8; do cusip='A','C'; do year=1997 to 2012; output; end; end; cusip='B'; do year=1997 to 2000; output; end; do cusip='D'; do year=1998 to 2013; output; end; end; run; /* extract */ proc sort data=have nodupkey; by cusip year; where 1997<=year<=2012; run; proc freq data=have noprint; table cusip / out=freq(where=(COUNT=16)); run; proc sql; create table want as select have.* from have, freq where have.cusip=freq.cusip ; quit;

PaigeMiller · ‎05-10-2020

@therock wrote: By putting the Class variables in model statement, SAS tells me that: The X’X matrix has been found to be singular. This always happens when you have CLASS variables. It does not indicate you did anything wrong. It does not indicate a problem with the analysis, which (if everything is done correctly) is the correct analysis. Everyone who has ever used CLASS variables in a SAS model with the SOLUTION option has gotten this message.

RichardDeVen · ‎04-01-2020

The report is not a quick 'one off' due to several factors: Each regression has output tables that don't 'align' for the desired report Two regressions The output tables can't be combined to be in a shape that tabulate can perform as expected. TABULATE is nice, in so far as: when a variable is placed in a table the format can be specified. The downside of tabulate is that a cell can display only A class level, class value or aggregation result from values belonging to a dimensional intersection. The `estimate` you want shown is actually combination of the estimate and probt 'category'. Tabulate can't do that. A Proc REPORT step can be coded to have a COMPUTE block that computes a complex value rendering based on more than one column (such as aforementioned estimate and probt). But the data alignments are still a little off and each column would need a CALL DEFINE to render the numbers according the the row specific format. The tricky part of your report is that it is showing the estimate table output (which has estimate, stderr and probt in a single row) in a row-wise pivot fashion (estimate on one row and stderr on next row) stacked with rows from FitStatistic and DesignSummary output tables. One way to deal with all the issues is take them on one by one in a series of steps in which you can control the data shape and cell value renderings and do a final simple PRINT or REPORT. Example: Fake data data have(drop=_:); call streaminit(1234); length industry $25; _n_=-1; do industry = 'Mining', 'Automotive', 'Pharmaceutical'; _n_+1;_m_=-1; do year = 2017 to 2019; _m_+1; do id = 1 to 1221; array iv iv1-iv7; do over iv; iv = ceil(rand('norm', _n_*3+_m_, 1)); if iv < 1 or iv > 9 then iv = .; end; dv1 = iv1 * 1.15 + iv2 / 2 + iv3 * (1.00+rand('uniform',1)) - iv4 / 3 + iv5 * 2 - iv6 / (0.01+rand('uniform',3)) - iv7 * (1.00+rand('uniform')/7) ; dv2 = iv1/1 + iv2/2 - iv3/(0.01+rand('uniform',3)) - iv4/(3.00+rand('uniform',4)) + iv5/1.5 - iv6/2 - iv7/2 ; output; end; end; end; format dv: 9.2; run; Custom formats and regressions proc format; picture estimatef (round) low - < 0 = ' 9.999' (prefix='-') 0 <- high=' 9.999' .=' '; picture stderrf (round) low-high=' 9.999)' (prefix='(') .=' '; value probtstars 0 - 0.01 = ' ***' 0.01 <- 0.05 = ' **' 0.05 <- 0.10 = ' *' other = ' !!!'; run; ods output ParameterEstimates (persist) = est_dv1 DesignSummary (persist) = smy_dv1 FitStatistics (persist) = fit_dv1 ; proc surveyreg data = HAVE ; cluster id; class industry year; model dv1 = iv2 iv3 iv4 iv5 iv6 iv7 / noint ADJRSQ solution; run; ods output close; ods output ParameterEstimates (persist) = est_dv2 DesignSummary (persist) = smy_dv2 FitStatistics (persist) = fit_dv2 ; proc surveyreg data = HAVE ; cluster id; class industry year; model dv2 = iv2 iv3 iv4 iv5 iv6 iv7 / noint ADJRSQ solution; run; ods output close; Computing formatted renderings of estimate and stderr (instead of TABULATE *F=...) data est_dv1; set est_dv1; estimate_fmt = put(estimate,estimatef.) || put(probt,probtstars.); stderr_fmt = put(stderr,stderrf.); run; data est_dv2; set est_dv2; estimate_fmt = put(estimate,estimatef.) || put(probt,probtstars.); stderr_fmt = put(stderr,stderrf.); run; Reshaping estimates (row-wise transpose), stacking with other regression output and their computed renderings (PUT(...)) for each modeled dependent variable proc transpose data=est_dv1 out=est_T_dv1; by parameter; var estimate_fmt stderr_fmt; run; proc transpose data=est_dv2 out=est_T_dv2; by parameter; var estimate_fmt stderr_fmt; run; data t1 (keep=model var _name_ col1 row); model = 'dv1'; length var label1 $32; set est_T_dv1 fit_dv1 smy_dv1; var = coalesceC (parameter, label1); if label1 =: 'Adj' then col1 = put (nvalue1, 5.3); if label1 =: 'Num' then col1 = left(put (nvalue1, comma12.)); if not missing(parameter) or label1 in ('Adjusted R-Square', 'Number of Clusters' ); row+1; run; data t2 (keep=model var _name_ col1 row); model = 'dv2'; length var label1 $32; set est_T_dv2 fit_dv2 smy_dv2; var = coalesceC (parameter, label1); if label1 =: 'Adj' then col1 = put (nvalue1, 5.3); if label1 =: 'Num' then col1 = left(put (nvalue1, comma12.)); if not missing(parameter) or label1 in ('Adjusted R-Square', 'Number of Clusters' ); row+1; run; data t; set t1 t2; run; Transposing again to get one column per modeled variable proc sort data=t; by row model; run; proc transpose data=t out=report; by row var _name_; id model; var col1; run; and lastly a REPORT with a /order to hide repeated var values. ODS Excel instead of tagsets.excelxp ods excel file='surveyreg-report.xlsx'; ods html file='surveyreg-report.html'; ods listing; proc report data=report(drop=row _name_); define var / order order=data; run; ods _all_ close; Excel output

therock · ‎02-11-2020

Hi, I forgot to mention that I use industry and size as control variables in both models. The model that I am using is confirmed by my professor. What I need advice is regarding the interpretation. Please see my new post regarding interpretation. Am I interpreting it correctly? Thank you so much!

ballardw · ‎10-22-2018

@therock wrote: I have to break the post in two part. Sorry about that. Going back to the previous post, I thought that the UCLA would be helpful but the modified code did not work. My request can someone help me with a code to put the proc surveyreg results into the format that I want? Have you determined which data you want (table name) and how to direct that to an output data set? That would be the first step. For example if you want the parameter estimates which would be in the ParameterEstimates table you would use something like proc surveyreg data=example; cluster id; class year siblings; model write = sex income sex*income math science / noint adjsq solution; ods output ParameterEstimates = work.myparameters; run; Each table of output has a different name. You see which ones are generated by your procedure using ODS TRACE=ON; before running the procedure or check in the details of the procedure documentation for table names generated by which statement. With a data set you can then manipulate it with either data step code or a reporting procedure, possibly combining multiple output sets.

therock · ‎02-23-2016

I guess that makes sense. I will try the graphing and see what comes out of it. Thanks!

alexchien · ‎02-09-2016

The one i mentioned would allow different X3 slopes for different X3 percentile values. Which one is better? I would start with the model you proposed and add additional complexity such as the terms included in the model i mentioned to see if they add any significant value in terms of goodness-of-fit or validation using a holdout data. Typically, however, the simplier the model, the better. Cheers

therock · ‎11-24-2015

Hi Roger, Thanks so much for help. It worked perfectly. Now I can use this and modify on my data with over 30000 data fields and 100 variables! I have to do at least 100 analysis. Have a wonderful holiday season! Thanks, The rock

therock · ‎09-24-2015

I forgot to add. I want to test the interaction term between all three variables.

therock · ‎08-26-2015

Thanks Mark Johnson for the updated info. It worked perfectly. Thanks ndp, your code also worked.

therock · ‎07-28-2015

Thanks to both Xia Keshan and EM@sas on info about the SAS SQL documentation.

Patrick · ‎07-21-2015

It's may be not your intention but what your doing here is asking us to do the job for you. I suggest you give it a go first and then come back with some targeted questions in the bits where you've got stuck. Please also post a data step creating sample data, the code you've already developed and then tell us what's not working and what you need.

Online Status	Offline
Date Last Visited	‎04-20-2021 12:15 AM

Unbalanced to Balanced Panel Data

Re: PROC Surveyreg, PROC GLM, or PROC Mixed?

PROC Surveyreg, PROC GLM, or PROC Mixed?

Re: Creating results table from Proc Survereg

Creating results table from Proc Survereg

Re: Help interpreting the results

Re: Help interpreting the results

Re: Help interpreting the results

Help interpreting the results

Re: Proc tabulate from proc surveyreg

Re: Unbalanced to Balanced Panel Data

Re: PROC Surveyreg, PROC GLM, or PROC Mixed?

Re: Creating results table from Proc Survereg

Re: Help interpreting the results

Re: Proc tabulate from proc surveyreg

Re: Help with Regression & Interaction term

Re: Help with Proc Surveyreg and Interaction terms

Re: Help with Proc SQL step

Re: Help with Proc Surveyreg and Test Statement

Re: Missing Values need to be Filled

Re: Help with File using Data & Simple Steps

Re: Help with File using Data & Simple Steps