About WillTheKiwi

WillTheKiwi · ‎11-07-2018

Thanks for the reasoned and quick reply. If I understand you correctly, Proc Mixed does not include anything from the residuals in its outp confidence limits. Maybe that should be an option to make simple regression models concur exactly with those estimated with Proc Reg. Anyway, if that is the case, the prediction intervals from my meta-regressions should be OK, because I use the clever code that Yang (2003) devised, in which you set the residual variance to 1. I have several random effects, and I get wider intervals for the outp than for the outpm. Presumably these are trustworthy prediction intervals in individual settings and the predicted mean confidence limits for mean of such settings, respectively. Yang M. A Review of Random Effects Modelling in SAS (Release 8.2). London: London, Centre for Multilevel Modelling, 2003.

WillTheKiwi · ‎11-07-2018

The student has provided some code to produce and compare the predicted values with Proc Mixed and Proc Reg in a simple linear regression. Proc Mixed produces the same confidence limits for the predicted mean values as Proc Reg, but Proc Mixed produces the same confidence limits for the predicted individual values, whereas Proc Reg produces different, wider confidence limits. Whatever you may think, I regard this as a substantial problem. I have been doing random-effect meta-analyses with Proc Mixed and using predicted values in meta-regressions to get predicted meta-analyzed effects in mean settings and in individual settings (for given values of the study characteristics--I create dummy study-estimates with missing values of the dependent variable to achieve this, elegantly). I no longer trust the predicted confidence limits defining the prediction interval for individual settings. Please get this problem fixed. Will

WillTheKiwi · ‎11-06-2018

I've been using outp and outpm in the model statement in Proc Mixed. The outp gives the usual prediction interval for individual observations when there is a random or repeated statement, but when it's a simple linear regression without such statements, the outp confidence limits are the same as the outpm confidence limits. I tried adding a dummy repeated (repeated;), but it still gave the confidence interval for the predicted mean. Proc Reg gave the right intervals--well, I presume they are right; they were wider than those for the mean! I'll provide data and code if I have to, but trust me, it's simple enough and it simply isn't right. It's the latest version of SAS Studio University Edition. It's actually a PhD student's project. He's using Proc Reg for the simple linear regressions meantime, but it's a hassle, because he has lots of models and dependent variables. Anyway, it needs to be fixed. Will

WillTheKiwi · ‎09-07-2017

I think the bravo might be premature. In my understanding of sample-size bias in a statistic, it is simply this: the mean of the statistic in samples of a given size is not equal to the population (very large sample) value of the statistic. That is, the sense of magnitude you would get by looking at a lot of samples would be different from the true, population, or very-large-sample magnitude. Hence, for example, the sample standard deviation is biased low, because the mean of the SD of samples of a given size is less than the population SD. With SDs derived from small samples, you can actually get the impression that the SD is a bit too small, with really small sample sizes. Surely exactly the same thing applies to the Pearson correlation coefficient? It's not a question of what transformation you apply to it before you then consider whether the transformed value is biased. I can transform the SD by squaring it. The resulting variance is unbiased: the mean of a lot of small-sample variances is unbiased. When I back transform the mean of the variances, I am back to a biased statistic, but there is much less bias, because the sample size is much bigger. The bottom line is that the Pearson correlation coefficient, as observed in samples of a finite size, is biased low. Isn't that the end of the story? When SAS shows a "correlation estimate" that is less than the sample correlation, it is quite simply wrong. I submit that the authors of the papers that have been cited here have actually misunderstood what small-sample bias is all about. The original authors, Olkin & Pratt (1958), got it right.

WillTheKiwi · ‎08-15-2017

Thanks for the suggestion, but it doesn't solve the problem. I went back to my own files and website. In my own files I found I had done simulations years ago to show that the Pearson is biased low. It was the Pearson produced by proc corr, which is the one shown as the Sample Correlation in the listing. The correlation shown as the Correlation Estimate shoud be higher, not lower. My simulaitons also showed that the corresponding intraclass correlation coefficient, when the two variables are repeated measurements, has even worse bias, which surprised me, as I expected a variance divided by a variance to be unbiased. In the reliability spreadsheet at my website http://sportsci.org I have this comment in one of the cells: "The Pearson and intraclass correlation are biased low. The factor to correct the Pearson is 1 + (1-r^2)/(2(n-3)), where n is the sample size. Olkin, I., & Pratt, J.W. (1958). Unbiased estimation of certain correlation coefficients. Annals of Mathematical Statistics, 29, 201-211." So I'm afraid there is an outright error in the Fisher option for proc corr. The correction goes the wrong way.Can someone from SAS please deal with this? Thanks.

WillTheKiwi · ‎08-14-2017

I accepted the solution to the inclusion of images, thank you, but the correction for bias in the Pearson is still a problem I hope someone will confirm or solve.

WillTheKiwi · ‎08-14-2017

I may be making some stupid error here, but I always thought the Pearson was biased low by sample size, and that the correction factor therefore increases the sample correlation to give an unbiased estimate of the population correlation. But when you use the Fisher option in proc corr, the bias correction (expressed as a tweak for the Fisher transformation) is a small positive value that is subtracted off the sample correlation when it should be added. See attached. I checked that simply invoking proc corr without fisher gave the sample correlation, and I checked wirht CORREL() in Excel and got the same sample correlation. However, I am one grey hair short of Alzheimers, so maybe it's just me. Help! I have to teach this next week. Oh god, why can't we attach a screen shot as a jpg? Fix this restriction, please.

WillTheKiwi · ‎01-05-2017

Thanks for all the feedback, guys. I have posted this in the University Edition forum. Maybe someone from SAS will see it there and help out, but if not, I can live with saving as RTF, then copy-specialing into Powerpoint and ungrouping. Reeza's early suggestion for creating completely separate graphs of each series using by-group processing is also a good way to get the elements of each series clear of all the other series, so it can be grouped and then added animination-wise to a full plot.

WillTheKiwi · ‎01-04-2017

Thanks, and I apologize for not finding this in the documentation. It's not entirely true, because I could get all sorts of fonts with proc print. Anyway, it underscores the need to clean up graphs in Powerpoint. I will be running a workshop on SAS Univeristy Edition with some colleagues in China. At first we thought a Chinese language version was available, and according to that documentation it is, but apparently he cold download only the English version. Any help on that? Will

WillTheKiwi · ‎01-04-2017

*Can you please claify how you are running SAS? Sure. SAS Univerisity Edition, installed on PC, Windows 7/Office 2010/Internet Explorer 11.0.35. I have been using SAS for ~30 years, by the way. Will

WillTheKiwi · ‎01-04-2017

Sorry, forgot to attach the PDF.

WillTheKiwi · ‎01-04-2017

Dan, thanks for the help. The suggestion to use the standard font syntax in the TITLE statement worked before a proc print: title font="Arial Narrow" height=3 "Means and SDs"; I could choose other fonts, including Times and Times New Roman. However this title statement did not allow me to select fonts when it preceded the proc sgplot. The LOG contained the usual warning: The font <sans-serif> is not available. Albany AMT will be used. Also, height=3 produced a disproportionately large font on thre graph. I had to use height=2. Also, I could not change fonts in the text within sgplot. By following links from sgplot I found the family of fonts for sgplot at http://support.sas.com/documentation/cdl/en/graphref/69717/HTML/default/viewer.htm#n0c8945h7o2kmrn1h0uehmio2i6j.htm but they didn't work, as you will see from code beloe and the attached PDF of the plot. I'm trying not to make some angry comment here about the difficulty of doing simple things in SAS, like changing a font. Finally your suggestion to put the REG statement ahead of the SCATTER statement worked, thank you, but the legend will need cleaning up in Powerpoint. Here's the current code. Oh, by the way, there is apparently no control over the size of the caps of the error bars in sgplot, so I have had to make everything else larger so that the caps don't look ugly. What I've ended up with there is about right for reducing the size of the whole plot in Powerpoint (before ungrouping) to make a figure for publication, and to increase the size somewhat to make a figure for a slide. The blue color fill in the square symbols does not completely fill the squares when the graph is ungrouped. I doubt whether direct output to Powerpoint will solve this problem. (At the moment ungrouping after direct output results in complete loss of the figure. I am still waiting for a solution to this problem.) title font="GERMANBI" height=2 "Means and SDs"; ods graphics / reset width=16cm height=18cm imagemap attrpriority=none; proc sgplot data=meanPower1 noborder; styleattrs datacolors=(white blue red) datalinepatterns=(solid solid solid) datacontrastcolors=(black black black) datasymbols=(circlefilled squarefilled diamondfilled); reg x=Rep y=PowerMean / degree=1 nomarkers lineattrs=(thickness=0.5) group=TypeOfStretching; scatter x=Rep y=PowerMean / transparency=0.0 filledoutlinedmarkers markerattrs=(size=20) yerrorupper=MeanPlusSD yerrorlower=MeanMinusSD errorbarattrs=(color=black) group=TypeOfStretching; keylegend /title="Type of stretching:" noborder titleattrs=(size=16 family="GERMAN") valueattrs=(size=16); xaxis label="Repetition number" labelattrs=(family="SWISSBI" size=16) valueattrs=(family="Times New Roman" size=16); yaxis label="Power (W/kg)" labelpos=top labelattrs=(family="Arial Narrow" size=16) valueattrs=(family="Times" size=16); *refline 0; run; ods graphics / reset;

WillTheKiwi · ‎01-02-2017

THANK YOU! See below for the code I am using to produce the attached graph (not quite what I would do for a publication). The only thing I can't do now is change the font, either in the general title or in any of the family=whatever specifications, which I appear to be using correctly in the various ways shown. I have spent hours clicking around the SAS site, to no avail. Presumably it needs something generic, like attrpriority=none, which also took hours to find to get the three datalinepatterns working. The only indication in the LOG is "WARNING: The font <sans-serif> is not available. Albany AMT will be used." Also, is there any way to get the regression lines sitting behind the points? I suspect tweaking in Powerpoint is inevitable. (I sill don't know how to get it into Powerpoint via the ods.) title3 "Plot of means and SDs"; ods graphics / reset imagemap attrpriority=none; proc sgplot data=meanPower1 noborder; styleattrs datacolors=(red green blue) datacontrastcolors=(red green blue) datalinepatterns=(longdash solid solid) datasymbols=(circlefilled squarefilled trianglefilled); scatter x=Rep y=PowerMean / transparency=0.0 filledoutlinedmarkers markerattrs=(size=12) yerrorupper=MeanPlusSD yerrorlower=MeanMinusSD errorbarattrs=(color=black) group=TypeOfStretching; keylegend /title="Type of stretching:"; xaxis label="Repetition number" labelattrs=(family=Times size=12) valueattrs=(family=Times size=12); yaxis label="Power (W/kg)" labelpos=top labelattrs=(family='Arial Narrow' size=12) valueattrs=(family="Times" size=12); reg x=Rep y=PowerMean / degree=1 nomarkers lineattrs=(thickness=1) group=TypeOfStretching; *refline 0; run; ods graphics / reset;

WillTheKiwi · ‎12-31-2016

After much struggling I was able to get decent-looking graphs for each series separately, but putting them all on one set of axes with different symbols and/or colors has defeated me. Studio doesn't seem to have the symbol1, symbol2 etc functionality of the main SAS package. I could find no examples that I could adapt, and as usual the SAS documentation isn't friendly for dummies. Here's the code that produced three separate graphs. data meanPower1; set meanPower; MeanPlusSD=PowerMean+PowerSD; MeanMinusSD=PowerMean-PowerSD; Tweak=0.1; if TypeOfStretching="Dynamic" then Rep=Rep-Tweak; if TypeOfStretching="Static" then Rep=Rep+Tweak; title3 "Plot of means and SDs"; title4 "for number of Powerormances >10"; ods graphics / reset imagemap; proc sgplot data=meanPower1 noautolegend; scatter x=Rep y=PowerMean / transparency=0.0 name='Scatter' filledoutlinedmarkers markerattrs=(symbol=circlefilled size=10 color=black) markerfillattrs=(color=black) yerrorupper=MeanPlusSD yerrorlower=MeanMinusSD errorbarattrs=(color=black); xaxis grid; yaxis grid; reg x=Rep y=PowerMean / degree=1 nomarkers lineattrs=(color=black thickness=1); by TypeOfStretching; run; ods graphics / reset; I tried unstarring by TypeOfStretching and adding group=stretching in various places, but it didn't work. Can I really expect my students and colleagues to develop this sort of code as casual SAS Studio users? They can't dial it up from the options tab for sgplot. Without a library of codes they can use for templates, it looks like I will have to tell them to do their plots in Excel and Powerpoint. I'm really sorry to have to say this. I want SAS Studio to work for plots as well as analyses. Will

WillTheKiwi · ‎12-29-2016

Thanks for the suggestion about posting to the ODS GRAPHICS forum. I went to the SAS Communities pages but was unable to find such a forum. Most of the time I wasn't even sure what I was looking at. Sorry.

Online Status	Offline
Date Last Visited	‎04-05-2024 06:53 PM

Re: No standard error for the overdispersion factor in a simple Poisso...

No standard error for the overdispersion factor in a simple Poisson re...

Re: how to use non-parametric way to analysis one sample?

Re: SAS Studio interface responding too slowly

Re: SAS Studio interface responding too slowly

Re: SAS Studio interface responding too slowly

SAS Studio interface responding too slowly

Re: Bug in SGPLOT: symbol= does not work in markerattrs=(symbol=)

Re: Bug in SGPLOT: symbol= does not work in markerattrs=(symbol=)

Re: Bug in SGPLOT: symbol= does not work in markerattrs=(symbol=)

Re: SAS Studio interface responding too slowly

Re: Interpreting the random-effect solution in a mixed model

Delete the Studio cookies if updating fails and you can't get SAS Stud...

Re: Wrong confidence limits for predicted values in simple linear regr...

Re: Wrong confidence limits for predicted values in simple linear regr...

Wrong confidence limits for predicted values in simple linear regressi...

Re: Wrong adjustment for bias in Pearson correlation with proc corr

Re: Wrong adjustment for bias in Pearson correlation with proc corr

Re: Wrong adjustment for bias in Pearson correlation with proc corr

Wrong adjustment for bias in Pearson correlation with proc corr

Re: Best way to get graphs into Powerpoint for presentation/publcation...

Re: Plotting several time series on one graph, with filled symbols, SD...

Re: Plotting several time series on one graph, with filled symbols, SD...

Re: Plotting several time series on one graph, with filled symbols, SD...

Re: Plotting several time series on one graph, with filled symbols, SD...

Re: Plotting several time series on one graph, with filled symbols, SD...

Plotting several time series on one graph, with filled symbols, SD bar...

Re: Best way to get graphs into Powerpoint for presentation/publcation...

SAS Analytics Explorers