BookmarkSubscribeRSS Feed
deleted_user
Not applicable
someone asked the question here

how is the r-squared from the proc reg computed and what is it used for?

I tried to explain but failed.

any ideas appreciated
9 REPLIES 9
Doc_Duke
Rhodochrosite | Level 12
This note may help.

http://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/statug_introreg_sect009.htm

It doesn't get to the history of the term. If the regression has just one continuous predictor, then the r-squared is algebraically identical to the squared Pearson correlation coefficient, though the interpretation (causation vs association) is different.
deleted_user
Not applicable
ah but when they calculated the correlation in excel the number was not the same

anyone know what excel is really doing?
Paige
Quartz | Level 8
I don't know what Excel is really doing ... but ... no one should be using Excel for statistical calculations. There have been paper after paper showing flaws in Excel's algorithm. Sometimes, it isn't able to compute variances properly.
deleted_user
Not applicable
I'd love a reference

I get tired of numbers not coming out the same when someone does the same thing in excel and points out that the numbers are different
Doc_Duke
Rhodochrosite | Level 12
just go to Google scholar and search for
excel statistics accuracy
and you will find a host of references.
Paige
Quartz | Level 8
From "On the accuracy of statistical procedures in Microsoft Excel 2007", B.D. McCullough and and David A. Heiser, Computational Statistics & Data Analysis
Volume 52, Issue 10, 15 June 2008, Pages 4570-4578.

"The statistical literature has regularly identified flaws in Excel’s statistical procedures at least since Sawitzki (1994), and Microsoft has repeatedly proved itself incapable of providing reliable statistical functionality. It is little wonder that introductory texts on statistics warn students not to use Excel when the results matter (e.g., Keller (2001) and Levine et al. (2002))."
deleted_user
Not applicable
how bad are any of these problems with excel?
Paige
Quartz | Level 8
Bad enough so that "introductory texts on statistics warn students not to use Excel when the results matter". In other words, we're not talking about advanced methods in bioinformatics that involve millions of data points ... we are talking about every day statistics.

Do your results matter? Message was edited by: Paige
deleted_user
Not applicable
one of the things that has been done here and other places where I've worked is using excel addons for monte carlo simulation work like crystal ball. Do these problems persist there?

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 9 replies
  • 1411 views
  • 0 likes
  • 3 in conversation