BookmarkSubscribeRSS Feed
krishmar1
Fluorite | Level 6

Is there a proc or formula to calculate the magnitude of bias in regression parameter estimates?

Thanks,

Krishna.

4 REPLIES 4
SteveDenham
Jade | Level 19

Well, we know that the MSE is equal to the bias squared plus the variance for an estimator.  So now it all depends on what you know about the distribution of the estimator.  If the distribution has a known variance, you can calculate the MSE from the estimator's standard error, subtract the population variance, and get the squared bias. 

This all requires knowing the population mean and variance for the estimator in question.

Steve Denham

krishmar1
Fluorite | Level 6

Thank you, Steve and Paige.


Currently, I am working on a project that involves missing data analysis of sample data. I wanted to know if there is a way to measure bias for regression models for different "Missing Data" Deletion or Imputation methods -- I mean, in Listwise or Pairwise or Mean Substitution methods, I know the estimates are highly biased compared to those in Multiple Imputation or Expectation Maximization, but is it possible to calculate bias in the regression parameter estimates.


It seems since there is no "correct estimate" for non-missing sample data, the "amount of bias" cannot be calculated.


Please let me know if you have any other thoughts.


Thanks,

K



PaigeMiller
Diamond | Level 26

If you are talking about Ordinary Least Squares Regression, and you are estimating the correct model, then it is my understanding that the bias is zero.

--
Paige Miller
ranjan_mitre_org
Calcite | Level 5

Are you asking about Linear or Logistic regression? If Logistic Regression, then there is a very good paper by King and Zeng, "Logistic Regression in Rare Events Data" that gives a formula for bias to account for missing data.


sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 2946 views
  • 0 likes
  • 4 in conversation