BookmarkSubscribeRSS Feed
SAS_Student
Calcite | Level 5

Hi

I really need help because I have been looking for answers for my questions everywhere.

 

First I want to analyze my expenditure data which I know it is very highly skewed, as a result Procsurveyreg wasn't an option. Now I have been reading multiple articles about the issue of skewed data and how to deal with it. Now from my understanding there is what called Generalized Linear modeling (GLM). The GLM compared to the ordinary linear regression is more flexible such that the dependent variable (in my case the expenditure outcomes) may be of either a normal or non-normal distribution.

Three main components make up the GLM:

1) the dependent variable distribution known as the exponential family;

2) the independent variables which maybe linear in their relationship with the dependent variable

; and 3) a link function (http://www.sagepub.com/sites/default/files/upm-binaries/21121_Chapter_15.pdf). I learnt also there is what is called modified park test to tell you exactly about the three main component mentioned above.

 

After this brief introduction my questions are:

1- Is there a SAS procedure along with documentation explaining how to conduct GLM and park's test? I am not looking in the documentation how statistics are computed rather I am looking for an easy explanation about the procedure.

2- I have a SUEVEY data, so I would like to know a procedure in which I can include the strata, cluster and weight.

 

P.S. I am fully aware of Paper 1657-2014 (http://support.sas.com/resources/papers/proceedings14/1657-2014.pdf), however, I have to admit as a student that it is above my ability to comprehend.

 

Help is very needed.

Thanks

SAS_Studnet

 

 

2 REPLIES 2
Reeza
Super User

Double check the assumptions for regression. The errors must be randomly distributed - not the independent variable. 

 

Also, have you considered transformations?

 

 

 

SAS_Student
Calcite | Level 5

Hi Reeza

Thank you for your fast reply

 

I read that errors must be normaly distributed, however, how to check for that? I only was able to know how to check for normality for my dependent variable (see attachment)

Now regarding the log-transformation, I considered it until I read back transformation introduce a bias in the analysis. But if you have the equation and refernce for back transformation I might consider it again.


Attach.png

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1781 views
  • 0 likes
  • 2 in conversation