SAS Support Communities

David_M

Yes, the whole survey has 400 observations but I'm only working with 3 main observations per time period plus confounders.

Season

I am not an expert on deep learning, so I thought I should not respond to your question before. But now that three weeks have passed since you raised your question and no response have been offered, I would like to express my concern so that you may find a better place to ask your question. The concern is: are you sure you are dealing with SAS codes? I do not have access to SAS Viya, the software developed by SAS Institute capable of doing deep learning, but the formats of your code really makes me think that your code is highly unlikely to be a SAS code. For instance, SAS does not use hashes to mark the beginning of appendices; SAS does not have codes like "model.add" (with dots in the middle); SAS does not stuff multiple options in a space flanked by parentheses. So I really speculate that you are coming to the wrong forum for help. The users here are all SAS users. While being a SAS user never implies that he/she cannot use other softwares, the fact that your question has been here for three weeks and no one has ever responded urges me to issue my concern. Nobody in this forum might be able to answer your question because you might had raised it in the wrong place. Therefore, if your code is not a SAS code, then confirm the software on which the code should run and post your question on a forum on that software to minimize your time to get an answer; if I am wrong and your code really is a SAS code, then my response brings you back to the first place of the Statistical Procedures board so that other people visiting this board can find it more easily. Good luck!

BayzidurRahman

Hi, Thanks for the references. I got the solution in Stata.

SAS_Rob

One option might be to make use of the WEIGHT statement in Proc CAUSALMED to run with one of the replication methods. The general approach would be to use one of the SURVEYXXX procedures to compute the replicate weights and then to run CAUSALMED with each set of replicate weights. There is an example of doing something similar with Proc GENMOD. You could likely adapt this example by substituting the CAUSALMED step for the GENMOD step. You might need to do some research to see if this would be statistically valid, but I have seen others use a similar approach with BRR for mediation models. SAS/STAT Poisson Regressions for Complex Surveys

Season

By the way, section 9.3 of Amazon.com: SAS for Mixed Models, Second Edition: 9781590475003: Littell, Ramon C., Milliken, George A., Stroup, Walter W., Wolfinger, Russell D., Schabenberber, Oliver, Ph.D.: Books provides demonstrations and examples of dealing with heteroscedasticity in mixed models. In short, this is dealt with two approaches that in essence both belong to the joint mean and variance modeling approach. They are termed as "Power-of-X" and "Power-of-the-Mean" models. For Power-of-X models, the variance of residuals are modeled in this manner: Var(ei)=σ^2*exp(xγ), with γ being the regression coefficient. In SAS, this can be modeled by codes like: proc mixed data=xxx; /*other statements omitted*/ repeated /local=exp(x); run; Or proc nlmixed data=xxx; /*other statements omitted*/ model y ~ normal(mean,sig2*exp(gamma*x)); run; The Power-of-the-Mean model assumes that the residuals are proportional to y_hat. Let yi_hat=β0+β1xi1+...+βkxik be the predicted dependent variable for the ith observation. The residuals are assumed to take the form Var(ei)=σ^2*|yi_hat|^θ, where θ is an unknown power parameter. SAS codes for this modeling approach are also documented in the book but are not displayed here because of their complexity. Refer to the book for more details. By the way, this book now has a newer edition: SAS for Mixed Models: Introduction and Basic Applications: Stroup PH D, Walter W, Milliken PhD, George A, Claassen, Elizabeth a: 9781642951837: Amazon.com: Books. However, it is stated in the preface of the newer edition that contents regarding heterogeneous variance models are removed in that edition and are reserved to a later publication. For the time being, however, I have not found this publication.

Season

First of all, all of your questions concern logistic regression, so it is better to phrase your problem as "Help with logistic regression". Second, I am not sure if you wish to solve your problem with SAS or not. If the answer is "yes", then both Amazon.com: Logistic Regression Using SAS: Theory and Application, Second Edition and Categorical Data Analysis, 3rd Edition | Wiley are good choices. These books cover your questions and provide examples of building binary and multinomial logistic regression models in SAS. Other topics on categorical data analysis are also discussed in the duo. In addition, if you can understand Chinese, then 《医学研究中的logistic回归分析及SAS实现》【正版图书折扣优惠详情书评试读】 - 新华书店网上商城 is another book I would like to recommend. This book is a monograph specifically devoted to building logistic regression models in SAS, touching on topics not covered by the former two English books like building logistic regression models for complex survey data. On the other hand, if you only wish to learn something on logistic regression and may not build models with SAS, then I recommend Logistic Regression Models (Chapman & Hall/CRC Texts in Statistical Science) 1, Hilbe, Joseph M. - Amazon.com. This book is yet another monograph on logistic regression. What makes this book different from the aforementioned trio is that the last one primarily uses Stata and discusses more theoretical intricacies of logistic regression like the Fisher scoring method, an algorithm used in estimating the parameters in logistic regression.

Season

Thank you so much! I will try it.

Season

@StellaPals wrote: Also, if the assumptions were satisfied, and I made n separate models (grade = sex, grade= math_exam and so on), would I need to use bonferron correction (0.05/n)? Or would I still compare p-values to just 0.05? This has something to do with multiple comparisons and multiple tests, which is closely related to the specific question of the investigator. Briefly, if answering your question entails multiple hypothesis testing procedures, then it is necessary to employ such methods. For instance, if I have 20 groups of students that have finished an English exam and my research question is that whether the mean of English exams scores of all of the groups are different, then I have C(20,2)=20!/(2!*18!) comparisons to do. Therefore, I have to use some method to correct the "raw" P values and compare the corrected P values with the threshold of statistical significance, which is usually 0.05. On the other hand, if I am only interested in whether the scores of group 1 and group 5 are different, then there is only one hypothesis testing procedure needed. In this case, using the "raw" P value and compare it with the statistical significance threshold suffices. By the way, the Bonferroni method is only one way of dealing with multiple comparisons and multiple tests. You can employ this method for P value correction while other choices can also be used. For a more detailed introduction of multiple comparison methods, the circumstances that suitable for each of these methods and how to realize them in SAS, see Amazon.com: Multiple Comparisons and Multiple Tests Using SAS, Second Edition: 9781607647836: Westfall PhD, Peter H., Tobias PhD, Randall D., Wolfinger PhD, Russell D.: Books.

PanShuyao

Yes, it works. Thank you so much!

Season

Weighting is not necessarily needed in logistic regression, unless you are modeling complex survey data or dealing with rare events. See the documentation of PROC SURVEYLOGISTIC for more information of the former and Weighted logistic regression for large-scale imbalanced and rare events data - ScienceDirect and Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data - PMC for more information of the latter.

Season

It seems like you wish to validate a model on your data. Given that the dependent variable is binary, you can employ the concordance statistic (c statistic) to validate the model.

palolix

Ok, thank you Season!

Season

I guess you are using interchangeably the concept of components- a term in finite mixture models; and subgroup- a word coined by yourself and may be more suitable for your analysis. PROBMODEL Statement :: SAS/STAT(R) 14.1 User's Guide says that the PROBMODEL statement is used for building regression models (usually logistic regression) for component membership. Odds ratios and other common metrics reported in logistic regression can be calculated from the table named "Parameter Estimates for Mixing Probabilities" in the output.

jitb · ‎04-12-2025

Thank you...this code is very helpful. The problem was I had missing weights.

Season · ‎04-11-2025

I beg to disagree on substituting t-test for Wilcoxon sum-of-rank test unless the differences follow a normal distribution. I guess that assumption is highly unlikely to be tenable for @reubendon's data because if that is the case, @reubendon does not need to resort to Wilcoxon sum-of-rank test and therefore raise the question here.

Online Status	Offline
Date Last Visited	yesterday

SAS Support Communities

Follow Us

What is...