About Terry

Terry · ‎03-19-2012

I've got some ideas of what the problem might be. One level of the covariate is exactly the same as the level of another predictor in the model, that is, one level of the education was assigned to represent the education level for children<18 while the age variable also has a category for <18yr. Since they provide essentially the same information mathematically, one of them is omitted in the computation.

Terry · ‎03-19-2012

Hi to all, I ran a linear model with proc glm and got a very weired result that I cannot explain what is going on. I got one categorical variable in the model with 4 levels and I put it in the class statement. Normally I would expect that 3 levels except the reference level would have the estimates. But the output I got resulted in two levels of the variable with nothing estimated. educ 0 0.0000000 B . . . educ 1 1.1880336 B 0.99728636 1.19 0.2337 educ 2 -0.9403576 B 1.06206014 -0.89 0.3760 educ 3 0.0000000 B . . . There's no error message in the log. Besides, I also tried to create 3 dummy variables and replaced the categorical variable with them. Still, SAS gave no estimate for one of the 3 dummy variables. I just wonder if there is any statistical trick in the procedure that does not match my conventional thinking. Or there might be some other things I missed or did wrong? Mostly importantly, do I adjust for this categorical variable sufficiently given this result, since I include this covariate in the model as a confounder? Thanks so much for the help!

Terry · ‎01-23-2012

Thank you! It works. It is a great idea to separate then update the data in such a way.

Terry · ‎01-23-2012

Hi, everyone! I've got an ill-organized raw dataset and I met some troubles while trying to clean it. 1. There are multiple rows for one person and I'd like to combine them into one. It's like ID Gender Age Var1 Var2 Var3 1 F . . 5 10 1 . 25 . . . 1 . . 6 . . And the problem is that the missing pattern is not consistent across individuals. That is, for another individual, the data may look like ID Gender Age Var1 Var2 Var3 1 F 25 6 . . 1 . . . 5 . 1 . . . . 10 So by far what I can do with it is subsetting the data into many small non-missing datasets including ID and another variable, then remerging them by ID. Since there are many variables in the dataset, it is too time-consuming. Is there any simple way or command that can combine the data into one row for one individual? 2. The other scenario is that there are partly duplicated observations, which look like ID Gender Age Var1 Var2 Var3 1 F . . 5 10 1 F 25 6 5 10 I hope to retain the observation with the most complete information and delete the duplicates. The only way I know about eliminating duplicates is using PROC SORT with NODUP options, but it seems that it does not work here. I feel that it can be solved in a similar way as for problem 1, but I don't know how. Thank you!

Online Status	Offline
Date Last Visited	‎06-28-2016 11:31 AM

How to explain the regression results?

How to explain the regression results?

Combine multiple observations with missing values into one

Combine multiple observations with missing values into one

How to explain the regression results?

How to explain the regression results?

Combine multiple observations with missing values into one

Combine multiple observations with missing values into one