About morgalr

morgalr · ‎04-30-2015

The only way I know of is the manual way, which is probably what you are trying to avoid. The fastest way I know of is to insert all combinations into a "already used table", then do a lookup on it.

morgalr · ‎04-23-2015

Manthan, I believe this article shows what you want: http://blogs.sas.com/content/sasdummy/2014/01/29/using-filename-zip/

morgalr · ‎04-15-2015

To speedup your operation, be sure there are indexes on the num column in both tables. Also a question: You said the num column is character where the leading may be "0". Is '0000123' the same as '123' for your purposes? If they are then you run into a problem where your character field may introduced unwanted results due to inability to match because of leading zero's in the num column. If that is the case convert the num in both tables to a number field. (see next part of comment) We do a lot of our SAS from large tables natively in MS-SQL or Oracle (100's of GB or larger per table) and I tend to create temporary tables with the appropriate "keep" values in the set to limit the data returned to me as only the minimal dataset that I need. I can then make any changes or indexes I need to the local data, and then if I need to upload any back I can do so in a single stream without any local functions to take time. This moves the processing completely into the realm of my control and since it is all local, I don't have to worry about network time either. This scheme works faster, than any other I have tried when I have to process any amount of the data locally.

morgalr · ‎04-09-2015

Easiest way I know is to manually output. do while(not eof); set mylib.inputstuff end=eof; if first.myVar then do; *do header stuff here end; *whatever processing you want for regular body output; end; note: you need to be ordered by myVar for first to work.

morgalr · ‎03-12-2015

You can make a table with only the list of reps you want, this will not have anything other than the reps, then you can do outer joins to the tables you are using with the conditions. This will give you two result tables both containing the reps you need and null data, except rep id's, for missing data. Since both of the tables have been prequalified, then you can join them and get the result you seek. (yes, ugly and messy way of it.)

morgalr · ‎01-30-2015

We often do that with the month of our observation period. We have our first month of observation start at 1 and then by month year we increment though our observation period. This gives a convenient way to track many things including the fiscal year. Here is a format we use: proc format; value FYfmt 1- 12 = 1998 13- 24 = 1999 25- 36 = 2000 37- 48 = 2001 49- 60 = 2002 61- 72 = 2003 73- 84 = 2004 85- 96 = 2005 97-108 = 2006 109-120 = 2007 121-132 = 2008 133-144 = 2009 145-156 = 2010 157-168 = 2011 169-180 = 2012 181-192 = 2013 193-204 = 2014 205-216 = 2015 217-228 = 2016 229-240 = 2017 241-252 = 2018 253-264 = 2019 265-276 = 2020 ; run; Please note, our observation period, month 1, is our first month of our fiscal year and does not follow the calendar year.

morgalr · ‎01-29-2015

ballardw, yes, quite right: I always get the order of the first/last mixed up with the variable--too much object oriented programming--and indeed it does remove any that only have a singular observation. here is the corrected code: data myEdit; set test; by myID; if (first.myID or last.myID) then delete; run; And here is the code with that extra bit, it only strips the first and last, but it will also leave the observation that is a single for the id: data myJunk; set test; by myID; if (not (first.myID and last.myID)) and (first.myID or last.myID) then delete; run;

morgalr · ‎01-29-2015

Jeff, I understand your view, but also RW9 is correct: the request was for an unprecedented mooch of everything--that in and of itself, was extremely rude and shows a great deal of insensitivity and laziness on the behalf of the requester--New or otherwise. Les BTW: aydot24 welcome to the SAS world.

morgalr · ‎01-29-2015

Order your dataset and use .first and .last if myvar.first OR myvar.last delete;

morgalr · ‎01-27-2015

proc sql; create table test (myID num, myOne varchar(5), myTwo num, myThree varchar(5), myFour num, myFive num, mySix varchar(5)); insert into test values(1, '', null, '3', 4, 5, '6'); insert into test values(2, '1', null, '3', 4, 5, ''); insert into test values(3, '', 2, '3', null, 5, ''); insert into test values(4, '1', 2, '3', null, 5, '6'); insert into test values(5, '1', 2, '3', 4, 5, ''); quit; /* This will give you the user id along with a string containing the name of each colum that was missing or null. colum list is in a funky order because of using two seperate arrays. */ data myOut (replace=yes compress=no); set test; array myChar _CHARACTER_; array myNum _NUMERIC_; myFlag = 0; length myMissing $ 300; length myMissingOne $ 300; do _i=1 to dim(myChar); if(myChar[_i] = '') then do; myMissingOne = VNAME(myChar[_i]); myMissing = catx(' ', myMissing, myMissingOne); myFlag=1; end; end; do _i=1 to dim(myNum); if(myNum[_i] = .) then do; myMissingOne = VNAME(myNum[_i]); myMissing = catx(' ', myMissing, myMissingOne); myFlag = 1; end; end; if(myFlag = 1) then output; keep myID myMissing; run; /* This will give you a dataset listing all of the columns that contain either null or missing. The columns are tied to the client ID through a one to many relationship in this table. One client ID and many columns. The client ID/Column relation is unique. */ data myOutTwo (replace=yes compress=no); set test; array myChar _CHARACTER_; array myNum _NUMERIC_; myFlag = 0; length myMissing $ 300; do _i=1 to dim(myChar); if(myChar[_i] = '') then do; myMissing = VNAME(myChar[_i]); output; end; end; do _i=1 to dim(myNum); if(myNum[_i] = .) then do; myMissing = VNAME(myNum[_i]); output; end; end; keep myID myMissing; run;

morgalr · ‎01-27-2015

Here is how I would go about solving the problem: make an input array for your input dataset, make 2 variables for your output dataset--ID and Missing, use VTYPE to check the data type of each variable in your input array. use appropriate number or character functions to do your checking concatinate the character representation for your index for your input array onto an output string each time you find a missing or null value. set a flag saying you have missing values output your string if you have any missing. keep client number and string showing missing loop until done. do what you'll get is: an ID field and a string filed, the string field will contain all missing indexes. [1] [2 4] [2] [1 3 4 6] [3] [2 3 5] [4] [2 4 5] [5] [2 3 4 5]

morgalr · ‎01-22-2015

Jesse, Thank you for further defining your requirements, you need to move back to a dataset ordered by MemberID and Admit_Date. Use a manual "output" that is triggered by your MemberID or your Admit_Date changing. Each iteration you have to "retain" your calculation values and only keep the calculated values for your output. Are you familiar with using "output" with a dataset? Les

morgalr · ‎01-22-2015

This is not a trivial exercise, I developed the exact solution you ask for as a solution for a Federal requirement for a project called "Case Mix" when I worked at Washington States DSHS Aging and Disability Services. You can request the code from them, it is public disclosable, but I no longer work for them. Their code is in VBA and MS Access, but it should be easily convertible to SAS. Our requirement was to track residents in and out of nursing facilities and apply assessments to their periods they were in the facility according to Federal guidelines. There were time lines that had to be applied and defaults if they did not do an assessment in the time given as a grace period by the Federal Government. If the resident left the facility before the "Grace Period" was over, then when they came in they had a new grace period start and any resulting rate, or default, had to be retroactively applied to all open grace periods. Fun times and took almost a year to work all the bugs out, but it still stands as the "go to way to do it" after more than a decade.

morgalr · ‎01-22-2015

Jse, You are correct, it does not group by Member ID and Date, but then neither does your example output data. That is why I add the comment: If you want to have your answer by MemberID and Admit_Date, then just add Admit_Date to the "group by" clause: "group by MemberID, Admit_Date". This will give: select MemberID, Admit_Date, sum(cost) as Episode_Cost, 1 as Episode_Count, Count(Distinct(Admit_Date)) as Episode_Span from MyStuff group by MemberID, Admit_date; And if you want to be assured sequential MemberID's and Admit_Date's you need an "order by" clause" which will give: select MemberID, Admit_Date, sum(cost) as Episode_Cost, 1 as Episode_Count, Count(Distinct(Admit_Date)) as Episode_Span from MyStuff group by MemberID, Admit_date order by MemberID, Admit_date; Which I believe is what you asked for, but did not show, in the first place. I am still at a loss as to what you want to do with your group field: in your sample output it is taken from the first obs of the group by MemberID--in both cases shown, it has been ignored.

morgalr · ‎01-22-2015

For needs like you describe I tend to use Proc SQL. The following will create a table, populate it, and give you the described result using Proc SQL. proc sql; create table myStuff(MemberID Char(4), Admit_Date Date, cost int, Group char(16)); insert into myStuff values('1111', '01-JAN-2010'd, 3, 'Unavoidable'); insert into myStuff values('1111', '01-JAN-2010'd, 2, 'Avoidable'); insert into myStuff values('1111', '02-JAN-2010'd, 4, 'Unavoidable'); insert into myStuff values('2222', '20-JAN-2010'd, 1, 'Avoidable'); select MemberID, Min(Admit_Date) as Admit_Date, sum(cost) as Episode_Cost, 1 as Episode_Count, Count(Distinct(Admit_Date)) as Episode_Span from MyStuff group by MemberID; quit; If you want to have your answer by MemberID and Admit_Date, then just add Admit_Date to the "group by" clause: "group by MemberID, Admit_Date".

Online Status	Offline
Date Last Visited	‎06-12-2018 06:29 PM

Re: How to simplify the code in data step by macro and do loop？

Re: Char to Num

Re: Char to Num

Re: SAS SQL self join on large datasets

Re: SAS SQL self join on large datasets

Re: How to track changing values for same ID across 5 datasets

Re: Pulling data from SQL serever for a particular month

Re: How to save time to extract data from SQL Server.

Re: SAS Scheduling

Re: counting occurrences within a household

Re: How to save time to extract data from SQL Server.

Re: Question regarding SAS data storage options

Re: How to predict date of birth using First name in SAS? Please help ...

Re: Metadata&colon; More space please

Re: Delete First/Last observation from multiple EU's in a dataset

Re: Multiple "Where" statements in a PROC SQL step?

Re: left join with more tables

Re: How we create only specific row(observations)/How do we create row...

Re: retaining unique combinations

Re: Reading SAS Datasets directly from a zip file...

Re: left join by character variable

Re: How we create only specific row(observations)/How do we create row...

Re: prob in left join

Re: I would like to know how to make fiscal year

Re: Delete First/Last observation from multiple EU's in a dataset

Re: Codes

Re: Delete First/Last observation from multiple EU's in a dataset

Re: How to output observations whose data are missing for certain vari...

Re: How to output observations whose data are missing for certain vari...

Re: Group by member ID and consecutive dates

Re: retain date when first instance of default

Re: Group by member ID and consecutive dates

Re: Group by member ID and consecutive dates