About newbatprocsql1

newbatprocsql1 · ‎09-05-2024

Hi, I have a table that has a variable ClosedDt of types Num, Len 8, Format DATETIME19. and Informat DATETIME19. I have a proc sql statement that would like to filter this table every month and this involves comparing against this variable. Could I please get some help on how best to write a simple macro variable to do this? Please see the below for what I mean %let curr_dt = 2407 /*this is in the form YYMM for month end*/ proc sql; create table output as select a.CaseID as count_caseID from source_data_&curr_dt. as a where ClosedDt < "01AUG2024"d and ClosedDT >= "01JUL2024"d ;quit; 1. I know that the WHERE statement will produce zero observations. How can I rewrite it? 2. Could someone please show me how I can use the curr_dt macro variable to automate the WHERE statement (i.e. change the ClosedDt < "01AUG2024"d into something like ClosedDt < &dt_start. where dt_start will resolve to "01AUG2024", based on my curr_dt variable?

newbatprocsql1 · ‎08-10-2023

I say that will give different results in general, because I remember taking away the "unique" keyword and it gave different results. Also, did you mean to type count(person_id) twice in the proc sql statements?

newbatprocsql1 · ‎08-10-2023

Thanks! That's really helpful. I think now I'm just not understanding what the difference between using count(*), count(unique person_id) and count(person_id) is when using that "group by category, ref_type". It seems like it will always give the same result (but I know that count(unique person_id) and count(person_id) would give different results in general, I just can't see why, since we are using the group by statement).

newbatprocsql1 · ‎08-10-2023

That makes sense - so if PERSON_ID is always not missing then count(*) = count(person_id)? Also, how does that work with the group by statement? I see @JosvanderVelden 's reply, but the post they linked only says there's a caveat with the aggregate function, but doesn't say what the caveat is.

newbatprocsql1 · ‎08-10-2023

I'm pretty new to proc sql, but I was trying to count the number of people. Each unique person can have multiple rows (multiple application submissions). An example of a table below is: Person_ID Category Ref_type Total_amt 100 Green 2 350 100 Blue 2 300 100 Red 3 100 200 Green 1 20 200 Black 3 500 300 Blue 2 200 I want to count the number of people in each Category*Ref_type, so it'll be Category Ref_type No_ppl Green 1 1 Green 2 2 Blue 2 2 Red 3 1 Black 3 1 I'm not at home right now so I can't check but I remember that I tried doing proc sql; create table want as select Category ,Ref_type ,count(unique Person_ID) from have group by Category, Ref_type ;quit; which gave me not exactly what I wanted (note that this was for a large dataset, probably 2 million rows). When I removed the "unique" keyword, I got what I wanted. Can someone explain to me what the code with the "unique" when I'm using it with the count function, and a group by statement, and also what the code without the "unique" function does when I'm counting a specific variable that's not in the group by statement?

Online Status	Offline
Date Last Visited	‎09-06-2024 02:19 AM

Comparing against a DATETIME19 variable and macros

Re: What does "unique" do here for the count function alongside group ...

Re: What does "unique" do here for the count function alongside group ...

Re: What does "unique" do here for the count function alongside group ...

What does "unique" do here for the count function alongside group by s...

Comparing against a DATETIME19 variable and macros

Re: What does "unique" do here for the count function alongside group ...

Re: What does "unique" do here for the count function alongside group ...

Re: What does "unique" do here for the count function alongside group ...

What does "unique" do here for the count function alongside group by s...