About KS99

KS99 · ‎02-03-2024

Thank you, sbxkoenk! This is a great and detailed help! I can use it my entire life as long as I get troubled by Poisson! KS -

KS99 · ‎12-20-2023

Hi, nice to see you again, K-Sharp!! I opened your message and got your codes. I inserted them into my old codes after modifying them a little bit, and they work perfectly as I wanted them! Thank you, K-Sharp, Wish you great holidays! KS -

mkeintz · ‎08-13-2023

My understanding is that You want one output record for each M&A record (dataset M_and_A in the code below). For each M&A record you want an earnings record posted on the same date, or the earliest following date (from dataset EARNINGS). If both of your datasets are sorted by ID (e.g. ticker, cusip, permco), and DATE, then you can take advantage of conditional SET statements (if some_condition then SET some_dataset), to carry forward (or in this case, carry backward) data of interest. You might think you have to sort data in reverse chronological order within each id, then use conditional set's to logically carry forward (but in real calendar terms, carry backward) the needed matching data. Having done that, you'd have to re-sort back to chronological order. You don't have to do that. Instead, just recognize that carrying back a given earnings record dated, say 02JUL2023, back to (but not including) the previous earnings record (say 31mar2023), is the same as carrying forward the 02JUL2023 data from 01APR2023 (day after preceding earnings) forward through 02JUL2023. An inexpensive (compared to sorting and re-sorting) dataset view can enable this approach. In other words, carrying data from date D{t} backwards is the same as carrying forward the same data from date D{t-1}+1, so just create a dataset view using this concept. Something like this: data earnings_next/ view=earnings_next; set earnings; by id; _sort_date=ifn(first.id,'01jan1960'd,sum(lag(date),1)); run; data want (label="Every M&A record plus either the sameday earnings or earliest following Earnings" drop=_:); merge m_and_a (in=in_m_and_a rename=(date=_sort_date) keep=id _sort_date) earnings_next (in=in_earnings keep=id _sort_date); by id _sort_date; if in_earnings then set earnings_next (rename=(date=earnings_date)); if inm_and_a then set m_and_a (rename=(date=M_and_A_date)) ; if inm_and_a; run; The first MERGE is unconditional, which means all the variables it retrieves from the merged datasets are replaced (or set to missing) with every new distinct value of ID/_SORTDATE. I.E. all earnings data are set to missing every time an M&A record is read (without an exact earnings match), and vice-versa. But this unconditional MERGE reads only the sorting variables ID and _SORT_DATE, to avoid overwriting data from the subsequent conditional SETs. The conditional set statements later in the code means that the relevant data is NOT replaced or set to missing until the condition is met. So earnings data records are logically carried forward (but actually carried backward using the modified _SORT_DATE in earnings_next) and held. It is still there when a subsequent M&A record is encountered, (inm_and_a=1), M&A data is read, and the merged data is output. I have a 2017 paper on this: History Carried Forward, Future Carried Back: Mixing Time Series of Differing Frequencies A couple notes on the code: In making the earnings_next view, I use 01jan1960 as the default _SORT_DATE value for the first record of each ID. You can use whatever date value you want (including a missing value), as long as it precedes the earliest actual DATE value in both datasets. The resulting data will have one merged record for each M_and_A record, with two DATE variables M_and_A_date and earnings_date.

KS99 · ‎12-20-2022

Oh, is it? If PROC TIMEDATA is faster than PROC SQL, then I will study it to use it. Many thanks! KS -,

Ksharp · ‎10-14-2022

data have; infile cards expandtabs; input cusip $ year month x; datalines; 000255 2004 1 . 000255 2004 2 . 000255 2004 3 1 000255 2004 4 . 000255 2004 5 . 000255 2004 6 . 000255 2004 7 . 000255 2004 8 . 000255 2004 9 . 000255 2004 10 . 000255 2004 11 . 000255 2004 12 . 000255 2005 1 . 000255 2005 2 . 000255 2005 3 . 000255 2005 4 2 000255 2005 5 2 000255 2005 6 . 000255 2005 7 1 000255 2005 8 . 000255 2005 9 . 000255 2005 10 . 000255 2005 11 . 000255 2005 12 . 000307 2015 1 . 000307 2015 2 1 000307 2015 3 1 000307 2015 4 2 000307 2015 5 3 000307 2015 6 . 000307 2015 7 3 000307 2015 8 3 000307 2015 9 . 000307 2015 10 5 000307 2015 11 . 000307 2015 12 . ; run; data temp1; set have; by cusip year; retain x1; if first.year then call missing(x1); if not missing(x) then x1=x; run; proc sort data=temp1 out=temp2; by cusip year descending month; run; data temp3; set temp2; by cusip year; retain x2; if first.year then call missing(x2); if not missing(x) then x2=x; run; data want; set temp3; want=mean(x1,x2); drop x x1 x2; run; proc sort data=want; by cusip year month; run;

KS99 · ‎06-28-2022

Thank you Ksharp, long time no hear. I will copy your code. As a novice like me in SAS, "data B merge A A" is a surprise. I think I can use them in a very efficient way. Sincerely, KS -,

KS99 · ‎06-24-2022

Thank you, Tom, Your code works way faster than I expected! The logic you are using is the one I was thinking. But I had a problem in creating workable codes. Yours are a bit over my head, but I will try to learn them. I wish you a nice weekend! Sincerely, KS -,

KS99 · ‎05-16-2022

Thank you mkeintz, I've encountered coalese function many times before, but now in this context I come to understand it. If you don't mind, can you give me a little explanation on how call missing(_days); is supposed to work? (I never get call function) Wish you a very good evening! KS -.,

KS99 · ‎05-11-2022

Thank you Reeza for pointing out. I abbreviated many obs. to present my dataset here. What I simple mean is to check all previous obs. within 365 days' range. Wish you a good evening, KS -,

KS99 · ‎05-11-2022

Thank you, mkeintz, for you help out all the time! Your code works perfectly 😉 Sincerely, KS -

KS99 · ‎05-09-2022

Please, anyone from SAS community provide me a solution, if you are available. Dear PeterClemmensen, In relation to my previous task, I again bumped into an obstacle. Now, I proc sorted my dataset by cusip(=firm id), Analyst_id, Annc_date, and Span_closing. Span_closing is the final 1-year span before a house closes. cusip House_id Analyst_id Annc_date Span_closing 00030710 2143 134474 2/23/2016 0 00030710 2143 134474 3/18/2016 0 00030710 2143 134474 8/5/2016 1 00030710 2143 134474 11/4/2016 1 00030710 2143 134474 2/27/2017 1 00030710 2143 134474 3/1/2017 1 00030710 3945 165909 10/23/2015 1 00030710 3945 165909 2/5/2016 1 00030710 3945 165909 2/24/2016 1 00032Q10 926 194378 8/30/2021 1 00032Q10 926 194378 11/11/2021 1 00036020 371 1112 3/22/1994 0 00036020 371 1112 6/7/1994 0 00036020 371 1112 11/21/1994 0 00036110 32 136 6/29/1989 0 00036110 32 136 1/4/1990 0 00036110 32 136 3/29/1990 1 00036110 32 136 5/10/1990 1 00036110 32 136 8/2/1990 1 00036110 32 136 8/15/1990 1 00036110 32 136 8/30/1990 1 00036110 32 136 10/24/1990 1 00036110 32 136 10/31/1990 1 00036110 32 136 1/10/1991 1 00036110 479 136 5/2/1991 0 00036110 479 136 8/27/1991 0 00036110 479 136 9/12/1991 0 00036110 479 136 10/1/1991 0 00036110 479 136 10/31/1991 0 00036110 479 136 12/30/1991 0 00036110 479 136 4/2/1992 0 00036110 479 136 7/1/1992 0 00036110 479 136 12/21/1992 0 00036110 479 136 1/26/1993 0 00036110 165 278 9/8/1988 0 In this dataset, you can see that Analyst_id= 134474, who was working for the house 2143, stopped working (i.e., disappeared) as soon as the house 2143 closes (Span_closing=1, especially on 3/1/2017). It is also the case with Analyst_id= 165909 working for the house 3945, and with Analyst_id= 194378 working for the house 926. You can simply leave such cases. However, Analyst_id= 136, who used to work for the house 32, enters the final year span of the house 32 (Span_closing=1), and after the house closes, he continues to work for another house 479. For such an analyst like him, I'd like to create a new variable "Analyst_persist" that highlights his 1-year span (5/2/1991~4/2/1992) in which he works for a new house. That is, I'd like to create a dataset like below: cusip House_id Analyst_id Annc_date Span_closing Analyst_persist 00030710 2143 134474 2/23/2016 0 0 00030710 2143 134474 3/18/2016 0 0 00030710 2143 134474 8/5/2016 1 0 00030710 2143 134474 11/4/2016 1 0 00030710 2143 134474 2/27/2017 1 0 00030710 2143 134474 3/1/2017 1 0 00030710 3945 165909 10/23/2015 1 0 00030710 3945 165909 2/5/2016 1 0 00030710 3945 165909 2/24/2016 1 0 00032Q10 926 194378 8/30/2021 1 0 00032Q10 926 194378 11/11/2021 1 0 00036020 371 1112 3/22/1994 0 0 00036020 371 1112 6/7/1994 0 0 00036020 371 1112 11/21/1994 0 0 00036110 32 136 6/29/1989 0 0 00036110 32 136 1/4/1990 0 0 00036110 32 136 3/29/1990 1 0 00036110 32 136 5/10/1990 1 0 00036110 32 136 8/2/1990 1 0 00036110 32 136 8/15/1990 1 0 00036110 32 136 8/30/1990 1 0 00036110 32 136 10/24/1990 1 0 00036110 32 136 10/31/1990 1 0 00036110 32 136 1/10/1991 1 0 00036110 479 136 5/2/1991 0 1 00036110 479 136 8/27/1991 0 1 00036110 479 136 9/12/1991 0 1 00036110 479 136 10/1/1991 0 1 00036110 479 136 10/31/1991 0 1 00036110 479 136 12/30/1991 0 1 00036110 479 136 4/2/1992 0 1 00036110 479 136 7/1/1992 0 0 00036110 479 136 12/21/1992 0 0 00036110 479 136 1/26/1993 0 0 00036110 165 278 9/8/1988 0 0 I tried to make a little modification of your previous codes and achieve this, but, unfortunately, my understanding of the sas function intnx is very limited and I couldn't do it. I'd really appreciate it, if you give me a hand one more time! Sincerely, KS -,

KS99 · ‎04-19-2022

Thank you SASkiwi! I will keep them in mind! Sincerely, KS -,

KS99 · ‎03-09-2022

Oh, hi, Ballardw, Thank you for your codes. I will keep them and use them for a future research. I wish you a very good week! KS -,

KS99 · ‎03-07-2022

Hi, Peter, Thank you so much for your revised codes. It works perfectly not only for this case, but also for my subsequent creations. But soon again, I bumped into another problem. (I posted it on Community board.) I'd appreciate your help greatly. Sincerely, KS -,

Tom · ‎03-06-2022

So you are back to the method I posted. You can now use the BY statement to create the FIRST. and LAST. variables so you know when the LAG and "LEAD" variables are valid.

Online Status	Offline
Date Last Visited	‎02-03-2024 06:51 PM

Re: How can I compare two Poisson models (nested)?

Re: How can I compare two Poisson models (nested)?

How can I compare two Poisson models (nested)?

Re: Random sampling of n=500 by a random year btw. 1999 and 2022. Am I...

Random sampling of n=500 by a random year btw. 1999 and 2022. Am I doi...

How to resolve the error " Cannot remove record from a hash object whi...

Re: Extracting the (-120,-21) window stock returns from an event day d

Re: Extracting the (-120,-21) window stock returns from an event day d

Re: Extracting the (-120,-21) window stock returns from an event day d

Re: Extracting the (-120,-21) window stock returns from an event day d

Re: How can I compare two Poisson models (nested)?

Re: How can I compare two Poisson models (nested)?

Re: Filling in missing obs with conditions

Re: How to lag values as many rows as designated, iteratively over all...

How to merge two datasets by the equal or the next closest dates?

How to identify three-day (more more) windows around an event (and cal...

Re: How can I compare two Poisson models (nested)?

Re: Random sampling of n=500 by a random year btw. 1999 and 2022. Am I...

Re: How to resolve the error " Cannot remove record from a hash object...

Re: Extracting the (-120,-21) window stock returns from an event day d

Re: How to fill in missing obs. with the mean values of prior and post...

Re: Sieving only the 0-and-1 consecutive couplets from dataset

Re: Choosing one-year prior 'same day' or 'end day' or their closest d...

Re: Filling in missing obs with conditions

Re: Highlighting all one-year prior observations using intnx function

Re: My intnx function is not working

Re: Marking all the way back to 1 year ago from each designated date

Re: Proc-importing multiple excel files using %macro

Re: My code "p=max() to min()" does not provide repeated overlaps, whi...

Re: How to merge two datasets by the equal or the next closest dates?

Re: How to identify three-day (more more) windows around an event (and...