About hkim3677

mkeintz · ‎09-18-2021

I don't think you /* may need to sort have by YEAR if not actually so*/ Just use by year notsorted; in the TRANSPOSE procedue.

Ksharp · ‎10-16-2019

If you don't have a big table, otherwise I would try Hash Table. data test; input id $ group condition1 condition2; datalines; a 1 123 9 b 1 123 5 c 1 234 7 d 0 456 8 e 0 123 8 f 0 234 9 ; run; proc sql; create table want as select *,case when (select count(distinct group) from test where condition1=a.condition1)= (select count(distinct group) from test ) then 1 else 0 end as dummy1, case when (select count(distinct group) from test where condition2=a.condition2)= (select count(distinct group) from test ) then 1 else 0 end as dummy2 from test as a; quit;

Kurt_Bremser · ‎04-23-2018

Slight change on @novinosrin's take: data want; set have (rename=(cc=_cc)); by id; retain cc; if first.id then cc=_cc; drop _cc; run; Just to show another possible solution.

novinosrin · ‎04-16-2018

same as PG, did PG miss customer in group by? Data original; input supplier customer year post; datalines; 1001 8001 2000 1 1001 8001 2001 1 1002 8006 1995 0 1002 8006 1996 0 1002 8006 1997 1 1002 8006 1998 1 1002 8006 1999 1 1003 8008 2005 0 1003 8008 2006 0 1003 8009 2006 0 1003 8009 2007 1 ; run; proc sql; create table hope as select * from original group by supplier,customer having count(distinct post) = 2 order by supplier, year; quit;

PeterClemmensen · ‎04-14-2018

No problem 🙂

ChrisNZ · ‎03-27-2018

Or use a BY statements to avoid re-sorting if the tables are already sorted. data combined; set example1 example2; by id year; run;

Ksharp · ‎03-24-2018

Data test; input ID NAME :$16. NAME2 :$16. Year; datalines; 1 Highland HighlandBell 2008 1 Highland HighlandBell 2009 1 Highland HighlandBell 2010 1 Highland HighlandCorp 2008 1 Highland HighlandCorp 2009 1 Highland HighlandCorp 2010 1 Highland HighlandMalt 2008 1 Highland HighlandMalt 2009 1 Highland HighlandMalt 2010 2 HillBrosINC HillBrosINC 2011 2 HillBrosINC HillBrosINC 2012 3 HitachiLTD HitachLTD 2008 ; proc sql; create table test2 as select *,case when count(distinct catx(' ',name,name2)) ne 1 then 1 else 0 end as want from test group by name; quit;

ballardw · ‎03-23-2018

Note that you may not be able to let functions pick you "closest" without some judgment. If the company in one set is International Business Machines and IBM in the other then the functions may not get the smallest differences for this type of match.

hkim3677 · ‎03-16-2018

All above solutions are working good!! Thank you masters!

PGStats · ‎03-15-2018

The possibilities are endless... You can account for hyphenated names and firstnames easily, but frankly I don't even know what's what in "B. Evan Bayh, III". If such "names" are rare, you might be better sending them to a separate table and treating them by hand... Data test; input id fullname $64.; datalines; 1 Michael R. Boyce 2 James G. Brocksmith, Jr. 3 Gerald F. Fitzgerald, Jr. 4 Norman R. Bobins, BS, MBA 5 Peter Pace, USMC, Ret. 6 Ryan, Norman A. 7 Anne Newman Foreman 8 Michael R.S. Boyce 9 Ryan, Norman A., BS 10 Lavizzo-Mourey, Evans 11 B. Evan Bayh, III ; data want except(keep=id fullname); length firstName lastName $20 initials $6; if not prxId1 then prxId1 + prxParse("/^([\w-]+)\s+(([A-Z]\.)*\s*)([^,]+)/"); if not prxId2 then prxId2 + prxParse("/^([\w-]+),\s*([\w-]+)\s+(([A-Z]\.)*)/"); set test; if prxMatch(prxId1, fullname) then do; firstName = prxPosn(prxId1, 1, fullname); initials = prxPosn(prxId1, 2, fullname); lastName = prxPosn(prxId1, 4, fullname); output want; end; else if prxMatch(prxId2, fullname) then do; firstName = prxPosn(prxId2, 2, fullname); initials = prxPosn(prxId2, 3, fullname); lastName = prxPosn(prxId2, 1, fullname); output want; end; else output except; drop prxId: ; run; title "Matched names"; proc print data=want noobs; run; title "Unmatched names"; proc print data=except noobs; run;

Reeza · ‎03-14-2018

Then my previous answer was correct. @hkim3677 wrote: Sorry, I should have been clearer. My question here is how to create the column containing the percentage of manager having MD=1 by firm and year.

Ksharp · ‎03-14-2018

Data example; input firm manager $ year turnover post_turnover turnover_year; datalines; 1001 abc 2005 0 0 . 1001 abc 2006 0 0 . 1001 abc 2007 1 1 2007 1001 abc 2008 0 0 . 1001 abc 2008 0 1 . 1001 abc 2009 0 0 . 1001 abc 2009 0 1 . 1001 abc 2010 0 0 . 1001 abc 2010 0 1 . 1001 abc 2011 0 0 . 1001 abc 2011 0 1 . 1001 abc 2012 1 1 2012 1001 abc 2013 0 1 . 1002 def 1990 0 0 . 1002 def 1991 0 0 . 1002 def 1992 0 0 . 1002 def 1993 0 0 . 1002 def 1994 1 1 1994 1002 def 1995 0 0 . 1002 def 1995 0 1 . 1002 def 1996 1 1 1996 1002 def 1996 0 1 . ; run; data key; set example(where=(turnover_year is not missing)); do i=turnover_year-3 to turnover_year+3; year=i;output; end; keep firm manager year; run; data want; if _n_=1 then do; if 0 then set key; declare hash h(dataset:'key'); h.definekey('firm', 'manager', 'year'); h.definedone(); end; set example; if h.check()=0; run; proc print;run;

mkeintz · ‎03-10-2018

If I understand your question correctly, this is not a complicated task. Do a SET statement with TEST as the object 2 times the first time just for turnover years the second time for all year. The SET statement will pass through all the turnover years for a given id first, allowing generation of minyear (minyear=first turnover year minus 3) and maxyear (equal last turnover year +3) . The data step will then pass through ALL years for the same id, where comparison against minyear and maxyear can be tested. Data test; input ID MANAGER $ YEAR POST_TURNOVER; datalines; 1001 a 1990 0 1001 a 1991 0 1001 a 1992 0 1001 a 1993 0 1001 a 1994 0 1001 a 1995 1 1001 a 1996 1 1001 a 1997 1 1001 a 1998 1 1001 a 1999 1 1001 a 1995 0 1001 a 1996 0 1001 a 1997 0 1001 a 1998 0 1001 a 1999 0 1001 a 2000 0 1001 b 2000 1 1001 b 2001 1 1001 b 2002 1 1001 b 2003 1 1002 c 1989 0 1002 c 1990 0 1002 c 1991 0 1002 c 1992 0 1002 c 1993 1 1002 c 1994 1 1002 c 1995 1 1002 c 1996 1 1002 c 1997 1 run; data want; set test (where=(post_turnover=1) in=in1) test (in=in2); by id manager; retain min_year max_year; if first.manager then call missing (min_year,max_year); if first.manager and in1=1 then min_year=year-3; else if in1 then max_year=year+3; if in2=1 and (min_year<=year<=max_year); run; Notes The RETAIN statement tells SAS to not reset min_year and max_year to missing with every incoming record. Since all the turnover years are read first, testing on the associated IN1 dummy allows calculation of min_year and max_year. Then all the years are read (including a re-read of turnover years), allowing a subsetting IF of (min_year<=year<=max_year). The program allows for multi-year event spans (a sequence of post_turnover=1 records), but does not accommodate multiple events separated by non-event years.

hkim3677 · ‎03-08-2018

Thank you! However,.... Now I realized that I mis-specified turnover value. Can you help me find the turnover event? Like... ID YEAR Manager Turnover 1 1990 a 0 1 1991 a 0 1 1992 b 1 1 1993 b 0 1 1994 b 0 1 1995 c 1 1 1996 c 0 1 1997 c 0 2 1993 d 0 2 1994 d 0 2 1995 d 0 2 1996 e 1 2 1997 e 0 2 1998 e 0 2 1999 e 0 2 2000 e 0 2 2001 e 0 Can you help me create the turnover column? I really appreciate your help!

Online Status	Offline
Date Last Visited	‎09-18-2021 04:34 AM

Re: Reformating data

Re: Reformating data

Reformating data

How to create dummy variables for common codition

Replace values with the beginning value

Construct appropriate pre-and post- sample

Re: Remove after second-post period obs

Re: Remove after second-post period obs

Remove after second-post period obs

Merge two different period sample

Re: Reformating data

Re: How to create dummy variables for common codition

Re: Replace values with the beginning value

Re: Construct appropriate pre-and post- sample

Re: Remove after second-post period obs

Re: Merge two different period sample

Re: Identifying multiple possibilities

Re: Matching the closest name

Re: Dealing with close change

Re: Extract Name

Re: Calculate percentage by firm and year

Re: Retain only -3 and +3 period obs

Re: Event study window

Re: Post-treatment data manipulation