About gamotte

gamotte · ‎09-28-2018

Hello, It would be easier to answer if you post have and want datasets in the form of datasteps. data want; set have; array services(*) service2010-service2018; startindex=death_year-2009; stopindex=dim(services); do i=startindex to stopindex; num_use_post_death=sum(num_use_post_death,services(i)); end; run;

gamotte · ‎09-26-2018

@ChrisNZ code works fine although it creates macrovariables m1 to m3. Il you want the macrovariables index to start from 0, just move the call symput above the N+1 instruction.

gamotte · ‎09-25-2018

Hello, You forgot to resolve the macrovariable in the where condition where=(origin="&tt.") Note that double quotes are necessary for the resolution to take place. As often said on this forum, splitting a dataset is rarely necessary as most SAS procedures allow by group processing.

gamotte · ‎09-21-2018

Please provide data in the form of an executable data step. The function you are looking for is call execute that allows to generate SAS code from the columns of a dataset. Here is a (not tested) proposition for your problem. data _NULL_; call execute('data want; set data1;') do until(eof); set data2 end=eof; call execute(cat('if ',operand1,rel_oper,operand2,' then ',obs_rule_no,'="OK"')); call execute(cat('else ',obs_rule_no,'="KO"')); end; call execute('run'); stop; run;

gamotte · ‎09-21-2018

Thanks, i finally managed to overcome the difficulty. I needed in particular first.var and last.var created by the by statement so i used a prior data step to save them as new columns of the dataset. The by statement thus became useless and i could use the POINT= option.

gamotte · ‎09-21-2018

Generalizing the program to identify the shareholders by Sector/quarter proved a little bit more difficult for me than i supposed but i think the following program should work, though i haven't checked the resulting dataset thoroughly. data have; format Quarter date9.; input Shareholder $ Company $ Quarter date9. Sector; cards; A AA 31MAR2006 2 B AA 31MAR2006 2 C AA 31MAR2006 2 C BB 31MAR2006 2 B BB 31MAR2006 2 Z BB 31MAR2006 2 A CC 31MAR2006 2 B CC 31MAR2006 2 C CC 31MAR2006 2 Z CC 31MAR2006 2 A DD 31MAR2006 3 B DD 31MAR2006 3 C DD 31MAR2006 3 C EE 31MAR2006 3 B EE 31MAR2006 3 Z EE 31MAR2006 3 A FF 31MAR2006 3 B FF 31MAR2006 3 C FF 31MAR2006 3 Z FF 31MAR2006 3 A AA 30JUN2006 2 B AA 30JUN2006 2 C AA 30JUN2006 2 C BB 30JUN2006 2 Y BB 30JUN2006 2 Z BB 30JUN2006 2 A CC 30JUN2006 2 B CC 30JUN2006 2 C CC 30JUN2006 2 Z CC 30JUN2006 2 A DD 30JUN2006 3 B DD 30JUN2006 3 C DD 30JUN2006 3 A EE 30JUN2006 3 B EE 30JUN2006 3 Y EE 30JUN2006 3 A FF 30JUN2006 3 Y FF 30JUN2006 3 C FF 30JUN2006 3 Z FF 30JUN2006 3 ; run; proc sql noprint; /* We generate all possible pairs as a Cartesian product */ CREATE TABLE pairs AS SELECT DISTINCT a.Sector, a.Quarter, a.Company AS Comp1, b.Company AS Comp2 FROM have a, have b WHERE a.Company<b.Company /* Avoid to include both (AA,BB) and (BB,AA) */ AND b.Quarter=a.Quarter AND b.Sector=a.Sector ; quit; /* Create a reference dataset of shareholders with numeric id */ /* that can be used later as an array index */ proc sort data=have(keep=Shareholder) out=Shareholders nodupkey; by Shareholder; run; data Shareholders; Id=_N_; set Shareholders; call symput("nbsh",strip(_N_)); run; /* We append the shareholders ids to the have dataset */ proc sql noprint; CREATE TABLE have AS SELECT a.*, b.ID FROM have a LEFT JOIN Shareholders b ON b.Shareholder=a.Shareholder; quit; proc sort data=have out=have_S; by Sector Quarter Shareholder Company; run; data have_S; set have_S; by Sector Quarter; fstq=first.Quarter; lstq=last.Quarter; run; /* For each pairs, we identify Sharheloders that have shares in both companies */ data Common; set pairs; /* Array index is the Id of the shareholder */ array SH (&nbsh.) SH1-SH&nbsh.; do i=1 to nobs; /* We rename Sector and Quarter to avoid overriding columns in pairs dataset */ set have_S (rename=(Sector=Sect Quarter=Quart)) point=i nobs=nobs; if fstq then call missing(of SH(*)); if Sector=Sect and Quarter=Quart and (Company=Comp1 or Company=Comp2) then SH(Id)+1; if lstq then do idx=1 to dim(SH); /* If the shareholder with Id=idx have shares in both companies from the current pair */ /* we save the Id and output the result. */ if SH(idx)=2 then do; ShareholderId=idx; output; end; end; end; keep Quarter Sector Comp1 Comp2 ShareholderId; run; /* We retrieve the shareholer name */ proc sql noprint; CREATE TABLE Common AS SELECT a.*, B.Shareholder FROM Common a LEFT JOIN Shareholders b ON b.Id=a.ShareholderId ORDER BY Comp1, Comp2, Shareholder; quit;

gamotte · ‎09-21-2018

Thanks @data_null__ and @Astounding for your answers. I tried to simplify the problem in order to avoid distracting readers with unnecessary details but i realize that it can give a misleading view of what i am trying to do. The computation of sums only served as an example to justify a by statement. Actually, what is done in the loop is dependant on the data in the currently processed row of the have dataset. That is, for each row in have, i want to read the whole other dataset and create new variables that depend on both datasets columns. I just don't figure out how to restart reading the second dataset from observation 1 for each new observation of the have dataset. If you want the full context, i was trying to generalize my last answer in the following thread : https://communities.sas.com/t5/SAS-Programming/Pairwise-comparisons-in-loops/m-p/497468#M131862 in order to take different sectors/quarters into account and stumbled upon this difficulty.

gamotte · ‎09-21-2018

Sorry for not being clear enough. Here is what the resulting dataset should look like : a i Name Sex Age Height Weight total_weight 1 1 Alice F 13 56.5 84.0 84.0 1 2 Barbara F 13 65.3 98.0 182.0 1 3 Carol F 14 62.8 102.5 284.5 1 4 Jane F 12 59.8 84.5 369.0 1 5 Janet F 15 62.5 112.5 481.5 1 6 Joyce F 11 51.3 50.5 532.0 1 7 Judy F 14 64.3 90.0 622.0 1 8 Louise F 12 56.3 77.0 699.0 1 9 Mary F 15 66.5 112.0 811.0 1 10 Alfred M 14 69.0 112.5 112.5 1 11 Henry M 14 63.5 102.5 215.0 1 12 James M 12 57.3 83.0 298.0 1 13 Jeffrey M 13 62.5 84.0 382.0 1 14 John M 12 59.0 99.5 481.5 1 15 Philip M 16 72.0 150.0 631.5 1 16 Robert M 12 64.8 128.0 759.5 1 17 Ronald M 15 67.0 133.0 892.5 1 18 Thomas M 11 57.5 85.0 977.5 1 19 William M 15 66.5 112.0 1089.5 2 1 Alice F 13 56.5 84.0 84.0 2 2 Barbara F 13 65.3 98.0 182.0 2 3 Carol F 14 62.8 102.5 284.5 2 4 Jane F 12 59.8 84.5 369.0 2 5 Janet F 15 62.5 112.5 481.5 2 6 Joyce F 11 51.3 50.5 532.0 2 7 Judy F 14 64.3 90.0 622.0 2 8 Louise F 12 56.3 77.0 699.0 2 9 Mary F 15 66.5 112.0 811.0 2 10 Alfred M 14 69.0 112.5 112.5 2 11 Henry M 14 63.5 102.5 215.0 2 12 James M 12 57.3 83.0 298.0 2 13 Jeffrey M 13 62.5 84.0 382.0 2 14 John M 12 59.0 99.5 481.5 2 15 Philip M 16 72.0 150.0 631.5 2 16 Robert M 12 64.8 128.0 759.5 2 17 Ronald M 15 67.0 133.0 892.5 2 18 Thomas M 11 57.5 85.0 977.5 2 19 William M 15 66.5 112.0 1089.5

gamotte · ‎09-21-2018

Hello, Consider the following program; data have; input a; cards; 1 2 ; run; proc sort data=sashelp.class out=class; by sex; run; data test; set have; do i=1 to nobs; set class nobs=nobs; by sex; if first.sex then total_weight=0; total_weight+weight; output; end; run; When execution reaches observation 2 of the have dataset, the whole sashelp.class has already been entirely read. Thus the consecutive "set sashelp.class" will stop the data step. I can not use the point= option because of by group processing. Thanks for any advice to overcome this difficulty.

gamotte · ‎09-21-2018

Sure. mod(month+7,12)+1 is the modulo function (equivalent to % operator in some other languages) and is used to get the right month number from your month variable (1:jan, 2:feb,...) mdy(...,1,2017) creates a date variable for the 1st of the selected month intnx('month',...,0,'e') changes the date to the last day ('e' parameter) of the unchanged month ('month' and 0 parameters)

gamotte · ‎09-20-2018

Hello, You have a prxchanged string in your prxchange which is not given a length, hence the truncation. proc sql; create table c(drop=tempstr) as select origstring, prxchange('s/([AI])\/(S)/$1$2/',-1,origstring ) as tempstr length=609, prxchange('s/\s?[&,\/\+]\s?/_/',-1,calculated tempstr) as xlatestring length=609 from a; quit;

gamotte · ‎09-20-2018

You have a complex problem and we won't solve the whole thing for you. You made a first step by identifying the different steps involved. I propose to limit this thread to the identification of common shareholders. Here is a program that adress this specific subject : data have; input Shareholder $ Company $; cards; A AA B AA C AA C BB B BB Z BB A CC B CC C CC Z CC E CC B DD F DD E EE F EE A EE ; run; proc sql noprint; /* We generate all possible pairs as a Cartesian product */ CREATE TABLE pairs AS SELECT DISTINCT a.Company AS Comp1, b.Company AS Comp2 FROM have a, have b WHERE a.Company<b.Company /* Avoid to include both (AA,BB) and (BB,AA) */ ; quit; /* Create a reference dataset of shareholders with numeric id */ /* that can be used later as an array index */ proc sort data=have(keep=Shareholder) out=Shareholders nodupkey; by Shareholder; run; data Shareholders; Id=_N_; set Shareholders; call symput("nbsh",strip(_N_)); run; /* We append the shareholders ids to the have dataset */ proc sql noprint; CREATE TABLE have AS SELECT a.*, b.ID FROM have a LEFT JOIN Shareholders b ON b.Shareholder=a.Shareholder; quit; /* For each pairs, we identify Sharheloders that have shares in both companies */ data Common; set pairs; /* Array index is the Id of the shareholder */ array SH (&nbsh.) SH1-SH&nbsh.; call missing(of SH(*)); do i=1 to nobs; set have point=i nobs=nobs; SH(Id)+sum(Company=Comp1,Company=Comp2); end; do i=1 to dim(SH); if SH(i)=2 then do; ShareholderId=i; output; end; end; keep Comp1 Comp2 ShareholderId; run; /* We retrieve the shareholer name */ proc sql noprint; CREATE TABLE Common AS SELECT a.*, B.Shareholder FROM Common a LEFT JOIN Shareholders b ON b.Id=a.ShareholderId ORDER BY Comp1, Comp2, Shareholder; quit; From that you will be able to move to the other aspects of your task and you can open new discussions i you get stuck.

gamotte · ‎09-20-2018

Hello, What is the ultimate goal and why the relevant tools should be forbidden ? What criterion does tell us where the letter should be inserted ? This does what you want but I doubt it really answers your needs. data s ; name='sas programmer ' ; name2=cats(scan(name,1,'g'),'Hg',scan(name,2,'g')); run;

gamotte · ‎09-19-2018

Oops ! Right. Sorry, answered too fast. On your sorted dataset; Data want; set have; lp=lag(phase); if lp ne phase; drop lp; run;

gamotte · ‎09-19-2018

Use proc sort with option nodupkey. proc sort data=have nodupkey; by obs; run;

Online Status	Offline
Date Last Visited	‎08-13-2025 08:30 AM

Re: Stream a binary file

Stream a binary file

Re: Syntax error with parenthesis

Re: if-else if-else with do

Re: Macro Do Loop

Re: Top N values in sas

Re: How to add months in macro variable?

Re: Stored Process drop down lists

Re: If nobs is 0 then create a new observation

Re: IF THEN CONDITION

Re: SAS Enterprise Guide. What happens when one submits a SAS job

Re: Macro Do Loop

Re: Converting date to number forth and back

Re: where vs. if

Re: How to count from file inside SAS?

Re: add double quotes for words in string

Re: missing,nonmissing and unique value in column

Re: In a Stored Process, using _webout and <select> statement how do I...

Re: How to change the WORK library folder location based on user

Re: [Feature suggestion] Implement VIM in the SAS base and SAS Enterpr...

Re: Use a variable's value as part of another variable's name

Re: Create series if macro varaibles for dates values in data set

Re: split one data set into many

Re: Creating and resolving macro variables as a validation rule in the...

Re: Reset observation pointer for the set statement

Re: Pairwise comparisons in loops

Re: Reset observation pointer for the set statement

Re: Reset observation pointer for the set statement

Reset observation pointer for the set statement

Re: Month to Calendar Month

Re: PROC SQL truncates result of prxchange function

Re: Pairwise comparisons in loops

Re: how to insert letter in a string

Re: Removing duplicates depending on preceeding observation

Re: Removing duplicates depending on preceeding observation

SAS Analytics Explorers