About Oligolas

Oligolas · ‎06-15-2017

Not directly ASFAIK but... in PROC SQL you can use SAS functions and among them, regular expression functions. You could therefore check if a given field conforms to your date/datetime requirements before retrieving the records. You could also preprocess your dataset in order to determine the fields you are interested in. These are 2 possibilities among others.

Oligolas · ‎06-15-2017

Hi, the MFILE MPRINT functionality is explained >here< Edit: maybe turning on these options is sufficient for you: options mprint mprintnest source source2; options nomprint nomprintnest nosource nosource2; %macro test(initial); proc print data=sashelp.class;where substr(name,1,1) eq "&initial.";run; %mend test; %test(A); %test(J); options mprint mprintnest source source2; %macro test(initial); proc print data=sashelp.class;where substr(name,1,1) eq "&initial.";run; %mend test; %test(A); %test(J);

Oligolas · ‎06-14-2017

Hi, the variable num is not created in 'macro' if the condition is not fulfilled but you still try to resolve it in 'rename' Proc means data=temp n; Var _numeric_; Output out=want sum =; Run; %macro doit; %let num=; data macro; set want; if substr(name,1, 6)='nad_dx' then call symputx('num', N1, 'G'); if substr(name,1, 8)='nad_cert' then call symputx ('num', N2, 'G'); run; data rename; set test; %if &num. ne %then %do; rename nad_dx_1-nad_dx_&num=bd_dx_1-bd_dx_&num; rename nad_cert_1-nad_cert_&num=bd_dx_cert_1-bd_dx_cert_&num; %end; run; %mend doit; %doit;

Oligolas · ‎06-14-2017

Hi, I have no knowledge about hive. Nevertheless I would try different ways of deleting that table and see if it resolves the issue: proc sql; drop table hive_lib.ext_test_default; quit; or proc datasets lib=hive_lib nolist; delete ext_test_default; run;quit; or: looks like you only need to empty your table and not deleting it entirely right? data hive_lib.ext_test_default; if 0 then set hive_lib.ext_test_default; delete; run;

Oligolas · ‎06-14-2017

Hi, what about regex? data test; infile cards truncover; input string $100.; cards; "total_payout_value" : { "asset" : "SBD", "amount" : 37.13 }, "pending_payout_value" : { "asset" : "total_payout_value" : {"amount" : 37.13, "asset" : "SBD"}, "pending_payout_value" : { "asset" : "total_payout_value" : {"amount" : 0, "asset" : "SBD"}, "pending_payout_value" : { "asset" : ; run; data test1; length var $200; set test; var=prxchange('s/.*total_payout_value\D*(\d*\.?\d*).*/$1/',-1,string); run;

Oligolas · ‎06-12-2017

Hi, if you already compared a with c, you do not need to compare c with a. Regarding the 100 variables... is this an issue you still require assistance on? if so, are these 100 variables always available in all datasets? if you have those 100 variables in a and let's say only 85 of them in b (15 variables of 'a' are'nt available in 'b'), what should be the result of the comparison? Can you perhaps provide 3-4 datasets for testing?

Oligolas · ‎05-24-2017

Hi, this improvement sounds good to me. You will need to evaluate the programming effort with the running time of your program and the gain in robustness or efficacy. I would say, give it a try and come back to the community (with sample program & test data) if you need optimization feedback. Cheers,

Oligolas · ‎05-23-2017

Hi, I hope you realize you will have to make a comparison of all datasets together. For 1000 datasets that means C(1000,2)=1000!/(2!(1000-2)!)=499500 comparisons... Not sure how long it will take. Try this: %* Create test data; data a b c d e; do i=1 to 10; uid=i; if i > 4 then output a b c d e; else if i > 3 then output a b c d; else if i>2 then output a b c; else if i>1 then output a b; else output a; end; drop i; run; %* Determine Datasets to check; PROC SQL; CREATE VIEW v_all as SELECT memname,nobs from sashelp.vtable where libname eq 'WORK' /*adapt*/ and memname not in ('__COMPARISON', 'V_ALL') ; CREATE TABLE __COMPARISON AS select a.memname as current,a.nobs as currentnobs, b.memname as next, . as similarity format=percent7.1 from v_all a,v_all b where b.memname>a.memname order by a.memname,b.memname ; DROP VIEW v_all; QUIT; %* Prepare comparison macro; %MACRO compare(uniqueIDVar,current,next); PROC SQL; update __COMPARISON SET similarity=( (select count(*) from &current. where &uniqueIDVar. in (select &uniqueIDVar. from &next.))/currentnobs ) where current eq "&current." and next eq "&next." ; QUIT; %MEND compare; %* Run comparisons; DATA _NULL_; set __COMPARISON; call execute('%nrstr(%compare(uid,'||strip(current)||','||strip(next)||'))'); RUN; Cheers,

Oligolas · ‎05-23-2017

Hi, I'm not sure appending the dataset n-times is the most performant way to achieve what you want, but without further explanations or test data it's difficult to say. Anyway one way to loop would be with a macro, something like this: %macro repeat(n); %do i=1 %to &n.; data Sim_WD; set Sim_WD; Sim_value=COL1 + rand("Normal", 0, _RMSE_); run; PROC SQL; CREATE TABLE MD_append AS SELECT Site AS Site, YEAR AS YEAR, Season AS Season, TYPE AS TYPE, RegYear AS RegYear, max(Sim_value) AS MD FROM Sim_WD GROUP BY Site, YEAR, Season, TYPE, RegYear ; QUIT; proc transpose Data=MD_Append out=MD_Append (Drop=_Name_) prefix=MD; var MD; id RegYear; by Site Year Season Type; run; %if &i. eq 1 %then %do; data Work.md_hist_wd; set work.md_append; run; %end; %else %do; proc append base=Work.md_hist_wd data=work.md_append; run; %end; %end; %mend repeat; %repeat(100); Cheers

Oligolas · ‎04-24-2017

Hi, my solution isn't that elegant 😉 filename location "C:\TEMP"; data files; length name $250 nbRec 8; drop rc did i; did=dopen("location"); if did > 0 then do; do i=1 to dnum(did); name=pathname('location')||'\'||dread(did,i); if scan(name,-1,'.') eq 'txt' then output; end; rc=dclose(did); end; else put 'Could not open directory'; run; data _NULL_; set files; call execute(' data _null_; infile "'||strip(name)||'" end=eof; input; if eof then call execute(" proc sql; update files set nbRec="||put(_N_,best32.)||" where name eq ""'||strip(name)||'""; quit; "); run; '); run; Cheers

Oligolas · ‎04-19-2017

@SASKiwi wrote: According to the documentation: http://support.sas.com/documentation/cdl/en/acpcref/67382/HTML/default/viewer.htm#n0msy4hy1so0ren1ac... if you specify a range and GETNAMES=YES then the first row of the range is used to construct the column names and the second row is where the data starts. So NAMEROW and DATAROW become redundant in this case. Alternatively, you could specify a "named range" in Excel (open your workbook and hold CTRL+F3, then click new... select a region and enter a name) /*import the whole sheet*/ PROC IMPORT datafile="C:\TEMP\workbook.xlsx" OUT=want DBMS=XLSX REPLACE ; SHEET="...yourSheetName..."; GETNAMES=YES; RUN; /*import a named range*/ PROC IMPORT datafile="C:\TEMP\workbook.xlsx" OUT=want DBMS=XLSX REPLACE ; RANGE="...yourNamedRange..."; GETNAMES=NO; RUN; Cheers

Oligolas · ‎04-19-2017

...and remember btw that merge is not able to proceed a many-to-many relationship Cheers

Oligolas · ‎04-19-2017

http://www.lexjansen.com/nesug/nesug11/ds/ds03.pdf https://www.youtube.com/watch?v=zlDMwF3kQ6s

Oligolas · ‎04-19-2017

Hi, Choose the join you need from the graphic below. If you still need help consult http://www.listendata.com/2014/04/proc-sql-select-statement.html https://www.youtube.com/watch?v=rF5DC3CPORc https://www.youtube.com/watch?v=rIbdmdVdAgQ or post a specific question with sample data. Cheers

Oligolas · ‎04-12-2017

Hi, if your xls isn't a native excel file proc import won't work. Try saving your XLS as XLSX and use DBMS=xlsx If it works and you need to automate the conversion try this Cheers

Online Status	Offline
Date Last Visited	‎03-21-2025 04:50 PM

Re: Simplify proc sql multiple joins

Re: Use of colon after a comparison operator

Re: Use of colon after a comparison operator

Use of colon after a comparison operator

Re: IF-THEN statement not working for all observations

Re: Notation scientifique dans une variable de type alfanumerique $cha...

Re: Resolve macro variable name in intnx function to generate SQL code

Re: Resolve macro variable name in intnx function to generate SQL code

Re: nested macro definition causes :Open code statement recursion dete...

Re: nested macro definition causes :Open code statement recursion dete...

Re: Count number of value 1 using do loop

Re: cmiss(of _all_)

Re: How to create an if then statement to drop dates for the same obse...

Re: how do we apply where condition when there are multiple values in ...

Re: How dhms is calculating hours, minutes and seconds?

Re: Creating flag and additional record

Re: IF-THEN statement not working for all observations

Re: Count number of value 1 using do loop

Re: Using ODS Layout for customizing PDF Output

Re: Convert UTC timestamp string to date

Re: Isdate

Re: Export executed code to external file

Re: Call Symputx doesn't work?

Re: How to delete hive tables using datastep

Re: Pulling number from complex string and creating new numeric field ...

Re: Identifying Duplicates datasets

Re: Repeat a section of code (multiple steps) for n iterations

Re: Identifying Duplicates datasets

Re: Repeat a section of code (multiple steps) for n iterations

Re: count number of records of multiple text file seperately ??

Re: Importing Excel Files with namerow and range options

Re: please help in inner join

Re: please help in inner join

Re: please help in inner join

Re: Issues with importing data from another language

SAS Analytics Explorers