About brulard

brulard · ‎04-24-2018

Appreciate the explanation, thank you

brulard · ‎04-24-2018

hi, I have multiple numeric variables that I want to copy all into 1 new variable. I can do it using PUT. Is there a another way of doing it (preferably that would also work in SQL)? thanks in advance, data have; input Id Rank Count Date; datalines; 001 01 01 2017 001 02 01 9999 002 01 03 9999 003 01 02 2018 003 02 02 9999 004 01 02 9999 run; data want; /*format a $3. b $2. c $2. d $4.; */ set have; a=PUT(id,best.); b=PUT(rank,best.); c=PUT(count,best.); d=PUT(date,best.); fin=catx('|',a||b||c||d); keep fin; run;

brulard · ‎04-11-2018

hi, I'm writing a query involving multiple left joins using several months' worth of data for same group of customers. To help end user understand arbitrary value, I add a CASE statement to include the Value description. Can someone suggest a way for me to only have to write the case description ONCE, so then each subsequent month/ and case the value need not be re written? Should this be done via MACRO, and then referring to it using the INCLUDE statement? End result would look like: Acct Status_mon01 Desc_mon01 Status_mon02 Desc_mon02 /*and so on */ 001 S No mail S No Mail 002 C Vacation N Pending sample code: proc sql; create table base select t1.*, t2.Status as Status_mon01, case t2.status when 'A' then 'MAIL' when 'S' then 'No mail' /* and so one until end of when-then */ end as Desc_mon01, t3.status as status_mon02, case <...> from table1 as t1 left join <...> on t1.id=t2.id left join <...> thank you

brulard · ‎04-09-2018

hi Ksharp... it is 1 in my example (although a zero could work, but should never be 2,or 3). I've struggled in perl with making count of example (ii) identical to count of (i) (and that is related to how /b function operates i believe

brulard · ‎04-06-2018

hi, i'm looking to count how many words contain 1 or more numeric characters (for address variable). The following code almost does it: prxparse('/\d{1}\b/'), derived from a very useful suggestion from Ksharp in prior post. But it needs another enhancement: in example below, all words are properly counted, Except record (i) [which should be 3], where note the 101A in (i) is reverse of A101 in (ii). data have; input; have=_infile_; pid=prxparse('/\d{1}\b/'); s=1;e=length(_infile_); n=0; call prxnext(pid,s,e,_infile_,p,l); do while(p>0); n+1; call prxnext(pid,s,e,_infile_,p,l); end; keep have n; cards; () 189 ELIOT STREET | UNIT 112 | ROCKLAND | ON | K4K0G4 | CAN (i) 1769 101A AVE | SURREY | BC | V4N5V8 | CAN (ii) 1769 A101 AVE | SURREY | BC | V4N5V8 | CAN (iii) 204 ALGONQUIN RD UNIT114 (iv) BUREAU N2P1G1 106 (v) 29-549 RGE RD 232 STURGEON COUNTY AB T8L5E9 CAN ; run; A word boundary: -can be preceded, or ended by 0 characters (starting or end characters within variable) -can be preceded, or ended by 1 or more spaces -word containing a hyphen to be considered two words Thank you

brulard · ‎04-04-2018

thank you for the tip! very helpful

brulard · ‎04-04-2018

hi, thank you for the tip, + explanation... as an aside, if there are any key books or articles you have to recommend, please let me know!

brulard · ‎04-04-2018

it is a number formatted 1.

brulard · ‎04-03-2018

hi, I have dataset containing IDs, with different rank, count, and dates. If you could offer a tip to meet the following: if condition true, then flag 1 for all same IDs. Example HAVE: Id Rank Count Date 001 01 01 2017 001 02 01 9999 002 01 03 9999 003 01 02 2018 003 02 02 9999 004 01 02 9999 Example WANT New Var Conditions: midflag:where year 9999 and count<3 /*optional variable*/ endflag:where midlflag=1 and ID (appears once or is duplicate from different row) Id Rank Count Date MIDflag Endflag 001 01 01 2017 0 1 001 02 01 9999 1 1 002 01 03 9999 0 0 003 01 02 2018 0 1 003 02 02 9999 1 1 004 01 02 9999 1 1 Thanks in advance

brulard · ‎03-28-2018

hi Ksharp, thanks this works too

brulard · ‎03-27-2018

hello Mr Ksharp, a quick thank you for your tip! Also, you and others like R9, Astounding, etc.. are a real inspiration to study and practice SAS & coding!

brulard · ‎03-27-2018

hi, Can someone suggest a code /function to scan string and return num value when: Where any character within string where delimetor is 1 or more space, begin scanning: If contains or is made up of numeric, flag 1. This is to scan string related to addresses. Examples for variable Address1 (i) 145 SAINT-GEORGES #1107 (ii) 12 MIDDLEPORT CRESEN (iii) 2040 ALGONQUIN RD UNIT114 (iv) BUREAU 106 Output (i) 2 [flags '145' as 1, and flags '#1107' as 1] (ii) 1 [flags '12' as 1] (iii) 2 (iv) 1 thanks... I'm gonna review Perl and see what i can get.

brulard · ‎02-22-2018

Thanks. I appreciate your feedback and will open a ticket to our Hadoop administrators. (I did manage to produce my desired output but had to split my query into two timeframes.)

brulard · ‎02-21-2018

Thanks, appreciate your tips... Will remove the order by.. currently trying a suggestion from a colleague to use the GROUP BY function instead of using DISTINCT. As for creating a table that would help... I think i need to create in my environment what is referred to as a KERBEROS ticket

brulard · ‎02-21-2018

hi, I'm fairly new to querying in hadoop. when running a query using sql pass thru, getting an out of memory error. Does someone know of an option i could use to perhaps by-pass this, or has a suggestion other than my having to break my query into different parts. There was no error when I limited the query time frame to 4 years (from date, to date). However, when broadening to 7 years, I am getting out of memory error. Below is the bit of code that is scanning through hundreds of millions of records, that is resource intensive that I believe causes the error. LEFT JOIN (SELECT CRNT_BAL_AMT,eff_from_dt ,eff_TO_dt,ACCT_ID ,PRD_CD FROM TRANS_HIST Where eff_to_dt>='2010-12-31' and eff_to_dt < '9999-12-31' ) g ON A.ACCT_ID=g.ACCT_ID AND date_sub(A.txn_dt,1)=g.EFF_TO_DT LEFT JOIN (SELECT CRNT_BAL_AMT,eff_from_dt ,eff_TO_dt,ACCT_ID ,PRD_CD FROM TRANS_HIST Where eff_to_dt>='2011-01-01' and eff_to_dt < '9999-12-31' ) h ON A. ACCT_ID=h.ACCT_ID AND date_sub(g.eff_from_dt,1)=h.EFF_TO_DT ORDER BY a.ACCT_ID) ; DISCONNECT FROM hadcon; quit; Error message: ERROR: Prepare error: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Reducer 2, vertexId=vertex_1517179256891_507352_1_09, diagnostics=[Task failed, taskId=task_1517179256891_507352_1_09_000003, diagnostics=[TaskAttempt 0 failed, info=[Error: Failure while running task:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:173) Thanks in advance

Online Status	Offline
Date Last Visited	‎03-01-2021 10:19 PM

Re: Vertical align text and graphic in banner

Re: Vertical align text and graphic in banner

Vertical align text and graphic in banner

Re: Macro to export each table in a library to an Excel file

Re: Macro to export each table in a library to an Excel file

Macro to export each table in a library to an Excel file

Re: guidance to produce donut with percent outer, count inner

guidance to produce donut with percent outer, count inner

Re: Concat by maintaining space or adding between strings

Re: Concat by maintaining space or adding between strings

Re: Excluding rows when two variables have specific values on the same...

Re: ODS, EXCELXP, PROC REPORT, VJUST COLUMN

Re: how to pad character variable with leading zeroes?

Re: Get a list of dataset names in a directory/library

Re: Vertical align text and graphic in banner

Re: Concatenating Date and Time

Re: Error running script in Hadoop (Apache)

best practice for daily query fetch date

Re: Hadoop connectivity issue

Re: Flag random data based on criteria

Re: concat multiple num var into 1 char

concat multiple num var into 1 char

Best practice or short cut on multiple SQL CASE statements

Re: Help with PERL string count for address var

Help with PERL string count for address var

Re: Add flag=1 when condition true and ID in same group

Re: Add flag=1 when condition true and ID in same group

Re: Add flag=1 when condition true and ID in same group

Add flag=1 when condition true and ID in same group

Re: Flag words containing or made of numeric

Re: Flag words containing or made of numeric

Flag words containing or made of numeric

Re: Hadoop out of memory error

Re: Hadoop out of memory error

Hadoop out of memory error