About Patrick

Patrick · ‎02-11-2025

After a brief Google search: https://bayer-group.github.io/sas2r/external-data.html https://github.com/phuse-org/TestDataFactory/tree/main/Updated/TDF_SDTM

Patrick · ‎02-10-2025

I copy/pasted your question just for fun into Copilot - and it made me laugh! I don't have the skills to judge if the Copilot response is correct. Below the answer I got. I understand your frustration with the documentation. Let's break it down step-by-step to create an example. Run PROC MIXED and Output Base Densities: First, you need to run PROC MIXED and output the "Base Densities" table to a data set. This will give you the format for the input data set you need for the PRIOR statement. proc mixed data=your_dataset; class factor1 factor2; model response = factor1 factor2 / solution; random intercept / subject=subject; ods output BaseDensities=base_densities; run; Create the Input Data Set: The base_densities data set will contain the necessary columns ( Type , Parm1 , Parm2 , etc.). You can use this data set directly or modify it if needed. Run PROC MIXED with PRIOR Statement: Now, use the base_densities data set in the PRIOR statement. proc mixed data=your_dataset; class factor1 factor2; model response = factor1 factor2 / solution; random intercept / subject=subject; prior (intercept) / data=base_densities; run; This should give you a concrete example of how to use the PRIOR statement with PROC MIXED . Does this help clarify things?

Patrick · ‎02-10-2025

Querying SAS Metadata gets quickly involved. What helped me a lot in the past to figure out where things are is Metabrowse. Two methods for editing SAS metadata Having said that: With both SAS DIS and SAS Metadata no more existing under SAS Viya I'd think twice how much effort I'd still invest into reporting on SAS metadata.

Patrick · ‎02-10-2025

Data in CAS won't be a csv. The csv can be the physical leg of the in-memory CAS table. Where I was going to was more: In Viya there is SAS Compute and CAS. SAS Compute is "traditional SAS". CAS is in-memory SAS with a physical source defined to save data (so it can for example get reloaded when the CAS server needs a restart that wipes out memory). You don't need to load data into CAS for writing it out as a csv file. If the data is not anyway in CAS then I'd be using SAS Compute. SAS Studio is a client that connect to a SAS Server. Your SAS Viya server (that's where things actually happen) is likely a cloud based environment. "SAS Desktop " likely refers to a SAS9.4 version that's locally installed on the users PC's. For you the first bit to solve before any coding is how you can deliver a csv file (or any file) from your environment to a location your customer can read from. Is there any location your SAS environment (server side where SAS executes) can access that's also accessible by your customer (and if it's a locally installed SAS then a path directly accessible from their PC). If there isn't such a share location then you will need to look into another delivery mechanism like writing to share point, sending and email with attachment, ...or whatever else works, is technically available and meets the requirements. Once you know how/where to deliver the csv/the data exchange mechanism then the rest is very solvable. Btw: If this is a one-off then you can just download the .csv to your local environment via SAS Studio.

Patrick · ‎02-10-2025

We both have SAS environments although I'm using SAS Studio Viya LTS and they're using "SAS desktop" (i'm honestly not sure what they're meaning though with that term). SAS Studio is a client with which you can access a SAS Server environment. SAS desktop could mean that they are running a locally installed SAS (="server" on a PC) which would be a totally different environment than what you've got. The first thing you need to figure out is a location (path) that's accessible both to you (your SAS server) and your customer. From what you describe you're creating the file successfully under /public/ which appears to be the physical leg of your caslib public. /public/ is not the actual physical path which is why you need to access the file via the filesrvc filename engine. Again based on your description the actual filename works and the only thing that's surprising is that %sysfunc(fexist(jobout)) returns 0 and not 1. If all of the above is true then it's possible that fexist() isn't working as expected with filesrvc filename engine. If no one else comes up with a better explanation then I'd contact SAS Tech Support. ...but of course to solve your actual problem you need to figure out where to deliver the data to so it's accessible to your customer. And given that you are very new to SAS: Is your source data actually loaded into CAS (memory) or is that just something you're doing for creating the .csv the way you know how to do?

Patrick · ‎02-09-2025

@Tom Sure, that will work as well but I can't see the hurt in an additional simple data _null_ step that won't iterate through the data.

Patrick · ‎02-09-2025

Something like below should do. I've modified your RegEx adding word boundary metacharacter \b so your 2nd regex does not match Maple Street data have; input street $80.; datalines; Bldg A 153 First Street 6789 64th Ave 4 Moritz Road 7493 Wilkes Place 711 Maple Street ; run; data patterns; input regex :$100.; datalines; m/\d+\s[a-z]+\s[a-z]+/i m/\b(Pl|place)\b/i m/\b(rd|road)\b/i m/\b(ave|avenue)\b/i ; run; data _null_; call symputx('n_patterns',nobs); stop; set patterns nobs=nobs; run; data want; set have; if _n_=1 then do; array expr_id {&n_patterns} _temporary_; do i=1 by 1 until(last); set patterns end=last; expr_id[i]=prxparse(strip(regex)); end; /* create variable match with same length as variable street */ if 0 then match=street; length matchtype $8; end; do i=1 to dim(expr_id); call prxsubstr(expr_id[i], street, position, length); if position> 0 then do; match=substr(street, position, length); matchtype=cats('pattern', i); output; end; end; drop regex i; run; proc print data=want; run;

Patrick · ‎02-09-2025

Don't use macro language if not necessary. It only makes debugging harder. data have; input street $80.; datalines; Bldg A 153 First Street 6789 64th Ave 4 Moritz Road 7493 Wilkes Place ; run; data want; set have; if _n_=1 then do; pattern1="m/\d+\s[a-z]+\s[a-z]+/i"; pattern2="m/Pl|place/i"; pattern3="m/rd|road/i"; pattern4="m/ave|avenue/i"; array patterns{4} pattern1 - pattern4; array expr_id {4} _temporary_; do i=1 to dim(patterns); expr_id[i]=prxparse(patterns[i]); end; length matchtype $8; /* create variable match with same length as variable street */ if 0 then match=street; end; do i=1 to dim(patterns); call prxsubstr(expr_id[i], street, position, length); if position> 0 then do; match=substr(street, position, length); matchtype=cats('pattern', i); output; end; end; drop Pattern: i; run; proc print data=want; run; Or even shorter: data want; set have; if _n_=1 then do; array expr_id {4} _temporary_; expr_id[1]=prxparse("m/\d+\s[a-z]+\s[a-z]+/i"); expr_id[2]=prxparse("m/Pl|place/i"); expr_id[3]=prxparse("m/rd|road/i"); expr_id[4]=prxparse("m/ave|avenue/i"); /* create variable match with same length as variable street */ if 0 then match=street; length matchtype $8; end; do i=1 to dim(expr_id); call prxsubstr(expr_id[i], street, position, length); if position> 0 then do; match=substr(street, position, length); matchtype=cats('pattern', i); output; end; end; drop i; run;

Patrick · ‎02-07-2025

@JKHess I did initially an edit but then reverted back. The private message you sent: Hi Patrick, I just tried to respond to your most recent response to my post but it looks like it's been closed. I separated the files by year, re-ran the deciles, etc, then ran the code you provided. It ran fine with the smaller samples (10k records), but when I used the entire file, i got an error "Array subscript out of range at line 417 column 7", which is referring to this line in the code: if a_itemcum[midpt-1] >= ran_val then hbound=midpt-1; Any idea what might be causing this error? thank you.. The error was due to a wrong assignment of the upper array boundary. It's corrected in below code. Instead of hbound=a_itemcum_n_elements; it was hbound=a_itemcum[a_itemcum_n_elements]; The binary search algorithm itself is based on the excellent paper Array Lookup Techniques by @hashman /*************** create sample data ************************/ /* file 1 */ data dec; input ID Date :mmddyy10. Decile; format Date mmddyy10.; datalines; 1 1/1/2017 1 22 1/1/2017 1 41 1/1/2017 1 56 1/1/2017 2 79 1/1/2017 2 85 1/1/2017 2 100 1/2/2017 1 118 1/2/2017 1 125 1/2/2017 2 167 1/2/2017 2 178 1/2/2017 3 ; run; /* file 2 - not really a bridge because relationship bridge:no_dec is many:many */ data bridge; input Date :mmddyy10. Decile Zipcode $5.; format Date mmddyy10.; datalines; 1/1/2017 1 88123 1/1/2017 1 03867 1/1/2017 1 04001 1/1/2017 2 03304 1/1/2017 2 98765 1/1/2017 2 96224 1/1/2017 2 00001 1/2/2017 1 98801 1/2/2017 2 88123 1/2/2017 2 12345 1/2/2017 2 83356 1/2/2017 2 98765 1/2/2017 3 03304 1/2/2017 3 04945 ; run; /* file 3 */ data no_dec; input ID Zipcode $5.; datalines; 2 88123 21 88123 22 88123 23 88123 24 88123 3 12345 4 03304 5 03867 6 04945 7 04001 8 98765 9 98801 10 96224 11 00001 12 83356 13 83356 ; run; /************ data prep *************************************/ /* assign a random value to each entry in no_dec (file 3) and output as table no_dec_ranno sorted by zipcode and random value */ data _null_; dcl hash h1(ordered:'y', multidata:'y'); h1.defineKey('Zipcode','ran_no'); h1.defineData('id','zipcode','ran_no'); h1.defineDone(); call streaminit(10); do until(_last); set no_dec end=_last; ran_no=rand('uniform'); _rc=h1.add(); end; _rc=h1.output(dataset:'no_dec_ranno'); stop; run; /************ draw control *************************************/ data control(keep=id_dec id zipcode date Decile select_cnt) control_insufficient_data(keep=id_dec id zipcode date Decile select_cnt) ; length id_dec 8; if _n_=1 then do; call streaminit(10); /* define hash to collect number of rows (items) per zipcode */ /* - used for weighted random selection of zipcode from which to draw control from */ n_items=0; dcl hash h_nodec_nperzip(); h_nodec_nperzip.defineKey('zipcode'); h_nodec_nperzip.defineData('n_items'); h_nodec_nperzip.defineDone(); /* load no_dec_ranno into hash replacing the random values by a sequence number (by zipcode) */ /* - the order is still random but the sequence number instead of a random value will allow to address specific items later on */ /* - memory consumption of this hash is around 88bytes * number of items plus some overhead. For 34.6M rows close to 3GB */ dcl hash h_nodec(ordered:'y'); h_nodec.defineKey('Zipcode','seq_no'); h_nodec.defineData('id','zipcode','seq_no'); h_nodec.defineDone(); do until(_last); set no_dec_ranno(drop=ran_no) end=_last; by Zipcode; /* populate hash h_nodec */ if first.zipcode then seq_no=1; else seq_no+1; _rc=h_nodec.add(); /* populate hash h_nodec_nperzip */ n_items=sum(n_items,1); if last.zipcode then do; _rc=h_nodec_nperzip.add(); n_items=0; end; end; /* load the bridge data into a hash */ dcl hash h_brdg(dataset:'bridge', multidata:'y', ordered:'y'); h_brdg.defineKey('date','decile'); h_brdg.defineData('zipcode'); h_brdg.defineDone(); /* hash to store per zipcode the last sequence number used to populate the table with control data */ /* - to ensure a record gets only drawn once */ dcl hash h_last_ranno(); h_last_ranno.defineKey('zipcode'); h_last_ranno.defineData('seq_no'); h_last_ranno.defineDone(); /* arrays to store zipcode and cumulative sum of number of items under a zipcode */ array a_zipcode{50000} $5 _temporary_; array a_itemcum{0:50000} 8 _temporary_; a_itemcum[0]=0; end; call missing(of _all_); set dec(rename=(id=id_dec)); /*** draw two controls for case ***/ /** 1. select zipcode from bridge for lookup of rows in no_dec (file 3) */ /* load all zipcodes from the bridge into hash h_zipcollect that match with the current row from dec (file 1) */ _rc=h_brdg.reset_dup(); do _i=1 by 1 while(h_brdg.do_over() = 0); _rc=h_nodec_nperzip.find()=0; a_zipcode[_i]=zipcode; a_itemcum[_i]=sum(a_itemcum[_i-1],n_items); end; a_itemcum_n_elements=sum(_i,-1); select_cnt=0; do _i=1 to 99 until(select_cnt=2); /* if sufficient data to draw control from, loop will only iterate twice */ /** random selection of one of the matching zipcodes from array a_zipcode, weighted by number of items per zipcode **/ /* create random integer in the range of 1 to n zipcodes to choose from */ ran_val=rand('integer',1,a_itemcum[a_itemcum_n_elements]); /* binary search through array a_itemcum to find the element that stores the higher boundary */ /* - when found use the index of this element to derive the zipcode from which to draw control record */ lbound=1; hbound=a_itemcum_n_elements; do while(lbound <= hbound); midpt=floor(sum(lbound,hbound)/2); if a_itemcum[midpt-1] >= ran_val then hbound=midpt-1; else if a_itemcum[midpt] < ran_val then lbound=midpt+1; else /* if a_itemcum[midpt-1] < ran_val <= a_itemcum[midpt] then */ do; zipcode=a_zipcode[midpt]; leave; end; end; /** 2. draw control from population under selected zip code **/ /* for the chosen zipcode derive the row with the lowest seq_no that hasn't been drawn previously */ if h_last_ranno.find() ne 0 then do; seq_no=1; _rc=h_last_ranno.add(); end; /* draw control record */ if h_nodec.find()=0 then do; /* count how many control records selected for the current record from dec */ select_cnt=sum(select_cnt,1); output control; /* remove selected record from hash as we won't select it again */ _rc=h_nodec.remove(); /* increase seq_no by 1 for this zipcode as prep of selection of another row for the table with controls */ seq_no=sum(seq_no,1); _rc=h_last_ranno.replace(); end; end; if select_cnt<2 then output control_insufficient_data; run; /* title 'control'; */ /* proc print data=control; */ /* run; */ /* title 'Decedent with insufficient matching data to create control'; */ /* proc sql; */ /* select * */ /* from control_insufficient_data; */ /* quit; */ /* title; */ Going forward I suggest you create a new question once you've accepted a response as solution. Just mention and link the previous discussion in the new follow-up question. Not only will this help to not "overload" discussions, it will also increase the likelihood for more people looking into your new question. In regards of the logic used to draw the control just some more thoughts for your consideration if relevant at all. I would assume that compared to your control population (file 1) your deceased population (file 3) has a higher average age and percentage of members living under an urban postcode. This not the least because of better availability of medical facilities. You could consider to also add age-group information to your file1 and file3 to further segment from which population to draw the control from. And for rural/urban: If the distributions between file1 and file3 significantly differ then you might also want to add this info to your data to further subset the control population to draw from. ....and then there is of course the change of zipcodes. I would imagine that a change to work with actual date ranges that doesn't impact too much on performance could be hard, a change to yearly snapshots of data would be rather simple. @JKHess Update: I further tested the binary search logic (makes my brain hurt!). I believe I've got it right now.

Patrick · ‎02-07-2025

@JKHess wrote: ...I agree this is a problem. How can I weight by the number of records for each zipcode? Below the amended code that now should choose id's from any of the zipcodes in scope with the same probability. The hard bit was to come-up with a change for a weighted random selection of a zipcode where performance doesn't degrade too much. I couldn't avoid implementation of some additional looping. Let me know the runtime (and if it's still manageable). And once again: Please review both the code and validate the result. /*************** create sample data ************************/ /* file 1 */ data dec; input ID Date :mmddyy10. Decile; format Date mmddyy10.; datalines; 1 1/1/2017 1 22 1/1/2017 1 41 1/1/2017 1 56 1/1/2017 2 79 1/1/2017 2 85 1/1/2017 2 100 1/2/2017 1 118 1/2/2017 1 125 1/2/2017 2 167 1/2/2017 2 178 1/2/2017 3 ; run; /* file 2 - not really a bridge because relationship bridge:no_dec is many:many */ data bridge; input Date :mmddyy10. Decile Zipcode $5.; format Date mmddyy10.; datalines; 1/1/2017 1 88123 1/1/2017 1 03867 1/1/2017 1 04001 1/1/2017 2 03304 1/1/2017 2 98765 1/1/2017 2 96224 1/1/2017 2 00001 1/2/2017 1 98801 1/2/2017 2 88123 1/2/2017 2 12345 1/2/2017 2 83356 1/2/2017 2 98765 1/2/2017 3 03304 1/2/2017 3 04945 ; run; /* file 3 */ data no_dec; input ID Zipcode $5.; datalines; 2 88123 21 88123 22 88123 23 88123 24 88123 3 12345 4 03304 5 03867 6 04945 7 04001 8 98765 9 98801 10 96224 11 00001 12 83356 13 83356 ; run; /************ data prep *************************************/ /* assign a random value to each entry in no_dec (file 3) and output as table no_dec_ranno sorted by zipcode and random value */ data _null_; dcl hash h1(ordered:'y', multidata:'y'); h1.defineKey('Zipcode','ran_no'); h1.defineData('id','zipcode','ran_no'); h1.defineDone(); call streaminit(10); do until(_last); set no_dec end=_last; ran_no=rand('uniform'); _rc=h1.add(); end; _rc=h1.output(dataset:'no_dec_ranno'); stop; run; /************ draw control *************************************/ data control(keep=id_dec id zipcode date Decile select_cnt) control_insufficient_data(keep=id_dec id zipcode date Decile select_cnt) ; length id_dec 8; if _n_=1 then do; call streaminit(10); /* define hash to collect number of rows (items) per zipcode */ /* - used for weighted random selection of zipcode from which to draw control from */ n_items=0; dcl hash h_nodec_nperzip(); h_nodec_nperzip.defineKey('zipcode'); h_nodec_nperzip.defineData('n_items'); h_nodec_nperzip.defineDone(); /* load no_dec_ranno into hash replacing the random values by a sequence number (by zipcode) */ /* - the order is still random but the sequence number instead of a random value will allow to address specific items later on */ /* - memory consumption of this hash is around 88bytes * number of items plus some overhead. For 34.6M rows close to 3GB */ dcl hash h_nodec(ordered:'y'); h_nodec.defineKey('Zipcode','seq_no'); h_nodec.defineData('id','zipcode','seq_no'); h_nodec.defineDone(); do until(_last); set no_dec_ranno(drop=ran_no) end=_last; by Zipcode; /* populate hash h_nodec */ if first.zipcode then seq_no=1; else seq_no+1; _rc=h_nodec.add(); /* populate hash h_nodec_nperzip */ n_items=sum(n_items,1); if last.zipcode then do; _rc=h_nodec_nperzip.add(); n_items=0; end; end; /* load the bridge data into a hash */ dcl hash h_brdg(dataset:'bridge', multidata:'y', ordered:'y'); h_brdg.defineKey('date','decile'); h_brdg.defineData('zipcode'); h_brdg.defineDone(); /* hash to store per zipcode the last sequence number used to populate the table with control data */ /* - to ensure a record gets only drawn once */ dcl hash h_last_ranno(); h_last_ranno.defineKey('zipcode'); h_last_ranno.defineData('seq_no'); h_last_ranno.defineDone(); /* arrays to store zipcode and cumulative sum of number of items under a zipcode */ array a_zipcode{10000} $5 _temporary_; array a_itemcum{0:10000} 8 _temporary_; a_itemcum[0]=0; end; call missing(of _all_); set dec(rename=(id=id_dec)); /*** draw two controls for case ***/ /** 1. select zipcode from bridge for lookup of rows in no_dec (file 3) */ /* load all zipcodes from the bridge into hash h_zipcollect that match with the current row from dec (file 1) */ _rc=h_brdg.reset_dup(); do _i=1 by 1 while(h_brdg.do_over() = 0); _rc=h_nodec_nperzip.find()=0; a_zipcode[_i]=zipcode; a_itemcum[_i]=sum(a_itemcum[_i-1],n_items); end; a_itemcum_n_elements=sum(_i,-1); select_cnt=0; do _i=1 to 99 until(select_cnt=2); /* if sufficient data to draw control from, loop will only iterate twice */ /** random selection of one of the matching zipcodes from array a_zipcode, weighted by number of items per zipcode **/ /* create random integer in the range of 1 to n zipcodes to choose from */ ran_val=rand('integer',1,a_itemcum[a_itemcum_n_elements]); /* binary search through array a_itemcum to find the element that stores the higher boundary */ /* - when found use the index of this element to derive the zipcode from which to draw control record */ lbound=1; hbound=a_itemcum[a_itemcum_n_elements]; do while(lbound <= hbound); midpt=floor(sum(lbound,hbound)/2); if a_itemcum[midpt-1] >= ran_val then hbound=midpt-1; else if a_itemcum[midpt] < ran_val then lbound=midpt+1; else if a_itemcum[midpt-1] < ran_val <= a_itemcum[midpt] then do; zipcode=a_zipcode[midpt]; leave; end; end; /** 2. draw control from population under selected zip code **/ /* for the chosen zipcode derive the row with the lowest seq_no that hasn't been drawn previously */ if h_last_ranno.find() ne 0 then do; seq_no=1; _rc=h_last_ranno.add(); end; /* draw control record */ if h_nodec.find()=0 then do; /* count how many control records selected for the current record from dec */ select_cnt=sum(select_cnt,1); output control; /* remove selected record from hash as we won't select it again */ _rc=h_nodec.remove(); /* increase seq_no by 1 for this zipcode as prep of selection of another row for the table with controls */ seq_no=sum(seq_no,1); _rc=h_last_ranno.replace(); end; end; if select_cnt<2 then output control_insufficient_data; run; /* title 'control'; */ /* proc print data=control; */ /* run; */ /* title 'Decedent with insufficient matching data to create control'; */ /* proc sql; */ /* select * */ /* from control_insufficient_data; */ /* quit; */ /* title; */ I do have another design idea how to approach your problem which would likely perform quite a bit better but for which I would need to know more about your data and environment (to not waste memory) to decide if it's feasible. Let's see how above code performs before we go there. About the address changes: The dataset had zipcode assigned to each beneficiary annually. About 5% changed zipcode from one calendar year to the next If you've got annual snapshots then why not run the process for each year separately using the matching yearly snapshot. That shouldn't take that much but return better data which must be in your interest.

Patrick · ‎02-07-2025

Does below return what you're after? data have; input ID :$20. Admission :date09. Discharge :date09. Age_class Age_end Index Value Total Age_class1 Age_class2 Age_class3 Age_class4 Age_class5; format Admission date9. Discharge date9.; cards; 0001 01JUL2014 16AUG2014 1 4 1 2.3 11.9 2.3 1.4 5 3.2 . 0001 13MAY2018 22JUN2018 3 4 0 1.4 . . . . . . 0001 23JAN2019 25JAN2019 4 4 0 3.2 . . . . . . 0002 13MAY2016 22SEP2016 1 5 1 2 7.9 2 0.3 0.2 5 0.4 0002 09JUL2023 10JUL2023 2 5 0 0.3 . . . . . . 0002 12SEP2024 15SEP2024 3 5 0 0.2 . . . . . . 0003 01JUL2014 18AUG2014 1 3 1 12 17.3 12 0.3 5 . . 0003 07DEC2023 16DEC2023 2 3 0 0.3 . . . . . . 0004 12JAN2014 15JAN2014 1 2 1 2 2.1 2 0.1 . . . 0004 30MAY2019 13JUL2019 2 2 0 0.1 . . . . . . 0005 30JUN2019 13OCT2019 5 5 1 4.1 4.1 . . . . 4.1 0006 30JUN2019 13OCT2019 5 5 1 0 0 . . . . . ; run; data want(drop=_:); if _n_=1 then do; _val=0; dcl hash h1(dataset:'have(rename=(value=_val))', ordered:'y'); h1.defineKey('id','age_class'); h1.defineData('_val'); h1.defineDone(); end; set have; array age_class_derived{5} 8; if index=1 then do; /* distribute existing values */ do _i=1 to dim(age_class_derived); if h1.find(key:id,key:_i)=0 then age_class_derived[_i]=_val; end; /* fill-up with max 5 until total reached */ _total=round(sum(of age_class_derived[*]),.0000001); do _i=1 to dim(age_class_derived) while( _total<total ); if missing(age_class_derived[_i]) then do; age_class_derived[_i]=min(5,abs(sum(total,-_total))); end; _total=round(sum(of age_class_derived[*]),.0000001); end; end; run; proc print data=want; run; I've made the assumption that the marked cells in your sample data were typos.

Patrick · ‎02-05-2025

@JKHess wrote: ...I agree this is a problem. How can I weight by the number of records for each zipcode? I believe I have an approach how to implement this change with acceptable impact on performance. I'll give it a go once time.

Patrick · ‎02-05-2025

The likely cause is that your SAS session runs single byte with an encoding like wlatin1 (ISO-8859-1) for which there is no character for ≥. The following will show you under which encoding you're running SAS. proc options GROUP=LANGUAGECONTROL; run; And this wiki page lets you check if you can map a multibyte character to your current character set. If my assumption is correct then the only way to get around this is to run your SAS session in multi-byte mode preferably using UTF-8.

Patrick · ‎02-05-2025

@Visiting wrote: It is close to get it work, thank you! But I got an error: "The RLANG system option must be specified in the SAS configuration file or on the SAS invocation command line to enable the submission of R language statements". How can I solve this issue? Suggest you read the info under the links provided. For example under the one already shared earlier it clearly states:

Patrick · ‎02-05-2025

The connect statement in your 2nd Proc SQL misses connection=global and though creates a new session on Teradata where the volatile tables don't exists. Either have all your SQLs under a single Proc SQL under a single connection, or make sure you use in multiple SQLs the exactly same connection string including connection=global or (my preference) define the connection once via a Libname statement (with connection=global) and then use syntax connect using <libref>

Online Status	Online
Date Last Visited	2 hours ago

Re: How do I upgrade IP's from basic Sku to Standar Sku?

Re: struggle with join toward a table with 3 billions rows

Re: I want to transfer DI Studio code to SAS Viya and remove the DI wr...

Re: Macro variable not resolved - ONLY in scheduled jobs. Works fine w...

Re: struggle with join toward a table with 3 billions rows

Re: check when tera view table was lastly updated

Re: sas to tera

Re: Seeking Effective Methods for SAS Code Optmization and Refactoring...

Re: How to execute a proc compare in a Kernel Script on a Unix server

Re: how can I obtain an identical matchcode between 2 similar names ?

Re: The ADDRLONG function is not available beginning with SAS 9.4M9

Writing parquet files via the libname engine should use the format inf...

Re: how to convert dates into weekly increments

Re: Why does SAS randomly report "Library WORK does not exist" during ...

Re: How to create a variable based on multiple conditions with lots of...

Re: Macro variable not resolved - ONLY in scheduled jobs. Works fine w...

Re: struggle with join toward a table with 3 billions rows

Re: Seeking Effective Methods for SAS Code Optmization and Refactoring...

Re: how can I obtain an identical matchcode between 2 similar names ?

Re: Why read data 1 times or 2 times when declare hash object with or ...

How do I add a row number to a table in SAS code?

Re: You like me, you really like me!

Re: publically available Sample SDTM sas datasets?

Re: How to specify input data set for PRIOR statement in PROC MIXED

Re: SAS Metadata: list of all attributes you can use in METADATA_GETAT...

Re: How to return a physical .csv file from sas studio as a job?

Re: How to return a physical .csv file from sas studio as a job?

Re: Extract text using Prxsubstr

Re: Extract text using Prxsubstr

Re: Extract text using Prxsubstr

Re: How do I randomly select controls from within a large administrati...

Re: How do I randomly select controls from within a large administrati...

Re: Transform a dataset from long to wide format and fill values

Re: How do I randomly select controls from within a large administrati...

Re: XLSX cell containing a "hovered" >= symbol

Re: Is it possible to directly import a RData file into SAS?

Re: SAS--TERA

CoDe SAS German