About Wolverine

Wolverine · ‎03-06-2023

I'm using the following code to save the log after running my program. The problem is that, if the log file already exists from a previous run, a dialog box pops up and asked if I want to Replace, Append, or Cancel. I want it to replace by default. This code runs without error, but the 'replace' command doesn't seem to do anything. dm log 'file "P:\logs\ReportPrint..log"' replace;run; Also, I know that I can use proc printto to save the log, but I prefer the dm statement so I can watch the log update in SAS as it runs.

Wolverine · ‎02-20-2023

@Tom wrote:Remember that the format attached to a variable just determines how the value is DISPLAYED, not what it stored. Excellent point -- that is the source of my confusion! The variables are displayed as 02/20/2022, but that is not how SAS is actually storing them.

Wolverine · ‎02-20-2023

This should be simple, but I can't get it to work 😡 Here are the variable formats from proc contents: birth_infantdob Num 8 MMDDYY. last_reg_dob Num 8 MMDDYY10. I've tried several variations of this approach, but I keep getting errors data want; set have; where ('01/01/2022'<=birth_infantdob<='09/30/2022' or '01/01/2022'<=reg_infantdob<='09/30/2022'); run; This give me ERROR: WHERE clause operator requires compatible variables. I also tried removing the single quotes from around the dates, but then that says it's an "obvious false WHERE clause".

Wolverine · ‎12-09-2022

My biggest concern with this approach is that the coverage variable is not human-readable -- I can't easily tell which month/year is covered and which are not. If there was some issue where everything was shifted by a month, I probably wouldn't be able to see it.

Wolverine · ‎12-09-2022

@mkeintz wrote: And what do you want your output dataset to look like? The final file should have a single record for each member_ID and a flag variable (1=yes, 0=no) for continuous coverage around the timeframe of the index event.

Wolverine · ‎12-07-2022

I've updated the data file so that it look like this: memberid monthid service_date service_date_d 1234 201811 20190215 21595 1234 201812 20190215 21595 1234 201901 20190215 21595 1234 201902 20190215 21595 1234 201903 20190215 21595 1234 201904 20190215 21595 5678 202110 20220122 22665 5678 202111 20220122 22665 5678 202201 20220122 22665 5678 202202 20220122 22665 5678 202203 20220122 22665

Wolverine · ‎12-05-2022

Sorry, I should have clarified. Right now the index date is in another file, and I plan to link them by memberid. There's a character version called service_date in the format 20191015 (Oct 15 2019), as well as a numeric version called service_date_d that contains the number of days since Jan 1 1960 (22,568 represents Oct 15 2019). The monthid is character. The months of coverage are independent of each other, so a case may be enrolled in 201910, skip 201911, and be re-enrolled in 201912.

Wolverine · ‎12-05-2022

I have insurance claims data, and I've identified cases that have had a particular medical event. Now I'm trying to make sure that these cases were continuously enrolled 3 months prior to and 2 months after that event. In the following example data, let's say the index date for 1234 was Feb 15 2019. That case would qualify. But if the index date for 5678 was Jan 20 2022, that case would not qualify. memberid monthid 1234 201811 1234 201812 1234 201901 1234 201902 1234 201903 1234 201904 5678 202110 5678 202111 5678 202201 5678 202202 5678 202203 I searched and found some potentially useful code, but this is set up to find cases that have not had a gap in the last 12 months. So how do I modify this to work with the index date and to search 3 months back and 2 months after? data want; set have; by memberID monthID; if first.memberID then counter=0; if dif(monthID)>1 and mod(monthID,100) ne 1 then counter=0; if mod(monthID,100) eq 1 and dif(monthID) ne 89 then counter=0; counter+1; if counter ge 12 then output; run;

Wolverine · ‎10-25-2022

@Quentin wrote: I like the idea of coding up the same algorithm using multiple approaches to compare efficiency. That said, I think you should work to get all the approaches to match in their output. This should not happen. If SAS runs out of memory, you should get an error in the log. You definitely should not get the wrong result (with no error). if you really have a case where SQL is giving you the wrong result, I would send it in to tech support. Same for your statement that hash approach had some discrepancies. I think it's likely that there are some edge cases in your data that are falling through some cracks in your code. But if you have a repeatable example of discrepancy (especially one where the results of the code vary with the amount of memory available to the SAS session), please send it in to tech support. Also confused by your statement that the output dataset from the hash approach was double the size of other approaches. If the output datasets from each approach are identical (e.g. judged via PROC COMPARE to compare the metadata and data), this shouldn't happen. Unless maybe you changed compression options. Before this project, I had virtually no experience w/ arrays or hash objects. I don't know why the flag counts were slightly different. The only clue I had that RAM could be an issue was that Firefox crashed and presented a dialog box indicating that the crash was due to insufficient memory. When I manually reviewed the discrepant cases, the codes that matched for those cases had been successfully matched on many other cases. And after increasing the memory, the discrepancies disappeared. It wasn't just the output filesize that was different among the various approaches, it was also the number of records they contained. So data compression wouldn't explain the differences. Perhaps Proc SQL does not continue to search for matches on a given flag after it has already matched that flag, whereas the hash version DOES continue to search? In that case, there could be duplicate records for a given case that match on the same flag, and that could explain the differences in filesize. I could run Proc SQL w/ select distinct on the hash output file to see if it finds and eliminates any duplicate records. I will review all of this in an attempt to make sure I haven't made any errors. If I can't find any, I'll submit it to tech support. However, I'm getting busy with other projects, so I don't have as much time to dedicate to this right now. So it may take a while😐

Wolverine · ‎10-24-2022

Here is the report I wrote based on the various approaches discussed in this thread. The intended audience is the other programmers I work with. Benchmarking Comparison of SAS Procedures Various SAS procedures were tested to determine the best method to match the diagnosis and procedure codes in patient data files to lists of codes to be flagged for various medical conditions of interest. There were 3 main approaches: Proc SQL merge, data step arrays, and data step hash. There were also some minor variants of these approaches that were tested as well. The input patient data file was about 78.5GB. The output file varies by approach. Proc SQL merge (see Appendix A for syntax) The lists of codes to be flagged were imported from Excel tab-by-tab, and a flag variable was added corresponding to each code. The imported lists were then combined into a single SAS file. This file was matched on all 27 proc variables and 26 DX variables in the data file using JOIN. Variants were tested with various Where statements (for example, requiring that the proc/DX variables in the data file must not be blank, in an effort to avoid the time needed to match blanks to the flag file). The Where statements had little to no effect. The resulting output file was approximately 133GB. IMPORTANT NOTE: During testing, some slight discrepancies in flag frequencies (< 0.5%) were discovered between the Proc SQL and Array versions. In each occurrence, the Proc SQL version failed to flag a small number of cases that were flagged in the array version. Manual review confirmed these cases should have been flagged. These discrepancies were apparently due to SAS running out of memory during the merge, even though less than half of the computer’s RAM was in use. SAS did not provide any error messages or even warnings about this in the log. By default, SAS only uses 2GB of RAM. Increasing the RAM corrected the frequency issue, but did not have a notable impact on processing speed. See Appendix D for discussion of this issue and how to correct it. Here are typical results: real time 28:08:32.61 cpu time 7:24:39.17 Data step Array (see Appendix B for syntax) The lists of codes to be flagged are imported into list variables. Each tab has 2 list variables, one for proc and one for DX codes. An array is then used to compare all proc variables in the data file to the proc list variables, and a similar array is used for DX variables. The resulting output file is approximately 159GB. real time 13:07:03.42 cpu time 12:53:12.39 A variant was tested that included a command to delete records for which all flags = 0. This variant saved over 2 hours of processing time. The resulting output file was approximately 142GB. real time 10:47:45.35 cpu time 10:43:20.82 Data step Hash (see Appendix C for syntax) The lists of codes to be flagged are imported from Excel tab-by-tab, and a flag variable is added corresponding to each code. The imported lists are then combined into a single SAS file. A hash is created for proc codes and another is created for DX codes. An array then uses the proc hash to compare all proc variables in the data file to the proc list variables, and a similar array is used for DX variables. The resulting output file was approximately 320GB. real time 4:06:06.53 cpu time 39:04.64 NOTE: There were some discrepancies in the frequency output (hash vs array), similar to (though less severe than) the discrepancies in the original Proc SQL output. Again, this seems to be related to SAS running out of memory. However, in this case the memory issues probably occurred during the step to find the max of each flag, rather than during the merge itself. DISCUSSION The clear winner in terms of real processing time was the hash approach. However, there are some drawbacks to this approach as well. The output file size was more than twice as large as the other two approaches. This not only requires more hard drive space, but it also results in longer processing times for subsequent steps. For example, the next step in the testing SAS program was a Proc SQL designed to find the max of each flag variable grouped by patient ID. This step took much longer than it did with the other approaches – so much so that it not only negated the increased processing speed for the flag merge, it actually took longer overall than the Proc SQL approach! On the other hand, this may not be an issue in other programs with less-complicated downstream steps. While the array approach was nearly three times faster than the Proc SQL approach in real time, consideration must be given to CPU time as well. In this metric, Proc SQL was more than 30% faster. The Proc SQL approach requires extensive reading and writing of data, and the very long overall processing time is due to the relatively slow I/O capabilities of our virtual machines. At my previous position, I used a computer where the I/O speeds were in the range of 8-10 times higher. On a machine with similar transfer speeds, the Proc SQL version would have been faster by more than three hours. In summary, there is no “one size fits all” answer. Each approach has advantages and disadvantages. Familiarity with the data, an understanding of the capabilities of the computer being used, and knowledge of SAS procedures and their inner workings are the keys to choosing the best approach for large processing tasks. APPENDIX A – Syntax for Proc SQL approach /*Section 3.2 -- Merge proc codes and DX codes from hyst_prepost_ICD_CPT with flags from HystProlapseCode_comb_mx, among index cases.*/ PROC SQL; Create table temp.hyst_prepost_ICD_CPT_flags_CPU as Select distinct a.member_id, a.episode_id, a.claim_id, a.dt_svc_from, a.dt_svc_end, a.svc_from_dt, a.svc_end_dt, a.svc_from_dt_ICD, a.icd_dx1_prof, a.comp_icd_dx1, a.comp_icd_dx2, a.comp_icd_dx3, a.comp_icd_dx4, a.comp_icd_dx5, a.comp_icd_dx6, a.comp_icd_dx7, a.comp_icd_dx8, a.comp_icd_dx9, a.comp_icd_dx10, a.comp_icd_dx11, a.comp_icd_dx12, a.comp_icd_dx13, a.comp_icd_dx14, a.comp_icd_dx15, a.comp_icd_dx16, a.comp_icd_dx17, a.comp_icd_dx18, a.comp_icd_dx19, a.comp_icd_dx20, a.comp_icd_dx21, a.comp_icd_dx22, a.comp_icd_dx23, a.comp_icd_dx24, a.comp_icd_dx25, a.cpt, a.cpt_prof, a.icd_pr1, a.icd_pr2, a.icd_pr3, a.icd_pr4, a.icd_pr5, a.icd_pr6, a.icd_pr7, a.icd_pr8, a.icd_pr9, a.icd_pr10, a.icd_pr11, a.icd_pr12, a.icd_pr13, a.icd_pr14, a.icd_pr15, a.icd_pr16, a.icd_pr17, a.icd_pr18, a.icd_pr19, a.icd_pr20, a.icd_pr21, a.icd_pr22, a.icd_pr23, a.icd_pr24, a.icd_pr25, b.HEDIS_proc_code, b.HEDIS_DX_code, b.Code_Description, b.mFLAG_cpt_hyst_ab, b.mFLAG_cpt_hyst_all, b.mFLAG_Dxs_Prolapse, [FLAG VARIABLE LIST TRUNCATED] /*These variables are needed later*/ a.year, a.flg_age_0_17, a.member_birth_dt, a.val_age, a.zip_cd, a.flg_cond_hysterectomy, a.hyst_ep_dt_beg, a.healthcare_vis_dt, a.healthcare_vis_days, a.drg, a.clm_type, a.e_clm_type, a.payer, a.provider_npi, a.provider_splty, a.provider_type_cd, a.facility_npi From temp.hyst_prepost_ICD_CPT a JOIN temp.HystProlapseCode_comb_mx b On a.cpt = b.HEDIS_proc_code OR a.cpt_prof = b.HEDIS_proc_code OR a.icd_pr1 = b.HEDIS_proc_code OR a.icd_pr2 = b.HEDIS_proc_code OR a.icd_pr3 = b.HEDIS_proc_code OR a.icd_pr4 = b.HEDIS_proc_code OR a.icd_pr5 = b.HEDIS_proc_code OR a.icd_pr6 = b.HEDIS_proc_code OR a.icd_pr7 = b.HEDIS_proc_code OR a.icd_pr8 = b.HEDIS_proc_code OR a.icd_pr9 = b.HEDIS_proc_code OR a.icd_pr10 = b.HEDIS_proc_code OR a.icd_pr11 = b.HEDIS_proc_code OR a.icd_pr12 = b.HEDIS_proc_code OR a.icd_pr13 = b.HEDIS_proc_code OR a.icd_pr14 = b.HEDIS_proc_code OR a.icd_pr15 = b.HEDIS_proc_code OR a.icd_pr16 = b.HEDIS_proc_code OR a.icd_pr17 = b.HEDIS_proc_code OR a.icd_pr18 = b.HEDIS_proc_code OR a.icd_pr19 = b.HEDIS_proc_code OR a.icd_pr20 = b.HEDIS_proc_code OR a.icd_pr21 = b.HEDIS_proc_code OR a.icd_pr22 = b.HEDIS_proc_code OR a.icd_pr23 = b.HEDIS_proc_code OR a.icd_pr24 = b.HEDIS_proc_code OR a.icd_pr25 = b.HEDIS_proc_code OR a.icd_dx1_prof = b.HEDIS_DX_code OR a.comp_icd_DX1 = b.HEDIS_DX_code OR a.comp_icd_DX2 = b.HEDIS_DX_code OR a.comp_icd_DX3 = b.HEDIS_DX_code OR a.comp_icd_DX4 = b.HEDIS_DX_code OR a.comp_icd_DX5 = b.HEDIS_DX_code OR a.comp_icd_DX6 = b.HEDIS_DX_code OR a.comp_icd_DX7 = b.HEDIS_DX_code OR a.comp_icd_DX8 = b.HEDIS_DX_code OR a.comp_icd_DX9 = b.HEDIS_DX_code OR a.comp_icd_DX10 = b.HEDIS_DX_code OR a.comp_icd_DX11 = b.HEDIS_DX_code OR a.comp_icd_DX12 = b.HEDIS_DX_code OR a.comp_icd_DX13 = b.HEDIS_DX_code OR a.comp_icd_DX14 = b.HEDIS_DX_code OR a.comp_icd_DX15 = b.HEDIS_DX_code OR a.comp_icd_DX16 = b.HEDIS_DX_code OR a.comp_icd_DX17 = b.HEDIS_DX_code OR a.comp_icd_DX18 = b.HEDIS_DX_code OR a.comp_icd_DX19 = b.HEDIS_DX_code OR a.comp_icd_DX20 = b.HEDIS_DX_code OR a.comp_icd_DX21 = b.HEDIS_DX_code OR a.comp_icd_DX22 = b.HEDIS_DX_code OR a.comp_icd_DX23 = b.HEDIS_DX_code OR a.comp_icd_DX24 = b.HEDIS_DX_code OR a.comp_icd_DX25 = b.HEDIS_DX_code ; QUIT; APPENDIX B – Syntax for Array approach /*Section 3.2 -- Create the flag file using array method*/ DATA temp.hyst_prepost_ICD_CPT_flags_TEST; SET temp.hyst_prepost_ICD_CPT; /*Set up an array to recode missing values to 0*/ Array _arr(*) FLAG_cpt_hyst_ab FLAG_cpt_hyst_all [FLAG VARIABLE LIST TRUNCATED] FLAG_Dxs_Prolapse; Do i=1 to dim(_arr); If _arr(i)=. then _arr(i)=0; End; Drop i; Array dx_codes $ ICD_DX1 ICD_DX1_PROF COMP_ICD_DX1-COMP_ICD_DX25; Array proc_codes $ CPT CPT_PROF ICD_PR1-ICD_PR25; do index=1 to dim(dx_codes ); if dx_codes[index] in: (&cpt_hyst_ab_DX.) THEN FLAG_cpt_hyst_ab=1; if dx_codes[index] in: (&cpt_hyst_all_DX.) THEN FLAG_cpt_hyst_all=1; [FLAG VARIABLE LIST TRUNCATED] if dx_codes[index] in: (&Dxs_Prolapse_DX.) THEN FLAG_Dxs_Prolapse=1; END; do index=1 to dim(proc_codes ); if proc_codes[index] in: (&cpt_hyst_ab_pr.) THEN FLAG_cpt_hyst_ab=1; if proc_codes[index] in: (&cpt_hyst_all_pr.) THEN FLAG_cpt_hyst_all=1; [FLAG VARIABLE LIST TRUNCATED] if proc_codes[index] in: (&Dxs_Prolapse_pr.) THEN FLAG_Dxs_Prolapse=1; END; /*Delete records that don't have any flags*/ IF (FLAG_cpt_hyst_ab ^=1 AND FLAG_cpt_hyst_all ^=1 AND FLAG_cpt_hyst_lap_all ^=1 AND [FLAG VARIABLE LIST TRUNCATED] FLAG_Dxs_Prolapse ^=1) THEN delete; APPENDIX C – Syntax for Hash approach /*Section 3.2 -- Create the flag file using hash method*/ data temp.hyst_prepost_ICD_CPT_flags_HASH(drop=_:); LENGTH HEDIS_proc_code HEDIS_DX_code $ 8; /*Testing to see if this fixes error, seems to work*/ if _n_=1 then do; if 0 then set temp.HystProlapseCode_comb_mx; dcl hash h_proc (dataset:"temp.HystProlapseCode_comb_mx"); h_proc.defineKey('HEDIS_proc_code'); h_proc.defineData(all:'y'); h_proc.defineDone(); dcl hash h_dx(dataset:"temp.HystProlapseCode_comb_mx"); h_dx.defineKey('HEDIS_DX_code'); h_dx.defineData(all:'y'); h_dx.defineDone(); end; call missing(of _all_); set temp.hyst_prepost_ICD_CPT; Array dx_codes $ ICD_DX1 ICD_DX1_PROF COMP_ICD_DX1-COMP_ICD_DX25; Array proc_codes $ CPT CPT_PROF ICD_PR1-ICD_PR25; array _a_proc $ CPT CPT_PROF ICD_PR1-ICD_PR25; do _i=1 to dim(_a_proc); if h_proc.find(key:_a_proc[_i])=0 then do; output; /*return;--The return statement writes a row to output as soon as it finds a matching flag and then stops further checks. But since there can be multiple matches per record, it should be commented out*/ end; end; array _a_dx $ ICD_DX1 ICD_DX1_PROF COMP_ICD_DX1-COMP_ICD_DX25; do _i=1 to dim(_a_dx); if h_dx.find(key:_a_dx[_i])=0 then do; output; /*return;*/ end; end; /*Set up an array to recode missing values to 0*/ Array _arr(*) mFLAG_cpt_hyst_ab mFLAG_cpt_hyst_all mFLAG_cpt_hyst_lap_all [FLAG VARIABLE LIST TRUNCATED] mFLAG_Dxs_Prolapse; Do i=1 to dim(_arr); If _arr(i)=. then _arr(i)=0; End; Drop i; RUN; APPENDIX D – Setting memory usage for SAS By default, SAS only uses 2GB of RAM. Most computers produced in the last five years have at least 8GB of RAM, and most high-end machines have 32-64GB. Our virtual machines have 16GB. Taking advantage of this additional RAM may improve SAS’ performance. In most situations, increasing RAM usage is accomplished by editing the sasv9.cfg file. However, we do not editing permission for this file on the VM. As work-around, use the following steps: Click the magnifying glass icon in the bottom left of the screen (next to the Start menu icon). Type “run” in the search box, and then right-click on the “Run” app. Select Open file location. In the resulting folder, right-click on the Run icon and copy it to your desktop. Double-click on the Run icon you just created on your desktop. Type “sas.exe -memsize 16g” in the dialog box and click OK. SAS will open, and will have access to additional RAM. Use this desktop shortcut each time you open SAS. However, some RAM is reserved for Windows and other programs, so SAS will not be able to access all 16GB. Use the following syntax in SAS to see how much RAM is available: /*This shows available RAM*/ data _null_; mem = input(getoption('xmrlmem'),20.2)/10e6; format mem 20.2; put "You have " mem "GB memory available"; run;

Wolverine · ‎09-27-2022

@Reeza wrote: You only have 2GB of RAM assigned to SAS, which is possibly an issue in your speed/response. The VM has 16GB of RAM, and SAS is really the only software I run on it. To increase the RAM available to SAS, I tried updating the sasv9.cfg file. but I can't save my changes because I don't have admin rights on the VM. I was able to go the Run dialog in SAS and enter "SAS.exe -memsize 12g". Then I ran /*This shows available RAM*/ data _null_; mem = input(getoption('xmrlmem'),20.2)/10e6; format mem 20.2; put "You have " mem "GB memory available"; run; This shows I have ~10GB (not sure why there's a discrepancy between the 12g I entered the 10g available, but I suspect it has to do with using 1000KB per MB rather than 1024). Is there a way I can make the change via syntax? It would be easier to have a line or 2 at the beginning of every program than it would be to open SAS via the Run dialog every time. I'm rerunning the Proc SQL merge just to test how much faster it is with the increased RAM.

Wolverine · ‎09-26-2022

I'm starting to look at the hash approach, and I have a couple of questions for @mkeintz and @Patrick : Now, in a context more familiar to me: If the temp.preport_FLAGS dataset has no duplicate HEDIS_proc_code values or duplicate HEDIS_dx_code values, then this data step could be a lot faster. Not sure what this means. Under my original Proc SQL approach, the HEDIS variables are in the lookup table (HystProlapseCode_comb_mx) not in hyst_prepost_ICD_CPT. Under the array approach, the HEDIS variables are turned into macro variables. It is possible for a record in hyst_prepost_ICD_CPT to have the same code in multiple DX/proc variables, because that file includes both hospital and doctors' billing records. There are no identical variable names in the two datasets. That's because this code would allow the preport_flags variable values to overwrite the same name variables found in the prepost_ICD_CPT In which 2 datasets? hyst_prepost_ICD_CPT versus HystProlapseCode_comb_mx? I don't think they have any of the same variable names, but I will confirm that before running. The code above writes a row to output as soon as it finds a matching flag and then stops further checks (return statement). If for a single row there can be multiple flags that match and you want to write out the row multiple times then you would have to remove the RETURN; statements. As I mentioned above, it is possible for records in hyst_prepost_ICD_CPT to match on multiple flags. Furthermore, there are some duplicate codes in the HEDIS lists. For example, FLAG_proc_bladder_all contains the combination of all codes included in FLAG_proc_bladder_open and FLAG_proc_bladder_perc. I know could write a simple IF-THEN to flag _all whenever either _open and _perc are flagged, but there are quite a few of these types of combinations and I'd prefer not to have to track them all down.

Wolverine · ‎09-26-2022

@Reeza wrote: proc setinit;run; proc options option=cpucount; proc options option=memsize; run; Out of curiousity what is the output from the following code above - output will be in the log I believe. Here is the log: NOTE: SAS (r) Proprietary Software 9.4 (TS1M6) Licensed to [redacted] - T&R - SFA, Site 70080722. NOTE: This session is executing on the X64_SRV16 platform. NOTE: Analytical products: SAS/STAT 15.1 SAS/ETS 15.1 SAS/OR 15.1 SAS/IML 15.1 SAS/QC 15.1 NOTE: Additional host information: X64_SRV16 WIN 10.0.14393 Server NOTE: SAS initialization used: real time 5.53 seconds cpu time 3.54 seconds 1 proc setinit;run; NOTE: PROCEDURE SETINIT used (Total process time): real time 0.06 seconds cpu time 0.07 seconds Original site validation data Current version: 9.04.01M6P111518 Site name: '[redacted] - T&R - SFA'. Site number: 70080722. CPU A: Model name='' model number='' serial=''. Expiration: 30JUN2023. Grace Period: 45 days (ending 14AUG2023). Warning Period: 45 days (ending 28SEP2023). System birthday: 06JUN2022. Operating System: WX64_SV . Product expiration dates: ---Base SAS Software 30JUN2023 (CPU A) ---SAS/STAT 30JUN2023 (CPU A) ---SAS/GRAPH 30JUN2023 (CPU A) ---SAS/ETS 30JUN2023 (CPU A) ---SAS/FSP 30JUN2023 (CPU A) ---SAS/OR 30JUN2023 (CPU A) ---SAS/AF 30JUN2023 (CPU A) ---SAS/IML 30JUN2023 (CPU A) ---SAS/QC 30JUN2023 (CPU A) ---SAS/SHARE 30JUN2023 (CPU A) ---SAS/ASSIST 30JUN2023 (CPU A) ---SAS/CONNECT 30JUN2023 (CPU A) ---SAS/EIS 30JUN2023 (CPU A) ---SAS/SHARE*NET 30JUN2023 (CPU A) ---MDDB Server common products 30JUN2023 (CPU A) ---SAS Integration Technologies 30JUN2023 (CPU A) ---SAS/Secure 168-bit 30JUN2023 (CPU A) ---SAS/Secure Windows 30JUN2023 (CPU A) ---SAS Enterprise Guide 30JUN2023 (CPU A) ---SAS Bridge for ESRI 30JUN2023 (CPU A) ---OR OPT 30JUN2023 (CPU A) ---OR PRS 30JUN2023 (CPU A) ---OR IVS 30JUN2023 (CPU A) ---OR LSO 30JUN2023 (CPU A) ---SAS/ACCESS Interface to DB2 30JUN2023 (CPU A) ---SAS/ACCESS Interface to Oracle 30JUN2023 (CPU A) ---SAS/ACCESS Interface to SAP ASE 30JUN2023 (CPU A) ---SAS/ACCESS Interface to PC Files 30JUN2023 (CPU A) ---SAS/ACCESS Interface to ODBC 30JUN2023 (CPU A) ---SAS/ACCESS Interface to OLE DB 30JUN2023 (CPU A) ---SAS/ACCESS Interface to R/3 30JUN2023 (CPU A) ---SAS/ACCESS Interface to Teradata 30JUN2023 (CPU A) ---SAS/ACCESS Interface to Microsoft SQL Server 30JUN2023 (CPU A) ---SAS/ACCESS Interface to MySQL 30JUN2023 (CPU A) ---SAS/IML Studio 30JUN2023 (CPU A) ---SAS Workspace Server for Local Access 30JUN2023 (CPU A) ---SAS Workspace Server for Enterprise Access 30JUN2023 (CPU A) ---SAS/ACCESS Interface to Netezza 30JUN2023 (CPU A) ---SAS/ACCESS Interface to Aster nCluster 30JUN2023 (CPU A) ---SAS/ACCESS Interface to Greenplum 30JUN2023 (CPU A) ---SAS/ACCESS Interface to SAP IQ 30JUN2023 (CPU A) ---SAS/ACCESS to Hadoop 30JUN2023 (CPU A) ---SAS/ACCESS to Vertica 30JUN2023 (CPU A) ---SAS/ACCESS to Postgres 30JUN2023 (CPU A) ---SAS/ACCESS to Impala 30JUN2023 (CPU A) ---SAS/ACCESS to Salesforce 30JUN2023 (CPU A) ---SAS/ACCESS to HAWQ 30JUN2023 (CPU A) ---SAS/ACCESS to Amazon Redshift 30JUN2023 (CPU A) ---High Performance Suite 30JUN2023 (CPU A) ---SAS/ACCESS to SAP HANA 30JUN2023 (CPU A) ---SAS/ACCESS Interface to the PI System 30JUN2023 (CPU A) ---SAS/ACCESS to JDBC 30JUN2023 (CPU A) ---prod 1312 30JUN2023 (CPU A) 2 proc options option=cpucount; SAS (r) Proprietary Software Release 9.4 TS1M6 CPUCOUNT=4 Specifies the number of processors that thread-enabled applications should assume are available for concurrent processing. NOTE: PROCEDURE OPTIONS used (Total process time): real time 0.03 seconds cpu time 0.01 seconds 3 proc options option=memsize; 4 run; SAS (r) Proprietary Software Release 9.4 TS1M6 MEMSIZE=2147483648 Specifies the limit on the amount of virtual memory that can be used during a SAS session. NOTE: PROCEDURE OPTIONS used (Total process time): real time 0.03 seconds cpu time 0.03 seconds

Wolverine · ‎09-24-2022

@Wolverine wrote: I suspect that SAS ran out of RAM during the big SQL merge and dropped some matches. Of course it would have been nice if there had been an error or warning msg about that... I forgot to mention that while SAS did not give me an error msg, Firefox crashed and there was a Windows dialog saying that it closed Firefox due to lack of RAM. On the other hand, Task Manager never showed RAM usage going above 50%, at least not while I was watching it.

Wolverine · ‎09-23-2022

@SASKiwi wrote: How about comparing the inputs for the cases where there are discrepancies and deciding which method is correct? If the new method is correct and the old one isn't then no problem. If the new method is incorrect for some cases then post these examples here if you can't fix it yourself. That's more complicated than it sounds due to the number of cases flagged, the number of records for discrepant cases, and the number of variables that could contain potential matches. But I did find an example and it does indeed have the correct proc code for that particular flag. It is present in the input data file (hyst_prepost_ICD_CPT) and in the flag table (HystProlapseCode_comb_mx), and it is correctly entered (ie, no blanks or extra characters) in each file. And the "On" statement in the Proc SQL includes the correct variables. I couldn't see any reason why they weren't matched, so I began to wonder about some kind of computer issue. I ran the Proc SQL with a version of the data file with only this case, and it was flagged! That suggests to me that there is nothing wrong with the syntax or even the data, but rather I suspect that SAS ran out of RAM during the big SQL merge and dropped some matches. Of course it would have been nice if there had been an error or warning msg about that... Also, if you know that certain codes are notably more likely than others, then list them at the start of their respective arrays. And put the more commonly matched array first. You could get fancier, but I won't go into it. That's good to know. But luckily the order I used in the array statements is also the order in which they are most likely to be matched.

Online Status	Offline
Date Last Visited	Friday

Re: Macro's don't "reset" after pressing Cancel button

Re: Macro's don't "reset" after pressing Cancel button

Re: Macro's don't "reset" after pressing Cancel button

Macro's don't "reset" after pressing Cancel button

How to check if a macro variable has a specified value

Re: Concatenating means and standard deviations with trailing zeros wh...

Concatenating means and standard deviations with trailing zeros when n...

Re: PROC SQL match missing about 3% of cases even though they are in b...

Re: PROC SQL match missing about 3% of cases even though they are in b...

Re: PROC SQL match missing about 3% of cases even though they are in b...

Re: How to check if a macro variable has a specified value

Re: Do loop iterations based on the value of a variable, with 0-paddin...

Re: Using TableN macro, having trouble with "COLBY" column by option

Re: Searching through multiple variables with array not giving same re...

Re: Continuous enrollment across years following a birth event pt2

Re: PROC SQL match missing about 3% of cases even though they are in b...

Re: Logic problem with IF-THEN?

Why doesn't this RETAIN statement work??

Re: Using dm statement to save log, how to replace old log automatical...

Re: Basic data quality check -- how to determine percent missing for k...

Using dm statement to save log, how to replace old log automatically w...

Re: Restricting to a range of dates in MM/DD/YYY format

Restricting to a range of dates in MM/DD/YYY format

Re: Continuous enrollment 3 months prior to and 2 months after index m...

Re: Continuous enrollment 3 months prior to and 2 months after index m...

Re: Continuous enrollment 3 months prior to and 2 months after index m...

Re: Continuous enrollment 3 months prior to and 2 months after index m...

Continuous enrollment 3 months prior to and 2 months after index medic...

Re: SAS generating a large file, but barely using any computer resourc...

Re: SAS generating a large file, but barely using any computer resourc...

Re: SAS generating a large file, but barely using any computer resourc...

Re: SAS generating a large file, but barely using any computer resourc...

Re: SAS generating a large file, but barely using any computer resourc...

Re: SAS generating a large file, but barely using any computer resourc...

Re: SAS generating a large file, but barely using any computer resourc...

SAS Inner Circle Panel

SAS Analytics Explorers