About Patrick

Patrick · ‎04-08-2024

Can't think of anything "shorthand". Below should work. data new; set old; array vars{*} IR00 - IR40; do i=1 to dim(vars); if vars[i]>0 then min_ir=min(min_ir,vars[i]); end; MAX_IR = max(of IR00 - IR40); output; keep ID IR00 IR10 IR20 IR30 IR40 MAX_IR min_ir; run;

Patrick · ‎04-08-2024

The in= option lets you capture which source table contributes data to the target table. data raw3; merge raw1(in=in1) raw2(in=in2); by patient_local_id; if in1=0 or in2=0 then do; in_raw1=in1; in_raw2=in2; output; end; run;

Patrick · ‎04-08-2024

@Mazi If I understand the problem then account A and C could have no common attributes but are still linked via account B that shares attributes with both A and C ...which a single SQL self-join couldn't detect.

Patrick · ‎04-08-2024

Is there a reason that it must be SQL and can't be a data step? Is a single query just getting a single account as input and you then want all the related accounts OR is this actually about creating networks for all of your data? How many rows do you have in your actual data? I believe what you really will have to do is something along the line of How to find all connected components in a graph

Patrick · ‎04-08-2024

And as another update: When running the same script against source data with the same number of rows and variables but less duplicates then the elapsed run-time for deduplication.deduplicate remains the same but the run-time for simple.groupBy deteriorates significantly.

Patrick · ‎04-07-2024

Based on your picture something like below should work (not tested). data example2; set example; by subject term start; lag_cflg=lag(cflg); if first.term then r_newseq=cflg; else if cflg=1 and missing(lag(cflg)) then r_newseq+1; if cflg=1 then newseq=r_newseq; drop lag_cflg r_newseq; run;

Patrick · ‎04-07-2024

For implicit pass-through: Which functions SAS can convert to database syntax is fully documented per access engine (like for SAS Access to Oracle). Look it up. For explicit pass-through: Because SAS sends the syntax as-is to the database you need to use database functions in your code. The SAS functions are obviously not available.

Patrick · ‎04-07-2024

@sbxkoenk In my environment with a recent Viya 4 version and 4 worker nodes the parquet file gets created in chunks (=multiple files) all stored under a folder with the name of the parquet file that had been provided as value to parameter casout. I couldn't find a way to only create a single file using Proc Casutil. I do believe that chunks are required for full support of parallelism. I could create a single parquet file via client side (compute) processing using a data step. %let sessref=MySess; %if %sysfunc(sessfound(&sessref)) %then %do; cas mySess terminate; %end; cas &sessref cassessopts=(caslib="casuser" /*metrics=True*/); libname casuser cas; options fullstimer msglevel=i ps=max; data casuser.class; set sashelp.class; run; libname comp_pq parquet "&_userhome"; data comp_pq.class_datastep; set casuser.class; run; caslib cas_pq path="&_userhome" datasource=(srctype="path"); proc casutil; save casdata="class" incaslib="casuser" casout="class_casutil.parquet" replace; quit; /* cas mySess terminate; */

Patrick · ‎04-06-2024

@SAS0005 I still recommend that you register with Prams and get the proper data for analysis. But just for fun and to improve my Python skills a bit... When downloading and looking into these prams Excel reports found here I realised that for each table a named ranges had been defined (Table1, Table2, etc.) and that all tables in all sheets have the same structure. In the following an approach how to read such Excel data into a SAS table. I've run the Python script separately but there are also various methods how to run a Python script directly out of a SAS session (available methods depend on SAS version) or alternatively you can also call SAS out of Python. Step 1: Using Python read each table (range) into a data frame and then export to a .csv import pandas as pd from openpyxl import load_workbook from openpyxl.utils.cell import range_boundaries import re workbook_path = r"C:\temp\prams_test\\" workbook_name = "PRAMS-MCH-Indicators-2020-508.xlsx" outfile_root = r"C:\temp\prams_test\source_csv\\" wb = load_workbook(workbook_path + workbook_name) def replace_non_alphanumeric(string, replacement=''): return re.sub(r'[^0-9a-zA-Z_]+', replacement, string.strip()) def extract_tables(ws, sheet_name): dfs_tmp = {} for name, table_range in ws.tables.items(): # Get position of data table defined by named range min_col, min_row, max_col, max_row = range_boundaries(table_range) # read value of cell relative to top left corner of named range reference_cell = ws[table_range.split(':')[0]] table_name = reference_cell.offset(row=-1).value table_name = table_name.replace(',',' ') # Convert table to DataFrame table = ws.iter_rows(min_row, max_row, min_col, max_col, values_only=True) header = next(table) df = pd.DataFrame(table, columns=header) # add columns workbook, sheet_name and table_name to data frame df.insert(0, "workbook_name", workbook_name) df.insert(1, "sheet_name", sheet_name) df.insert(2, "table_name", table_name) # write data frame to .csv without header row and index column df.iloc[1:].to_csv(outfile_root + replace_non_alphanumeric(workbook_name.split('.')[0]).lower() + "_" + replace_non_alphanumeric(sheet_name).lower() + "_" + replace_non_alphanumeric(table_name).lower() + ".csv" , index=False, header=False) dfs_tmp[name] = df return dfs_tmp # Dictionary to store all the dfs in. # Format: {table_name1: df, table_name2: df, ...} dfs = {} for ws in wb.worksheets: dfs.update(extract_tables(ws, ws.title)) Above creates per table a separate .csv with a naming convention: <workbook name>_<sheet name>_<table name (derived from the table title)>.csv Step 2: Read the .csv's into SAS You can use a wildcard in the csv name to for example read all the tables belonging to a workbook or to only read the tables belonging to a single worksheet. And you can of course also already sub-set your data as part of reading it. data want; infile "C:\temp\prams_test\source_csv\pramsmchindicators2020508_breastfeedingpractices_*.csv" truncover dsd dlm=","; attrib workbook informat=$200. label='Name of Excel Workbook' sheet informat=$31. label='Name of Excel Sheet' table_name informat=$200. label='Name of table in Excel Sheet' site_name informat=$100. label='State' denominator informat=best32. label='N (Denominator) - Unweighted Sample Size' numerator informat=best32. label='N (Numerator) - Unweighted Frequency' weight informat=best32. label='Weighted %' lower95 informat=best32. label='Lower 95% - Confidence Interval' upper95 informat=best32. label='Upper 95% - Confidence Interval' ; input workbook sheet table_name site_name denominator numerator weight lower95 upper95 ; if site_name in ('Alabama','Florida'); run; proc print data=want; run; Btw: The value for table name is sourced from the title above the table And because this cell was not part of the named range for the table I needed below code to derive it # read value of cell relative to top left corner of named range reference_cell = ws[table_range.split(':')[0]] table_name = reference_cell.offset(row=-1).value table_name = table_name.replace(',',' ')

Patrick · ‎04-06-2024

MUCH more detail please! Which SAS and VA version are you working with? Are you after unstructured text or do you want to extract tables in a pdf? Is the pdf textual or is it picture? Do you have SAS Visual Text Analytics licensed? Are you only using VA or do you also know how to write code? Is XCMD enabled in your environment? .... Also show us a screenshot of your pdf so we can better understand what you've got. And last but not least: Do first some searching on your own as this will help you to narrow down the bits where you still need support. If you search a bit you will find info like here.

Patrick · ‎04-06-2024

We can help you best if you share your code with the data not as a screenshot but as text that we then can copy/paste as a starting point. Use the running man icon to post such code and data. At least one of the issues with your current code: You define a length for variable location which instructs SAS how to create the variable BUT it doesn't instruct SAS how to read source data. You need to define/use an INFORMAT for this. In your input statement you're using INPUT ... location $ With this syntax SAS will only read the first 8 characters. Try and use INPUT ... location :$20. instead.

Patrick · ‎04-06-2024

@soujik wrote: Thank you, but I have all sas codes ( around 20 sas programs) in below directory. i would like to get all table names from that all sas programs. /cloud/pvxxx-123/mktprf/89009321/code/ sample sas program name /cloud/pvxxx-123/mktprf/89009321/code/ext_xpo_map.sas @soujik Best you can get is to actually execute the code and use Proc Scaproc to get some logging information that's then easy to parse for the information you're after. Parsing actual SAS code to get this information is not only cumbersome but it will potentially also not return what you're after because there are many ways where table names get only determined during run time and they might differ between runs. Consider for example code as below: data demo; set mylib.monthly_tables_:; run; Which tables you actually pick will depend on what's stored under mylib at the time your code executes.

Patrick · ‎04-05-2024

@vorkady wrote: Hi Requirement is to connect SAS VIYA 4.0 hosted on cloud to connect on premises data sources , no direct connectivity allowed for security reasons, but connection NEED TO comply with network layer 7 termination. Can I assume SAS Cloud Data Exchange is similar to Power BI gateway which comply with layer 7 termination. Given this is an official SAS product I'd say you can assume that it will comply with the usual security requirements. https://documentation.sas.com/doc/en/pgmsascdc/v_049/dataagentag/p0h72wof7so3rhn0zci6j3o1168h.htm To get further security details confirmed is something your architecture needs to ask SAS directly as part of contract negotiation and/or prep for install & config of the agent.

Patrick · ‎04-05-2024

Are you looking for any row where the comparison date is from the same or previous month, or should the two dates only be max one month apart? Below code should give you an idea of the options. data base2; format "Date de réception"n ddmmyy10.; "Date de réception"n='20mar2024'd; output; "Date de réception"n='20may2024'd; output; "Date de réception"n='12jan2023'd; output; run; run; data base_finale; set base2; /* if intck('month',"Date de réception"n,"01APR2024"d) in (0,1); */ interval_1=intck('month',"Date de réception"n,"01APR2024"d); interval_2=intck('month',"Date de réception"n,"01APR2024"d,'c'); run; proc print data=base_finale; run;

Patrick · ‎04-05-2024

Feels like you are looking for SAS Cloud Data Exchange. Here the SAS docu: Cloud Data Exchange for the SAS® Viya® Platform Here a recent SAS Communities article: SAS Cloud Data Exchange for the SAS Viya Platform

Online Status	Offline
Date Last Visited	Friday

Re: Difference between dates in SAS doesn't match results

Re: Capture SQL View Definition for Programmatic Change

Capture SQL View Definition for Programmatic Change

Re: Question from a DBA on idle database session

Re: SAS online on mainframe - Is there a way to prevent execution of...

Re: Bulk loading

How to improve performance copying a huge table while also creating ad...

Re: send email +attach log file only in case of error or warning

Re: send email +attach log file only in case of error or warning

Re: How to switch between different projects using EG?

Re: Discussion，AI [Agent] with SAS?!

Question from a DBA on idle database session

Re: Dates should be displayed in the local time zone when exported fro...

Re: How to improve performance copying a huge table while also creatin...

Re: How to improve performance copying a huge table while also creatin...

Capture SQL View Definition for Programmatic Change

Re: Difference between dates in SAS doesn't match results

Re: Question from a DBA on idle database session

Re: Download Dataset From SAS Server to Local Folder

Re: How to switch between different projects using EG?

How do I add a row number to a table in SAS code?

Re: You like me, you really like me!

Re: Find Min and Max of five variables with min above zero for each ro...

Re: Missing observations

Re: Want to write sql query to get related party account details

Re: Want to write sql query to get related party account details

Re: CAS answers to 4 common data manipulation tasks – Part 3 – DE-DUPL...

Re: How to have two different sequence ID's within the same grouping o...

Re: functions in pass thru query

Re: Saving an In-Memory CAS Table as a Single Parquet File

Re: How do I split up a large table from an imported xlsx file?

Re: Visual analytics to Read PDFdata

Re: How to create a dataset ?

Re: How to find all table name in sas program ( prod sql)

Re: SAS hosted on cloud connectivity with on prem data sources

Re: Comparing the dates

Re: SAS hosted on cloud connectivity with on prem data sources

CoDe SAS German