About RichardDeVen

RichardDeVen · ‎05-06-2020

Windows filenames can contain ampersands. You will want to be sure SAS does not try to use the & as a resolution directive. Example: Wrap path in %NRSTR for explicit assignment. Use %SUPERQ to resolve the symbol only. %let path = %nrstr(c:\temp\tom&jerry videos); filename dir pipe "dir /b /s ""%superq(path)"""; data _null_; infile dir; input; put _infile_; run;

RichardDeVen · ‎05-06-2020

In this answer the PANELBY is TIMEFRAME, VBAR is TREATMENT, and / GROUP= is THERAPY The core issue is that /GROUP THERAPY values are driving the patterning. These values and their patterns appear in the legend. The categorical data set has the fully expected situation that THERAPY values encountered in one TREATMENT will be the same as encountered in a different TREATMENT. This means the THERAPY based patterning can not accommodate the TREATMENT it is under and thus can not drive color differentiation. Let's simulate some data so others can play along. The therapy value is coded 0 to 7 and the timeframe is code 0 to 3: proc format; value timeframe 0 = 'Baseline iron use' 1 = 'Day 1 : Week 12' 2 = 'Week 12 : Week 24' 3 = 'Week 24 : Week 36' ; value therapy 0 = 'None' 1 = 'Only IV' 2 = 'Only Oral' 3 = 'Only Other' 4 = 'Both IV + Oral' 5 = 'Both IV + Other' 6 = 'Both Oral + Other' 7 = 'All IV + Oral + Other' ; run; data have(keep=patid--timeframe); top = 400; call streaminit(1234); length patid 8 treatment $8 therapy 8 timeframe 8 ; do treatment = 'Dapro', 'rhEPO'; * 400 Dapro, 600 rhEPO; do i = 1 to top; patid + 1; p = rand('uniform', 100); select; when (p < 40.0) therapy = 0; when (p < 85.0) therapy = 1; when (p < 92.0) therapy = 2; when (p < 93.5) therapy = 3; when (p < 95.0) therapy = 4; when (p < 96.5) therapy = 5; when (p < 98.0) therapy = 6; otherwise therapy = 7; end; do timeframe = 0 to 3; * simulate a 3% patient drop out at any timeframe; if rand('uniform') < 0.03 then leave; * simulate an entire bar segment with no patients; if treatment = 'rhEPO' and timeframe = 3 and therapy = 1 then leave; output; end; end; top + 200; end; format timeframe timeframe. therapy therapy. ; run; Take a look at the simplest first try that uses STYLE=JOURNAL2 for its grayscale theme: ods html path='.' file='vbar.html' style=Journal2; proc sgpanel data=have pctlevel=group; title 'Simple Starter Plot'; panelby timeframe / layout=columnlattice onepanel ; vbar treatment / group=therapy stat=percent ; run; The attached code shows the steps and some intermediate plots demonstrating the transition to a final plot with desired features. The pre plot processing includes: Creating a custom template named StackedVbarGroupCrossVar based on Journal2 The first eight GraphData styles elements (for the 8 therapies) specify the cross hatch patterns to use Another eight GraphData style elements specify the same hatching with ContrastColor=LIGHTRED Creating a custom format THERAPY_CROSS_TREATMENT to deal with a transformed value of THERAPY that accounts for TREATMENT Computing the counts for each TREATMENT & TIMEFRAME combination Create a PLOT data set that has New column BAR_COUNT that contain the computed counts New column THERAPY_CROSS_TREATMENT that segregates therapy according to TREATMENT (Dapro nochange, rhEPO therapy code+10) New column TREATMENT_WITH_COUNT that has "/(N=<combination-count>)" concatenated to TREATMENT The tweaked data now has everything needed tricking out the plot into a final presentation created using the custom template StackedVbarGroupCrossVar: Compute a quoted list of the rhEPO specific THERAPY_CROSS_TREATMENT formatted values so they can be EXCLUDEd from the KEYLEGEND Add PANELBY options: novarname noborder colheaderpos=bottom headerbackcolor=white noheaderborder uniscale=row (not having uniscale col will let proc remove 'nobar' ticks) Change the VBAR statement to use tweaked values vbar treatment_with_count / group=therapy_cross_treatment ... Update the KEYLEGEND to exclude the segregating group values beyond the first eight keylegend / ... exclude=(&keyexcludes) Add a rowaxis to display 10% intervals rowaxis values = (0 to 1 by 0.1) ... ; Add a colaxis with splitalways so the combination count (the bar total counts) appears below the TREATMENT proc sgpanel data=plot pctlevel=group; title 'Presentation Plot Final'; panelby timeframe / layout=columnlattice onepanel novarname noborder colheaderpos=bottom headerbackcolor=white noheaderborder uniscale=row ; vbar treatment_with_count / group=therapy_cross_treatment stat=percent ; label therapy_cross_treatment = 'Iron therapy'; keylegend / across=4 outerpad=14px noborder exclude=(&keyexcludes) ; rowaxis values = (0 to 1 by 0.1) display=(noline noticks) grid ; colaxis display=(nolabel) fitpolicy=splitalways splitchar='/' valueattrs=(size=8pt) ; run;

RichardDeVen · ‎05-06-2020

You will want to track the prior SP500 value so that you can identify the entry criteria. RETAIN the tracking variable so that it is not reset at the top of the step. Explicitly reset the tracking value to zero at the first row in the group. This will allow the first row to be an 'entry' condition. Update the tracking value after checking the criteria. data want(drop=prior_SP500); set have; by id; retain prior_SP500; if first.id then prior_SP500 = 0; SP500Entrant = (prior_SP500 = 0 and sp500 = 1); prior_SP500 = sp500; run;

RichardDeVen · ‎05-04-2020

Just create a new question. It sounds like an outer join of TABLE1 and TABLE2 would be needed, so in the new question specify information like how many names there are in each table, and what criteria are to be used to 'match' names which would then be your 'links' of this question. Once you have that joined table the answers in this question can be used to create the 'super-sets' from the outer join. The presumption would be that a name in either table can be matched to more than one row in the other table.

RichardDeVen · ‎05-01-2020

How many rows does the data have ? Up to a certain point, one simple approach is to combine N groups of sizes 1 to N over the data and process the combined data using PROC MEANS with a BY statement. Example - 100 rows, creates a 'triangle' of grouped data with 5,050 rows ( N (N+1) / 2 ) data have; call streaminit(123); do row = 1 to 100; x = ceil(500*rand('normal', 10, 4)); output; end; run; data triangle; set have nobs=nobs; do group = _n_ to nobs; output; end; run; proc sort data=triangle; by group; run; proc means noprint data=triangle; by group; var x; output out=accum_quartiles q1=q1 p50=p50 q3=q3; run;

RichardDeVen · ‎04-30-2020

You can use DOW loop processing to capture the final code, and a second loop to apply it. data want; do _n_ = 1 by 1 until (last.customer_id); /* repurpose _n_ */ set have; by customer_id; last_code = code; /* capture code */ end; * at end of above loop last capture is last code in group; * use repurposed _n_ to iterate over the rows of the group; do _n_ = 1 to _n_; set have; * second read buffer; code = last_code; * apply captured value, overwriting original; OUTPUT; * output one row per row of original data; end; run;

RichardDeVen · ‎04-29-2020

Not seeing the macro source code, so I would recommend stacking all the CALL EXECUTES by wrapping the macro invocation in %NRSTR DATA _NULL_; SET MonitorsAndDates; URL=CATS("'https://api.....com/api/monitor?auth=",SYMGET('Auth'),%NRSTR('&id='),Monitor,%NRSTR('&start='),PUT(StartDate,E8601DA10.),%NRSTR('&end='),PUT(EndDate,E8601DA10.),"'"); length statement $2000; statement = '%nrstr(' || '%GetVolume(MonitorLabel='||MonitorLabel||',N='||_N_||',URL='||URL||');' || ');'; CALL EXECUTE(statement); RUN;

RichardDeVen · ‎04-29-2020

The sample code at https://www.devenezia.com/downloads/sas/samples/#groupbyeither shows how find the groups within data having dual keys and linked by key1 OR key2. Written for a 2004 SAS-L question "How to group people by their first name OR last name". Here is the same processing rewritten for Proc DS2. The core concept is using a HASH to map names to groups, and an multidata anti-map HASH of group to names. The anti-map must be used when combining separated groups that become linked. In particular, combining groups requires a traversal one of the groups in the anti-map using has_next/find_next. * Given: * There is only one records number for each name, no matter how many records contain that name; data have; input NAME1: $10. RECORDS1 NAME2: $10. RECORDS2; format records: 6.; datalines; JONATHAN 500 JOHNNY 905 JOHNO 750 JOHNNY 905 JONNO 415 JOHNO 750 JOHHN 675 JOHNO 750 JOHNNY 905 JOHN 1017 JOHN 1017 JOHNNY 905 TOM 5243 TOMMY 4 BRAD 873 BRADLEY 219 BRADLEY 219 BRAD 873 JON 875 AJONO 775 JOHHNO 904 JON 875 ZIP 250 ZIPP 175 JOHNNY 905 AJONO 775 ; proc datasets nolist lib=work; delete want; run; proc ds2; data _null_; declare package hash name_group_map(); * a mapping from name to group. a name can belong to only one group; declare package hash group_name_map(); * an anti mapping of group to names. a group can have many names; declare char(25) name name1 name2; declare double records records1 records2 group group1 group2; method init(); name_group_map.ordered('ASCENDING'); name_group_map.keys([name]); name_group_map.data([name records group]); name_group_map.defineDone(); group_name_map.multidata('yes'); group_name_map.keys([group]); group_name_map.data([group name]); group_name_map.defineDone(); group = 0; end; method run(); declare double found1 found2 hold_group; declare double rc; set have; * each row represents a link. both ends are a name with a count.; * a name can appear in other rows, but its count will not be different; found1 = name_group_map.find([name1], [name1 records1 group1]) = 0; found2 = name_group_map.find([name2], [name2 records2 group2]) = 0; select; when ( ~found1 and ~found2) do; group + 1; *put 'NOTE: both new' group=; name_group_map.add([name1], [name1 records1 group]); name_group_map.add([name2], [name2 records2 group]); group_name_map.add([group], [group name1]); group_name_map.add([group], [group name2]); end; when ( found1 and ~found2) do; *put 'NOTE: add name2 to name1' group1=; name_group_map.add([name2], [name2 records2 group1]); group_name_map.add([group1], [group1 name2]); end; when ( ~found1 and found2) do; *put 'NOTE: add name1 to name2' group2=; name_group_map.add([name1], [name1 records1 group2]); group_name_map.add([group2], [group2 name1]); end; when ( found1 and found2) do; if group1 = group2 then return; * traverse the multidata of group2 key and migrate each data to the group1 key; hold_group = group; group = group2; rc = group_name_map.find(); do while (rc = 0); name_group_map.find(); name_group_map.replace([name], [name records group1]); group_name_map.add([group1], [group1 name]); if group_name_map.has_next() ne 0 then leave; rc = group_name_map.find_next(); end; group_name_map.removeall(); * remove the anti-map key; group = hold_group; end; otherwise; end; end; method term(); * DS2 does not overwrite existing tables; * DS2 does allow a table option (OVERWRITE=YES), or /OVERWRITE=YES; * However, DS2 Package HASH method OUTPUT does NOT provisio for such, * and will log an error if used, example: * ERROR: Malformed hash data source name <table-name>(overwrite=yes). * * Thus, the coder MUST pre-delete the target output table in an earlier step; name_group_map.output('work.want'); end; enddata; run; quit; options nosource nonotes; proc sort data=want; by group descending records name; run; ods html file='want.html' style=plateau; proc print data=have; proc print noobs data=want; run; ods _all_ close; options source notes; Sample output

RichardDeVen · ‎04-26-2020

Should I see the transformation from JOHN 61017 JOHNNY 905 JOHNNY 905 JOHN 61017 JONATHAN 500 JOHNNY 905 to (conceptually) a count ordered chain JOHN(61017) > JOHNNY(905) > JONATHAN(500) to a root grouped name list JOHN JOHNNY JOHN JONATHAN or more generally root(N 0 ) > node-1(N 1 ) > node-2(N 2 ) > ... > node-m(N m ) to root node-1 root node-2 root ... root node-m

RichardDeVen · ‎04-24-2020

Can you explain what happened to the JOHNATHAN JOHNNY pair ? Why is the pair excluded from WANT ? JOHN 61017 JOHNNY 905 JOHNNY 905 JOHN 61017 JONATHAN 500 JOHNNY 905 What should happen if the BRAD data was BRAD 873 BRADLEY 219 BRADLEY 2500 BRAD 873 Should a NAME1 value be excluded from output because it appears in a different pair as NAME2 with a lower count of the pair ? @triley This edit is a new question about the problem. Would you ever have a single row like the TOM row, but the higher count is in RECORDS2. So, would the data ever have a case such as the following? RICKY 215 RICHARD 1618

RichardDeVen · ‎04-24-2020

Do you want a missing table_name and table_count because none of the 'insurance' tables had run_id=12345 and company_code="ABF" ? If you were to process only one 'insurance' table should all the other rows of details be set to have missing values for their run_id and company_code ? In other words, should the table names and counts in details be set to missing when their runs and companies don't match the 'set' of tables names in the in the filelist? What should happen if a filelisted table contains a new run/company combination ?

RichardDeVen · ‎04-24-2020

The question subject is misstated as "Long to wide". The question is really one of "Max over Group", or more generically "Result over Group" The result over group for this question "presence within group". A "DOW" loop is very effective for processing data sorted into BY groups, and outputting one row per group. (The technique can also be used to apply the computed result to each row in the group). Example: data want(keep=pt_id diabetes hypertension); do until (last.pt_id); SET have (rename=(diabetes=_dia hypertension=_htn)); BY pt_id; diabetes = _dia OR diabetes; hypertension = _htn OR hypertension; end; run; The secret to the DOW loop is placing the SET and BY statements inside a DO LOOP. An implicit OUTPUT occurs at the end of the step with the desired computed results. The rename= is necessary because the programmer wants the aggregate result variables (diabetes/hypertension) to be the same name as the individual flag variables.

RichardDeVen · ‎04-24-2020

A data set index can be created in different ways As specified by an output data set with option INDEX= Simple indices (single columns) can not be named something else Compound indices are specified <index-name>=(<column1> ... <column-n>) PROC DATASETS; MODIFY PROC SQL; CREATE INDEX Index management is done with either DATASETS or SQL. Read the documentation for more information about uniqueness and non-null indices and other data index topics such as integrity constraints and foreign keys. Examples: data cars(index=(Make Model type_drive=(Type DriveTrain))); set sashelp.cars; run; proc datasets nolist lib=work; modify cars; index delete Make Model; index create Origin; run; proc sql; create index Make on work.cars; drop index type_drive from work.cars; create index type_cyl on work.cars(Type, Cylinders) ; NOTE: SAS data sets do not have RDBMS features such as triggers or sequence numbers.

RichardDeVen · ‎04-24-2020

The following SQL construct can be used to insert (i.e. append) a query result into an existing table INSERT INTO <TABLE> SELECT ... ; Example: * blank base table for results; proc sql; create table work.results (Question CHAR(20), Result NUM); * Dynamic 1 .... ; data dynamic1; set sashelp.class; run; * Append query results for Q1 to results table; proc sql; insert into results select 'Q1', count(*) from dynamic1; * Dynamic 2 .... ; data dynamic2; set sashelp.class; where age < 14; run; * Append query results for Q2 to results table; proc sql; insert into results select 'Q2', count(*) from dynamic2; %let syslast = results;

RichardDeVen · ‎04-22-2020

The shown code %let YYMMDD2='200420; is just incorrect and will put your SAS session into a open parsing state because of the lone single quote. Presume for sake of argument, the initial macro variables could contain garbage characters that are not part of a proper yymmdd representation of a date. You really don't need macro looping for this. Your coding sanity is far safer using other statements. I will presume the second macro assignment actually has a stray single quote. So, on the premise that the digits within the macro value are date representation in the construct of yymmdd, the INPUTN() function can parse those digits in to a SAS date value. COMPRESS can process a string and keep only the digits. You might want to convert the date value to it's date literal representation (maybe if you are writing a source code file for review and later submittal), or to a representation for reporting (such as a TITLE that has to show dd-MON-yyyy) Example options nosource; %let YYMMDD1=200421; %let YYMMDD2=%str(%'200420); %put NOTE: YYMMDD1=%superq(YYMMDD1); %put NOTE: YYMMDD2=%superq(YYMMDD2); %put --; %let DATEVAL1 = %sysfunc(INPUTN(%sysfunc(COMPRESS(%superq(YYMMDD1),,KD)),YYMMDD8.)); %let DATEVAL2 = %sysfunc(INPUTN(%sysfunc(COMPRESS(%superq(YYMMDD2),,KD)),YYMMDD8.)); %put NOTE: &=DATEVAL1; %put NOTE: &=DATEVAL2; %put --; %let DATE_LITERAL1 = "%sysfunc(PUTN(&DATEVAL1,DATE11.))"D; %let DATE_LITERAL2 = "%sysfunc(PUTN(&DATEVAL2,DATE11.))"D; %put NOTE: &=DATE_LITERAL1; %put NOTE: &=DATE_LITERAL2; %put --; %let DATE1_FOR_TITLE = %sysfunc(PUTN(&DATEVAL1,DATE11.)); %let DATE2_FOR_TITLE = %sysfunc(PUTN(&DATEVAL2,DATE11.)); %put NOTE: &=DATE1_FOR_TITLE; %put NOTE: &=DATE1_FOR_TITLE; %put --; options source; Log NOTE: YYMMDD1=200421 NOTE: YYMMDD2='200420 -- NOTE: DATEVAL1=22026 NOTE: DATEVAL2=22025 -- NOTE: DATE_LITERAL1="21-APR-2020"D NOTE: DATE_LITERAL2="20-APR-2020"D -- NOTE: DATE1_FOR_TITLE=21-APR-2020 NOTE: DATE1_FOR_TITLE=21-APR-2020 --

Online Status	Offline
Date Last Visited	‎08-01-2023 12:32 PM

Re: Dynamic code to convert char to numeric

Re: ODS EXCEL using Calibri on Unix/Linux Server

Re: Recursive find files

Recursive find files

Is there a way to get compressed file sizes of entries in zip archive ...

Applying Include Exclude criteria in control data set

Is there a way to test if SASFILE will fail before trying to open data...

Re: OPEN Function with (keep= specifying absent variables is logging m...

OPEN Function with (keep= specifying absent variables is logging messa...

Re: open() function unquotes values?

Re: Proc Import a Date Column in EXCEL as a Date

ODS package path option: path longer than 68 produces Warning

Re: XPT: Not a SAS data set

SAS Export dataset as csv or excel preserving line break

Re: SAS Export dataset as csv or excel preserving line break

Re: Recursive find files

Re: PROC sql: insert into

Re: Why is SAS 9.4 converting variable to VAR8?

Re: How to flip the order of an array

Re: Is there a macro of function can do any base conversion of numbers...

Examples: DATA Step Functions for Reading Metadata

Re: Using Command Prompt Syntax with Spaces

Re: Stacked ,clustered and grouped bar chart with patterns

Re: How do I indicate based on a previous observation (t -1)?

Re: Creating a table based on prior records

Re: How to get the accumulative quantiles

Re: Retain to keep latest record

Re: Understanding macro compiling

Re: Creating a table based on prior records

Re: Creating a table based on prior records

Re: Creating a table based on prior records

Re: Update base table for every iteration

Re: Long to wide format

Re: INDEX creation using DATA step

Re: How to summarize a bunch of queries

Re: Error in creating macro vars