About FreelanceReinh

FreelanceReinh · ‎07-03-2024

Hello @aazzarello, Shift the start of the week to Wednesday by using 'week.4' instead of 'week' in the first argument of the INTNX function, then the weeks will end on Tuesday as desired (see documentation of the shift index).

FreelanceReinh · ‎07-01-2024

Hi @ChrisNZ, It's the default value 1E-12 of the FUZZ= option which causes this. Use FUZZ=0 (or, e.g., FUZZ=1E-23 in this example) to avoid the unwanted formatting. The numbers 8.400000000008E-11 and 8.400000000000E-11 differ by only 8E-23 < 1E-12. @PaigeMiller wrote: SAS cannot represent numbers exactly that are more than about 15 significant digits. This is true, but the numbers involved here have only up to 13 significant digits, so SAS can handle them fairly well. Only fairly well, though, because the case of 8.400000000012E+11 is an example where the internal representation depends on whether scientific notation is used in the literal: 430 data _null_; 431 x=840000000001.2; 432 y=8.400000000012E+11; 433 if x ne y then put 'Surprise!'; 434 put (x y) (=binary64./); 435 run; Surprise! x=0100001001101000011100100111110011011010000000000010011001100110 y=0100001001101000011100100111110011011010000000000010011001100111 (using Windows SAS 9.4M5). Here, the internal representation of x is mathematically correct, i.e., closer to the theoretical exact representation, which repeats the 4-digit pattern "0011" (occurring three times in the representation of x, followed by the last zero) infinitely often. The last bit (1) of the internal representation of y is actually the result of incorrectly rounding up. Translated back to the decimal system, the two internal representations look like this: x=840000000001.199951171875 y=840000000001.2000732421875 So we can see that the precision is clearly sufficient to distinguish either of these numbers from, say, 840000000001.0. The situation with the numbers close to 8.4E-11 is quite similar because of the number of significant digits. The eleven leading zeros in the decimal representation don't really matter (only a little bit if you try and enter them in a numeric literal, where, again, the internal representation may differ from that of the literal in scientific notation).

FreelanceReinh · ‎06-29-2024

@TomHsiung: Sorry for the delayed reply, I was out of the office for a week. @TomHsiung wrote: I guess if we have more than 3 time-varying variables, this approach would be very laborious. I don't think I've ever had that many time-varying variables in a Cox model. But since the counting process style of input follows a general pattern -- a change in one of the time-varying variables calls for a new observation in the input dataset -- it should be possible to use DATA step programming logic to create all those observations. See the recent post Re: Counting process time dependent cox model for an example (for discrete times). @TomHsiung wrote: I addition, the PROC TRANSPOSE might experience difficulty when transferring a wide dataset to a narrow dataset, given there is more than one time-varying variables (e.g., A_wk1, A_wk2, ... A_wkm, and B_wk1, B_wk2, ... B_wkn). Data transformations from wide to long (and vice versa) have been discussed many times in the SAS Support Communities: please see the search results https://communities.sas.com/t5/forums/searchpage/tab/message?q=%22wide%20to%20long%22&noSynonym=false&sort_by=score&collapse_discussion=true or open a new thread describing your specific problem.

FreelanceReinh · ‎06-29-2024

Hello @ANKH1, Missing quartile point estimates and confidence limits are quite common if the Kaplan-Meier curve and related curves for the confidence limits don't drop far enough -- a data-dependent issue: see explanations in Re: Why did I get confidence intervals without a estimate? (and the PROC LIFETEST documentation page linked there) and Re: Kaplan Meier stats. Also, there was a long standing bug impacting some of those calculations of PROC LIFETEST, which has been fixed only in SAS 9.4M7 (and later releases): see Problem Note 64617: The LIFETEST procedure produces incorrect upper confidence limits for the quartiles for certain data.

FreelanceReinh · ‎06-20-2024

You can also simplify the code by using SAS date values, the INTNX function and a date format: %macro newtest; %do i=0 %to 7; %put bef%sysfunc(intnx(qtr,'01JAN2008'd,&i,e),yymmn.); %end; %mend; %newtest

FreelanceReinh · ‎06-20-2024

Hello @mgrasmussen, Just insert the missing ampersand: %if &l<12 %then %do;

FreelanceReinh · ‎06-19-2024

Hello @Cornelis, The outer single quotes prevent the resolution of the macro variable reference. Use double quotes instead and duplicate the existing inner double quotes: filename DIRLIST2 pipe "dir ""\\msnlgoudcp3102\nlgouddata\GLC\LZ-analyses\to be printed\&thisyear\vetzuur"" /s";

FreelanceReinh · ‎06-14-2024

Hello @MParthasarathy, This was discussed in the 2021 thread Proc sql giving result when expected not to. Please see the solution there. Does it answer your question?

FreelanceReinh · ‎06-14-2024

Hello @bgb, Glad to see that SASKiwi's solution should work for you. Then it would be fair and help later readers if you marked his helpful reply as the accepted solution, not your own "thank you" post. Could you please change that? It's very easy: Select his post as the solution after clicking "Not the Solution" in the option menu (see icon below) of the current solution.

FreelanceReinh · ‎06-13-2024

@TomHsiung wrote: If the two time-varying variables change on the same day (tie), how do we count them? Only two rows with a same start and stop time? The general pattern is always the same: Each row represents a semiclosed time interval (start, stop] in which the time-varying variables are constant. If in the previous example not only B changed on day 4 from 1 to 0, but also A from 0 to 1 (and remained constant thereafter), we would specify: start stop A B 0 3 0 1 3 14 1 0 So, up to and including day 3 the "vector" (A, B)=(0, 1), whereas after day 3, i.e., on days 4, 5, ..., 14, (A, B)=(1, 0).

FreelanceReinh · ‎06-11-2024

@sasmhe1 wrote: I tried: "C:\Program Files\SASHome\SASFoundation\9.4\core\sasexe\sasoact.exe" action=Browse datatype=Data filename="f:\ds2.sas7bdat" but it didn't work. What happened when you submitted that command? Did it open a new SAS session or did a running SAS session react? Did you get an error message from cmd.exe (e.g., saying that the path was misspelled or not found) or from SAS (e.g., saying that f:\ds2.sas7bdat does not exist)? What is the content of the Windows registry keys HKEY_CLASSES_ROOT\SAS.DataSet.701\shell\Browse\command and HKEY_CLASSES_ROOT\SAS.DataSet.701\shell\BrowsewithSAS940\command (see the SAS GF 2013 paper I linked to for step-by-step instructions)? Does the command work with .sas7bdat files on other drives? Have you tried the SAS Universal Viewer?

FreelanceReinh · ‎06-11-2024

Hello @sasmhe1, On my computer, using Windows SAS 9.4M5, the below command opens dataset WANT (from folder C:\Temp) in a VIEWTABLE window of an open SAS session (or a new SAS session if none is already open): "C:\Program Files\SASHome\SASFoundation\9.4\core\sasexe\sasoact.exe" action=Browse datatype=Data filename="C:\Temp\want.sas7bdat" See the SAS GF 2013 paper "Double-Clicking a SAS® File: What Happens Next?" for more details, in particular the difference between sas.exe and sasoact.exe. Personally, I prefer the SAS Universal Viewer (see https://support.sas.com/downloads/browse.htm?fil=&cat=74 -- it is not installed by default) in those rare situations when I want to scroll through a dataset. I'm using version 1.42 (which is not the most recent one) and the corresponding command looks like this: "C:\Program Files\SASHome\SASUniversalViewer\1.4\SAS.UniViewer.exe" "C:\Temp\want.sas7bdat"

FreelanceReinh · ‎06-10-2024

@stellapersis7 wrote: Hi all, I have datasets called cases and controls. I need to match 1:1 from cases and controls using the following variables: age and gender should be exactly matched for duration, duration of controls should be more than duration of cases. Hi @stellapersis7, Are the above two bullet points the only requirements? Consider this simple example with only three cases and three controls (all with the same age and gender): Obviously, there are several solutions satisfying your requirements: One is the set {(1, 3), (3, 5)} of (case, control) pairs, highlighted in green in the graph (where the subjects are represented by their "duration" values for simplicity). Other solutions are {(1, 2), (3, 5)}, {(1, 2), (4, 5)} and [(1, 3), (4, 5)} -- but also {(1, 5)}. The latter set contains only one (case, control) pair, as there is no eligible control left for the cases with durations 3 and 4, once the control with the large duration 5 has been "wastefully" assigned to the case with duration 1. Mathematically, your goal is to find a matching in a bipartite graph. If you want to obtain a set with as many eligible (case, control) pairs as possible, this would be called a maximum (cardinality) matching. The maximum possible cardinality in the above example is 2, so the singleton matching {(1, 5)} is not a maximum matching. I think (but haven't proved mathematically; I don't know much about graph theory) that the DATA step suggested below (creating dataset WANT) finds a maximum matching. It uses case and control datasets sorted by age, gender and descending duration. Starting with the maximum duration in each age-gender BY-group of the CASES dataset, it randomly selects one of the eligible controls in the CONTROLS dataset (if any). Technically, it temporarily stores the ENROLIDs and durations of one BY-group of the controls in a hash object (using a sequential number _c as the key), which is convenient because a control that has been assigned to a case can be easily deleted in order to avoid duplicate assignments. Output dataset WANT contains all observations from dataset CASES plus the ENROLID of the assigned control, named ENROLID_CONTROL, and the corresponding DURATION_CONTROL. The latter two variables have missing values if no matching control was found (anymore). Let me first create sample datasets CASES and CONTROLS with about 1000 cases and 3000 controls. (The purpose of the exclusions via WHERE= dataset options is to include non-matching cases and controls.) /* Create sample data for demonstration */ data cases(rename=(d=duration_case) where=(age ne 21)) controls(rename=(d=duration_control) where=(age ne 42)); call streaminit(27182818); do enrolid=1 to 4000; age=rand('integer',18,80); gender=char('MF',rand('integer',1,2)); d=rand('integer',1,2000); if enrolid<1000 then output cases; else output controls; end; run; proc sort data=cases; by age gender descending duration_case; run; proc sort data=controls; by age gender descending duration_control; run; /* Match controls to cases */ data want(drop=_:); call streaminit(3141592); if _n_=1 then do; if 0 then set cases; dcl hash h(ordered:'a'); h.definekey('_c'); h.definedata('_c','enrolid_control','duration_control'); h.definedone(); dcl hiter hi('h'); end; set controls(in=ctrl rename=(enrolid=enrolid_control)) cases(in=case); by age gender; if ctrl then do; if first.gender then _c=1; else _c+1; h.add(); end; if case then do; _i=0; _rc=hi.first(); do while(_rc=0 & duration_control>duration_case); _i+1; _rc=hi.next(); end; if _i then do; _r=rand('integer',_i); do _j=1 to _r; hi.prev(); end; end; else call missing(enrolid_control, duration_control); output; if _i then do; _d=_c; _rc=hi.prev(); _rc=h.remove(key:_d); end; end; if last.gender then do; _rc=hi.first(); _rc=hi.prev(); h.clear(); end; run; I have also written a "reverse" variant of the above program (not shown here), i.e., assigning cases to controls, using input datasets sorted by age, gender and ascending duration, starting the assignments with the smallest DURATION_CONTROL in each age-gender BY-group. With all input datasets I tested, it obtained the exact same number of matches as the above program -- indicating that those numbers might be the maximum possible "cardinalities". Please note, however, that results of both versions of the program are somewhat "biased" in a sense: The above version "favors" large case durations (within each age-gender BY-group). Cases with smaller durations may be left unassigned because eligible controls have already been assigned earlier. Similarly, the reverse program version "favors" small control durations. You would have to decide if such "biases" are acceptable for whatever statistical analysis you are planning to perform with the matched case-control pairs. In the small example above, the program would always assign control "5" to case "4" and hence leave case "3" unassigned. The "reverse" version of the program would always assign case "1" to control "2" and hence leave control "3" unassigned. Therefore, neither of the two program versions could ever obtain the "green" solution {(1, 3), (3, 5)}. If that is a problem and you want to avoid the "biases" mentioned above and your SAS license (unlike mine) includes SAS/OR or similar modules for optimization, I think you should post your question in the Mathematical Optimization, Discrete-Event Simulation, and OR forum. SAS/OR contains advanced procedures that are suitable for such "graph theoretic" problems. EDIT: Unlike my test datasets, your sample data contain a duplicate case ENROLID (27264303). Depending on the rules to be applied to duplicates, the code above may need to be modified a bit in order to handle those cases correctly.

FreelanceReinh · ‎06-04-2024

Hi @cosmid, So the first, second, third, ... ID in the first dataset (let's call it HAVE1) is replaced by the first, second, third, ... ID in the second dataset (HAVE2), respectively? If so, try this: data want(drop=_); retain id; /* redundant if order of variables is not important */ set have1(rename=(id=_)); if value='01' then set have2; run;

FreelanceReinh · ‎06-04-2024

@TomHsiung wrote: Thank you for your suggestion. (...) If there is an individual who was followed for 14 days, for whom the A changed on day 7 from 0 to 1. In addition, the B changed on day 4 from 1 to 0. According to my understanding of your idea, there should be three rows for this individual in the overall table and they are: row one: A=0, B=1, start=0, stop=4 row two: A=0, B=0, start=4, stop=7 row three A=1, B=0, start=7, stop=14 You're welcome. In a situation with discrete (integer) times t 1 , t 2 , ..., as in your example, one has to make sure that the time-varying variables at time t i have the values that are relevant for the case that t i is the event time of the individual they describe. This is illustrated in Example 92.7 Time-Dependent Repeated Measurements of a Covariate of the PROC PHREG documentation: A value measured at time t i is assumed to be valid in the entire semiclosed interval (t i-1 , t i ] where t i-1 is the time of the previous measurement (or zero if i=1). Time t i-1 must not be equal to t i in this situation. Otherwise, PROC PHREG would discard the observation and issue a note about this in the log. So, if you know that "B changed on day 4 from 1 to 0," (and not: "it may have changed earlier, but the first measurement detecting the change happened to be that on day 4") I think it would be more appropriate to have a time interval with stop time 3 and B=1, followed by an interval with start time 3 and B=0. Similarly, knowing that "A changed on day 7 from 0 to 1," the latter interval would rather have stop time 6 and A=0. start stop A B 0 3 0 1 3 6 0 0 6 14 1 0 If the event or censoring time of that individual was day 7, the third observation would have stop=7 (and the corresponding value of the variable indicating event or censoring, not shown above). Thus, the model would take the potential impact of A=1 on the occurrence probability of the event into account since start=6<7=stop.

Re: How to Reg on each row?! with Slope/Intercept saved out?!

Re: How to use a macro variable in a if else condition

Re: How to use a macro variable in a if else condition

Re: problem with where clause on numeric

Re: INTCK Question

Re: How to tell macro variable created or not inside PROC SQL?!

Re: INPUT not converting character to numeric

Re: modify xaxis with different ranges (some very close to 1.xx and ot...

Re: modify xaxis with different ranges (some very close to 1.xx and ot...

Re: Proc Optmodel - output

Re: VALIDVARNAME=V7

Re: problem with ODS in SAS EG 8.3

Re: is there a minimum file size for .sas7bdat files?

Re: is there a minimum file size for .sas7bdat files?

Re: ods pdf and gmap: PDF output different than EG

Re: How to use a macro variable in a if else condition

Re: INTCK Question

Re: How to tell macro variable created or not inside PROC SQL?!

Re: modify xaxis with different ranges (some very close to 1.xx and ot...

Re: IF statement not working consistently

Re: Output week ending Tuesday from a Date colum

Re: numeric format does not validate intervals properly

Re: More than one time-dependent variable in a time-dependent Cox regr...

Re: Median and confidence intervals missing when using proc lifetest

Re: How to include a number immediately after one macro and immediatel...

Re: How to include a number immediately after one macro and immediatel...

Re: Macro variable in filename not resolved

Re: Code should result in error but it does not

Re: How to compare 2 string and return a number of letter difference b...

Re: More than one time-dependent variable in a time-dependent Cox regr...

Re: how to open a .sas7bdat file from the Windows cmd.exe command line...

Re: how to open a .sas7bdat file from the Windows cmd.exe command line...

Re: matching cases and controls

Re: How do I replace one dataset's ID column with a second dataset's I...

Re: More than one time-dependent variable in a time-dependent Cox regr...

SAS Analytics Explorers

CoDe SAS German