About Malthe

Malthe · ‎05-26-2021

Thank you for all your replies and helpful tips. So, first off I think an apology is called for. I am not an internet troll and reviewing my initial post, I recognize the content as both provocative and inconsiderate towards users of the forum. Please know that it was a product of frustration. I am often in company with other students and university staff, who favorize R and praise its functionality a lot. Also i see, that my search on graphical editing in SAS was not extensive enough. It seems there are plenty of possibilites within SAS containing the exact features I was looking for. I do think though, that a lot of the inputs you gave me here are harder to come by without prior knowledge. Searching on Google, YouTube and Stack Overflow you are quickly met with a lot of easily accesible R-content, the same is not true for SAS, unfortunately. The SAS documentation is very thorough and extensive, but its not exactly user-friendly, especially if you are new to the SAS-ecosystem, which I definetely still regard myself as. So, to sum it up, thanks for pointing me in the right direction, and for having a forum with a friendly community and quick replies. (Ps. And even though it hurt a bit, a mild internet scolding is sometimes needed 🤗 )

Malthe · ‎05-26-2021

I am a Ph.D. in medicine and have used SAS for a year and a half now. When starting my studies, I had to choose between SAS and R. The research-community in medicine in Denmark is split roughly 50/50 in SAS and R-users, with young, new users tending to choose R due to it being Open Source. I chose SAS because most people at my department used it and it seemed like an easy way to get going. I have previous experience from Python, but wanted an easy start from the get-go. Now, SAS is very intuitive for data-management and most statistics (although the syntax for a statement like CONTRAST in Phreg is very unintuitive compared to the R-alternative). I have grown to like SAS a lot, and therefore it both baffles and hurts me, that the graphical capabilities are so inferior to R. I hate having to export my datasets to be able to do what in R takes half an hour, but in SAS takes several hours and obscure code. I am sure that this is one of the reasons that SAS is loosing foothold in this battle. How come a company with so big economic muscles and market-dominance doesn't prioritize this aspect of the program? Why can't you output vector-formats easily for editing in other programs? Why isn't there a wizard or some easy code, that recognizes all the typical objects you want to be able to manipulate, when having created a graph? I mean most of these functions are from a coding-perspective not that advanced compared to all the other mechanisms in SAS. I hope that this side of the program will get some more attention - and a fresh new Googly/Apply-approach in the future!

Malthe · ‎03-02-2021

1) Noted! Thanks for mentioning that 🙂 2) In my real data sets those values are true dates. For this example it didn't matter, since it was only about getting the coding right - with some numbers representing dates. I could have left out the formatting. 3) Ah, yeah, i see the underlying problem. Thanks 🙂

Malthe · ‎03-02-2021

Thanks a lot! Worked like a charm 🙂

Malthe · ‎03-02-2021

Thanks for your fast reply! 1) That's weird. I took the code directly form SAS Enterprise Guide, something must have happened with the formatting that didn't show, when i was editing the post. It's fixed now! 2) The treatment-dates are used to ensure, that only the scores from before the startdate (from patients), are outputted. It's in this step of the hash-do-loop: "if TreatmentDate < Startdate then do;" i can definitely use your method, but i think the array one in the other reply is easier to apply in this case 🙂

Malthe · ‎03-01-2021

Hi there! I have a table of patients (one line per patient) for whom i want to find a score in another table of treatments (several lines per patient, one score for each date). In my real search it is several thousands, but here it's boiled down to a simple example. Patient-table: Data patients; Input PatientID StartDate; cards; 1 50 2 70 3 20 4 10 run; Treatments-table: Data Treatments; Input PatientID TreatmentDate Score; format TreatmentDate date9.; cards; 1 20 1 1 40 3 1 60 4 2 30 1 run; Then my goal is then to search for the treatments by using a hash-object. For each treatment-score that fulfills the criteria, i want to populate/create a new variable: Look-up code: data lookup; if _n_ = 1 then do; if 0 then set Treatments; dcl hash treat (dataset: 'Treatments', multidata: 'y'); treat.definekey ('PatientID'); treat.definedata ('TreatmentDate','Score'); treat.definedone (); end; set patients; by patientid; if first.patientid then count = .; do _iorc_ = treat.find() by 0 while (_iorc_ = 0); if TreatmentDate < Startdate then do; Count + 1; call symput('counter',count); Number&Counter = Score; end; _iorc_ = treat.find_next(); end; run; So that i end up with the final table looking like: PatientID | StartDate | Number1 | Number2 | Number3 ... Number X 1 | 50 | 1 | 3 | . 2 | 70 | . | . | . Thanks for your time and help 🙂

Malthe · ‎11-09-2020

Thanks for all your help, all of you! It was hard choosing a 'right solution', but this one worked flawlessly for me. The Proc Format/Proc Tabulate/Freq didn't really work as i have 10.000+ patients, and that methodology was simply too slow. One little note though; If i made multiple arrays, and made a do-loop for each of them (i.e. ugly coding with a lot of lines), it was 60% faster than the solution above.

Malthe · ‎11-06-2020

I have tried to add yet another description to clarify what i mean (I'm sorry, but i'm really trying to phrase it as easily understandable as i can, thanks for all your help) I will look into the formatting method and see if i can apply that 🙂

Malthe · ‎11-06-2020

Unfortunately with my real-world data I have a lot more diseasecategories. I have elaborated in the original post 🙂 I hope this helps!

Malthe · ‎11-06-2020

Thanks for answering! I think an example is best for me to try and explain it: The way I intended for it to work: 1) Value of Categoryindex{1} = 'diseasecategory1' (which is the name of another array) 2) The dim would then be a function of 'diseasecategory1'. DIM(Diseasecategory1) 3) Then i would have the number of elements in Diseasecategory1. In this instance 2 ('DG45' 'DG46'). So my new counting variable C would then go from 1 to 2. 4) I would then be able to use these values in the findstring. So i start by using step 1 | Categoryindex{1} = 'diseasecategory1' Then i would use my new C-variable in the do loop | C = 1 I could then put these together in the cats-function so i ended up having the value | diseasecategory1{1} 5) I would then be able to do a look-up of the first value in diseasecategory1{1} = 'DG45' in my original dataset. Then i would be able to repeat the process for diseasecategory1{2} (with C = 2). After this was done i would then in the loop go to diseasecategory2 .. and so o

Malthe · ‎11-06-2020

Hi forum! I have to create a medical score depending on several categories; i.e. cerebrovascular disease, cardiac disease, neurological disease. Each of these diseases have several ICD10-codes connected to them. So for instance 'Cerebrovascular Disease' could have 'DG45', 'DG46' etc. I have a huge look-up table and i would like to look this through using two arrays. One array containing the categories and several arrays containing the ICD-10 codes. I'm having trouble using this two-layered array-approach, can you help me? This is a much simplified version of my dataset and categories. data have; input id disease $; cards; 1 DG451 2 DG313 3 DG462 4 DG461A 5 DI14 6 DI13B 7 DI69 8 DI141 ; run; data lookup; set have; array diseasecategory1 {2} $4 _temporary_ ('DG45' 'DG46'); array diseasecategory2 {1} $4 _temporary_ ('DI14'); /* IN THE REAL WORLD I HAVE CATEGORIES UP TO 13 AND THEY CAN EACH CONTAIN UP TO 30 VALUES */ array categoryindex {2} $25 _temporary_ ('diseasecategory1' 'diseasecategory2'); /*THIS WOULD THEN NEED TO GO ON TO 13 */ do i = 1 to dim(categoryindex); do c = 1 to dim(categoryindex{i}); if find(disease,strip(cat(categoryindex{i},'{',c,'}'))) > 0 then counter +1; end; end; run; The problems are as follows: - In the second DO-loop i can't use the DIM-function on the value of the categoryindex-array - In the FIND-function i can't combine the value of categoryindex-array and the counting variable of the do loop. I hope you can help me 🙂 ***EDIT*** Thanks for replying Paige Miller, it seems i need to elaborate a bit more. I'll try my best. So with my real world data i have 13 disease categories. Each category can have up to 30 different values. So that would be the equivalent of: array diseasecategory1...13 {4...30} $12 _temporary_ ('DGX1' 'DGX2' 'DIX4' ..... DZ30'); And then i would use the DO-loop to add 1 to a counting variable. So for instance if a patient had one disease in three diseasecategories, their final score would be 3. *** EDIT 2 *** What i want: So I want to calculate a comorbidity score based on the diagnoses the patient has in the history. So for each patient i look through their entire history of diagnoses, if they match one in one of the categories, the score goes up by 1. I thought it would confuse more than clarify: data patientlist; input patientid; cards; 1 2 3 4 5 ; run; data diagnosishistory; input patientid icdcode $; cards; 1 DG451 1 DG313 1 DG462 1 DG461A 2 DG45 2 DI13B 2 DI49 3 DI151 ; run; data lookup; if _N_ = 1 then do; if 0 then set diagnosishistory; dcl hash pat (dataset:'diagnosishistory',multidata:'y'); pat.definekey('patientid'); pat.definedata('ICDCode'); pat.definedone(); end; set patientlist; by patientid; retain SCORE lock; if first.patientid then do; call missing(score, lock); end; array diseasecategory1 {2} $4 _temporary_ ('DG45' 'DG46'); array diseasecategory2 {2} $4 _temporary_ ('DI14' 'DI49'); array categoryindex {2} $25 _temporary_ ('diseasecategory1' 'diseasecategory2'); do _iorc_ = pat.find() by 0 while (_iorc_ = 0); do i = 1 to dim(categoryindex); do c = 1 to dim(categoryindex{i}); if find(disease,strip(cat(categoryindex{i},'{',c,'}'))) > 0 then SCORE + 1; end; end; _iorc_ = pat.find_next(); end; run; So in this example, my desired output would be like: Patient 1, score = 3 (Because he has one diagnosis with DG45 and two with DG46 in his history) Patient 2, score = 2 (Because he has one DG45 and one DI49 in his history) Patient 3, score = 0 (Because she hasn't got any of the diagnoses in history)

Malthe · ‎07-29-2020

Thanks for all your answers 🙂 I hoped for a simple solution other than running the macro in each program-instance. The SAS-autocall might be a solution, but seems a bit too technical for me to grapple with at the moment.

Malthe · ‎07-08-2020

Hi all, I am part of a research-group where we look at data from a specific patient-group. Often we have to calculate scores, measures etc based on the same type raw-data. Therefore macros are essential to us and we use them frequently. But it could be made a lot easier: When you invoke a function/macro that is defined by SAS-itself, you get a tooltip/help box. If you define a custom macro, in that same program, you will get a tooltip as well: But this doesn't happen if you load the macros with the %include-function (which we often do). Is there a way to make this help-text/tooltip visible to make input easier? Thanks in advance for your help!

Malthe · ‎07-03-2020

Great solution, and quick as well! Thanks! I actually solved it myself before seeing your answer using a 'call execute - datastep-macro' instead, but i think i like your solution better. It must be quicker than to read and write to the same data-set many times 🙂

Malthe · ‎07-02-2020

Hi there and in advance thanks a lot for your help! I'm using SAS Enterprise Guide 7.15. I have a dataset with patients, baseline-dates and scoring-dates. For each patient i have the same baseline-date and multiple scoring-dates (>20). I want to see whether the scoring-dates are within certain periods of time from the baseline-date: 1 year, 2 years, 3 years ... - 100 years; Instead of writing multiple lines of codes, i am trying to boil it down to a more manageable size. Here is a data-example with five scoring dates: data one; length patient_id $ 8; format baseline_date score_date date9.; input patient_id baseline_date score_date; cards; Patient1 0 200 Patient1 0 400 Patient1 0 600 Patient1 0 800 Patient1 0 1000 ; run; Here is my code: data scoring_within_year_x; set one; array a_withinyear [5] (1 2 3 5 10 100); /* For simplicity boiled down to smaller fixed array, these are the values i need the most */ do i = 1 to dim(a_withinyear); if intnx('year',baseline_date,a_withinyear[i],'same' >= score_date then /* Here i have my first problem, i want to name the new variable as a concatenation of a string and the value of the array - something like this */ cats('Score_within_',a_withinyear[i],'_year') = 1 /* Here i have my second problem. I want to output multiple variables to the same observation as shown underneath */ end; run; So ideally my output for the 4th line of the original data would look like this: Patient_id | Baseline_date | Score_date | Score_within_1_year | Score_within_2_year | Score_within_5_year | etc... Patient1 0 800 1 1 0 I hope you can help me 🙂

Online Status	Offline
Date Last Visited	‎12-12-2022 01:47 PM

Re: What is the explanation behind the limited graphical capabilities ...

What is the explanation behind the limited graphical capabilities of S...

Re: Creating a new variable for each found entry in hash object

Re: Creating a new variable for each found entry in hash object

Re: Creating a new variable for each found entry in hash object

Creating a new variable for each found entry in hash object

Re: Using two layers of arrays in DO-loops

Re: Using two layers of arrays in DO-loops

Re: Using two layers of arrays in DO-loops

Re: Using two layers of arrays in DO-loops

Re: Creating a new variable for each found entry in hash object

Re: Creating a new variable for each found entry in hash object

Re: Creating variable name from concatenation of a character string an...

Re: What is the explanation behind the limited graphical capabilities ...

Display tooltip for custom macros outside of active program

Re: What is the explanation behind the limited graphical capabilities ...

What is the explanation behind the limited graphical capabilities of S...

Re: Creating a new variable for each found entry in hash object

Re: Creating a new variable for each found entry in hash object

Re: Creating a new variable for each found entry in hash object

Creating a new variable for each found entry in hash object

Re: Using two layers of arrays in DO-loops

Re: Using two layers of arrays in DO-loops

Re: Using two layers of arrays in DO-loops

Re: Using two layers of arrays in DO-loops

Using two layers of arrays in DO-loops

Re: Display tooltip for custom macros outside of active program

Display tooltip for custom macros outside of active program

Re: Creating variable name from concatenation of a character string an...

Creating variable name from concatenation of a character string and a ...