Manipulating Data in SAS Studio Flows Part 3: Deduplicating Data
Recent Library Articles
Recently in the SAS Community Library: Ever need to cut down a large data set, but aren’t sure how? Chances are you’ll need to remove duplicate rows or values in your data, whether that’s to eliminate redundancy or to simply analyze rows with unique field values. Much like other common data manipulation tasks, SAS Studio Flows has multiple steps that can remove duplicates in a few mouse clicks! SAS' @GraceBarnhill shows you how.
Hi everyone, I need your help because I'm going crazy. I need to create a program that, starting from an imported dataset, searches for the first possible combination of amounts, which when added together give the searched amount. Let me explain better with an example. Suppose we have a dataset consisting of two columns. In the first column there is a list of amounts, which represent the invoices received, in the second column the amount to search for (which is repeated the same for the entire extension of the dataset). I need the program to find the combination of amounts in the first column that result in the amount found in the second column. Example: Column 1: 100 200 50 400 1000 Column 2: 1700 1700 1700 1700 1700 I need the program to find the combination that gives the result 1700. So it will be line 1 + line 2 + line 4 + line 5. If there are multiple combinations, he will have to stop at the first find, without continuing further. Finally, I would need this combination to be reported in a new column called "combination". I have made many attempts, but I can only use "nested cycles" which however allow limited management of the amounts. In my project, the invoices to be added to arrive at the amount to be searched for can even be hundreds. Thanks for your help
... View more
Hello,
How to find the position of each backlash and dot in:
DATA new_dataset;
INPUT text :$100.;
DATALINES;
/dwh_actuariat/sasdata/sas1999/nx/ingnovex.rd016y.prm.jun1999.dat.Z
;
RUN;
... View more
Discover how Aleksandra Kruchinina turned her SAS Spring Campus internship into a launchpad for success in data science! From mastering SAS programming to honing crucial soft skills, her story is a blueprint for aspiring analysts.
Curious about kickstarting your data career? Learn how passion and curiosity can fuel your growth!
... View more
Hello everyone, I need your urgent help. I have 16 plants treated with 16 different fertilizers and a control plant (untreated), for a total of 17 units. The pots were randomly arranged without replication. However, these plants were fertilized for 27 weeks. What would be the repeated measure code for this? I want to see the effects of these fertilizers on plant height for 27 weeks. This is the coding I came up with. Thanks for your valuable time. frt= fertilizers ph= plant height Proc glimmix Data =first; class frt ph week; model ph = fert week fert*week; random week/subject=frt*rep type=ar(1) residual; lsmeans fert*week / pdiff Adjust = Tukey lines slicediff= day; output out=second predicted=pred residual=resid residual(noblup)=mresid student=studentresid student(noblup)=smresid; Run;
... View more
Hello Dears, When using sql query file in sas and write a select query as this example in sas documentation SELECT debtRatio AS {:debtinc:decimal}, cause AS {:reason:string:8} timestamp AS {:ts:datetime}
FROM hmeq_test WHERE badloan = {?:bad:decimal} this query on loaded table in sas memory or query database direct ? Thanks
... View more