About ed_sas_member

ed_sas_member · ‎12-03-2019

Hi @rkarr5 You can try this. It generate one dataset name according to the ordercode and containing 2 columns: year (with 1 observation per year from 2012 to 2018) and code. I am not sure if this is what you want, as your entry data are very weird: lot of duplicate records and there is no rule to link codes "north" or "south" with Internet_1a. /* List of tables to be created into macrovariables */ proc sql noprint; select distinct ordercode into: ordercode_1-:ordercode_999 from table1; select count(distinct ordercode) into: nb_ordercode from table1; run; /* Retrieve each couple of ordercode-code */ proc sort data=table1 out=table1_nodupkey (keep=ordercode code) nodupkey; by ordertype ordercode code name; run; /* Output 1 row / year for each couple ordercode-code - Method 1*/ data table2; set table1_nodupkey; do year=2012 to 2018; output; end; run; %macro ordercode_tables(); %do i=1 %to &nb_ordercode; data &&ordercode_&i (drop =ordercode); set table2; where ordercode = "&&ordercode_&i"; run; *title "&&ordercode_&i"; * proc print data=&&ordercode_&i; * run; *title; %end; %mend; %ordercode_tables;

ed_sas_member · ‎12-02-2019

Hi @Lekhnath I have updated the code to take into account this new requirement. In my opinion, the easiest way to handle "overlapping" departments is to use a specific format . In the below code, I have defined the format Dept. as a multilabel one (for example, 1 can be formatted either as "Tim" or "Chuck"). Then, you can invoke this format in the proc means and specify the option "MLF" to get summary statistics for each format category. Best, data have; infile datalines dlm=" "; input Year Month $ Office $ Sales_type $ Sales; datalines; 2018 Jan Dallas A 10 2018 Jan Dallas B 13 2018 Jan Dallas C 15 2018 Jan Dallas D 20 2018 Jan NY A 5 2018 Jan NY B 9 2018 Jan NY C 7 2018 Jan NY D 17 2018 Jan DC A 15 2018 Jan DC B 19 2018 Jan DC C 17 2018 Jan DC D 19 2018 Feb Dallas A 11 2018 Feb Dallas B 14 2018 Feb Dallas C 16 2018 Feb Dallas D 21 2018 Feb NY A 6 2018 Feb NY B 10 2018 Feb NY C 8 2018 Feb NY D 18 2018 Feb DC A 16 2018 Feb DC B 20 2018 Feb DC C 18 2018 Feb DC D 20 ; run; data have2; set have; if Office in ("Dallas","NY") and Sales_type in ("A","B") then Dept=1; else if Office = "Dallas" and Sales_type in ("C","D") then Dept=2; else if Office in ("NY", "DC") and Sales_type in ("C","D") then Dept=3; else if Office="DC" and Sales_type in ("A","B") then Dept=4; run; proc sort data=have2 (drop= Office Sales_type); by Year Month Dept; run; proc format fmtlib; value Dept (multilabel) 1 = "Tim" 2 = "Sam" 3 = "Henry" 4 = "Rick" 1,4 = "Chuck"; run; proc means data=have2 sum maxdec=0; var Sales; class Year Month Dept / mlf; ways 3; output out=want (drop=_type_ _freq_) sum=Sales; format Dept Dept.; run;

ed_sas_member · ‎12-02-2019

Hi @Lekhnath You can try the following code, using a proc means to get the summary statistics (sum of sales). Ideally, if you want to display your table in a chronological order (years and months), I recommend that you define a format for months (e.g. 1 = "Jan", etc.) Best, data have; infile datalines dlm=" "; input Year Month $ Office $ Sales_type $ Sales; datalines; 2018 Jan Dallas A 10 2018 Jan Dallas B 13 2018 Jan Dallas C 15 2018 Jan Dallas D 20 2018 Jan NY A 5 2018 Jan NY B 9 2018 Jan NY C 7 2018 Jan NY D 17 2018 Jan DC A 15 2018 Jan DC B 19 2018 Jan DC C 17 2018 Jan DC D 19 2018 Feb Dallas A 11 2018 Feb Dallas B 14 2018 Feb Dallas C 16 2018 Feb Dallas D 21 2018 Feb NY A 6 2018 Feb NY B 10 2018 Feb NY C 8 2018 Feb NY D 18 2018 Feb DC A 16 2018 Feb DC B 20 2018 Feb DC C 18 2018 Feb DC D 20 ; run; data have2; set have; length dept $10.; if Office in ("Dallas","NY") and Sales_type in ("A","B") then Dept="Tim"; else if Office = "Dallas" and Sales_type in ("C","D") then Dept="Sam"; else if Office in ("NY", "DC") and Sales_type in ("C","D") then Dept="Henry"; else if Office="DC" and Sales_type in ("A","B") then Dept="Rick"; run; proc sort data=have2 (drop= Office Sales_type); by Year Month Dept; run; proc means data=have2 sum maxdec=0; var Sales; class Year Month Dept; ways 3; output out=want (drop=_type_ _freq_) sum=Sales; run;

ed_sas_member · ‎12-02-2019

Hi @kz134 You can try this, using an array statement: data have; infile datalines dlm="09"x; input Customer $ Product_1 $ Cost_1 Product_2 $ Cost_2 Product_3 $ Cost_3; datalines; AAA Book 10 Pen 2 Computer 1000 BBB Pen 2 Phone 500 CCC TV 500 Phone 500 ; run; data want; set have; array product_(3) $; array cost_(3) ; do i=1 to dim(product_); if product_(i) ne " " then do; product = product_(i); cost = cost_(i); output; end; end; keep customer product cost; run; Output:

ed_sas_member · ‎12-02-2019

Hi @SivaKizildag You face this issue because the variable for grouping is the same name as variable for subgrouping. So if statements for subgrouping erase the first one for grouping. As only "06" is not referenced for subgrouping, the variable for grouping contains only this value (not erased). You should create another variable for subgrouping. I also encourage you to use "else if" in your statements to have a more efficient code. data temp; set in.data; if sect in ('01' '02' '03' '04' '05' '06') then my_sector='Industry' ; if sect in ('01' '02') then my_subsector='Textile Industry' ; else if sect in ('03' '04') then my_subsector='Food Industry' ; else if sect in ('05') then my_subsector='Mining Industry' ; run; Best,

ed_sas_member · ‎12-02-2019

Hi @vnreddy Could you please share some sample data? Thank you!

ed_sas_member · ‎12-02-2019

Hi @Dumi1, In case in your Excel file, the date is recognized as a character expression (e.g. "12022019") then you can use a function like this: =CONCATENATE(MID(A1;5;4);MID(A1;1;2);MID(A1;3;2)) It will return a character expression like "20191202". MID() is equivalent to substr() in SAS. Best,

ed_sas_member · ‎12-02-2019

Hi @Ronein , I believe you can create data-driven macro calls using the DOSUBL function as follows. To use it, you need to have a dataset containing the different months values in the variable month. data have; input Month; cards; 1904 1905 1906 ; run; data _null_; set have; rc=dosubl(cats('%RRR(',month,')')); run; Let me know! Best,

ed_sas_member · ‎12-02-2019

Hi @Bounce You can try this code (proc transpose + proc sort to remove duplicate records): proc transpose data=example out=want (rename=(col1=New_Var) drop=_name_); by id; var var:; run; proc sort data=want nodupkey; by id New_Var; run; Output:

ed_sas_member · ‎12-02-2019

Hi @farshidowrang Assuming the word in parenthesis is at the end of the character expression, you can try this code: data want; set have; var2 = trim(substr(var1, 1, index(var1,"(")-1)); run;

ed_sas_member · ‎12-01-2019

Hi @Xinhui The following code would perform a t-test (mean comparison of two groups defined by the CLASS statement) proc ttest data=have; var csrp; class d; run; Are you sure you want to perform a paired t-test? In this case, you need to define the couple of observations between the two groups. Do you have an ID variable to identify pairs? In this case, you should have two variables (csrp value for each group; a pair by row) and the syntax would be: proc ttest data=have; paired csrp1*csrp2; run;

ed_sas_member · ‎12-01-2019

You're welcome @Amy0223! Could you please set the topic as answered so that it can be accessible to the community? Thank you

ed_sas_member · ‎12-01-2019

Hi @Amy0223 I don't understand why you need to use the different functions separately, because in my opinion, what defines the zipcode pattern is the conjonction of 4 conditions: - length of the zipcode = 10 - digits 1 to 5 = a number - digits 7 to 9 = a number - 6th digit = an hyphen The use of the prxmatch function is a more efficient way to do that but you can also use the traditional length(), index() and substr() functions to create the flag variable. It doesn't make sense to use them separately. data zipcode_check; set zipcode; if length(zipcode)= 10 and 0 < substr(zipcode, 1, 5) < 99999 and 0 < substr(zipcode, 7, 4) < 9999 and index(zipcode,"-")= 6 then variable_9digit=1; else variable_9digit=0; run;

ed_sas_member · ‎12-01-2019

Hi @Amy0223 It is a typical use case for regular expressions. The function prxmatch() as written below checked if the zipcode variable match the following pattern: 5 digits (\d), 1 hyphen, 4 digits. data zipcode_flag; set zipcode; if prxmatch('/\d{5}\-\d{4}/',zipcode) then variable_9digit = 1; else variable_9digit = 0; run; The issue with your 3 tests is that your code depends specifically on one zip code in particular and not in general. Best,

ed_sas_member · ‎12-01-2019

Hi @Ronein Thank you for your feedback! Have you tried to use a filename like below? libname RRR clear; filename fileref "/path/shoes.xlsx"; data _NULL_; rc= fdelete ("fileref"); run;

Online Status	Offline
Date Last Visited	‎01-17-2025 01:26 PM

Re: Selecting multiple codes and excluding certain codes - array

Re: How can i return the output datasets from Proc Tabulate procedure ...

Re: How can i return the output datasets from Proc Tabulate procedure ...

Re: How can i return the output datasets from Proc Tabulate procedure ...

Re: How do I remove duplicate entries in a string leaving only unique ...

Re: delete the last digit

Re: How to find the value in three consecutive ID

Re: Using a List in a Macro Variable

Re: XLSX Library issues

Re: XLSX Library issues

Re: get an output in work lib

Re: how to delete or hide the lines in rtf output

Re: How to find worksheet names of an excel file?

Re: Search Missing List

Re: Create a line at the bottom of the report using Proc Report

Re: Changing font size in proc sgplot

Re: Creating a shell for a lab table

Re: Paired T test

Re: Using proc sql to create min and max

Re: Counting observations in consecutive year columns

Re: Creating datasets and inserting data based on a condition during e...

Re: How do I summarizing data with conditional statement and group the...

Re: How do I summarizing data with conditional statement and group the...

Re: SAS Transpose

Re: Creating subgroups while keeping group

Re: Arrays

Re: script

Re: Macro

Re: Vertical Joining in the same dataset

Re: Removing completely the parentheses expression

Re: Paired T test

Re: use length, index, and substr for zipcode variable

Re: use length, index, and substr for zipcode variable

Re: use length, index, and substr for zipcode variable

Re: delete XLSX file that was created before by XLSX engine