About sustagens

sustagens · ‎11-27-2019

If all the shortened names have a period in your data and your actual scenario is as simple as your sample, then you can just eliminate all rows that contain '.' PROC SQL; CREATE TABLE WANT AS SELECT FirmID, Name FROM HAVE WHERE (Name NOT CONTAINS '.'); QUIT;

sustagens · ‎11-27-2019

data have; input A:8. B:8. C:8.; datalines; 2 3 5 2 3 5 2 4 5 2 5 4 3 1 2 3 2 1 3 1 2 ; run; proc sort data=have nodupkey out=want; by a b c; run;

sustagens · ‎11-27-2019

Should the update be made whenever the value of action is "paid up"? or is it dependent on the account?

sustagens · ‎11-24-2019

Figured it out. data Ccy_Ref; input Ccy_Order:8. Ccy:$10.; datalines; 3 JPY 4 OTHERS 2 PHP 1 USD ; run; DATA want; MERGE have ccy_ref; BY Ccy Ccy_Order; RUN;

sustagens · ‎11-22-2019

The initial data is filtered, for just MNCs. My data has transactions, and sometimes not all the currencies (USD/PHP/JPY/OTHERS) are transacted on, but the report format should be retained for all four whether there were any for the period or not. PROC SQL; CREATE TABLE want AS SELECT DISTINCT Ccy, Base_YM, (SUM(EXPOSURE)) FORMAT=COMMA23.2 AS EXPOSURE, Ccy_Order FROM have WHERE Trans="MNC" GROUP BY Ccy_Order, Ccy, Base_YM ORDER BY Ccy_Order, Base_YM; QUIT; Here is sample data for my have table: data have; input Ccy:$6. Base_YM:$6. EXPOSURE:COMMA23.2 Ccy_Order:32.; format EXPOSURE COMMA23.2 ; datalines; USD 201504 30000000 1 USD 201505 30000000 1 USD 201506 30000000 1 USD 201507 30000000 1 USD 201508 30000000 1 USD 201509 30000000 1 USD 201510 30000000 1 USD 201511 30000000 1 PHP 201510 65000000000 2 PHP 201511 50000000000 2 PHP 201512 50000000000 2 PHP 201601 55000000000 2 PHP 201602 60000000000 2 PHP 201910 250600000000 2 OTHERS 201910 8500000000 4 ; I then transposed that so that I have the dates as the columns. PROC TRANSPOSE DATA=want OUT=TRNS_want (drop=_NAME_) PREFIX=EXP LABEL=Label ; BY Ccy_Order Ccy; ID Base_YM; VAR EXPOSURE; RUN; QUIT;

sustagens · ‎11-22-2019

Hi, just struggling how to best cope with summarising data with null variants? One of my summarised datasets look like this: I need to be able to insert a dummy row, with a value of 3 for Ccy_Order and value of JPY for Ccy, and all the rest of the "EXPYYYYMM" columns to the right should be null. Sometimes the row for Ccy_Order 2 will be missing, sometimes for 1 or 4 - I just need all four types to be present. Any help is appreciated.

sustagens · ‎11-14-2019

proc sql; create table want as select a * (select a from ds2) as product_a, b * (select b from ds2) as product_b, c * (select c from ds2) as product_c, d * (select d from ds2) as product_d from ds1; quit;

sustagens · ‎11-13-2019

/*a*/ proc sql; create table item_a as select Gender, count (Gender) as count /*count the gender from the result of the sub query*/ from (select distinct ID, Gender from record) /*sub query to get distinct list of IDs and their gender, since there are multiple entries per ID*/ group by Gender; quit; /*b*/ proc sql; create table item_b as select ID, count (ID) as count_visits from record where score>50 group by ID; quit; /*c*/ proc sort data=record; by ID Score; /*sort first*/ run; proc means data=record; var Score; /*we want to analyse score*/ by ID; /*for every value of ID*/ output out=item_c; run; /*d*/ proc sql; create table item_d as select ID, Score as Mean from item_c /*let's take from the output in item c*/ where _STAT_ = 'Mean' and Score > 45; quit;

sustagens · ‎11-12-2019

Check if the variable IDA is of character- or numeric- type. Null values in character variables are assigned a blank ('') while in numeric variables it is a period (.) Hence if IDA is numeric, your condition will just be: if IDA = . But if IDA is character, your condition will be: if IDA = '' The only time you will need to use: if IDA = '.' is when a character variable intentionally has a period for a value.

sustagens · ‎11-12-2019

Check out this link: PROC REG: Simple Linear Regression "The table also contains the statistics and the corresponding -values for testing whether each parameter is significantly different from zero."

sustagens · ‎11-11-2019

"Second dataset contains three variables (without city variable)" Both input datasets you posted have less than three variables. Please post the source tables as is.

sustagens · ‎11-07-2019

Assuming the fin yr is just the current year plus 1: %let start_date = "01Jul2018:00:00:00"DT; %let start_date_text = %sysfunc(datepart(&start_date),ddmmyyn8.); %put &=start_date_text; %let fin_yr=%sysfunc(datepart(&start_date),year2.)%eval(%sysfunc(datepart(&start_date),year2.) + 1); %put &=fin_yr;

sustagens · ‎11-07-2019

This is a more novice approach but does it: data result_table; input stat $ result $; datalines; yes High yes Medium yes Low ; proc sql; create table want as select t1.id, t2.result from new t1 left join result_table t2 on t1.Stat=t2.Stat order by t1.id, t2.result ; quit;

sustagens · ‎11-06-2019

I would take out and segregate all 18 years and over data in one dataset, and another for 16 years and over. When I have that I can join them on matching date and gender and get their difference. Once I have the difference I can add it to the "18 to 24 years" value. My code can probably be optimised to make it shorter. /*Take all '18 years and over' rows*/ proc sql; create table data_18 as( select * from have where Age_group = '18 years and over' ); quit; /*Take all '16 years and over' rows*/ proc sql; create table data_16 as( select * from have where Age_group = '16 years and over' ); quit; /*Compute difference*/ proc sql; create table diff_of_16_and_18 as( select t1.Date, t1.Gender, '16 to 24 years' as Age_Group, sum(t1.Population_in_thousands,-t2.Population_in_thousands) as Population_in_thousands from data_18 t1 left join data_16 t2 on t1.Date=t2.Date and t1.Gender=t2.Gender ); quit; /*Combine the two tables: (1) original table, without the "16- and 18- years"*/ /* (2) difference table, ready for joining to "18-24 years"*/ data want; set have (where=(Age_group not in ('16 years and over' '18 years and over'))) diff_of_16_and_18 ; if Age_group in ('18 to 24 years') then new_age_group='16 to 24 years'; /*Rename "18 to 24 years" to correct grouping so they can be summed together in next step*/ else New_age_group = Age_group; run; /*Sum by new age group*/ proc sql; create table sum_by_new_age_group as( select Date, Gender, New_age_group, sum(Population_in_Thousands) as Population_in_Thousands from want group by Date, Gender, New_age_group ); quit;

sustagens · ‎11-06-2019

%let day=%sysfunc(intnx(week,%sysfunc(today()),-2,s),date9.); %put &=day;

Re: How can I import a SAS dataset to Access?

Re: How can I import a SAS dataset to Access?

Re: How can I import a SAS dataset to Access?

Re: How can I import a SAS dataset to Access?

How can I import a SAS dataset to Access?

Re: Programming Question

Bar chart with a group and subgroup?

Re: Dynamic month and year selection

Re: Nested DO loop in Macro

Re: Solving the error: A lock is not available for <dataset>

Re: How to fill the sequential alphabet letter to survey questions

Re: order by in subquery??

The Pursuit of Happyness - World Happiness Report 2020

Re: IF ELSE STATEMENT is not working as expected

A Giraffe Taught me Kubernetes!

Re: Nested DO loop in Macro

Re: SAS EG Computing new column with advanced expression while using F...

Re: one to many merge does not work as expected

Re: Creating Sum variables with different sorting

Re: How to bring all data in one row

Re: same person with different name

Re: Filtering data by two variables

Re: Updating/Overwriting Data

Re: Enter zero filled row

Re: Enter zero filled row

Enter zero filled row

Re: Multiply variables across datasets based on condition

Re: Newbie with proc sql help with basic statements (mean, max, min, e...

Re: Filling in Missing Values

Re: Only keep variables that are significantly different from 0

Re: How to merge two datasets?

Re: Macro to converting Datetime into text

Re: How to add new rows

Re: Finding the difference between two numeric values, then summing th...

Re: today () - 14)

SAS Analytics Explorers