About rogerjdeangelis

rogerjdeangelis · ‎12-11-2016

Flag earliest test date when a test has at least one test date after treatment end date Flag earliest date when a test has at least one test date after treatment end date I sorted the original have datset by test testdate HAVE ==== Up to 40 obs from havsrt total obs=10 Obs TEST TESTDATE TREATMENDATE 1 a 01AUG2015 15AUG2015 2 a 20AUG2015 15AUG2015 3 b 02AUG2015 15AUG2015 4 b 21AUG2015 15AUG2015 5 c 03AUG2015 15AUG2015 6 c 23AUG2015 15AUG2015 7 d 04AUG2015 15AUG2015 8 d 22AUG2015 15AUG2015 9 e 11AUG2015 15AUG2015 10 f 12AUG2015 15AUG2015 WANT ==== Up to 40 obs WORK.WANT total obs=10 Obs TEST TESTDATE TREATMENDATE FLAG 1 a 20301 20315 Y Flag eariest date when test 'a' has at 2 a 20320 20315 least one testdate after treatment end date 3 b 20302 20315 Y 4 b 20321 20315 5 c 20303 20315 Y 6 c 20323 20315 7 d 20304 20315 Y 8 d 20322 20315 9 e 20311 20315 10 f 20312 20315 WORKING CODE =============== do until (last.test); if testdate > treatmendate then flg='Y'; end; do until (last.test); if first.test and flg='Y' then flag='Y'; else flag=' '; end; FULL SOLUTION ============= data have; input test $ testdate date10. treatmendate date10.; cards4; a 01AUG2015 15AUG2015 b 02AUG2015 15AUG2015 c 03AUG2015 15AUG2015 d 04AUG2015 15AUG2015 e 11AUG2015 15AUG2015 f 12AUG2015 15AUG2015 a 20AUG2015 15AUG2015 b 21AUG2015 15AUG2015 c 23AUG2015 15AUG2015 d 22AUG2015 15AUG2015 ;;;; run;quit; libname wrk "%sysfunc(pathname(work))"; proc sort data=wrk.have out=havsrt; by test testdate; run;quit; data wrk.want; retain flg "N"; do until (last.test); set havsrt; by test; if testdate > treatmendate then flg="Y"; end; do until (last.test); set havsrt; by test; if first.test and flg="Y" then flag="Y"; else flag=" "; output; end; flg="N"; drop flg; run;quit;

rogerjdeangelis · ‎12-11-2016

Flag earliest test date when a test has at least one test date after treatment end date Flag earliest date when a test has at least one test date after treatment end date I sorted the original have datset by test testdate HAVE ==== Up to 40 obs from havsrt total obs=10 Obs TEST TESTDATE TREATMENDATE 1 a 01AUG2015 15AUG2015 2 a 20AUG2015 15AUG2015 3 b 02AUG2015 15AUG2015 4 b 21AUG2015 15AUG2015 5 c 03AUG2015 15AUG2015 6 c 23AUG2015 15AUG2015 7 d 04AUG2015 15AUG2015 8 d 22AUG2015 15AUG2015 9 e 11AUG2015 15AUG2015 10 f 12AUG2015 15AUG2015 WANT ==== Up to 40 obs WORK.WANT total obs=10 Obs TEST TESTDATE TREATMENDATE FLAG 1 a 20301 20315 Y Flag eariest date when test 'a' has at 2 a 20320 20315 least one testdate after treatment end date 3 b 20302 20315 Y 4 b 20321 20315 5 c 20303 20315 Y 6 c 20323 20315 7 d 20304 20315 Y 8 d 20322 20315 9 e 20311 20315 10 f 20312 20315 WORKING CODE =============== do until (last.test); if testdate > treatmendate then flg='Y'; end; do until (last.test); if first.test and flg='Y' then flag='Y'; else flag=' '; end; FULL SOLUTION ============= data have; input test $ testdate date10. treatmendate date10.; cards4; a 01AUG2015 15AUG2015 b 02AUG2015 15AUG2015 c 03AUG2015 15AUG2015 d 04AUG2015 15AUG2015 e 11AUG2015 15AUG2015 f 12AUG2015 15AUG2015 a 20AUG2015 15AUG2015 b 21AUG2015 15AUG2015 c 23AUG2015 15AUG2015 d 22AUG2015 15AUG2015 ;;;; run;quit; libname wrk "%sysfunc(pathname(work))"; proc sort data=wrk.have out=havsrt; by test testdate; run;quit; data wrk.want; retain flg "N"; do until (last.test); set havsrt; by test; if testdate > treatmendate then flg="Y"; end; do until (last.test); set havsrt; by test; if first.test and flg="Y" then flag="Y"; else flag=" "; output; end; flg="N"; drop flg; run;quit;

rogerjdeangelis · ‎12-09-2016

Noticed an error which will just make the algoritm faster car = cats('E0',byte(mod(ran,10)+50),byte(mod(ran1,10)+50)); car_name = cats('F0',byte(mod(ran,10)+50),byte(mod(ran1,50)+50)); should be car = cats('E0',byte(mod(ran,10)+50),byte(mod(ran1,10)+50)); car_name = cats('F0',byte(mod(ran,10)+50),byte(mod(ran1,10)+50)); Car and vcar_name are not 1:1

rogerjdeangelis · ‎12-09-2016

This does not solve your problem but I am having trouble coming with example data. I was able to join a 160,000,000 11 variable table with a 6,000,000 2 variable table in under 2 minutes. I have a very slow old 2008 computer with DDR2 ram. Newer computers should cut the time in half. I was unable to create a 14m result? Do not see a need to parallelize the code(yet) /* 60k result is a far cry from 14m but I don't see how to get 14m My resulting table Result of join of 160,000,000 and 6,000,000 NOTE: Table SD1.RES created, with 60000 rows and 10 columns. 261 ! quit; NOTE: PROCEDURE SQL used (Total process time): real time 2:03.27 cpu time 3:19.24 */ Maybe you only need one variable in the potential 1:1 relationships? Here are my assumed cardinalities 3 Years 4 Countries 10,000 erg_ids ( algorithm is relatively insensitive to this variable) 1,000 different offices (assume office_name is 1:1 with office) 1,000 different office names (assume office_name is 1:1 with office) 100 different cars 1:1 car names 100 different car names 400 redidues 1:1 with residue name 400 residue name INPUTS Up to 40 obs from spde.dmperg160m total obs=160,000,000 OFFICE_ STA_ID OFFICE COUNTRY NAME CAR CAR_NAME RESIDUE RESIDUENAME ERG_ID YEAR 1 A049; C01 D049; E049 F049 G04C H04C 2 2005 1 A0324 C02 D0324 E032 F03F G03< H03< 61 2004 1 A0348 C02 D0348 E034 F03> G03> H03> 81 2003 1 A0;32 C04 D0;32 E0;3 F0;3 G0;= H0;= 49 2004 1 A0252 C01 D0252 E025 F025 G025 H025 80 2005 1 A035: C02 D035: E035 F03S G03? H03? 21 2003 1 A053: C02 D053: E053 F05= G05= H05= 83 2005 1 A069: C04 D069: E069 F069 G0@9 H0@9 54 2003 ..... Up to 40 obs from spde.dmpsta17m total obs=6,000,000 CARTYPE_ Obs STA_ID ID 1 1 33 2 2 44 3 3 55 4 4 66 5 5 77 6 6 88 7 7 99 8 8 :: 9 9 ;; 10 10 << WANT ( Cannot get 14m which may imply exact duplicate records) Up to 40 obs SD1.RES total obs=60,000 OFFICE_ CARTYPE_ Obs OFFICE COUNTRY NAME CAR CAR_NAME RESIDUE RESIDUENAME STA_IDS ERG_IDS IDS 1 A0222 C01 D0222 E022 F022 G022 H022 280 2 43 2 A0222 C01 D0222 E022 F022 G02< H02< 282 2 42 3 A0222 C01 D0222 E022 F022 G0<2 H0<2 152 1 38 4 A0222 C01 D0222 E022 F022 G0<< H0<< 170 1 37 5 A0222 C01 D0222 E022 F02< G022 H022 318 2 42 6 A0222 C01 D0222 E022 F02< G02< H02< 306 2 46 7 A0222 C01 D0222 E022 F02< G0<2 H0<2 159 1 39 8 A0222 C01 D0222 E022 F02< G0<< H0<< 140 1 31 9 A0222 C01 D0222 E022 F02F G022 H022 275 2 45 10 A0222 C01 D0222 E022 F02F G02< H02< 274 2 42 11 A0222 C01 D0222 E022 F02F G0<2 H0<2 156 1 35 12 A0222 C01 D0222 E022 F02F G0<< H0<< 145 1 35 13 A0222 C01 D0222 E022 F02P G022 H022 303 2 43 14 A0222 C01 D0222 E022 F02P G02< H02< 301 2 46 15 A0222 C01 D0222 E022 F02P G0<2 H0<2 146 1 36 16 A0222 C01 D0222 E022 F02P G0<< H0<< 173 1 40 WORKING CODE ============ group by r.office ,r.country ,r.office_name ,r.car ,r.car_name ,r.residue ,r.residuename FULL SOLUTION ============= libname spde spde ('c:\wrk\spde_c','d:\wrk\spde_d','e:\wrk\spde_e','g:\wrk\spde_g','h:\wrk\spde_h') metapath =('c:\wrk\spde_c\metadata') indexpath=( 'c:\wrk\spde_c' ,'d:\wrk\spde_d' ,'e:\wrk\spde_e' ,'g:\wrk\spde_g' ,'h:\wrk\spde_h') datapath =( 'c:\wrk\spde_c' ,'d:\wrk\spde_d' ,'e:\wrk\spde_e' ,'g:\wrk\spde_g' ,'h:\wrk\spde_h') partsize=500m ; proc datasets lib=spde kill; run;quit; * CREATE INPUT; data spde.dmperg160m (drop=ran: rec sortedby=country index=(country)); retain sta_id 0; length office $5 country $3 office_name $5 car $4 car_name $4 residue $4 residuename $4 erg_id $4; do rec=1 to 40000000; do country='C01','C02','C02','C04'; if mod(rec,10) = 0 then sta_id=sta_id+1; ran=int(100*uniform(5731)); ran1=int(100*uniform(5731)); ran2=int(100*uniform(5731)); year=put(2003+mod(ran,3),4.); office = cats('A0',byte(mod(ran,10)+50),byte(mod(ran1,10)+50),byte(mod(ran2,10)+50)); office_name = cats('D0',byte(mod(ran,10)+50),byte(mod(ran1,10)+50),byte(mod(ran2,10)+50)); car = cats('E0',byte(mod(ran,10)+50),byte(mod(ran1,10)+50)); car_name = cats('F0',byte(mod(ran,10)+50),byte(mod(ran1,50)+50)); residue = cats('G0',byte(mod(ran,20)+50),byte(mod(ran1,20)+50)); residuename = cats('H0',byte(mod(ran,20)+50),byte(mod(ran1,20)+50)); erg_id = put(mod(ran,9999),4.); output; end; end; run;quit; /* NOTE: The data set SPDE.DMPERG160M has 160000000 observations and 11 variables. NOTE: DATA statement used (Total process time): real time 4:47.34 user cpu time 7:01.88 system cpu time 4:11.58 memory 63209.31k OS Memory 89692.00k Timestamp 12/09/2016 10:24:40 AM Step Count 145 Switch Count 9766 */ data spde.dmpsta17m(index=(sta_id)); retain sta_id 0; length cartype_id $2; do sta_id=1 to 6000000; cartype_id = cats(byte(mod(sta_id,50)+50),byte(mod(sta_id,50)+50)); output; end; run;quit; /* NOTE: The data set SPDE.DMPSTA17M has 6000000 observations and 2 variables. NOTE: DATA statement used (Total process time): real time 3.54 seconds cpu time 5.67 seconds 1280! quit; */ proc sql; create table sd1.res as select r.office ,r.country ,r.office_name ,r.car ,r.car_name ,r.residue ,r.residuename ,count(distinct l.sta_id) as sta_ids ,count (distinct r.erg_id) as erg_ids ,count(distinct l.cartype_ID) as cartype_IDs from spde.dmpsta17m as l ,spde.dmperg160m as r where r.year='2005' and l.sta_id = r.sta_id group by r.office ,r.country ,r.office_name ,r.car ,r.car_name ,r.residue ,r.residuename ;quit; NOTE: Table SD1.RES created, with 60000 rows and 10 columns. 261 ! quit; NOTE: PROCEDURE SQL used (Total process time): real time 2:03.27 cpu time 3:19.24

rogerjdeangelis · ‎12-09-2016

Fullstimer statistics would also be usefull

rogerjdeangelis · ‎12-09-2016

There may be issues with your data structure. Could you provide the two proc contents and the join key. A 17gb dataset with only 6 million rows means that the record length is almost 3,000 bytes. It is a commmon practice to use codes for long text and even 8 byte bymerics to reduce the width of records. Both datasets seem very fat? How many rows and what is the width of the resultant dataset.

rogerjdeangelis · ‎12-01-2016

Note if you use the more powerful old text editor you can type 'cols' in the prefix area to get a ruler or use fslist with nums on and hex on. You can even use ths command file to find position of tabs or any other non-printable characters, ie f '09'x. I find these and the many other old text editor type functions that are not available in any of the newer somewhat crippled editors.

rogerjdeangelis · ‎12-01-2016

Looks like you have overlapping ranges. Also note that 'list;' statement can help with ranges data tst; *input name $ 1-14 sci_name $ 15-35 sale_qty 36-38 remnant 39-40 code$ 41-42 color $; input name $ 1-14 sci_name $ 15-40 sale_qty 41-45 remnant 46-50 code$ 51-53 color $; list; cards4; M. grandiflora Southern Magnolia 80 15 E White M. Campbellii 80 20 D Rose M. Liliflora Lily Magnolia 12 4 D Purple M. soulangiana Saucer Magnolia 25 3 D Pink M. Stellata Star Magnolia 10 3 D White ;;;; run;quit; ----+----1----+----2----+----3----+----4----+----5----+----6--- M. grandiflora Southern Magnolia 80 15 E White M. Campbellii 80 20 D Rose M. Liliflora Lily Magnolia 12 4 D Purple M. soulangiana Saucer Magnolia 25 3 D Pink M. Stellata Star Magnolia 10 3 D White

rogerjdeangelis · ‎11-30-2016

Left out code and manual operation to create xlsb workbook * create an xlsx dataset; libname xel "d:/xls/class.xlsx"; data xel.classxlb; set sashelp.class; run;quit; libname xel clear; * open and save as xlsb;

rogerjdeangelis · ‎11-30-2016

Here are three solutions ___ __ __ _ |_ _| \/ | | | || |\/| | | | || | | | |___ |___|_| |_|_____| proc iml; submit / R; library(RODBC); wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); data <- sqlFetch(con2, "class"); data; endsubmit; call importdatasetfromr('data','data'); quit; ____ | _ \ | |_) | | _ < |_| \_\ %utl_submit_r64(' library(RODBC); wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); data <- sqlFetch(con2, "classxlb"); data; '); > library(RODBC);wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); data <- sqlFetch(con2, "classxlb"); data; NAME SEX AGE HEIGHT WEIGHT 1 Alfred M 14 69.0 112.5 2 Alice F 13 56.5 84.0 3 Barbara F 13 65.3 98.0 4 Carol F 14 62.8 102.5 5 Henry M 14 63.5 102.5 6 James M 12 57.3 83.0 7 Jane F 12 59.8 84.5 8 Janet F 15 62.5 112.5 9 Jeffrey M 13 62.5 84.0 10 John M 12 59.0 99.5 11 Joyce F 11 51.3 50.5 12 Judy F 14 64.3 90.0 13 Louise F 12 56.3 77.0 14 Mary F 15 66.5 112.0 15 Philip M 16 72.0 150.0 16 Robert M 12 64.8 128.0 17 Ronald M 15 67.0 133.0 18 Thomas M 11 57.5 85.0 19 William M 15 66.5 112.0 > __ ______ ____ \ \ / / _ \/ ___| \ \ /\ / /| |_) \___ \ \ V V / | __/ ___) | \_/\_/ |_| |____/ %utl_submit_wps64(' options set=R_HOME "C:/Program Files/R/R-3.2.4"; libname saswork "%sysfunc(pathname(work))"; proc r; submit; library(RODBC); wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); classxlb <- sqlFetch(con2, "classxlb"); classxlb; endsubmit; import r=classxlb data=saswork.classxlb; run;quit; '); proc print data=classxlb; run;quit; Up to 40 obs from classxlb total obs=19 Obs NAME SEX AGE HEIGHT WEIGHT 1 Alfred M 14 69.0 112.5 2 Alice F 13 56.5 84.0 3 Barbara F 13 65.3 98.0 4 Carol F 14 62.8 102.5 5 Henry M 14 63.5 102.5 6 James M 12 57.3 83.0 7 Jane F 12 59.8 84.5 8 Janet F 15 62.5 112.5 9 Jeffrey M 13 62.5 84.0 10 John M 12 59.0 99.5 11 Joyce F 11 51.3 50.5 12 Judy F 14 64.3 90.0 13 Louise F 12 56.3 77.0 14 Mary F 15 66.5 112.0 15 Philip M 16 72.0 150.0 16 Robert M 12 64.8 128.0 17 Ronald M 15 67.0 133.0 18 Thomas M 11 57.5 85.0 19 William M 15 66.5 112.0

rogerjdeangelis · ‎11-30-2016

Here are three solutions ___ __ __ _ |_ _| \/ | | | || |\/| | | | || | | | |___ |___|_| |_|_____| proc iml; submit / R; library(RODBC); wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); data <- sqlFetch(con2, "class"); data; endsubmit; call importdatasetfromr('data','data'); quit; ____ | _ \ | |_) | | _ < |_| \_\ %utl_submit_r64(' library(RODBC); wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); data <- sqlFetch(con2, "classxlb"); data; '); > library(RODBC);wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); data <- sqlFetch(con2, "classxlb"); data; NAME SEX AGE HEIGHT WEIGHT 1 Alfred M 14 69.0 112.5 2 Alice F 13 56.5 84.0 3 Barbara F 13 65.3 98.0 4 Carol F 14 62.8 102.5 5 Henry M 14 63.5 102.5 6 James M 12 57.3 83.0 7 Jane F 12 59.8 84.5 8 Janet F 15 62.5 112.5 9 Jeffrey M 13 62.5 84.0 10 John M 12 59.0 99.5 11 Joyce F 11 51.3 50.5 12 Judy F 14 64.3 90.0 13 Louise F 12 56.3 77.0 14 Mary F 15 66.5 112.0 15 Philip M 16 72.0 150.0 16 Robert M 12 64.8 128.0 17 Ronald M 15 67.0 133.0 18 Thomas M 11 57.5 85.0 19 William M 15 66.5 112.0 > __ ______ ____ \ \ / / _ \/ ___| \ \ /\ / /| |_) \___ \ \ V V / | __/ ___) | \_/\_/ |_| |____/ %utl_submit_wps64(' options set=R_HOME "C:/Program Files/R/R-3.2.4"; libname saswork "%sysfunc(pathname(work))"; proc r; submit; library(RODBC); wb <- "d:/xls/class.xlsb"; con2 <- odbcConnectExcel2007(wb); classxlb <- sqlFetch(con2, "classxlb"); classxlb; endsubmit; import r=classxlb data=saswork.classxlb; run;quit; '); proc print data=classxlb; run;quit; Up to 40 obs from classxlb total obs=19 Obs NAME SEX AGE HEIGHT WEIGHT 1 Alfred M 14 69.0 112.5 2 Alice F 13 56.5 84.0 3 Barbara F 13 65.3 98.0 4 Carol F 14 62.8 102.5 5 Henry M 14 63.5 102.5 6 James M 12 57.3 83.0 7 Jane F 12 59.8 84.5 8 Janet F 15 62.5 112.5 9 Jeffrey M 13 62.5 84.0 10 John M 12 59.0 99.5 11 Joyce F 11 51.3 50.5 12 Judy F 14 64.3 90.0 13 Louise F 12 56.3 77.0 14 Mary F 15 66.5 112.0 15 Philip M 16 72.0 150.0 16 Robert M 12 64.8 128.0 17 Ronald M 15 67.0 133.0 18 Thomas M 11 57.5 85.0 19 William M 15 66.5 112.0

rogerjdeangelis · ‎11-30-2016

This shoud work regardless of the format(appearance) of the number in excel. You will need to change to numeric in SAS, which you can do in the outer SQL using input(ChrNum,best15.) as ChrNum /* T0100580 Casting 15 digit excel numbers to character using passthru Casting 15 digit excel numbers to character using passthru inspired by https://goo.gl/jnnQqa https://communities.sas.com/t5/Base-SAS-Programming/Excel-import-Exponential-value-into-SAS-as-character-field/m-p/312116 HAVE ( Where X is numeric) +------------------+ | A | --+------------------+ 1 | X | --|------------------+ 2 | 1202220022121120| ---------------------+ 3 | 1202220022121120| ---------------------+ 4 | 1202220022121120| --+------------------+ num18 WANT ==== WANT (note the sheet name is num18 could be the default) ==== Up to 40 obs WORK.XLS_CAST total obs=3 Obs CHRNUM 1 1202220022121120 2 1202220022121120 3 1202220022121120 WORKING CODE format(X,'################') as ChrNum FULL SOLUTION ============= * create a sheet with the 15 digit numbers; %utlfkil(d:/xls/utl_excel_cast.xlsx); libname xel "d:/xls/utl_excel_cast.xlsx"; data xel.num18; do x=1202220022121121,1202220022121121,1202220022121121; output; end; run;quit; libname xel clear; * cast the numbers to char using passthru; proc sql dquote=ansi; connect to excel (Path="d:\xls\utl_excel_cast.xlsx" mixed=yes); create table xls_cast as select ChrNum length=16 from connection to Excel ( Select format(X,'################') as ChrNum from num18 ); disconnect from Excel; Quit;

rogerjdeangelis · ‎11-29-2016

Hits #41 Creating an empty sas datasets (from SAS-L) Not exactly a reversal of proc contents, but related /* T000180 CREATING AN EMPTY SAS DATASETS */ data shell; length x 8 name sex $16; call missing(of _all_); stop; run;quit; /* call missing elims warniing */ data shell(drop=age); stop; set sashelp.class sashelp.shoes; AgeChar=put(age,z2.); run; proc sql; create table t1( id int ,ic varchar(10) ,icd varchar(500) ,Idca varchar(500) ) ; quit; proc sql; create table new like sashelp.class; quit; Also you can create a SAS dataset with data (bug introduced after 9.2 - char lengths missing) filename tagset http "http://support.sas.com/rnd/base/ods/odsmarkup/sql.sas"; %include tagset; ods tagsets.sql file="class.sql"; proc print data=sashelp.class ; run; ods _all_ close; ods listing; Create table CLASS (Name varchar(7), Sex varchar(1), Age float, Height float, Weight float); Insert into CLASS(Name, Sex, Age, Height, Weight) Values ('Alfred', 'M', 14, 69.0, 112.5); Insert into CLASS(Name, Sex, Age, Height, Weight) Values ('Alice', 'F', 13, 56.5, 84.0); /* Can be simplified */ Create table CLASS (Name varchar(7), Sex varchar(1), Age float, Height float, Weight float); Insert into CLASS(Name, Sex, Age, Height, Weight) Values ('Alfred', 'M', 14, 69.0, 112.5) Values ('Alice', 'F', 13, 56.5, 84.0);

rogerjdeangelis · ‎11-29-2016

SAS forum: Adding average MPG city by country and cartype to each observation HAVE Up to 40 obs WORK.CARSRT total obs=35 Obs ORIGIN TYPE MPG_CITY 1 Asia Hybrid 46 2 Asia SUV 17 3 Asia SUV 20 4 Asia Sedan 18 5 Asia Sedan 24 6 Asia Sedan 18 7 Asia Sports 20 8 Asia Sports 19 9 Asia Sports 17 10 Asia Truck 15 11 Asia Truck 24 12 Asia Wagon 15 13 Asia Wagon 26 14 Asia Wagon 16 15 Europe SUV 16 16 Europe Sedan 17 17 Europe Sedan 22 18 Europe Sedan 20 19 Europe Sports 20 20 Europe Sports 15 21 Europe Sports 16 22 Europe Wagon 18 23 Europe Wagon 19 24 Europe Wagon 19 25 USA SUV15 15 26 USA SUV19 19 27 USA Sedan 14 28 USA Sedan 20 29 USA Sedan 18 30 USA Sports 17 31 USA Sports 17 32 USA Truck 13 33 USA Truck 15 34 USA Wagon 22 35 USA Wagon 17 WANT Up to 40 obs from CarSrtAvg total obs=35 Obs ORIGIN TYPE MPG_CITY MPGAVG 1 Asia Hybrid 46 46.0000 2 Asia SUV 17 18.5000 3 Asia SUV 20 18.5000 4 Asia Sedan 18 20.0000 5 Asia Sedan 24 20.0000 6 Asia Sedan 18 20.0000 7 Asia Sports 20 18.6667 8 Asia Sports 19 18.6667 9 Asia Sports 17 18.6667 10 Asia Truck 15 19.5000 11 Asia Truck 24 19.5000 12 Asia Wagon 15 19.0000 13 Asia Wagon 26 19.0000 14 Asia Wagon 16 19.0000 15 Europe SUV 16 16.0000 16 Europe Sedan 17 19.6667 17 Europe Sedan 22 19.6667 18 Europe Sedan 20 19.6667 19 Europe Sports 20 17.0000 20 Europe Sports 15 17.0000 21 Europe Sports 16 17.0000 22 Europe Wagon 18 18.6667 23 Europe Wagon 19 18.6667 24 Europe Wagon 19 18.6667 25 USA SUV 15 17.0000 26 USA SUV 19 17.0000 27 USA Sedan 14 17.3333 28 USA Sedan 20 17.3333 29 USA Sedan 18 17.3333 30 USA Sports 17 17.0000 31 USA Sports 17 17.0000 32 USA Truck 13 14.0000 33 USA Truck 15 14.0000 34 USA Wagon 22 19.5000 35 USA Wagon 17 19.5000 SOLUTION * create some data; proc sort data=sashelp.cars(keep=origin type drivetrain mpg_city) out=carsrt(drop=drivetrain) nodupkey; by origin type drivetrain; run;quit; * use the DOW loop; data CarSrtAvg(keep=origin type mpg_city mpg_city MpgAvg); retain origin type; retain mpg_city MpgAvg MpgCnt .; do until (last.type); set carsrt; by origin type; MpgSum=sum(MpgSum,mpg_city); MpgCnt=sum(MpgCnt,1); end; MpgAvg=MpgSum/MpgCnt; do until (last.type); set carsrt; by origin type; output; end; MpgSum=0; MpgCnt=0; run;quit;

rogerjdeangelis · ‎11-29-2016

SAS forum: Adding average MPG city by country and cartype to each observation HAVE Up to 40 obs WORK.CARSRT total obs=35 Obs ORIGIN TYPE MPG_CITY 1 Asia Hybrid 46 2 Asia SUV 17 3 Asia SUV 20 4 Asia Sedan 18 5 Asia Sedan 24 6 Asia Sedan 18 7 Asia Sports 20 8 Asia Sports 19 9 Asia Sports 17 10 Asia Truck 15 11 Asia Truck 24 12 Asia Wagon 15 13 Asia Wagon 26 14 Asia Wagon 16 15 Europe SUV 16 16 Europe Sedan 17 17 Europe Sedan 22 18 Europe Sedan 20 19 Europe Sports 20 20 Europe Sports 15 21 Europe Sports 16 22 Europe Wagon 18 23 Europe Wagon 19 24 Europe Wagon 19 25 USA SUV15 15 26 USA SUV19 19 27 USA Sedan 14 28 USA Sedan 20 29 USA Sedan 18 30 USA Sports 17 31 USA Sports 17 32 USA Truck 13 33 USA Truck 15 34 USA Wagon 22 35 USA Wagon 17 WANT Up to 40 obs from CarSrtAvg total obs=35 Obs ORIGIN TYPE MPG_CITY MPGAVG 1 Asia Hybrid 46 46.0000 2 Asia SUV 17 18.5000 3 Asia SUV 20 18.5000 4 Asia Sedan 18 20.0000 5 Asia Sedan 24 20.0000 6 Asia Sedan 18 20.0000 7 Asia Sports 20 18.6667 8 Asia Sports 19 18.6667 9 Asia Sports 17 18.6667 10 Asia Truck 15 19.5000 11 Asia Truck 24 19.5000 12 Asia Wagon 15 19.0000 13 Asia Wagon 26 19.0000 14 Asia Wagon 16 19.0000 15 Europe SUV 16 16.0000 16 Europe Sedan 17 19.6667 17 Europe Sedan 22 19.6667 18 Europe Sedan 20 19.6667 19 Europe Sports 20 17.0000 20 Europe Sports 15 17.0000 21 Europe Sports 16 17.0000 22 Europe Wagon 18 18.6667 23 Europe Wagon 19 18.6667 24 Europe Wagon 19 18.6667 25 USA SUV 15 17.0000 26 USA SUV 19 17.0000 27 USA Sedan 14 17.3333 28 USA Sedan 20 17.3333 29 USA Sedan 18 17.3333 30 USA Sports 17 17.0000 31 USA Sports 17 17.0000 32 USA Truck 13 14.0000 33 USA Truck 15 14.0000 34 USA Wagon 22 19.5000 35 USA Wagon 17 19.5000 SOLUTION * create some data; proc sort data=sashelp.cars(keep=origin type drivetrain mpg_city) out=carsrt(drop=drivetrain) nodupkey; by origin type drivetrain; run;quit; * use the DOW loop; data CarSrtAvg(keep=origin type mpg_city mpg_city MpgAvg); retain origin type; retain mpg_city MpgAvg MpgCnt .; do until (last.type); set carsrt; by origin type; MpgSum=sum(MpgSum,mpg_city); MpgCnt=sum(MpgCnt,1); end; MpgAvg=MpgSum/MpgCnt; do until (last.type); set carsrt; by origin type; output; end; MpgSum=0; MpgCnt=0; run;quit;

Online Status	Offline
Date Last Visited	‎12-04-2021 02:20 PM

Youtube Point and Shoot Programming with the Original Display Manger E...

Re: Fun With SAS ODS Graphics: YAAEB (Yet another animated Easter Bunn...

Re: SAS Macros

Re: Deleting datasets older than 24 months

Re: Macro for using data step to several dataset

Re: Large integer field format in SAS

Re: SAS Grid Pros and Cons

Re: Paradox Table Import into SAS EG Problems

Re: Export to .csv value comes in a separate line

Re: Splitting columns

Fun With SAS ODS Graphics: YAAEB (Yet another animated Easter Bunny)

Re: Splitting up dataset based on unique variable

Youtube Point and Shoot Programming with the Original Display Manger E...

Re: Convert a column to a vector

Re: change rows order

Re: Proc Import multiple CSV files

Re: how to assign a flag to a test based on OBS present after treatmen...

Re: how to assign a flag to a test based on OBS present after treatmen...

Re: Performance, joining two tables 14m rows

Re: Performance, joining two tables 14m rows

Re: Performance, joining two tables 14m rows

Re: Performance, joining two tables 14m rows

Re: request for a solution

Re: request for a solution

Re: Importing issues - .Xlsb file(Excel Binary format) into SAS

Re: Importing issues - .Xlsb file(Excel Binary format) into SAS

Re: Importing issues - .Xlsb file(Excel Binary format) into SAS

Re: Reading decimals from xlsx/xls

Re: Reverse of PROC CONTENTS to create an Empty Dataset

Re: how to calculate average

Re: how to calculate average

SAS Analytics Explorers