About afsand

afsand · ‎07-05-2022

Hi, I have a SAS table column which contains some SAS files names extracted from a directory (see below some examples). I want to keep only one row for similar names, I am not sure how to do that, for example the first three rows must be replaced by only one row which can be the first row 0-2004-editcacacon.sas. 0-2004-editcacacon.sas 0-2005-editcacacon.sas 0-2112-editcacacon.sas 1.1 sald-etudettsald_2018.sas 1.1 sald-etudettsald_2020.sas 1.1 sald-etudettsald_2020-base 2018.sas 1.1 vie-etudettvie_2018.sas 1.1 vie-etudettvie_2020.sas 1.1 vie-etudettvie_2020-qx_anc.sas 1.2 sald-kkldmajtab_2018.sas 1.2 sald-kkldmajtab_2020.sas 1.2 vie- kkviemajtab_2018.sas 1.2 vie-kkviemajtab_2020.sas 1.2 vie-kkviemajtab_2020-qx_anc.sas

afsand · ‎06-06-2022

Hi Tom, I have a big table called Adherants that has 60Million of records. I have attached the table. I want to join to the Adherants table some variables from 6 other tables. I have also attached the other tables. It takes so much time to calculate new variables in the adherants table and also add variables from the six other tables. You have the tables attached and you can also see the complete code below. Please let me know if you need anything else. data adherant ; set adherant; age_adh = age_adh_ori + int((time-1)/12) ; Evaluation_dt = INTNX('month', &date_eval., (time-1), 'END'); duree_IE = min(max(int(YRDIF(date_acq_moyenne_IE,Evaluation_dt,'AGE'))+1,1),6); lapse_duree = min(max(int(YRDIF(Issue_Date,Evaluation_dt,'AGE'))+1,1),10); run; /*Add table1*/ Proc sql; create table adherant as select A.* , B.facteur_act as fct_tx_fwrd from adherant as A left join ses_id.Scn_tx_forward as B ON (A.TaskNum = B.TaskNum and A.Time = B.Time) ; quit; /*Add table2 */ Proc sql; create table adherant as select A.* , B.M as annuit_fct_M, B.F as annuit_fct_F from adherant as A left join ses_id.Annuitization as B ON (A.age_adh = B.age) ; quit; /* Add table3*/ Proc sql; create table adherant as select A.* , B.reset, B.indReset from adherant as A left join ses_id.Renouv_reset as B ON (A.no_garantie = B.no_garantie and A.age_adh = B.age) ; quit; /*Add table4 */ Proc sql; create table adherant as select A.* , B.Min_ferr from adherant as A left join ses_id.Min_FERR as B ON (A.age_adh = B.age) ; quit; /*Add table5*/ Proc sql; create table adherant as select A.* , B.penalite as rachat_penalite from adherant as A left join ses_id.Penalite_rachat_bl as B ON (A.duree_IE = B.annee) ; quit; /*Add table6 */ Proc sql; create table adherant as select A.* , B.tx_lapse_base from adherant as A left join ses_id.Lapse_tx_base as B ON (A.no_garantie = B.no_garantie and A.lapse_duree = B.duree) ; quit;

afsand · ‎06-06-2022

Hi Everyone, I have a big data with 60,000,000 rows. The code is very slow, I tried to index some columns but it is still very slow. Any advice? should I use proc sql or data step for big data tables? Proc sql; create table adherant as select A.*, B.Fr_var from ses_id.INFORCE_AJUST_1 as A left join ses_id.FR_VARIABLE as B ON A.no_garantie = B.no_garantie; quit; proc sql; create index no_garantie on adherant (no_garantie); Quit; /* Ajout Frais administratif fixe */ Proc sql; create table adherant as select * from adherant as A left join ses_id.FR_Fixe as B ON A.no_garantie=B.no_garantie; quit; /* On rajoute la table des scenarios economiques */ /* On commence par rajouter un id de 1 a 1000, qui correspond aux IDs des scenarios */ data adherant ; set adherant; do scn_id=1 to 1000; output; end; run; proc sql; create index scn_id on adherant (scn_id); Quit; /* On rajoute les scenarios economiques */ Proc sql; create table adherant (drop=scn_id) as select A.* ,B.RendDEX as rend1 ,B.RendMM as rend2 ,B.RendTSX as rend3 ,B.RendSP500 as rend4 ,B.RendEAFE as rend5 ,B.TaskNum ,B.Time from adherant as A left join ses_id.Scenarios_ECN as B ON A.scn_id = B.TaskNum where Time>0 ; quit; /* Calcul de l'age par mois de scenarios economique, l'age doit etre entier et seulement s'incrementer de 1 au 12eme mois */ data adherant2 ; set adherant; age_adh = age_adh_ori + int((time-1)/12) ; Evaluation_dt = INTNX('month', &date_eval., (time-1), 'END'); duree_IE = min(max(int(YRDIF(date_acq_moyenne_IE,Evaluation_dt,'AGE'))+1,1),6); lapse_duree = min(max(int(YRDIF(Issue_Date,Evaluation_dt,'AGE'))+1,1),10); run; /*Ajout table Taux forward */ Proc sql; create table adherant as select A.* , B.facteur_act as fct_tx_fwrd from adherant as A left join ses_id.Scn_tx_forward as B ON (A.TaskNum = B.TaskNum and A.Time = B.Time) ; quit;

afsand · ‎06-05-2022

Hi Kurt, The group by statement will create a table by group? or it will just group the table? I need to have a table for each group.

afsand · ‎06-05-2022

Actually I want to create a list of tables not variables. I need to use only the data step, but I read information from a data step and do some calculations and want to save the results in a series of tables by groupe. like this: data results_group1-results_group50; set adherants ; some calculations in between ; save results in the group tables; run;

afsand · ‎06-05-2022

Hi Everyone, I want to create a list of SAS tables with the same prefix. I am not sure how to do it or if it is possible, but I need to do it with a data step. The table names will look like this: results_group1, results_group2,.....results_group50. Thanks

afsand · ‎06-04-2022

Thank you very much, it works with the temporary option. I am running 700,000 scenarios for my work and I need to save the results of my calculations in a new output table. it seems easier in my head to use an array, but I understand that it is not that efficient. You can I output some calculated new variables in a new SAS table? you can see my code below. I want to create a table called output, that will be populated with calculated new variables. Data _null_ ; set adherants ; array p_reserve_adh_path_ac_FR[&nombre_obs_IA.,&nb_scn.]; array p_reserve_adh_path_ss_FR[&nombre_obs_IA.,&nb_scn.]; ----some calculations in between ----some calculations in between ---some calculations in between ---some calculations in between p_reserve_adh_path_ac_FR[x,y] = sum(p_reserve_adh_path_ac_FR[x,y],0) + (A_ECH + A_DECES + A_RETRA + A_gross_up - A_FRAIS_GAR + (A_FRAIS_ADM - A_MERDISPO)) * fct_tx_fwrd; p_reserve_adh_path_ss_FR[x,y] = sum(p_reserve_adh_path_ss_FR[x,y],0) + (A_ECH + A_gross_up + A_DECES + A_RETRA - A_FRAIS_GAR) * fct_tx_fwrd; run;

afsand · ‎06-04-2022

Thank you for your reply, I understand more. I think an array is not what I need for what I am trying to do. I am doing some calculations on a data step table and I also want to create an output table containing the results. I tried to create an array like below and output the results of the array but it seems not working. Do you suggest me to create a data step table for the outputs instead of an array? if yes, how to output the results only in the ouput data table? I have never worked on two SAS tables at the same time, how to ouput only on a table? Data _null_ ; set adherants ; array p_reserve_adh_path_ac_FR[&nombre_obs_IA.,&nb_scn.]; array p_reserve_adh_path_ss_FR[&nombre_obs_IA.,&nb_scn.]; ----some calculations in between ----some calculations in between ---some calculations in between ---some calculations in between p_reserve_adh_path_ac_FR[x,y] = sum(p_reserve_adh_path_ac_FR[x,y],0) + (A_ECH + A_DECES + A_RETRA + A_gross_up - A_FRAIS_GAR + (A_FRAIS_ADM - A_MERDISPO)) * fct_tx_fwrd; p_reserve_adh_path_ss_FR[x,y] = sum(p_reserve_adh_path_ss_FR[x,y],0) + (A_ECH + A_gross_up + A_DECES + A_RETRA - A_FRAIS_GAR) * fct_tx_fwrd; run;

afsand · ‎06-04-2022

the code just run forever, I cannot see any errors or the log. Here are the values of the two macro variables. 33 %put &nombre_obs_IA.; 13415 34 %put &nb_scn. ; 1000

afsand · ‎06-04-2022

Hi Everyone, I want to create a dynamic array with variable dimensions, but the code is not working. Can you please help with this? Data _null_ ; set adherants ; if _n_=1 then do; array p_reserve_adh_path_ac_FR[&nombre_obs_IA.,&nb_scn.]; array p_reserve_adh_path_ss_FR[&nombre_obs_IA.,&nb_scn.]; end; run; Thanks

afsand · ‎06-03-2022

OMG you saved my life. it works very well!!!!!!!!!!!!!!!! Thank you so much for your help!

afsand · ‎06-03-2022

Hi Ballardw, thanks for your reply. I attached the two tables, I do not want to merge the two tables because I calculate some variables in a data step that I use to extract the information from the QX_WB_AF table that is why I use a function that will read the mortality rate at each iteration of the Adherant table. My only concern is that I am currently using read_array at each iteration, but I need to read it only for the first iteration. The read array function is only available within a proc fcmp procedure, that is why I created a function for that. Do you know if Data step has a function that can do the same thing: read a data table and import it in an array?

afsand · ‎06-03-2022

Hi Everyone, I have created a proc fcmp function called getmortq that will read the mortality rate from a data step table (SASDATA.QX_WB_F). I am using the the read_array function to read the mortality table from the data step table and save it in an array. My issue is that the code is too slow because I have 700k observations in the ADHERANT table, at each iteration the proc fcmp function will read the SASDATA.QX_WB_F table but I need to read it only for the first observation and use the readed array for the other observations. But it seems like SAS does not save in memory the arrays, at each iteration the QX_WB_F array is deleted and I cannot have access to the array anymore I need to read the data step table again. How can I save the array in memory for the first observation and use it for the other observations? proc fcmp outlib=work.funcs.sql; function getmortq(mort_type $, sexe, indProduit, age, dt_nais, Eval_dt2, AnEval2, obs); if MORT_TYPE = 'LB' then age_lookup = age; if MORT_TYPE = 'NB' then do; if month(dt_nais) >= month(Eval_dt2) then do; age_lookup = age + round(1-( month(dt_nais)/12 - month(Eval_dt2)/12),1); end ; else do ; age_lookup = age + round(( month(Eval_dt2)/12 - month(dt_nais)/12),1); end ; end; if obs=1 then do ; array QX_WB_F[1] / nosymbols; rc = read_array('sasdata.QX_WB_F', QX_WB_F); end; qx = QX_WB_F[min(121,age_lookup+1),2+year(Eval_dt2)-AnEval2]; return(qx); endsub; options cmplib = work.funcs; data _null_; set adherant; MORT_TYPE = "&MORT_TYPE."; if _n_=1 then obs=1; qx = getmortq(mort_type, sexe, indProduit, age_adh, dt_nais_adh,&date_eval., &AnEval., obs); run;

Online Status	Offline
Date Last Visited	‎07-06-2022 12:46 PM

Remove similar names in a column

Re: SAS big data how to make the code faster

SAS big data how to make the code faster

Re: create a list of SAS tables with the same prefix

Re: create a list of SAS tables with the same prefix

create a list of SAS tables with the same prefix

Re: Create a SAS array with variable dimension

Re: Create a SAS array with variable dimension

Re: Create a SAS array with variable dimension

Create a SAS array with variable dimension

Re: Proc FCMP Array

Remove similar names in a column

Re: SAS big data how to make the code faster

SAS big data how to make the code faster

Re: create a list of SAS tables with the same prefix

Re: create a list of SAS tables with the same prefix

create a list of SAS tables with the same prefix

Re: Create a SAS array with variable dimension

Re: Create a SAS array with variable dimension

Re: Create a SAS array with variable dimension

Create a SAS array with variable dimension

Re: Proc FCMP Array

Re: Proc FCMP Array

Proc FCMP Array