About Edoedoedo

Edoedoedo · ‎03-30-2016

Hi, I'm getting stuck with this issue: I have a dataset which looks like this: START_DATE | END_DATE | ID | VALUE 01/01/2016 | 31/12/9999 | 1 | 300 01/01/2016 | 02/01/2016 | 2 | 500 02/01/2016 | 31/12/9999 | 2 | 100 It is an historical table. It means that: - ID #1 had a value of 300 on 01/01/2016, and it has still this value today - ID #2 had a value of 500 on 01/01/2016, but then on 02/01/2016 it changed its value to 100, and it has still this value today. So briefly the table says that each row is valid from START_DATE (included) until END_DATE (excluded). I want to create a report in VA, where I show a simple line graph where, for each day, I aggregate VALUE only for the records that are valid in that day. In this example, the line graph should have: - on X axis: 01/01/2016 and 02/01/2016 - on Y axis: 800 and 400 Why? Because on 01/01/2016 ID #1 has value 300 and ID #2 has value 500 so 800, and on 02/01/2016 ID #1 has value 300 and ID #2 has value 100 so 400. How could I achieve this in VA? I must use directly this table (and of course any calculated variable or parameter) but I cannot rework the table to get it nicer. Thanks a lot for any help. Regards

Edoedoedo · ‎03-10-2016

Thanks a lot, it indeed works exactly as I need! @RW9 It works as well! the problem is the speed, personally I haven't defined any libname (yet), but the program I'm writing needs to be run in EG, and the EG gets dozen of preassigned libraries upon connection which are directly mapped with the teradata engine, and each libname (schema in teradata) contains hundreds of tables; so I guess there is anything I can do to speedup access to sashelp.v*, such preassigned libnames are defined by the admin and are currently used by other user so they can't be removed... Thank you very much. Greetings

Edoedoedo · ‎03-10-2016

Edoedoedo · ‎03-10-2016

I did, but all I get is that in the "IDXUSAGE" field, the primary key fields are marked with "SIMPLE" or "COMPOSITE", and the other fields are marked with "NONE". So in this way I just know that there are some fields which are part of a simple/composite index, but I have no information about if such a index satisfy also the primary key constraint (i.e. unique not null).

Edoedoedo · ‎03-10-2016

Hi, I have a dataset, and I need to know with code if such a dataset has a primary key constraint defined, and if so which field are there in the primary key. I tried using "describe table constraints" sql-statement, but it just write the result in the log and in a report. Instead, I need to have the result in a dataset or in a macrovariable in order to process the result with code. Can you help me? Thanks a lot [SAS9.4M2]

Edoedoedo · ‎10-29-2015

Hi, I have a dataset like this: A_FIELD | ANOTHER_FIELD | OTHER_FIELD | CONDITION 1 | 2 | 3 | A_FIELD + ANOTHER_FIELD = OTHER_FIELD 0 | 1 | 0 | 0 <= A_FIELD * 10 + OTHER_FIELD <= ANOTHER_FIELD I need to scan this dataset in a datastep and append a column CHECK which contains 1 or 0 depending whether CONDITION, evaluated on each row, is true or false. Eg: A_FIELD | ANOTHER_FIELD | OTHER_FIELD | CONDITION | CHECK 1 | 2 | 3 | A_FIELD + ANOTHER_FIELD = OTHER_FIELD | 1 0 | 1 | 0 | 0 <= A_FIELD + 10 + OTHER_FIELD <= ANOTHER_FIELD | 0 Such a table has about 100k rows, and CONDITION is almost always different on each row. Moreover, each field can be string or number, and each field has a custom name, so there is no standard way to identify fields like field1, field2, ... Is there a way to "evaluate" a string using variables of each row? I'd do something like: data want; set have; CHECK = eval(CONDITION); run; How can it be done? I thought about doing a call execute for each row, but 100k executes crash the system. Thanks a lot. PS: for whom it may be interested, it is a data quality check: each row has some value in each field (string or number), and CONDITION is an expression about that row which when is evaluated true flags that row as an anomaly.

Edoedoedo · ‎08-27-2015

Thanks a lot guys, call execute without macrovariables works fine, it's simple and elegant.

Edoedoedo · ‎08-24-2015

Hi, I'm dealing with the following scenario: I have a table RULES which contains two character fiels: CONDITION and DESCRIPTION: RULE CONDITION DESCRIPTION RULE1 COL1 < 0 Warning, COL1 CAN'T BE NEGATIVE (current value = &COL1) RULE2 COL1 + COL2 > 100 Warning, THE SUM COL1+COL2 IS INVALID (current values: &COL1 + &COL2 > 100) Next, I have a table EVENTS which contains some variables: COL1 COL2 COL3 80 40 10 -10 10 0 Note that in RULES.CONDITION I have explicitly the EVENTS columns name; in RULES.DESCRIPTION I have the EVENTS columns name with "&" prefixed. I want to create an OUTPUT table as follows: COL1 COL2 COL3 RULE1 RULE2 RULE1_DESCRIPTION RULE2_DESCRIPTION 80 40 10 0 1 ok Warning, THE SUM COL1+COL2 IS INVALID (current values: 80 + 40 > 100) -10 10 0 1 0 Warning, COL1 CAN'T BE NEGATIVE (current value = -10) ok RULE1/2 are easy (I'm using call execute, I sketch the code so there may be mistakes it is just to point the idea out): data _NULL_; set RULES; if (_N_ = 1) then call execute('data OUTPUT; set EVENTS;); call execute(RULE || '= ifn(' || CONDITION || ',1,0);'); ....... run; This works perfectly. Now I want to add description, so I tried as before: call execute(RULE || '_DESCRIPTION = ifn(' || CONDITION || ',' || DESCRIPTION || ','ok);'); but this does not resolve macrovariables (of course because they don't exists), so I get e.g. Warning, COL1 CAN'T BE NEGATIVE (current value = &COL1). But if I try do define them using symput in the same datastep of course I get wrong values. How would you do that in a simple way? Thanks a lot

Edoedoedo · ‎08-10-2015

Thank you so much, it worked perfectly!

Edoedoedo · ‎07-03-2015

Thanks, I mean that for instance the macro would be: %macro mmm(par1); proc sql; select var || put(time(),time10.) into :macrovar from dataset where par = &par1; quit; options something=&macrovar; %mend; meaning that for instance I need to use the macrovar variable (calculated within the macro) later in the same macro neither in a datastep nor in a proc step (in the example I'm using "options" which does not have any sense, it is just to explain what I mean). As you taught me, this code would not work because &macrovar is resolved at compile time (hence undefined). How would you rewrite that line in order to use that macrovariable calculated just before? I'm looking for something like "options something=symget('macrovar');" (which is awful, I know). Thanks again

Edoedoedo · ‎07-03-2015

Thank you very very much, I understand, I'll change the code as you suggest. So two last questions: why that code made macrovar global? just curious is there a function to resolve a macrovariable at execution-time without using neither a datastep nor a proc?

Edoedoedo · ‎07-03-2015

Hi, please check out this code snippet which is driving me crazy: data dataset; par = 1; var = 'Valore'; run; %macro mmm(par1); %put macrovar1=&macrovar; proc sql; select var || put(time(),time10.) into :macrovar from dataset where par = &par1; quit; %put macrovar2=&macrovar; %mend; data _NULL_; call execute("%mmm(1)"); run; I expect that macrovar1 puts "not resolved" and macrovar2 puts e.g. "Valore 15:00:00". Instead, I get "not resolved" either in macrovar1 and in macrovar2. Moreover, "macrovar" becomes oddly a global macro variable, hence if I execute again this snippet I get "Valore 15:00:00" either in macrovar1 and in macrovar2, which does not make any sense since I expect macrovar1 to be "not resolved" and macrovar2 to be "Valore 15:01:00" i.e. a new value instead of keeping the old value. However, if I replace the last data step simply with %mmm(1); it works exactly as expected. So what is "call execute" doing wrong? I hope I made myself clear. Thanks a lot Regards

Edoedoedo · ‎06-11-2015

I AM impressed. Really. I rewrote such a proc sql with a data merge step as you suggested, and the very same output (20millions x 350) which took 5 hours and 500GB of temp space now is done in 10 minutes and without any temp space used. 10 minutes! Unbelievable! Thank you guys for all your support. P.S.: so proc sql really sucks... it's a pity because unlike sas true code, sql is well-understood among dbms fond people.

Edoedoedo · ‎06-09-2015

Hi, I'm writing a program which outputs (at the end) one huge dataset on customers with 350 columns and 20millions rows. During its execution, this program creates: - a "mother table", let's say T1, with ID unique primary key and some fields. These IDs are the whole universe of customers IDs (20millions); so this table is like 10 columns and 20millions rows; - several "child tables", let's say T2....T30, with ID unique primary key and some fields each. Every child table has only a subset of the IDs of T1, varying e.g. from 1% to 99% of the whole universe; I mean that for example T2 can be like 10 columns and 10.000 rows or 10 columns and 19million rows. This is not predictable. Anyway, the primary key ID is always unique and always included in table T1. At the end, these tables must be joined to generate one output dataset: T1 is the "base table", and every T2...T30 is LEFT JOINED with T1 on their unique primary key ID. So: proc sql; CREATE TABLE FINAL AS SELECT T1.ID, T1.FIELD1, ... T1.FIELD10, T2.FIELD1, ... T2.FIELD10, ... ... T30.FIELD1, ... T30.FIELD10 FROM T1 LEFT JOIN T2 ON T1.ID = T2.ID ... LEFT JOIN T30 ON T1.ID = T30.ID ; quit; This works and produces a 15GB table. However, this final proc sql lasts 5 hours! And it creates a temporary sas7butl file in the work directory bigger than 500GB! Things to say: ID is a numeric field with 9 digits all the left joins are one-to-one (ID is unique primary key) every dataset T1,T2,...,T30 is already sorted by ID (before being used in the final proc sql) every dataset T1,T2,...,T30 is indexed on ID using option "_method" on proc sql it shows this: NOTE: SQL execution methods chosen are: sqxcrta sqxfil sqxjm sqxsrc( T30 ) sqxsort sqxjm sqxsrc( T29 ) sqxsort sqxjm ..... sqxsort sqxjm sqxsrc( T2 ) sqxsrc( T1 ) Question is: why it appears to be so slow and so resource-consuming? How could it be optimized? Are there any "hints" you may suggest? Thank you very much.

Edoedoedo · ‎04-29-2015

Thank you very much!

Online Status	Offline
Date Last Visited	‎12-22-2023 09:35 AM

proc ssm and sarimax

Re: Allow access only to a given group for a SASContent folder and den...

Allow access only to a given group for a SASContent folder and deny to...

Working with caslibs in different cas servers

Re: uniTimeSeries.arima results are different (and worse) than statsmo...

Re: uniTimeSeries.arima results are different (and worse) than statsmo...

Re: uniTimeSeries.arima results are different (and worse) than statsmo...

Re: uniTimeSeries.arima results are different (and worse) than statsmo...

Re: uniTimeSeries.arima results are different (and worse) than statsmo...

uniTimeSeries.arima results are different (and worse) than statsmodels...

Re: Allow access only to a given group for a SASContent folder and den...

Re: uniTimeSeries.arima results are different (and worse) than statsmo...

Re: SMAPE calculation in tsmodel/utlstat seems to give wrong result

Re: TSA STATIONARITYTEST gives null pvalue

Re: TSA STATIONARITYTEST gives null pvalue

Re: Recove from error

Action to list all tables in a given caslib (loaded tables, not files)

Re: uniTimeSeries.arima results are different (and worse) than statsmo...

Macro for loop hangs exactly at 27th cycle

Re: Reuse SASLogon user/pass authentication to get an oauth token for ...

Aggregate on record which are valid in a period of time

Re: Describe table constraint -> output to dataset/macrovariable inste...

Re: Describe table constraint -> output to dataset/macrovariable inste...

Re: Describe table constraint -> output to dataset/macrovariable inste...

Describe table constraint -> output to dataset/macrovariable instead o...

Evaluate string expression within a dataset

Re: Create and use columns as macrovariables within datastep

Create and use columns as macrovariables within datastep

Re: Error when accessing the metadata of a LASR library

Re: call a macro inside call execute, and macro variables resolution

Re: call a macro inside call execute, and macro variables resolution

call a macro inside call execute, and macro variables resolution

Re: Why is proc sql so slow in joins?

Why is proc sql so slow in joins?

Re: Access an accdb file stored on a Webdav server

SAS Analytics Explorers