About wenzli25

wenzli25 · ‎05-23-2018

Sorry for the late reply. I tried @Astounding's method, but it failed. I'm not sure why.... still working on it. Could you tell me where should I modify "sasfile aaa open;" ?

wenzli25 · ‎03-23-2018

I have a SAS code question, and I'd like to ask if there is a more efficient way to solve my problem. Target: Through the customer's product holding and the product combination datasets, generated the recommended product based on the product holding of each customer. It's like association model. Method : I have two datasets: 1st, product combination records data AAA; infile datalines missover; input SET_SIZE COUNT ITEM1 $ ITEM2 $ ITEM3 $ ITEM4 $ ITEM5 $ ; datalines; 2 50 DP1 DP2 2 40 DP2 DP3 2 39 AC1 AC2 2 30 AB1 AB2 3 30 DP1 DP2 DP3 3 20 AB1 AB2 DP1 3 20 AC1 AC2 AB1 4 10 DP1 DP2 DP3 AC1 5 10 AB1 AB2 AC1 DP1 ; 2nd, the products held by the customer data BBB; infile datalines missover; input ID $ count_of_product HOLD1 $ HOLD2 $ HOLD3 $ HOLD4 $ HOLD5 $ HOLD6 $ HOLD7 $ HOLD8 $ HOLD9 $ datalines; ID01 3 DP1 DP2 AB1 ID02 4 DP1 DP2 DP3 DP4 AC1 ID03 4 DP1 AB1 AB2 AC1 AC2 ID04 3 AB1 AB2 AC1 ID05 2 AB1 DP1 ID06 8 AB1 AB2 DP1 DP2 DP3 AC1 AC2 GB1 ; I want to generate a dataset which record 1st, For each product combination, products held by customers and products not held by customers. 2nd, Calculate the number of products held by the customer in the product combination and the number of differentiation. After the calculation, I sort by the number of differences between the customer and the product combination, and the count of the combination.Then, I delete duplicate records by ID and the recommended product. Process of Implementation: 1. cartesian product (cross join) PROC SQL ; CREATE TABLE CCC AS SELECT t1.*, t2.* FROM AAA t1 , BBB t2; QUIT; 2. Comparison and calculation DATA DDD; SET CCC; ARRAY HOLD[9] $ HOLD1-HOLD9; ARRAY ITEM[5] $ ITEM1-ITEM5; ARRAY FLAG[5] FLAG1-FLAG5(0,0,0,0,0); ARRAY PRODUCT[5] $34 PRODUCT1-PRODUCT5; ARRAY INCLUD[5] $34 INCLUD1-INCLUD5; INCLUDED = 0; A=1; B=1; DO I=1 TO DIM(ITEM); DO J=1 TO DIM(HOLD); IF (ITEM[I]^="" AND ITEM[I]=HOLD[J]) THEN DO; INCLUDED=INCLUDED+1; FLAG[I]=1; LEAVE;LEAVE; END; ELSE FLAG[I]=0; END; END; DO K=1 TO DIM(FLAG); IF FLAG[K]=0 THEN DO; PRODUCT[A]=ITEM[K]; A=A+1; END; ELSE IF FLAG[K]=1 THEN DO; INCLUD[B]=ITEM[K]; B=B+1; END; END; OVERLAP = COUNT_OF_PRODUCT - INCLUDED; GAP = SET_SIZE - INCLUDED; RUN; 3. Sort and delete duplicates PROC SQL ; CREATE TABLE EEE AS SELECT DISTINCT t1.ID, t1.COUNT, t1.PRODUCT1, t1.INCLUD1, t1.INCLUD2, t1.INCLUD3, t1.INCLUD4, t1.INCLUD5 FROM DDD t1 WHERE t1.GAP = 1 ORDER BY t1.ID, t1.OVERLAP, t1.COUNT DESC; QUIT; proc sort data=EEE out=FFF nodupkey; by ID product1; run; Question: Cuz I need to do "cross join" and then "combination match". It takes lots of time. I'd like to ask if there is anyway (like hash table?but I don't know how to do) to shorten my process time. If you have any idea, please advise me. Many thanks. ^_^

wenzli25 · ‎01-23-2018

Thank you for your quick reply. ^_^ I have two problems: 1. I don't understand the meaning of the sentence below: "But why not move your loop into the macro instead?" 2. So I just take the same dataset to another iteration, then the problem solved, right? Data work.ccc; SET work.ccc; Much Thanks.

wenzli25 · ‎01-23-2018

Hi all, I am trying to create a macro to dynamically compute cumulative returns across variables in my dataset. And my problem is: the macro below is overwriting the results obtained for the prior variable instead of creating separate columns in each iteration. Here is my sample dataset: data work.AAA;  input id $ GRP $ month txn ;  datalines; ID1 A 1 100 ID1 A 1 200 ID1 A 2 300 ID2 A 1 200 ID2 B 2 300 ID2 A 3 400; My SAS code is as follows: %macro multmn(startmonth,stopmonth); %do mvalue=&startmonth %to &stopmonth; Data WORK.CCC; SET work.AAA; BY ID; IF FIRST.ID = 1 THEN DO; Trans_CNT_&mvalue=0; END; Retain Trans_CNT_&mvalue; IF (GRP='A' and MONTH=&mvalue) then Trans_CNT_&mvalue = SUM(Trans_CNT_&mvalue+1) ; run; %end; %mend multmn; %multmn(1,3) In fact, I want to convert the code below into macro method. Data WORK.BBB; SET work.AAA; BY ID; IF FIRST.ID = 1 THEN DO; Trans_CNT_1=0;Trans_CNT_2=0;Trans_CNT_3=0; END; Retain Trans_CNT_1 Trans_CNT_2 Trans_CNT_3; IF (GRP='A' and MONTH=1) THEN Trans_CNT_1 = SUM(Trans_CNT_1+1); IF (GRP='A' and MONTH=2) THEN Trans_CNT_2 = SUM(Trans_CNT_2+1); IF (GRP='A' and MONTH=3) THEN Trans_CNT_3 = SUM(Trans_CNT_3+1); run; If you have any ideas, please advise me. Much thanks. ^_^

wenzli25 · ‎01-23-2018

Thank you very much. Sorry for the late reply. It helps me a lot ^_^

wenzli25 · ‎12-22-2017

Hello, I have three datasets, and what I want to do is to get 3 records for each ID on specific conditions. My conditions are 1. Top 3 records for each ID. AND 2. Of the 3 records, there must be one that meets the criteria which is ID’s group = product’s group. If not, then look down to find the first match, replacing the third record of that ID. I do know that how to get the top3 records, but don’t know how to meet the second condition… If you have any ideas or solutions, please advise me. Much Thanks. Here is my example: data report;  input id $ product $ score ;  datalines; 001 a1 20 001 a2 10 001 a4 9 001 a5 8 001 a7 7 002 a1 99 002 a3 10 002 a4 8 002 a5 3 002 a7 1 003 a7 10 ; data ID_group; input ID $ group $; datalines; 001 x 002 y 003 x 004 y 005 y ; data product_group; input product $ group $; a1 x a2 y a3 x a4 x a5 y a6 x a7 y ; So the output result is as follows: ID product 001 a1 001 a2 001 a4 002 a1 002 a3 002 a5 003 a7 ; I tried to get the top 3 records, and also, I tried to get the first match record. But I do not know how to meet these two conditions at once or any efficient solutions for my question. Thanks for your reading. ^_^

wenzli25 · ‎12-12-2017

Hi novinosrin, Sorry for the late reply, because we're in different time zones. The format of my original data is like dataset two, but I think that data transposition can be done to facilitate data processing and understanding. The converted format is as follows: data two; input PRIORITY EVENT $ EVENT_CATEGORY $ ; datalines; 1 A A1 1 B A1 2 B A1 2 D B1 3 C B1 3 A A1 4 D B1 4 E B1 5 E B1 5 G A1 ; Explanation: For ID 01, ID's category is A1 (show in dataset one), and the recommended item1 is 'G', because 'G' is the first event in category 'A1' (show in dataset two). A. What look up key from dataset one fetches 'G' which is in EVENT2 of dataset two? I am assuming it is ID_category of one<--> Event2_category? For recommended event1 -> ID_category of dataset one = Event_category of dataset two -> and ID_event of one ^= Event of two -> The order of priority is from top to bottom of dataset two Then, the recommended item2 is 'D', because of the priority and the order. (Priority1 is 'A' and 'B' item in dataset two, but ID 01 has already held two events in dataset one. Priority2 is 'B' and 'D', so 'D' is selected.) and What look up key from dataset one fetches 'D' from EVENT2 dataset two? I am assuming it is ID_category of one<--> For recommended event2 -> ID_event of dataset one ^= Event of dataset two -> recommended event2 ^= recommended event1 -> The order of priority is from top to bottom of dataset two In essence, I need you to explain me the look up operation of the logic with "keys" and priority clearly like what fetches what and why(priority). Thank you for your help, I think I have some ideas, I'll continue to try some methods. ^_^

wenzli25 · ‎12-11-2017

Thank you ^ _ ^ I revised the description of the content, I hope this easier to understand. Thank you very much for your reply and help. Have a good night !!!

wenzli25 · ‎12-10-2017

Hi novinosrin, Thank you for your help. My old topic has been recovered, but I haven't got the solution. Do not know if I didn't explain my question well... I'll keep waiting and trying...

wenzli25 · ‎12-09-2017

(The topic I posted was missing, I don't know why..., so I reposted the topic. T____T) Hello everyone, I have couple questions in SAS…I want to retrieve data from multiple datasets in some conditions. In brief, there are two datasets. The first one is the events owned by customers, another one is the order of priority for the events. What I want to do is to recommend two events that not yet owned by the customer. However, there are two conditions, the first one is the category of recommended event is the same as the category of customer. The second one is based on the priority and the order, recommend the event not owned by the customer, and different to the first recommended event. I tried to use cross join on two tables, then, used the loop function to match the criteria. But I'm wondering if there is any more efficient way, or my way is wrong... The following is a simple example. Dataset one Description: ID only belongs one category, but one ID has multiple events, and event belongs one event category. data one; input ID $ ID_CATEGORY $ EVENT $ EVENT_CATEGORY $; datalines; 01 A1 A A1 01 A1 B A1 01 A1 C B1 02 B1 A A1 02 B1 D B1 03 B1 E B1 04 A1 A A1 ; Dataset two Description: Based on the priority, each row has two events, and each belongs their categories. data two; input PRIORITY EVENT1 $ EVENT1_CATEGORY $ EVENT2 $ EVENT2_CATEGORY $; datalines; 1 A A1 B A1 2 B A1 D B1 3 C B1 A A1 4 D B1 E B1 5 E B1 G A1 ; Dataset three (which I want to create) ID ID _CATEGORY ITEM1 ITEM1_CATEGORY ITEM2 ITEM2_CATEGORY 01 A1 G A1 D B1 02 B1 C B1 B A1 03 B1 D B1 A A1 04 A1 B A1 D B1 Explanation: For ID 01, ID's category is A1 (show in dataset one), and the recommended item1 is 'G', because 'G' is the first event in category 'A1' (show in dataset two). Then, the recommended item2 is 'D', because of the priority and the order. (Priority1 is 'A' and 'B' item in dataset two, but ID 01 has already held two events in dataset one. Priority2 is 'B' and 'D', so 'D' is selected.) Could you help me to figure out this question how to solve... or any recommended functions?? Thanks > <

wenzli25 · ‎12-09-2017

No problem, I corrected the code. I apologize for any inconvenience. > <

wenzli25 · ‎12-09-2017

Yes, 'D' is selected because of the priority and the sequence. (In Table TWO, the first priority is 'A' and 'B', but ID 01 has events already. Next, the second priority is 'B' and 'D', that's why 'D' is selected) Thank you for your reply. 🙂

wenzli25 · ‎12-08-2017

My apologies for the confusion. Because ID 01 which ID-category is A1, and my first condition is "Output1: The same event category as this ID" So ID 01 need to be recommended an A1 event, which is item G. (In table one, ID 01 already hold A &B). And the second recommendation is D because of the priority and the sequence in table TWO. Hope you understand what I mean... thanks 🙂

wenzli25 · ‎12-07-2017

Hello everyone, I have couple questions in SAS… I want to create a table from multiple tables in some conditions… Here is my example: Table ONE: Description: ID only belongs one category, but one ID has multiple events, and event belongs one event category. data one; input ID $ ID_CATEGORY $ EVENT $ EVENT_CATEGORY $; datalines; 01 A1 A A1 01 A1 B A1 01 A1 C B1 02 B1 A A1 02 B1 D B1 03 B1 E B1 04 A1 A A1 ; Table TWO: Description: Based on the priority, each row has two events, and each belongs their categories. date two; input PRIORITY EVENT1 $ EVENT1_CATEGORY $ EVENT2 $ EVENT2_CATEGORY $; datalines; 1 A A1 B A1 2 B A1 D B1 3 C B1 A A1 4 D B1 E B1 5 E B1 G A1 ; Table three: (which I want to create) ID ID _CATEGORY RECOMMEND1 RECOMMEND1_CATEGORY RECOMMEND2 RECOMMEND2_CATEGORY 01 A1 G A1 D B1 02 B1 C B1 B A1 03 B1 D B1 A A1 04 A1 B A1 D B1 For each ID, first of all, observe the events owned by ID in table ONE. Second, according to table TWO, based on the priority, and from the left to the right, to get the events that ID does not hold. Finally, generate two recommended events. And there are two conditions for the recommendation: 1. Generate the same event category as this ID (For Recommend1 filed) 2. For the priority and the order, regardless of the ID's category, generated the second recommendation. But the result is not the same as recommend1. (For Recommend 2 field) Could you help me to figure out this question how to solve... or any recommended functions?? Thanks > < Note: In fact, I tried to use a cross join on two tables, then, used the loop function to match the criteria. But I'm wondering if there is any more efficient way, or my way is wrong...

Online Status	Offline
Date Last Visited	‎05-24-2018 01:24 AM

Re: Another efficient method instead of cartesian product (cross join)

Another efficient method instead of cartesian product (cross join)

Re: Assign and Retain Values to Variable inside a macro

Assign and Retain Values to Variable inside a macro

Re: Top 3 rows meet certain criteria for each ID

Top 3 rows meet certain criteria for each ID

Re: Retrieve data from multiple datasets in some conditions

Re: Retrieve data from multiple datasets in some conditions

Re: Retrieve data from multiple datasets in some conditions

Retrieve data from multiple datasets in some conditions

Re: Another efficient method instead of cartesian product (cross join)

Another efficient method instead of cartesian product (cross join)

Re: Assign and Retain Values to Variable inside a macro

Assign and Retain Values to Variable inside a macro

Re: Top 3 rows meet certain criteria for each ID

Top 3 rows meet certain criteria for each ID

Re: Retrieve data from multiple datasets in some conditions

Re: Retrieve data from multiple datasets in some conditions

Re: Retrieve data from multiple datasets in some conditions

Retrieve data from multiple datasets in some conditions

Re: Create a table from multiple tables in some conditions

Re: Create a table from multiple tables in some conditions

Re: Create a table from multiple tables in some conditions

Create a table from multiple tables in some conditions