About darklord

darklord · ‎03-19-2024

Hey! Yeah I see the point, I was having hard time explaining it myself, figured explaining the background of the task itself would have taken some time. But I feel like there a logic in works now which I feel would do a good job for this. For that reason I would close this post so that more people don't get confused:P But yeah next questions the question gonna be more clear:P Thank you tho!

darklord · ‎03-19-2024

Hello! Yeah a lot of steps ended up being involved to make a logic for this. I think I have a clear idea now, I been working on creating flags for different scenarios. It would be save to say that this question can be closed for now. As again coming up with the current logic itself took some time (extra 4 datasets) 😛 Thanks again tho! I mean I was having trouble myself trying to phase this and explain it.

darklord · ‎03-17-2024

Hmm, okay I see. I guess let me rectify this and make it more clear. It's a bunch of lot of steps being involved to create the final table. I will update the explanation to make it more clear!

darklord · ‎03-16-2024

The code below: Suppose there's this combination here giving us duplicate users using the DOB, Email & Visa matching SELECT "C9" AS COMB, A. NEW _CIF_NO, A. CUST_NAME, B. NEW CIF_ NO AS NEW CIF NO 1, B. CUST_ NAME AS CUST_FULL NAME__1, COMPGED (A. CUST_NAME, B. CUST_NAME) AS SCORE FROM ALL_ENT_ CUST_IND BASE A JOIN ALI_ENT_ CUST_ INDV_BASE B ON COMPGED (A. CUST_NAME, B. CUST_NAME) <= 1000 AND A. CUST DOB = B. CUST DOB AND A. MOBILE NUM = B. MOBILE NUM AND A. DOC VISA = B. DOC VISA AND A. CUST_ NAME IS NOT NULL gives us the result below, CIF Name Cif_new Name2 Score EIB~72—775 EASO PHILIP K A EASO ENB~48--88 EASO PHILIP 330 EIB~72—445 JUDE GRAP ENB~48--98 JUDE GRAP 0 EIB~71—775 EASO PHILIP EASO K A ENB~43--88 PHILIP EASO 500 Score 0 is exact name match along with the doc matching, the higher the score the less the name match. After a bunch of combinations they all get merged, and a final dataset is created with all the user names, documents like this CIF Name Cif_new Doc DOB Email EIB~72—775 EASO PHILIP K A EASO ENB~48--88 A32742 12-JUL-20 AHSDA@GMAIL.COM EIB~72—445 JUDE GRAP ENB~48---98 B42823 13-AUG-20 GFA@GMAIL.COM EIB~72—775 EASO PHILIP K A EASO ENB~436-88 C91742 17-FEB-21 UADFN@GMAIL.COM This dataset here contains score 0, score 330 & score 500 etc all together. What Im trying to do is find a way to create a subset-dataset which would have only those users that even if the names don't match exactly but their documents match. Basically no false positive (like names that match but no matching documents) but just true positive. Let me know if more clarification needed!

darklord · ‎03-15-2024

working on that, will upload some soon!

darklord · ‎03-15-2024

Hello all! In a pickle here, I have these SQL queries that I run to pull users that are exactly the same or similar. Suppose the table name: usertable SQL: Combination 1 Name1, name2, compared(name1,name2) from usertable a join usertable b where a.passport=b.passport Combination 2 Name1, name2, compared(name1,name2) from usertable a join usertable b where a.email=b.email Results: name1 name2 0 name1 name2 200 name1 name2 50 Merge combination 1 & combination 2 This creates a dataset with field id, name1, name2, passport, email. Here all the names will be there with matching docs or not (true positive or false positive). From here how do I make sure that only true positive ids stay? i.e those with similar name or same name but the documents match exactly. Thanks!

darklord · ‎09-12-2023

Hello, Thanks for the help, I did end up using what @ballardw suggested and it worked well! Should keep summary in mind for next time use!

darklord · ‎09-12-2023

Hello, Thanks for this. It did help in the frequency of the users. Which helped in giving a nice picture of the repeat ones. I decided not to put the date in since it wasn't very much needed 🙂

darklord · ‎09-11-2023

Hello! I want to find repeated users who always do cash withdrawals between set weeks. Right now I have it set up in a way where I have created tables for each week containing userid, transaction type and dates. Tables: W1,W2,W3 Now to find the same users present every week I figure an inner join between the tables would do the job PROC SQL; Select a.id FROM W1 A JOIN W2 B ON.. JOIN W3 C ON.. WHERE transaction type = ; QUIT; I assume this would give the common users each week? But what if I have created one big table with all the dates included. How would I go about in finding the repeated users each week? Would keeping the tables separate make more sense?

darklord · ‎01-30-2023

Appreciate the link:)

darklord · ‎01-29-2023

Hello, Thank you for the help! I suppose should have searched harder online, didn't come across this DTDAY function. The output looks good now 🙂

darklord · ‎01-29-2023

Hello all! Pretty lost here, im working on creating a dataset using proc sql with DATETIME() field being the start_date, and trying to make end_date using intnx function using the query below, intnx('day', date time(), -1) as end_date format=datetime20. DATETIME() field is already in DATETIME20. format, but the results show '.' for end_date when it finishes running. Any idea what could be happening? The proc sql is built using a prebuilt proc sql table and one data table. Thank you for the help!

Online Status	Offline
Date Last Visited	‎03-19-2024 01:40 PM

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Duplicates in the table, but keep them based on other conditions

Re: Find repeating users between set weeks

Re: Find repeating users between set weeks

Find repeating users between set weeks

Re: Intnx minus 1 day

Re: Duplicates in the table, but keep them based on other conditions

Re: Find repeating users between set weeks

Re: Find repeating users between set weeks

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Re: Duplicates in the table, but keep them based on other conditions

Duplicates in the table, but keep them based on other conditions

Re: Find repeating users between set weeks

Re: Find repeating users between set weeks

Find repeating users between set weeks

Re: Intnx minus 1 day

Re: Intnx minus 1 day

Intnx minus 1 day