About munitech4u

munitech4u · ‎03-29-2016

oh, maybe thats why. It was so much confusing me

munitech4u · ‎03-29-2016

A1 A2 1 A1 A9 1 A2 A3 2 A2 A1 2 A3 A2 1 A4 A5 4 A5 A4 5 A6 A7 6 A7 A8 7 A7 A6 7 A8 A7 6 A9 A1 2 But in the dataset g is like above. Why the 4th and 5th obs are 1 and 4?

munitech4u · ‎03-29-2016

Can you please explain? How does this work? data t1; set &dsin.2; by &v1; g+first.&v1; run; And the purpose of it?

munitech4u · ‎03-29-2016

Cool, I am looking forward to it too. I have posted a logic solution, but not sure, as of now, how to implement it!

munitech4u · ‎03-29-2016

Yea, Seem like I need to go through it. The solution is based on the lag value which, is not the case with me. It does not necessarily have a relationship with lag.

munitech4u · ‎03-29-2016

I have an idea, but not sure how to execute this: 1. Get a dataset with all distinct IDs. 2. Now Pick the first ID, and look for all the possible IDs in ID1 or ID2 that occurs with this ID, put them in cluster 1. 3. Now Get all other distinct IDs that occur above and repeat the step above. Keep updating the cluster. 4. Loop until we reach at a point, when we have iterated through all the IDs in cluster 1. 5. Eliminate the cluster 1 IDs from the dataset with distinct IDs and the dataset we are looking into. 6. Repeat step 2-5 until we are finished with all IDs. Example: Distinct IDs A1 A2 A3 A4 A5 A6 A7 A8 A9 First ID: A1 All possible combination from dataset: A1 A2 A1 A9 Cluster 1: A1,A2,A9 Loop Through cluster 1 except A1: For A2, possible combination: A2 A3 Update cluster 1: A1,A2,A3,A9 Loop Through cluster 1 except A1,A2: For A9, possible combination: A9 A1 Update cluster 1: A1,A2,A3,A9 Eliminate A1,A2,A3,A9 from distinct IDs and observations wherever they occur in dataset Repeat above steps, keep incrementing the cluster.

munitech4u · ‎03-29-2016

clus is nothing but a category we need to assing for related IDs

munitech4u · ‎03-29-2016

Mine, is even more complex, as the relationship is not just one way, it can be two way. So like in the example give in other solution, the player id can occur in assignment id and vice versa.

munitech4u · ‎03-29-2016

I have a dataset like this: ID1 ID2 A1 A2 A2 A3 A4 A5 A6 A7 A7 A8 A1 A9 I want an output dataset like: ID Clus A1 1 A2 1 A3 1 A9 1 A4 2 A5 2 A6 3 A7 3 A8 3 Basically I want to cluster all the mapped IDs into one cluster. I tried unsuccessfully with self join. Any ideas?

munitech4u · ‎03-24-2016

Are you suggesting that, I should try by removing the surrogate rules? Well, that was what I had tried initiallly Or are you suggesting that Gradient boosting should not be used with high dimensionality data which might have lot of noise. The only time it worked on my data was, when I had around 30-40 variables and I had oversampled, as my event rate was about 1.22%. But Some people say, that boosting should work.

munitech4u · ‎03-17-2016

proc freq to run ctable?, Can you please explain that?

munitech4u · ‎03-17-2016

Thanks, but do you recommend running it on a dataset as large as 4 million?

munitech4u · ‎03-17-2016

No, I am talking about the section of logistic model, which tells about, concordant, dis-concordant and ties. Which are calculated from pairs of scored probabilities of target 1 and 0.

munitech4u · ‎03-17-2016

Hi, I built a logistic model and the number of ties are about 24%. How can I identify the observations which have ties, so that I can analyse them?

munitech4u · ‎03-14-2016

I have a dataset for a specific time period(say May 2015). and I have another dataset(say Aug 2015). I built a logistic regression model on the May data and tested on out of time. The performance is decreasing slightly. Is there a way to supply posterior probabilities from May to aug data, using bayes implementation in SAS, to adjust the probabilities of Aug data?

Online Status	Offline
Date Last Visited	‎04-04-2019 08:32 AM

Re: How to check duplicate customer IDs using SAS

Re: How to check duplicate customer IDs using SAS

Re: How to check duplicate customer IDs using SAS

Re: How to check duplicate customer IDs using SAS

Re: How to check duplicate customer IDs using SAS

How to check duplicate customer IDs using SAS

Re: Calculating GEODIST for various combinations, cross looping

Calculating GEODIST for various combinations, cross looping

Ensemble of Random Forest and Neural network node in Enterprise Miner

Re: Looping over a dataset to check values in another dataset

Re: How to check duplicate customer IDs using SAS

Re: How to check duplicate customer IDs using SAS

Re: How to check duplicate customer IDs using SAS

Re: Looping over a dataset to check values in another dataset

Re: Looping over a dataset to check values in another dataset

Re: How to store proc freq output of n variables in one dataset?

Re: How to perform this in sas?

Re: How to perform this in sas?

Re: How to perform this in sas?

Re: How to perform this in sas?

Re: How to perform this in sas?

Re: How to perform this in sas?

Re: How to perform this in sas?

Re: How to perform this in sas?

Recursive lookup for ID's

Re: EM Gradient Boosting node, not producing any output

Re: Proc logistic, how to get the observations with ties?

Re: Proc logistic, how to get the observations with ties?

Re: Proc logistic, how to get the observations with ties?

Proc logistic, how to get the observations with ties?

Using bayesian analysis in SAS.