turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

- Home
- /
- Analytics
- /
- Stat Procs
- /
- How do I calculate a Distance Matrix using Yule's ...

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

01-26-2017 05:55 AM

Hello,

I'm looking at clustering customers based on their transactional data. I have about 3000+ Products (items) in total that belong to various categories (6+) and I need to sort/group the products based on purchasing behavior of the customers. i.e., which products are bought together by the same customer.

I need to use Yule's Q measure of association (between all prooduct combinations) to prepare the input for the cluster analysis. I can pick one category at a time in order to reduce the product cominations.

Yule's Q is (ad - bc)/(ad+bc).

Conceptually, this is the number of pairs in agreement (ad) - the number in disagreement (bc) over the total number of paired observations.

How can I go about doing that using SAS. Proc Distance does not provide the option of Yule's Q

Any help will be more than appreciated.

Regards

Mari

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to mszommer

01-26-2017 07:28 AM

It looks like PROC FREQ can, with Gamma being equivalent to Yule's Q.

It may may help if present some sample data and expected output. Although not all of us understand the specific terminology, most can code many standard formulas.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content

Posted in reply to Reeza

01-27-2017 06:40 AM

Hello Reeza,

Thank you for the very prompt response.

Attached is an excel file with the requested details of the input data and desired output data.

Lookign forward to hearign from you

Regards

Mari