<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Clarifications on Associations Rules in SAS Academy for Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarifications-on-Associations-Rules/m-p/647327#M770</link>
    <description>&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;1. Data structure for Market Basket Analysis (MBA) and Sequence Analysis:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;just want to make sure I clearly understand how the data should be prepared for those analyses and the differences between the two.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;F&lt;FONT&gt;or MBA, each physical record represents an item and the ID represents a "Transaction". For instance, in the Bank dataset (page 8-54 of course notes), all entries with the same Account Number are considered to be part of the same "Transaction"; therefore, the total number of transactions in the dataset is equal to the number of distinct values in Account Number (which is the number used to calculate Support).&lt;BR /&gt;On the other hand, for Sequence Analysis, in general, the ID represents a specific "Customer" and "Transactions" are identified by the sequence number which is derived by using a time variable. In other words, items bought by a given Customer at the same time, are grouped together to form a "visit/transaction". Therefore, Customer and Transaction are two different concepts when it comes to Sequence Analysis. In the specific example of the Bank dataset, for a given Account Number, each transaction is represented by a different value in "Order of Service Addition" (in the example, it is assumed only 1 product can be purchased in a single transaction).&lt;BR /&gt;Finally, a "Sequence" is a set of transactions/visits all related to the same Customer, which are temporally related (i.e. they all happened within a certain timeframe in a specific order).&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;Is the above correct?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;MY Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;Please see the screenshot of the Metadata:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;For market basket analysis:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;DIV id="tinyMceEditorgcjfernandez_gmail_com_0" class="mceNonEditable lia-copypaste-placeholder"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV id="tinyMceEditorgcjfernandez_gmail_com_2" class="mceNonEditable lia-copypaste-placeholder"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="MBA metadata.jpg" style="width: 783px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/39358i9BEA264F87703076/image-size/large?v=v2&amp;amp;px=999" role="button" title="MBA metadata.jpg" alt="MBA metadata.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;The data role should be transaction&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="MBA data table.jpg" style="width: 999px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/39359iEAAD5DEDFD9EECE0/image-size/large?v=v2&amp;amp;px=999" role="button" title="MBA data table.jpg" alt="MBA data table.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;For sequence analysis:&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;The role of variable Visit will be changed to sequence&amp;nbsp;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;DIV id="tinyMceEditorgcjfernandez_gmail_com_1" class="mceNonEditable lia-copypaste-placeholder"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;2.&lt;/STRONG&gt;&lt;FONT&gt;&lt;STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;Rare occurrences:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in Market Basket Analysis, is it correct to say that if we were interested in "rare/unusual" combinations of items, then we would look at rules with a low Support and/or possibly a Lift below 1 (see page 8.57 of course notes)?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;My Answer:&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;Yes you are correct if you are after rare combination of items in MBA&lt;/FONT&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;3. Property "Support Percentage" of Association Node:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;the "Enterprise Miner 15.1: Reference Help" (at page 410) reports the following: "[...] The support percentage figure that you specify refers to the proportion of the largest single item frequency, and not the end support".&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;Would it be possible to clarify the meaning and practical implications of that sentence?&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;My Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;In order to consider an association rule useful what will be the minimum support % (Default% is 5. But You can modify based on your need)&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;4. Structure of rules in Sequence Analysis:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is it correct to say that rules derived in a Sequence Analysis can only contain single items in the identified chain; i.e. A =&amp;gt; B =&amp;gt; C and not A =&amp;gt; (B and C) =&amp;gt; D?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;My Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;Yes you are correct. Also in a rule you have Left and right side of the rule. You can have two items&amp;nbsp; appeared sequentially in the left side of the rule. But in the right side of the rule you can have only single item. Refer the Rule table in the Sequence analysis output.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;STRONG&gt;5. Calculation of Support in Sequence Analysis:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;would it be possible to clarify how support is calculated&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;(see statement at the bottom of page 8-68 of course notes)?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;First of all, in Sequence Analysis, is it correct to say that the left hand side of association rules must be related to a transaction/visit which precedes, from a temporal point of view, the item(s) on the right hand side? &lt;FONT color="#0000FF"&gt;Yes&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;Secondly, for the calculation of Support, what is the "unit of count"? In other words, what are the numbers used in the denominator and numerator? Are they based on the concept of "sequence" or "individual transaction"?&lt;FONT color="#0000FF"&gt; Frequency Count&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;My Answer: (From the course notes)&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;The percent support is the transaction count divided by the total number of customers, which &lt;/FONT&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;would be the maximum transaction count. The percent confidence is the transaction count &lt;/FONT&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;divided by the transaction count for the left side of the sequence. &lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;6. "Association Node Rules Selector":&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;I am not sure I understand the purpose of the Rules Selection (accessed via property "Rules"); why and how should be used?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;My Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;I am also not sure of the purpose of rule selection. There is another node (Link analysis) available in SAS EM which also perform&amp;nbsp; recommendation analysis. Please check the EM help.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 13 May 2020 06:56:07 GMT</pubDate>
    <dc:creator>gcjfernandez</dc:creator>
    <dc:date>2020-05-13T06:56:07Z</dc:date>
    <item>
      <title>Clarifications on Associations Rules</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarifications-on-Associations-Rules/m-p/646968#M766</link>
      <description>&lt;P&gt;Re:&amp;nbsp;&lt;FONT style="background-color: #ffffff;"&gt;Applied Analytics Using SAS Enterprise Miner&amp;nbsp;&lt;FONT style="background-color: #ffffff; box-sizing: border-box; color: #333333; font-family: Arial,Helvetica,sans-serif; font-size: 16px; font-style: normal; font-variant: normal; font-weight: 300; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;"&gt;(chapter 8 of course notes / Lesson 2 online)&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;&lt;BR /&gt;&lt;STRONG&gt;1. Data structure for Market Basket Analysis (MBA) and Sequence Analysis:&lt;/STRONG&gt; just want to make sure I clearly understand how the data should be prepared for those analyses and the differences between the two.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;F&lt;FONT style="background-color: #ffffff;"&gt;or MBA, each physical record represents an item and the ID represents a "Transaction". For instance, in the Bank dataset (page 8-54 of course notes), all entries with the same Account Number are considered to be part of the same "Transaction"; therefore, the total number of transactions in the dataset is equal to the number of distinct values in Account Number (which is the number used to calculate Support).&lt;BR /&gt;On the other hand, for Sequence Analysis, in general, the ID represents a specific "Customer" and "Transactions" are identified by the sequence number which is derived by using a time variable. In other words, items bought by a given Customer at the same time, are grouped together to form a "visit/transaction". Therefore, Customer and Transaction are two different concepts when it comes to Sequence Analysis. In the specific example of the Bank dataset, for a given Account Number, each transaction is represented by a different value in "Order of Service Addition" (in the example, it is assumed only 1 product can be purchased in a single transaction).&lt;BR /&gt;Finally, a "Sequence" is a set of transactions/visits all related to the same Customer, which are temporally related (i.e. they all happened within a certain timeframe in a specific order).&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;Is the above correct?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;2.&lt;/STRONG&gt;&lt;FONT style="background-color: #ffffff;"&gt;&lt;STRONG&gt; Rare occurrences:&lt;/STRONG&gt; in Market Basket Analysis, is it correct to say that if we were interested in "rare/unusual" combinations of items, then we would look at rules with a low Support and/or possibly a Lift below 1 (see page 8.57 of course notes)?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;&lt;BR /&gt;&lt;STRONG&gt;3. Property "Support Percentage" of Association Node:&lt;/STRONG&gt; the "Enterprise Miner 15.1: Reference Help" (at page 410) reports the following: "[...] The support percentage figure that you specify refers to the proportion of the largest single item frequency, and not the end support".&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;Would it be possible to clarify the meaning and practical implications of that sentence?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;&lt;STRONG&gt;4. Structure of rules in Sequence Analysis:&lt;/STRONG&gt; &lt;FONT style="background-color: #ffffff; box-sizing: border-box; color: #333333; font-family: Arial,Helvetica,sans-serif; font-size: 16px; font-style: normal; font-variant: normal; font-weight: 300; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;"&gt;is it correct to say that rules derived in a Sequence Analysis can only contain single items in the identified chain; i.e. A =&amp;gt; B =&amp;gt; C and not A =&amp;gt; (B and C) =&amp;gt; D?&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;&lt;BR /&gt;&lt;STRONG&gt;5. Calculation of Support in Sequence Analysis:&lt;/STRONG&gt; would it be possible to clarify how support is calculated &lt;FONT style="background-color: #ffffff; box-sizing: border-box; color: #333333; font-family: Arial,Helvetica,sans-serif; font-size: 16px; font-style: normal; font-variant: normal; font-weight: 300; letter-spacing: normal; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;"&gt;(see statement at the bottom of page 8-68 of course notes)&lt;/FONT&gt;?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;First of all, in Sequence Analysis, is it correct to say that the left hand side of association rules must be related to a transaction/visit which preceeds, from a temporal point of view, the item(s) on the right hand side?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;Secondly, for the calculation of Support, what is the "unit of count"? In other words, what are the numbers used in the denominator and numerator? Are they based on the concept of "sequence" or "individual transaction"?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT style="background-color: #ffffff;"&gt;&lt;STRONG&gt;6. "Association Node Rules Selector":&lt;/STRONG&gt; I am not sure I understand the purpose of the Rules Selection (accessed via property "Rules"); why and how should be used?&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 12 May 2020 05:41:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarifications-on-Associations-Rules/m-p/646968#M766</guid>
      <dc:creator>pvareschi</dc:creator>
      <dc:date>2020-05-12T05:41:42Z</dc:date>
    </item>
    <item>
      <title>Re: Clarifications on Associations Rules</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarifications-on-Associations-Rules/m-p/647327#M770</link>
      <description>&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;1. Data structure for Market Basket Analysis (MBA) and Sequence Analysis:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;just want to make sure I clearly understand how the data should be prepared for those analyses and the differences between the two.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;F&lt;FONT&gt;or MBA, each physical record represents an item and the ID represents a "Transaction". For instance, in the Bank dataset (page 8-54 of course notes), all entries with the same Account Number are considered to be part of the same "Transaction"; therefore, the total number of transactions in the dataset is equal to the number of distinct values in Account Number (which is the number used to calculate Support).&lt;BR /&gt;On the other hand, for Sequence Analysis, in general, the ID represents a specific "Customer" and "Transactions" are identified by the sequence number which is derived by using a time variable. In other words, items bought by a given Customer at the same time, are grouped together to form a "visit/transaction". Therefore, Customer and Transaction are two different concepts when it comes to Sequence Analysis. In the specific example of the Bank dataset, for a given Account Number, each transaction is represented by a different value in "Order of Service Addition" (in the example, it is assumed only 1 product can be purchased in a single transaction).&lt;BR /&gt;Finally, a "Sequence" is a set of transactions/visits all related to the same Customer, which are temporally related (i.e. they all happened within a certain timeframe in a specific order).&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;Is the above correct?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;MY Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;Please see the screenshot of the Metadata:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;For market basket analysis:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;DIV id="tinyMceEditorgcjfernandez_gmail_com_0" class="mceNonEditable lia-copypaste-placeholder"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV id="tinyMceEditorgcjfernandez_gmail_com_2" class="mceNonEditable lia-copypaste-placeholder"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="MBA metadata.jpg" style="width: 783px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/39358i9BEA264F87703076/image-size/large?v=v2&amp;amp;px=999" role="button" title="MBA metadata.jpg" alt="MBA metadata.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;The data role should be transaction&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="MBA data table.jpg" style="width: 999px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/39359iEAAD5DEDFD9EECE0/image-size/large?v=v2&amp;amp;px=999" role="button" title="MBA data table.jpg" alt="MBA data table.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;For sequence analysis:&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;The role of variable Visit will be changed to sequence&amp;nbsp;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;DIV id="tinyMceEditorgcjfernandez_gmail_com_1" class="mceNonEditable lia-copypaste-placeholder"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;2.&lt;/STRONG&gt;&lt;FONT&gt;&lt;STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;Rare occurrences:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in Market Basket Analysis, is it correct to say that if we were interested in "rare/unusual" combinations of items, then we would look at rules with a low Support and/or possibly a Lift below 1 (see page 8.57 of course notes)?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;My Answer:&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;Yes you are correct if you are after rare combination of items in MBA&lt;/FONT&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;3. Property "Support Percentage" of Association Node:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;the "Enterprise Miner 15.1: Reference Help" (at page 410) reports the following: "[...] The support percentage figure that you specify refers to the proportion of the largest single item frequency, and not the end support".&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;Would it be possible to clarify the meaning and practical implications of that sentence?&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;My Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;In order to consider an association rule useful what will be the minimum support % (Default% is 5. But You can modify based on your need)&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;4. Structure of rules in Sequence Analysis:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is it correct to say that rules derived in a Sequence Analysis can only contain single items in the identified chain; i.e. A =&amp;gt; B =&amp;gt; C and not A =&amp;gt; (B and C) =&amp;gt; D?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;My Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;Yes you are correct. Also in a rule you have Left and right side of the rule. You can have two items&amp;nbsp; appeared sequentially in the left side of the rule. But in the right side of the rule you can have only single item. Refer the Rule table in the Sequence analysis output.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;STRONG&gt;5. Calculation of Support in Sequence Analysis:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;would it be possible to clarify how support is calculated&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;(see statement at the bottom of page 8-68 of course notes)?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;First of all, in Sequence Analysis, is it correct to say that the left hand side of association rules must be related to a transaction/visit which precedes, from a temporal point of view, the item(s) on the right hand side? &lt;FONT color="#0000FF"&gt;Yes&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;Secondly, for the calculation of Support, what is the "unit of count"? In other words, what are the numbers used in the denominator and numerator? Are they based on the concept of "sequence" or "individual transaction"?&lt;FONT color="#0000FF"&gt; Frequency Count&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;My Answer: (From the course notes)&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;The percent support is the transaction count divided by the total number of customers, which &lt;/FONT&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;would be the maximum transaction count. The percent confidence is the transaction count &lt;/FONT&gt;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;FONT color="#0000FF"&gt;divided by the transaction count for the left side of the sequence. &lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT&gt;&lt;STRONG&gt;6. "Association Node Rules Selector":&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;I am not sure I understand the purpose of the Rules Selection (accessed via property "Rules"); why and how should be used?&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;My Answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;I am also not sure of the purpose of rule selection. There is another node (Link analysis) available in SAS EM which also perform&amp;nbsp; recommendation analysis. Please check the EM help.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 06:56:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarifications-on-Associations-Rules/m-p/647327#M770</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2020-05-13T06:56:07Z</dc:date>
    </item>
  </channel>
</rss>

