<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Create a binary variable from several categorical variables in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743564#M232830</link>
    <description>&lt;P&gt;Hi everyone,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am having trouble with if/then statements.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to create several new variables from two variables: race and hispanic.&lt;/P&gt;&lt;P&gt;Race:&amp;nbsp;&lt;/P&gt;&lt;P&gt;0=MR&lt;BR /&gt;1= White&lt;BR /&gt;2= Black&lt;BR /&gt;3= American Indian&lt;BR /&gt;4=Asian&lt;BR /&gt;5=Pacific Islander&lt;BR /&gt;8= Other&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hispanic:&lt;/P&gt;&lt;P&gt;1= Spanish/Hispanic Origin&lt;/P&gt;&lt;P&gt;2= Not of Spanish/Hispanic Origin&lt;/P&gt;&lt;P&gt;9=Unknown&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I need to create several binary variables:&lt;/P&gt;&lt;P&gt;White/non Hispanic&lt;/P&gt;&lt;P&gt;Black/non Hispanic&lt;/P&gt;&lt;P&gt;Hispanic&lt;/P&gt;&lt;P&gt;Other&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So far, I have tried this code:&lt;/P&gt;&lt;P&gt;data h.data2;&lt;BR /&gt;set h.data;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;if (nrace=1) and (hispanic=2) then whtnh=1;&lt;BR /&gt;else whtnh=0;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;if (nrace=2) and (hispanic=2) then blknh=1;&lt;BR /&gt;else blknh=0;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;if nrace not in ('1','2','0') and (hispanic=1) then hisp=1;&lt;BR /&gt;else hisp=0;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;if nrace not in ('1', '2', '0', '8') then else=1;&lt;BR /&gt;else else=0;&lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The code works and produces results, but the total of each '1' does not equal the total sample, 550274:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="bazingarollcall_0-1621952294458.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/59724i5C0C34921568559D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="bazingarollcall_0-1621952294458.png" alt="bazingarollcall_0-1621952294458.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;=551202&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Am I not thinking this through logically, or is it a coding issue? Thanks in advance. I've included the log below.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Log:&lt;/STRONG&gt;&lt;/P&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter dijitContentPaneSingleChild"&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitAlignCenter dijitContentPaneSingleChild dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane"&gt;&lt;DIV class="tabs dijitBorderContainer dijitContainer dojoDndTarget sasStudioTabsParentContainer dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasStudioTabsTabContainer sasStudioTabsTabContainerVertical sasStudioTabsTop dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer sasStudioTabsTabContainerChild dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dojoDndTarget dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasSuiteTabs dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter dijitContentPaneSingleChild"&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitAlignCenter dijitContentPaneSingleChild dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane"&gt;&lt;DIV class="tabs dijitBorderContainer dijitContainer dojoDndTarget sasStudioTabsParentContainer dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasStudioTabsTabContainer sasStudioTabsTabContainerVertical sasStudioTabsTop dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer sasStudioTabsTabContainerChild dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dojoDndTarget dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasSuiteTabs dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV&gt;&lt;DIV class="logTimeStampHeader"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;PRE class="sasLog"&gt;&amp;nbsp;&lt;/PRE&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;1 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;72&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;73 data h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;74 set h.mhmatch;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Data file H.MHMATCH.DATA is in a format that is native to another host, or the file encoding does not match the session&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;encoding. Cross Environment Data Access will be used, which might require additional CPU resources and might reduce&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;performance.&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;75&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;76&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;77 if (nrace=1) and (hispanic=2) then whtnh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;78 else whtnh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;79&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;80&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;81 if (nrace=2) and (hispanic=2) then blknh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;82 else blknh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;83&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;84&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;85 if nrace not in ('1','2','0') and (hispanic=1) then hisp=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;86 else hisp=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;87&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;88&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;89 if nrace not in ('1', '2', '0', '8') then else=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;90 else else=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;91 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Character values have been converted to numeric values at the places given by: (Line):(Column).&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;77:20 81:20 85:5 85:37 89:5&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: The data set H.MHMATCH2 has 550274 observations and 63 variables.&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: DATA statement used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 7.08 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 3.96 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;92&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;93&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;94 *check to make sure they were created correctly;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;95 proc freq data=h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;96 tables whtnh blknh hisp else;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;97 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH2.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: PROCEDURE FREQ used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 3.99 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 0.59 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;98&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;99&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;100 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;112&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;1 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;72&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;73 data h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;74 set h.mhmatch;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Data file H.MHMATCH.DATA is in a format that is native to another host, or the file encoding does not match the session&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;encoding. Cross Environment Data Access will be used, which might require additional CPU resources and might reduce&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;performance.&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;75&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;76&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;77 if (nrace=1) and (hispanic=2) then whtnh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;78 else whtnh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;79&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;80&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;81 if (nrace=2) and (hispanic=2) then blknh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;82 else blknh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;83&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;84&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;85 if nrace not in ('1','2','0') and (hispanic=1) then hisp=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;86 else hisp=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;87&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;88&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;89 if nrace not in ('1', '2', '0', '8') then else=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;90 else else=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;91 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Character values have been converted to numeric values at the places given by: (Line):(Column).&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;77:20 81:20 85:5 85:37 89:5&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: The data set H.MHMATCH2 has 550274 observations and 63 variables.&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: DATA statement used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 7.08 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 3.96 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;92&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;93&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;94 *check to make sure they were created correctly;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;95 proc freq data=h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;96 tables whtnh blknh hisp else;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;97 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH2.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: PROCEDURE FREQ used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 3.99 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 0.59 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;98&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;99&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;100 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;112&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
    <pubDate>Tue, 25 May 2021 14:18:31 GMT</pubDate>
    <dc:creator>bazingarollcall</dc:creator>
    <dc:date>2021-05-25T14:18:31Z</dc:date>
    <item>
      <title>Create a binary variable from several categorical variables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743564#M232830</link>
      <description>&lt;P&gt;Hi everyone,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am having trouble with if/then statements.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to create several new variables from two variables: race and hispanic.&lt;/P&gt;&lt;P&gt;Race:&amp;nbsp;&lt;/P&gt;&lt;P&gt;0=MR&lt;BR /&gt;1= White&lt;BR /&gt;2= Black&lt;BR /&gt;3= American Indian&lt;BR /&gt;4=Asian&lt;BR /&gt;5=Pacific Islander&lt;BR /&gt;8= Other&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hispanic:&lt;/P&gt;&lt;P&gt;1= Spanish/Hispanic Origin&lt;/P&gt;&lt;P&gt;2= Not of Spanish/Hispanic Origin&lt;/P&gt;&lt;P&gt;9=Unknown&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I need to create several binary variables:&lt;/P&gt;&lt;P&gt;White/non Hispanic&lt;/P&gt;&lt;P&gt;Black/non Hispanic&lt;/P&gt;&lt;P&gt;Hispanic&lt;/P&gt;&lt;P&gt;Other&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So far, I have tried this code:&lt;/P&gt;&lt;P&gt;data h.data2;&lt;BR /&gt;set h.data;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;if (nrace=1) and (hispanic=2) then whtnh=1;&lt;BR /&gt;else whtnh=0;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;if (nrace=2) and (hispanic=2) then blknh=1;&lt;BR /&gt;else blknh=0;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;if nrace not in ('1','2','0') and (hispanic=1) then hisp=1;&lt;BR /&gt;else hisp=0;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;if nrace not in ('1', '2', '0', '8') then else=1;&lt;BR /&gt;else else=0;&lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The code works and produces results, but the total of each '1' does not equal the total sample, 550274:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="bazingarollcall_0-1621952294458.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/59724i5C0C34921568559D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="bazingarollcall_0-1621952294458.png" alt="bazingarollcall_0-1621952294458.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;=551202&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Am I not thinking this through logically, or is it a coding issue? Thanks in advance. I've included the log below.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Log:&lt;/STRONG&gt;&lt;/P&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter dijitContentPaneSingleChild"&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitAlignCenter dijitContentPaneSingleChild dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane"&gt;&lt;DIV class="tabs dijitBorderContainer dijitContainer dojoDndTarget sasStudioTabsParentContainer dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasStudioTabsTabContainer sasStudioTabsTabContainerVertical sasStudioTabsTop dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer sasStudioTabsTabContainerChild dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dojoDndTarget dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasSuiteTabs dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter dijitContentPaneSingleChild"&gt;&lt;DIV class="dijitBorderContainer dijitContainer row-fluid dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitAlignCenter dijitContentPaneSingleChild dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane"&gt;&lt;DIV class="tabs dijitBorderContainer dijitContainer dojoDndTarget sasStudioTabsParentContainer dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasStudioTabsTabContainer sasStudioTabsTabContainerVertical sasStudioTabsTop dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer sasStudioTabsTabContainerChild dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dojoDndTarget dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer dojoDndContainerOver"&gt;&lt;DIV class="dijitTabContainer dijitTabContainerTop dijitContainer dijitLayoutContainer tabStrip-disabled sasSuiteTabs dijitBorderContainer-child dijitBorderContainer-dijitTabContainerTop dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV class="dijitTabPaneWrapper dijitTabContainerTop-container dijitAlignCenter"&gt;&lt;DIV class="dijitTabContainerTopChildWrapper dijitVisible"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitTabPane dijitTabContainerTop-child dijitTabContainerTop-dijitBorderContainer dijitLayoutContainer"&gt;&lt;DIV class="dijitBorderContainer dijitContainer dijitBorderContainer-child dijitBorderContainer-dijitBorderContainer dijitBorderContainerPane dijitAlignCenter dijitLayoutContainer"&gt;&lt;DIV class="dijitContentPane dijitBorderContainer-child dijitBorderContainer-dijitContentPane dijitBorderContainerPane dijitAlignCenter"&gt;&lt;DIV&gt;&lt;DIV class="logTimeStampHeader"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;PRE class="sasLog"&gt;&amp;nbsp;&lt;/PRE&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;1 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;72&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;73 data h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;74 set h.mhmatch;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Data file H.MHMATCH.DATA is in a format that is native to another host, or the file encoding does not match the session&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;encoding. Cross Environment Data Access will be used, which might require additional CPU resources and might reduce&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;performance.&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;75&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;76&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;77 if (nrace=1) and (hispanic=2) then whtnh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;78 else whtnh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;79&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;80&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;81 if (nrace=2) and (hispanic=2) then blknh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;82 else blknh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;83&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;84&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;85 if nrace not in ('1','2','0') and (hispanic=1) then hisp=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;86 else hisp=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;87&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;88&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;89 if nrace not in ('1', '2', '0', '8') then else=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;90 else else=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;91 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Character values have been converted to numeric values at the places given by: (Line):(Column).&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;77:20 81:20 85:5 85:37 89:5&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: The data set H.MHMATCH2 has 550274 observations and 63 variables.&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: DATA statement used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 7.08 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 3.96 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;92&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;93&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;94 *check to make sure they were created correctly;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;95 proc freq data=h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;96 tables whtnh blknh hisp else;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;97 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH2.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: PROCEDURE FREQ used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 3.99 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 0.59 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;98&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;99&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;100 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;112&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;1 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;72&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasSource"&gt;73 data h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;74 set h.mhmatch;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Data file H.MHMATCH.DATA is in a format that is native to another host, or the file encoding does not match the session&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;encoding. Cross Environment Data Access will be used, which might require additional CPU resources and might reduce&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;performance.&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;75&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;76&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;77 if (nrace=1) and (hispanic=2) then whtnh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;78 else whtnh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;79&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;80&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;81 if (nrace=2) and (hispanic=2) then blknh=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;82 else blknh=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;83&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;84&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;85 if nrace not in ('1','2','0') and (hispanic=1) then hisp=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;86 else hisp=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;87&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;88&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;89 if nrace not in ('1', '2', '0', '8') then else=1;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;90 else else=0;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;91 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: Character values have been converted to numeric values at the places given by: (Line):(Column).&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;77:20 81:20 85:5 85:37 89:5&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: The data set H.MHMATCH2 has 550274 observations and 63 variables.&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: DATA statement used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 7.08 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 3.96 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;92&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;93&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;94 *check to make sure they were created correctly;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;95 proc freq data=h.mhmatch2;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;96 tables whtnh blknh hisp else;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;97 run;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: There were 550274 observations read from the data set H.MHMATCH2.&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;NOTE: PROCEDURE FREQ used (Total process time):&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;real time 3.99 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;cpu time 0.59 seconds&lt;/DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV class="sasNote"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;98&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;99&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;100 OPTIONS NONOTES NOSTIMER NOSOURCE NOSYNTAXCHECK;&lt;/DIV&gt;&lt;DIV class="sasSource"&gt;112&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Tue, 25 May 2021 14:18:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743564#M232830</guid>
      <dc:creator>bazingarollcall</dc:creator>
      <dc:date>2021-05-25T14:18:31Z</dc:date>
    </item>
    <item>
      <title>Re: Create a binary variable from several categorical variables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743566#M232832</link>
      <description>&lt;P&gt;So you can debug this yourself. You go into data set h.data2 and find records where none of these new variables is 1, these are not getting assigned to a combined group, and from there you can determine where your IF statements are failing.&lt;/P&gt;</description>
      <pubDate>Tue, 25 May 2021 14:30:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743566#M232832</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-05-25T14:30:25Z</dc:date>
    </item>
    <item>
      <title>Re: Create a binary variable from several categorical variables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743569#M232833</link>
      <description>&lt;P&gt;Your may notice that the total cumulative frequency for all of your variables is 550274. So the exact same number of records are getting assigned something.&lt;/P&gt;
&lt;P&gt;So, how many records have a missing value for the race or ethnicity variable?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A very easy piece of code to test your recoding results is to use Proc freq with list.&lt;/P&gt;
&lt;P&gt;Try this : (or what ever the actual data set name is, your "log" does not match code shown for data set names) any you LOG only shows the output data set with 550274 records:&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;NOTE: The data set &lt;FONT color="#FF0000"&gt;H.MHMATCH2 has 550274&lt;/FONT&gt; observations and 63 variables.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;Proc freq data=h2.data2;
  tables nrace*hispanic*whtnh / list missing;
run;&lt;/PRE&gt;
&lt;P&gt;The list option places the values on a single line so is easy to read. The Missing means the variables with missing values will be shown. If you have 551202 records in the data set this table will total to 551202 and will pretty clearly show where/why your recoding comes in with fewer records.&lt;/P&gt;</description>
      <pubDate>Tue, 25 May 2021 14:42:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743569#M232833</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2021-05-25T14:42:29Z</dc:date>
    </item>
    <item>
      <title>Re: Create a binary variable from several categorical variables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743628#M232857</link>
      <description>Thanks for your reply, PaigeMiller. My sample size is actually 550274 and the race/ethnicity variables total to 551202.&lt;BR /&gt;&lt;BR /&gt;Some observations are being counted twice. How should I go about checking which ones are being counted more than once?</description>
      <pubDate>Tue, 25 May 2021 16:58:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743628#M232857</guid>
      <dc:creator>bazingarollcall</dc:creator>
      <dc:date>2021-05-25T16:58:35Z</dc:date>
    </item>
    <item>
      <title>Re: Create a binary variable from several categorical variables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743629#M232858</link>
      <description>Thanks for your reply, ballardw. My sample size is actually 550274 and the race/ethnicity variables total to 551202, so some observations are presumably being counted more than once. How do I go about checking this?</description>
      <pubDate>Tue, 25 May 2021 16:59:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743629#M232858</guid>
      <dc:creator>bazingarollcall</dc:creator>
      <dc:date>2021-05-25T16:59:42Z</dc:date>
    </item>
    <item>
      <title>Re: Create a binary variable from several categorical variables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743637#M232865</link>
      <description>&lt;P&gt;Now I'm lost. I don't see double-counting, and I don't see 551202. When I add up the numbers next to category 1, I don't get 551202, I get 526233.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To resolve this, you need to &lt;FONT color="#FF0000"&gt;LOOK AT the data set h.data2 with your own eyes&lt;/FONT&gt; and see where none of the new variables is 1, they will all be zero, that's where your logic error is. (I guess it's also possible that two or more of the new variables are 1, that would also be a problem, which can be identified the same way)&lt;/P&gt;</description>
      <pubDate>Tue, 25 May 2021 17:17:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743637#M232865</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2021-05-25T17:17:49Z</dc:date>
    </item>
    <item>
      <title>Re: Create a binary variable from several categorical variables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743688#M232889</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/370041"&gt;@bazingarollcall&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;Thanks for your reply, ballardw. My sample size is actually 550274 and the race/ethnicity variables total to 551202, so some observations are presumably being counted more than once. How do I go about checking this?&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;I think that you need to write out some very explicit numbers stating where it comes from and how you adt them to get 551202. None of the data you showed in the first post does that.&lt;/P&gt;
&lt;P&gt;If you are totaling all of the 1's across 4 variables then you have misunderstood what your code is doing. There is nothing in your code that made any of the assignments of the variables whtnh, blknh, hisp and else exclusive. Nothing. If you expect the 1 values to total to 550274 then your assignment code is wrong.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Use the Proc Freq code I showed before only add all 4 of your dependent variables in the tables statement:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;tables nrace*hispanic*whtnh* blknh* hisp * else / list missing;&lt;/P&gt;
&lt;P&gt;You will see that you have some values with 1 in multiple columns.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Note that not using the proper values can generate issues. When you use Nrace=2 in one place and Nrace not in ('0' '1' '2') then conversions of values become a possible issue. So, is NRACE numeric or Character?&lt;/P&gt;
&lt;P&gt;The following data step shows what happens when coding with each combination of Nrace and Hispanic as provided assuming NO records are missing either of the variables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;data toofew toomany;
   do nrace=0 to 5,8;
      do hispanic= 1,2,9;
         if (nrace=1) and (hispanic=2) then whtnh=1;
         else whtnh=0;
         if (nrace=2) and (hispanic=2) then blknh=1;
         else blknh=0;
         if nrace not in (1,2,0) and (hispanic=1) then hisp=1;
         else hisp=0;
         if nrace not in (1, 2, 0, 8) then else=1;
         else else=0;
         if sum(whtnh,blknh,hisp,else)&amp;gt;1 then  output toomany;
         if sum(whtnh,blknh,hisp,else)=0 then  output toofew;
      end;
   end;
run;&lt;/PRE&gt;
&lt;P&gt;Note that the data set TOOMANY will have three records that show that HISP and ELSE are both set to one. That might be part of your issue.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would suggest that you look at the TOOFEW data set as well. That shows 9 records where the values of all the variables are 0. Which considering that some of then have Hispanic=1 may well not be what you expect.&lt;/P&gt;
&lt;P&gt;I really do not understand why your race = MR (multiracial), white or black is excluded from HISP. Typically if you have a "non-Hispanic White" category then Hispanic White is counted in Hispanic, same with black or other races.&lt;/P&gt;</description>
      <pubDate>Tue, 25 May 2021 20:06:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Create-a-binary-variable-from-several-categorical-variables/m-p/743688#M232889</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2021-05-25T20:06:29Z</dc:date>
    </item>
  </channel>
</rss>

