<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Scarping Web Data on District Clerks Website in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Scarping-Web-Data-on-District-Clerks-Website/m-p/804572#M316851</link>
    <description>You'll probably have to study the javascript that runs when clicking on the buttons on the page.&lt;BR /&gt;That is what allows you to see the names on the page. The file is &lt;A href="https://www.cdcatexas.com/js/clerks.min.js" target="_blank"&gt;https://www.cdcatexas.com/js/clerks.min.js&lt;/A&gt; . &lt;BR /&gt;&lt;BR /&gt;Best regards, Jos</description>
    <pubDate>Mon, 28 Mar 2022 17:18:34 GMT</pubDate>
    <dc:creator>JosvanderVelden</dc:creator>
    <dc:date>2022-03-28T17:18:34Z</dc:date>
    <item>
      <title>Scarping Web Data on District Clerks Website</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Scarping-Web-Data-on-District-Clerks-Website/m-p/804559#M316844</link>
      <description>&lt;P&gt;Hello,&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am attempting to utilized SAS to created a code that obtain the data from the clerk directory to help make my co-workers job easier and help make it easier to inform them about this important new update regarding vital records. I review several articles and am stuck on the step involving the p&lt;SPAN&gt;rocessing/parse the web page contents to gather the data. I have made some progress but I am not sure how to create a data steps that allows me to actually read&amp;nbsp;formName,&amp;nbsp;formCounty,&amp;nbsp;formJobTitle, and&amp;nbsp;formEMail&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I am attempting to to pull Clerk Name, Job Title, County, and Email from this webpage:&amp;nbsp;&lt;A href="https://www.cdcatexas.com/Directory/Clerks" target="_blank"&gt;https://www.cdcatexas.com/Directory/Clerks&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;* Pull out Clerk Name, Job Title, County, and Email;&lt;BR /&gt;&lt;BR /&gt;filename src temp ; &lt;BR /&gt;proc http&lt;BR /&gt;method="GET"&lt;BR /&gt;url="https://www.cdcatexas.com/Directory/Clerks"&lt;BR /&gt;out=src;&lt;BR /&gt;run;&lt;BR /&gt;&lt;BR /&gt;/* Read the entire file and skip the blank lines */&lt;BR /&gt;/* the LEN indicator tells us the length of each line */&lt;BR /&gt;&lt;BR /&gt;data rep;&lt;BR /&gt;infile src length=len lrecl=32767;&lt;BR /&gt;input line $varying32767. len;&lt;BR /&gt;line = strip(line);&lt;BR /&gt;if len&amp;gt;0;&lt;BR /&gt;run;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Mar 2022 16:29:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Scarping-Web-Data-on-District-Clerks-Website/m-p/804559#M316844</guid>
      <dc:creator>watsoninvest</dc:creator>
      <dc:date>2022-03-28T16:29:15Z</dc:date>
    </item>
    <item>
      <title>Re: Scarping Web Data on District Clerks Website</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Scarping-Web-Data-on-District-Clerks-Website/m-p/804572#M316851</link>
      <description>You'll probably have to study the javascript that runs when clicking on the buttons on the page.&lt;BR /&gt;That is what allows you to see the names on the page. The file is &lt;A href="https://www.cdcatexas.com/js/clerks.min.js" target="_blank"&gt;https://www.cdcatexas.com/js/clerks.min.js&lt;/A&gt; . &lt;BR /&gt;&lt;BR /&gt;Best regards, Jos</description>
      <pubDate>Mon, 28 Mar 2022 17:18:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Scarping-Web-Data-on-District-Clerks-Website/m-p/804572#M316851</guid>
      <dc:creator>JosvanderVelden</dc:creator>
      <dc:date>2022-03-28T17:18:34Z</dc:date>
    </item>
    <item>
      <title>Re: Scarping Web Data on District Clerks Website</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Scarping-Web-Data-on-District-Clerks-Website/m-p/805506#M317320</link>
      <description>Thanks for your tip.</description>
      <pubDate>Fri, 01 Apr 2022 14:08:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Scarping-Web-Data-on-District-Clerks-Website/m-p/805506#M317320</guid>
      <dc:creator>watsoninvest</dc:creator>
      <dc:date>2022-04-01T14:08:44Z</dc:date>
    </item>
  </channel>
</rss>

