<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic PDF to SAS Dataset in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS-Dataset/m-p/401636#M97491</link>
    <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I want to prepare a&amp;nbsp;SAS data from pdf file.&lt;/P&gt;&lt;P&gt;For example, I have a some link where pdf file exists say : "&lt;A href="https://www.example.com/test/demo.pdf" target="_blank"&gt;https://www.example.com/test/demo.pdf&lt;/A&gt;".&lt;/P&gt;&lt;P&gt;How can I read whole pdf file (free text) in one SAS dataset. It would be fine even though whole pdf text data stored in one row only.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in Advance,&lt;/P&gt;&lt;P&gt;Tushar R. Sonawane&lt;/P&gt;</description>
    <pubDate>Fri, 06 Oct 2017 07:28:57 GMT</pubDate>
    <dc:creator>toshi</dc:creator>
    <dc:date>2017-10-06T07:28:57Z</dc:date>
    <item>
      <title>PDF to SAS Dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS-Dataset/m-p/401636#M97491</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I want to prepare a&amp;nbsp;SAS data from pdf file.&lt;/P&gt;&lt;P&gt;For example, I have a some link where pdf file exists say : "&lt;A href="https://www.example.com/test/demo.pdf" target="_blank"&gt;https://www.example.com/test/demo.pdf&lt;/A&gt;".&lt;/P&gt;&lt;P&gt;How can I read whole pdf file (free text) in one SAS dataset. It would be fine even though whole pdf text data stored in one row only.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in Advance,&lt;/P&gt;&lt;P&gt;Tushar R. Sonawane&lt;/P&gt;</description>
      <pubDate>Fri, 06 Oct 2017 07:28:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS-Dataset/m-p/401636#M97491</guid>
      <dc:creator>toshi</dc:creator>
      <dc:date>2017-10-06T07:28:57Z</dc:date>
    </item>
    <item>
      <title>Re: PDF to SAS Dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS-Dataset/m-p/401639#M97492</link>
      <description>&lt;P&gt;Reading pdf files are usually an awkward way to get information. It must be better to try to get hold on the data that was used to create the pdf.&lt;/P&gt;
&lt;P&gt;If you are fine with having the whole pdf in "one record", I suggets that you save the pdf as a txt file, and then read it from SAS.&lt;/P&gt;</description>
      <pubDate>Fri, 06 Oct 2017 07:40:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS-Dataset/m-p/401639#M97492</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2017-10-06T07:40:34Z</dc:date>
    </item>
    <item>
      <title>Re: PDF to SAS Dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS-Dataset/m-p/401641#M97493</link>
      <description>&lt;P&gt;The .pdf format is actually a container for all kinds of data. It could well be that your input is just a scanned picture.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To find tools that can extract text from pdfs, google "extract text from pdf command line".&lt;/P&gt;</description>
      <pubDate>Fri, 06 Oct 2017 07:56:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS-Dataset/m-p/401641#M97493</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2017-10-06T07:56:54Z</dc:date>
    </item>
  </channel>
</rss>

