<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: how to convert many PDF to SAS dataset in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294240#M270202</link>
    <description>&lt;P&gt;To add to what&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13868"&gt;@AhmedAl_Attar﻿&lt;/a&gt;&amp;nbsp;posted:&lt;/P&gt;
&lt;P&gt;PDF files as such are not "tabular" so there is not really a direct conversion path. Tika would allow you to convert your PDF into a text based document (done that myself, works really well and is simple to use) which you then could read into SAS.&lt;/P&gt;
&lt;P&gt;There is also &lt;A href="http://pdfbox.apache.org/" rel="nofollow" target="_blank"&gt;Apache PDFBox&lt;/A&gt;&amp;nbsp;which apparently can do PDF to csv conversions - never used it though.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://tika.apache.org/&amp;nbsp;" target="_blank"&gt;https://tika.apache.org/&amp;nbsp;&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://pdfbox.apache.org/&amp;nbsp;" target="_blank"&gt;http://pdfbox.apache.org/&amp;nbsp;&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 26 Aug 2016 00:53:36 GMT</pubDate>
    <dc:creator>Patrick</dc:creator>
    <dc:date>2016-08-26T00:53:36Z</dc:date>
    <item>
      <title>PDF to SAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294194#M270198</link>
      <description>&lt;P&gt;I have 500 PDF&amp;nbsp; files need to convert to SAS datasets. I found there are some complex codes show to do one convert (one pdf file to one&amp;nbsp;&amp;nbsp;sas datsset)&amp;nbsp;. This code is prety complex and one do one convert. Is there any simple proc. to do it in a simple way&amp;nbsp;and can quickly convert&amp;nbsp;&amp;nbsp;these 500 pdf files?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Aug 2016 21:09:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294194#M270198</guid>
      <dc:creator>Defense</dc:creator>
      <dc:date>2016-08-25T21:09:09Z</dc:date>
    </item>
    <item>
      <title>Re: PDF to SAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294199#M270199</link>
      <description>&lt;P&gt;No.&lt;/P&gt;
&lt;P&gt;PDF files are not easily readable by any system &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;EDIT:&lt;/P&gt;
&lt;P&gt;To clarify there's no simple proc. Your best bet is as indicated to save data to a text or machine readable file.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Personally, I would purchase a one month subscription to Adobe and use Adobe Pro to convert it. If you have Adobe Professional, most big corps do, you can batch process all 500 in a script. Adobe has an Automator feature that works well IMO.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 26 Aug 2016 02:55:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294199#M270199</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2016-08-26T02:55:12Z</dc:date>
    </item>
    <item>
      <title>how to convert many PDF to SAS dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294202#M270200</link>
      <description>&lt;P&gt;I have 500 PDF&amp;nbsp; files need to convert to SAS datasets. I found there are some complex codes show to do one convert (one pdf file to one&amp;nbsp;&amp;nbsp;sas datsset)&amp;nbsp;. This code is prety complex and one do one convert. Is there any simple proc. to do it in a simple way&amp;nbsp;and can quickly convert&amp;nbsp;&amp;nbsp;these 500 pdf files?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 25 Aug 2016 21:24:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294202#M270200</guid>
      <dc:creator>Defense</dc:creator>
      <dc:date>2016-08-25T21:24:16Z</dc:date>
    </item>
    <item>
      <title>Re: how to convert many PDF to SAS dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294204#M270201</link>
      <description>&lt;P&gt;You may want to look into the option of converting the PDF into a File format that can be accessed by SAS, such as Excel!?&lt;/P&gt;
&lt;P&gt;Here is a link with such option: &lt;A href="https://wagda.lib.washington.edu/gishelp/tutorial/excel.html" target="_blank"&gt;https://wagda.lib.washington.edu/gishelp/tutorial/excel.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this helps,&lt;/P&gt;
&lt;P&gt;Ahmed&lt;/P&gt;</description>
      <pubDate>Thu, 25 Aug 2016 21:35:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294204#M270201</guid>
      <dc:creator>AhmedAl_Attar</dc:creator>
      <dc:date>2016-08-25T21:35:47Z</dc:date>
    </item>
    <item>
      <title>Re: how to convert many PDF to SAS dataset</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294240#M270202</link>
      <description>&lt;P&gt;To add to what&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13868"&gt;@AhmedAl_Attar﻿&lt;/a&gt;&amp;nbsp;posted:&lt;/P&gt;
&lt;P&gt;PDF files as such are not "tabular" so there is not really a direct conversion path. Tika would allow you to convert your PDF into a text based document (done that myself, works really well and is simple to use) which you then could read into SAS.&lt;/P&gt;
&lt;P&gt;There is also &lt;A href="http://pdfbox.apache.org/" rel="nofollow" target="_blank"&gt;Apache PDFBox&lt;/A&gt;&amp;nbsp;which apparently can do PDF to csv conversions - never used it though.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://tika.apache.org/&amp;nbsp;" target="_blank"&gt;https://tika.apache.org/&amp;nbsp;&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://pdfbox.apache.org/&amp;nbsp;" target="_blank"&gt;http://pdfbox.apache.org/&amp;nbsp;&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 26 Aug 2016 00:53:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294240#M270202</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2016-08-26T00:53:36Z</dc:date>
    </item>
    <item>
      <title>Re: PDF to SAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294267#M270203</link>
      <description>&lt;P&gt;Such a conversion will only make sense if the PDFs in question contain usable data. Since a PDF could also be one big graphical image (like a scan), it is one of the least suited formats for business intelligence data transfer.&lt;/P&gt;
&lt;P&gt;I'd rather request the originator to provide data in a format that makes sense. And provide metadata (column descriptions) along.&lt;/P&gt;</description>
      <pubDate>Fri, 26 Aug 2016 06:15:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294267#M270203</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2016-08-26T06:15:36Z</dc:date>
    </item>
    <item>
      <title>Re: PDF to SAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294392#M270204</link>
      <description>&lt;P&gt;If the issue has to do with PDF fillable forms and the data contained therein then use a proper PDF tool like Adobe Acrobat Pro to export the data. That will usually result in some form of set that can be imported to SAS.&lt;/P&gt;</description>
      <pubDate>Fri, 26 Aug 2016 14:03:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294392#M270204</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2016-08-26T14:03:15Z</dc:date>
    </item>
    <item>
      <title>Re: PDF to SAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294652#M270205</link>
      <description>This is the way I decide to do with my data.&lt;BR /&gt;&lt;BR /&gt;1. Convert PDF to excel using Adobe Acrobat Professional version, which allows me to convert hundreds pdf to excel just by one “click”&lt;BR /&gt;&lt;BR /&gt;2. Read excel to sas using a macro&lt;BR /&gt;</description>
      <pubDate>Sun, 28 Aug 2016 19:56:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/PDF-to-SAS/m-p/294652#M270205</guid>
      <dc:creator>Defense</dc:creator>
      <dc:date>2016-08-28T19:56:55Z</dc:date>
    </item>
  </channel>
</rss>

