How can I read *.pdf documents using SAS!

Reply
Occasional Contributor
Posts: 12

How can I read *.pdf documents using SAS!

How can I read *.pdf documents using SAS!
SAS Super FREQ
Posts: 8,721

Re: How can I read *.pdf documents using SAS!

Hi!
This paper describes a process whereby you must first take a PDF file and turn it into an ASCII text file before you can read it with SAS. Since PDF is a proprietary format, the process he describes, makes sense. SAS creates PDF format files, it does not read them in their native, binary, format:
http://www8.sas.com/scholars/05/SESUG_05/Proceedings/2005/Serendipity/SER10_05.PDF

One other possibility is that you want to read the data that was collected in a PDF form (an FDF file or an XFDF file), as described in this paper:
http://www2.sas.com/proceedings/sugi27/p032-27.pdf

A third possibility involves printing the PDF document and then scanning it into OCR format, saving the file from the OCR scan and then reading -that- file with SAS (this is a variation of the first possibility).

Good luck!
cynthia
New Contributor
Posts: 2

Re: How can I read *.pdf documents using SAS!

there a variety of online pdf viewer vb.net on the web you can find to read pdf in full version.  you can also have all the processing features: zoom crop scale. most importanly you can convert pdf to various image formats. so it won't be a problem to read pdf now.

Occasional Contributor
Posts: 9

Re: How can I read *.pdf documents using SAS!

So the process of reading PDF doucment file is, in essence, the process of decoding PDF document to bitmap? By the way, witout using Adobe Acrobat PDF document reader, is there any free source code for us to use in order to view document in web application?

Occasional Contributor
Posts: 9

Re: How can I read *.pdf documents using SAS!

Hi, Cathyhill.

I am using another PDF reader to help me read PDF documents instead of Adobe Acrobat PDF document reader. What's more, using code to deal with the related PDF documents reading problem is too complicated for me. So you can choose some manual toolkits which allows users to customize its features according to our own favors to help you with the related PDF documents reading problem. Remember to check its free trial package first if possible. I hope you success. Good luck.

Best regards,

Arron

Occasional Contributor
Posts: 12

Re: How can I read *.pdf documents using SAS!

Thank u very much!!
N/A
Posts: 1

Re: How can I read *.pdf documents using SAS!

The easiest and fastest way by far is to use the full version of Adobe Acrobat.  Yes, it's expensive around $800 for the license but most companies will find at some point they need to edit PDFs.  You can also try on-line PDF to Excel converters (google it) but most only do a small number of pages.  There might be other cheaper PDF editors around.

So basically open the PDF in the full verion of Adobe Acrobat and then   File, Save as, select Excel.  Then from there it is plain sailing.  All the other methods I've looked in to are mega complicated and require lots of messing around.

Occasional Contributor
Posts: 5

Re: How can I read *.pdf documents using SAS!

I don't know which environment you are working in but if it is Windows you might find the PDF-text-extractor useful. There is a client based free version and there is a command line based version for USD 35. I went all in and invested the 35 dollars and built a routine that creates txt copies of all pdfs in a directory structure, thereby enabling the users to perform text-search and link back to the original pdf. Maybe that can serve as a starting point for you? Take a look at http://www.a-pdf.com/text/index.htm.

Ask a Question
Discussion stats
  • 7 replies
  • 13342 views
  • 0 likes
  • 7 in conversation