Text mining and content categorization

How can I use OCR for an image in PDF

Accepted Solution Solved
Reply
Highlighted
Learner
Posts: 1
Accepted Solution

How can I use OCR for an image in PDF

Hello,

 

I would like to use OCR for the extraction of  passports, i.e. passport number in a PDF. 

 

Is this possible in enterprise guide BI 7.1 or enterprise miner client 14.1? And if so, is there a script or manual about how to do this?

 

Thanks in advance. 

 


Accepted Solutions
Solution
‎01-29-2018 05:46 AM
Super User
Posts: 9,923

Re: How can I use OCR for an image in PDF

Use external OCR software to convert the image to text before reading the resulting text into SAS.

SAS does not have in-built OCR, AFAIK.

See hints for using google tesseract from SAS here: https://communities.sas.com/t5/SAS-Text-and-Content-Analytics/How-to-import-pdf-and-jpg-files-in-SAS...

---------------------------------------------------------------------------------------------
Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
How to post code

View solution in original post


All Replies
Solution
‎01-29-2018 05:46 AM
Super User
Posts: 9,923

Re: How can I use OCR for an image in PDF

Use external OCR software to convert the image to text before reading the resulting text into SAS.

SAS does not have in-built OCR, AFAIK.

See hints for using google tesseract from SAS here: https://communities.sas.com/t5/SAS-Text-and-Content-Analytics/How-to-import-pdf-and-jpg-files-in-SAS...

---------------------------------------------------------------------------------------------
Maxims of Maximally Efficient SAS Programmers
How to convert datasets to data steps
How to post code
☑ This topic is solved.

Need further help from the community? Please ask a new question.

Discussion stats
  • 1 reply
  • 462 views
  • 1 like
  • 2 in conversation