Text mining and content categorization

How can I use OCR for an image in PDF

Accepted Solution Solved
Reply
Highlighted
Learner
Posts: 1
Accepted Solution

How can I use OCR for an image in PDF

Hello,

 

I would like to use OCR for the extraction of  passports, i.e. passport number in a PDF. 

 

Is this possible in enterprise guide BI 7.1 or enterprise miner client 14.1? And if so, is there a script or manual about how to do this?

 

Thanks in advance. 

 


Accepted Solutions
Solution
4 weeks ago
Super User
Posts: 8,590

Re: How can I use OCR for an image in PDF

Use external OCR software to convert the image to text before reading the resulting text into SAS.

SAS does not have in-built OCR, AFAIK.

See hints for using google tesseract from SAS here: https://communities.sas.com/t5/SAS-Text-and-Content-Analytics/How-to-import-pdf-and-jpg-files-in-SAS...

---------------------------------------------------------------------------------------------
Maxims of Maximally Efficient SAS Programmers

View solution in original post


All Replies
Solution
4 weeks ago
Super User
Posts: 8,590

Re: How can I use OCR for an image in PDF

Use external OCR software to convert the image to text before reading the resulting text into SAS.

SAS does not have in-built OCR, AFAIK.

See hints for using google tesseract from SAS here: https://communities.sas.com/t5/SAS-Text-and-Content-Analytics/How-to-import-pdf-and-jpg-files-in-SAS...

---------------------------------------------------------------------------------------------
Maxims of Maximally Efficient SAS Programmers
☑ This topic is solved.

Need further help from the community? Please ask a new question.

Discussion stats
  • 1 reply
  • 270 views
  • 1 like
  • 2 in conversation