How to Automatically OCR PDF Files in a Given Folder?

NeoDevin · May 6, 2013

Can anyone recommend a method to have all pdf files in a given folder automatically OCR?

My scanner saves files as pdf, but I would like them to be searchable.

Thanks in advance.

ChrisJA · May 9, 2013

It would help to know what operating system you are using. Mac OS X or Linux?

Adobe Acrobat will do what you want.

NeoDevin · May 11, 2013

I have computers running windows and linux, a method for either would be fine, preferably a free option.

Routaran · May 16, 2013

There's several OCR options available to you to use. I did a Google Search for 'linux ocr pdf' and this was the first hit on the list
http://ubuntuforums.org/showthread.php?t=1456756

you can write a small script with a for loop that will go through the contents of a directory and ocr all the pdf files if the program doesn't have flags that allow you to do multiple pdfs at the same time.

ChrisJA · May 16, 2013

Sorry, I +thought+ I had relied to this days ago. It seems the way to go is "tesseract" http://code.google.com/p/tesseract-ocr/
It has it's own GUI but there are other 3rd party GUIs or you can run it from the command line or script

How to Automatically OCR PDF Files in a Given Folder?

Similar threads

What Free Privacy-Focused AI Chatbots Don’t Use My Data for Training?

How far will we let AI control us?

If you think having a backup is too expensive, try not having one

Impersonation News

Cooling a processor chip

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers