Hello,
Is there a tool that can
1. OCR an "image" PDF, and
2. Include the text output as an additional layer in a PDF, so that the user can search, and possibly select+copy, and paste it elsewhere, like it were a "text" PDF?
Thank you.
Quote Shohreh
Is there a tool that can
1. OCR an "image" PDF, and
2. Include the text output as an additional layer in a PDF, so that the user can search, and possibly select+copy, and paste it elsewhere, like it were a "text" PDF?
Besides Adobe Acrobat, pretty much any commercial OCR tool, e.g. ABBYY FineReader, can do this.
There are also a couple of free Linux tools that can do this, e.g. pdfsandwich, but most of them are neither easy to install nor exactly user-friendly.
Thanks for the info.
I tried a couple of open-source apps (Naps2 and ocrmypdf), and the output is pretty good.
Thanks for the tips on naps2 and ocrmypdf. Great looking utilities. k2pdfopt will also do this and also uses Tesseract.
k2pdfopt -mode copy -n- -ocr t file.pdf
Quote willus
k2pdfopt -mode copy -n- -ocr t file.pdf
These commands go in "Additional options" box?
Quote charsee
These commands go in "Additional options" box?
With the MS Windows GUI you can set them as shown in the attached screen shot. The OCR option will automatically turn off native mode.