Mobileread
Split PDF into Multiple files
#1  anonymust 08-09-2015, 06:50 AM
Good day.


Is there a way for me to split a large PDF (490MB) which my e-reader can not handle into multiple parts?

I plan to split it into at least 10 parts, or if I can split it my definning from page X to page Y

I believe the PDF is this large because it was scanned from the original source as an image then put together as a PDF




P.s. I use a Macbook not windows if you have software recommendations.
Reply 

#2  markom 08-09-2015, 09:47 AM
Many pdf readers, editors and tools can split and merge pdf files.

Since 490 MB is a way too big for pdf, I would try to reduce the pdf size by reducing the page size(resolution) also, not just by splitting, because it will be easier that way for e-reader too.

You can use k2pdfopt app (Mac version), both for reducing the page size (e.g. exactly for your reader screen size and resolution) and splitting the pdf into several parts (from page x to y).

k2pdfopt automatically crops the white borders, reduces the page size and cuts the pdf pages to fit e-reader's screen in landscape or portraite perfectly.

http://www.willus.com/k2pdfopt/

http://www.mobileread.com/forums/sho...d.php?t=144711

some other free pdf apps

http://www.willus.com/k2pdfopt/pdf_conversion.shtml
Reply 

#3  willus 08-09-2015, 11:41 PM
PDFSAM and jPDFtweak are two tools that can also do what you want. They are java-based, so they should be platform independent, but I have not used either on a Mac.
Reply 

#4  anonymust 08-10-2015, 06:43 AM
Is it possible with any of these softwares to have an OCR done on the files?

Currently the PDF is in high resolution, and the text is pretty crisp and clear as an image.

But I fear if I shrink or split the PDF I will not be able to annotate/highlight certain text within the document (because its im image not text if you know what I mean)

To make things more difficult, some of the text are in columns :/
Reply 

#5  willus 08-10-2015, 09:15 AM
Quote anonymust
Is it possible with any of these softwares to have an OCR done on the files?

Currently the PDF is in high resolution, and the text is pretty crisp and clear as an image.

But I fear if I shrink or split the PDF I will not be able to annotate/highlight certain text within the document (because its im image not text if you know what I mean)

To make things more difficult, some of the text are in columns :/
The software Markom mentioned, k2pdfopt (open source, available for Mac), uses the Tesseract OCR engine if you turn on OCR. That does a pretty good job if the text is well defined, but it is slow--you may have to run the conversion overnight. It should be okay with multiple columns, depending on the complexity of the layout. You can try things out on a few pages at a time. For an example of running the text menu version (like on the Mac) and turning on OCR, watch the 6-minute video on this help page.

I have read on these forums that the best OCR is done with ABBY FineReader, which is a commercial program.
Reply 

#6  anonymust 05-13-2020, 05:21 AM
thank you
Reply 

Today's Posts | Search this Thread | Login | Register