Mobileread
KindleUnpack (MobiUnpack): Extracts text, images and metadata from Kindle/Mobi files
#1261  KevinH 07-19-2019, 07:39 PM
Good point. I have spent no time grokking the kfx codebase but I would like a standalone python version that does not require all of calibre that will take a no dm kfx and convert it to something as epublike as possible. If that already exists, great. If not, I would like to use whatever code we have to "unpack" a kfx into an epub or fixed layout epub if that is what it takes.
Reply 

#1262  DiapDealer 07-19-2019, 07:47 PM
Quote KevinH
Good point. I have spent no time grokking the kfx codebase but I would like a standalone python version that does not require all of calibre that will take a no dm kfx and convert it to something as epublike as possible.
Oh absolutely. Same here. I'm just completely ignorant as to whether any of the existing routines in KindleUnpack will be helpful or not in that regard. If it turns out they will; great.
Reply 

#1263  jhowell 07-19-2019, 08:51 PM
I intended kfxlib to work outside of calibre, but it hasn't been fully tested that way. The basic method of converting KFX to EPUB would be something like:

Code
 import logging from kfxlib import YJ_Book book = YJ_Book(kfx_file, logging) epub_data = book.convert_to_epub()
Reply 

#1264  KevinH 07-19-2019, 09:07 PM
Thanks! I will make time to play with it over the weekend.

Kevin


Quote jhowell
I intended kfxlib to work outside of calibre, but it hasn't been fully tested that way. The basic method of converting KFX to EPUB would be something like:

Code
 import logging from kfxlib import YJ_Book book = YJ_Book(kfx_file, logging) epub_data = book.convert_to_epub()
Reply 

#1265  pdurrant 07-20-2019, 03:09 AM
I'm afraid I haven't had time to do much development of any kind, so I'm happy just to observe at this point.
Reply 

#1266  AcidWeb 08-03-2019, 03:06 PM
I still need Python 3 code that can efficiently extract AZW3 part of image heavy dual MOBI files.
Current implementation in mobi_split.py need minutes for ~500mb file.

Consider this post as code bounty. I can pay over PayPal. Contact me for details.
Any result of this bounty will be released with GPLv3 license.
Reply 

#1267  KevinH 08-03-2019, 04:51 PM
Don't extract them. Simply remove the old mobi parts leaving the remaining records and adjust the header.
Reply 

#1268  AcidWeb 08-03-2019, 06:01 PM
I don't have sufficient knowledge about MOBI structure to do it.
Reply 

#1269  KevinH 08-03-2019, 07:25 PM
It is all documented in the KindleUnpack code and in the DumpMobiHeader code.
Reply 

#1270  tkeo 08-09-2019, 10:12 PM
Quote AcidWeb
I still need Python 3 code that can efficiently extract AZW3 part of image heavy dual MOBI files.
Current implementation in mobi_split.py need minutes for ~500mb file.

Consider this post as code bounty. I can pay over PayPal. Contact me for details.
Any result of this bounty will be released with GPLv3 license.
Is it not enough to comment out or delete lines from 317 to381 in mobi_split.py?
Reply 

 « First  « Prev   (127/127)
Today's Posts | Search this Thread | Login | Register