Mobileread
Wiktionary for Pocketbook
#1  InMyPocket 02-05-2021, 10:00 AM
Hi,

I converted the french wiktionary from this GitHub project: https://github.com/BoboTiG/ebook-reader-dict

This project extract every day dictionaries from Wiktionary in several languages. So, the work I made could easiliy be done for other language too.


The french version I made is base on the 28th of January 2021. It contains more than 1.5 millions word !

It is downloadable here:

https://gofile.io/d/a27MMk

The .dic file must be copied into the "system\dictionaries" folder of your Pocketbook reader. (Note: the "System" folder is hidden)

Any comment and suggestion is welcomed !


------------------

If you want to make the convertion yourself, here are the steps to proceed:


# 1 ## Download the woktionary in the Stardict format:

https://github.com/BoboTiG/ebook-reader-dict

I used the french version:
https://github.com/BoboTiG/ebook-reader-dict/releases/tag/fr

# 2 ## Convert the Stardict files to a XDXF file using Linguae 0.15.

Download Linguae here:
https://linguae.stalikez.info/

# 3 ## Proceed to some cleanup of the XDXF with the "sed" tools:

1) Change the dictionary name : "Wiktionnaire"
2) Remove colors codes not needed on reader : #([0-9a-fA-F]){6}
3) Add "\n" after each </li> : this truncates long lines to avoid crash of the Pocketbook converter (step 4)
4) Add "#" at the beginning of definitions : a matter of look and lisibility. Change to whatever you like...
5) For french wiktionary, remove <i>(Date à préciser)</i> : this text is useless for readers. Change the text to your language.


I used this light and standalone sed for windows:
https://raw.githubusercontent.com/mbuilov/sed-windows/master/sed-4.8-x64.exe

and used this command:
sed -e "s/<full_name>.*<\/full_name>/<full_name>Wiktionnaire<\/full_name>/" -e "s/\(<\/li>\)/\1\n/g" -e "s/#\([0-9a-fA-F]\)\{6\}//g" -e "s/<li>/<li># /g" -e "s/<i>(Date .* pr.*ciser)<\/i>//g" dict.xdxf >Wiktionnaire.xdxf

# 4 ## Convert to Pocketbook format:

The official converter can be downloader here:

https://support.pocketbook-int.com/dictionaries/u/DictionaryConverter.zip

This version includes only few "locales" files. The "fr" local was not included.

These locales are available in an older version in this post:
https://www.mobileread.com/forums/sh...06&postcount=4

The file is here:
https://www.mobileread.com/forums/at...4&d=1435862948

I used this patched version of the converter which avoid crash with long XDXF content:
https://www.mobileread.com/forums/sh...82&postcount=4



To convert the file from XDXF to Pocketbook format:
converter Wiktionnaire.xdxf fr
=> Create: Wiktionnaire.dic
Reply 

#2  ploum 02-08-2021, 06:11 AM
That’s great, thanks for the work. Any idea if the dic could work with Koreader ?
Reply 

#3  nhedgehog 02-08-2021, 06:29 AM
Quote ploum
That’s great, thanks for the work. Any idea if the dic could work with Koreader ?
I guess you could say that. Since KoReader uses Stardict file format, I'm sure you can find a link in the first post of the thread
Reply 

#4  InMyPocket 02-09-2021, 12:08 PM
Hi,

I packed a bundle of the tools required to convert dictionaries without the need of Linguae (I use a modified version of the Penelope tools instead). So, the convertion process can now be fully automated.

See my post in the developper corner:
https://www.mobileread.com/forums/sh...7&postcount=76

Note: this bundle also include also the Wiktionary (fr) and an adapted version of the Nouveau Littré.
Reply 

#5  janno19 03-12-2021, 09:13 PM
Hi folks,

I came too late -> The "wiktionnaire" file is NOT available anymore.

I have already sent a message to InMyPocket and maybe he will help me BUT
someone could also upload it again....

Thanks anyway!

Have a nice day
Reply 

#6  nhedgehog 03-14-2021, 05:26 AM
Same here, seems to be that InMyPocket went missing. Tried to contact her/him too. No responce.
Reply 

#7  janno19 03-14-2021, 11:42 PM
Quote nhedgehog
Same here, seems to be that InMyPocket went missing. Tried to contact her/him too. No responce.
- Thank you, but maybe he's just busy. So I will will wait, but if no help is coming, we will have to find another way.


For me it's just that I am too busy right now to learn how to do the work myself. InMyPocket even described, how he created the dictionary.

Have a nice day!
Reply 

#8  nhedgehog 03-15-2021, 02:11 AM
Hope InMyPocket is well, I do have a backup of the aformentioned Penelope hack and a Windows Sed version, hope that helps.
[zip] penelope-3.1.3-XDXF-InMyPocket.zip (59.1 KB, 16 views)
[zip] sed-4.8-x64.exe.zip (202.2 KB, 17 views)
Reply 

#9  janno19 03-15-2021, 02:33 PM
Quote nhedgehog
Hope InMyPocket is well, I do have a backup of the aformentioned Penelope hack and a Windows Sed version, hope that helps.
Dear nhedgehog, thank you for uploading the tools, but quite frankly I will be busy for some time.

Have a nice evening!
Reply 

#10  InMyPocket 03-16-2021, 12:45 PM
Quote nhedgehog
Hope InMyPocket is well
Yes, I'm well, thanks ! I was just disconnected for some days...

All files can be found here.

There many Wiktionaries in there (I génerated a German version for you) and several multilanguage version. e.g: An FR_EN_NL version or a ES_PT_PL version. See discussion in the developper corner for more details.

The WiktDict_XXXX files were generated with a all new program I made to convert dbnary files to xdxf. I also uploaded that tool: dbnary2xdxf.py

To use it python must be installed on your computer. Then start:

Quote
dbnary2xdxf.py the_dbnary_ontolex_file lang_from lang_to [lang_to2...] > dest_file.xdxf
Rem:
Then convert the xdxf file with the PocketBook converter (included PBConvertTools.zip) :
Quote
converter dest_file.xdxf language_code
Enjoy !
Reply 

Today's Posts | Search this Thread | Login | Register