Thank you to both AndyM and DaleDe. For what I need, I'm finding Ebook Tidy does a smash-up job - and I don't have to learn regular expression programming!
Derek
Quote andym
You might find it worth getting a text editor (such as
e-texteditor that supports regular expressions. These allow you to do very clever search and replace operations - eg select all returns that follow a letter rather than a punctuation mark and delete them (or replace with a space). There's a bit of a learning curve but they offer a very flexible way of dealing with problem texts.
Have a look at the tutorial
here.
Quote delphidb96
I've got a *bunch* of .txt files which have - for some silly reason - *SHORT* lines, maybe 40-45 characters before hitting a hard line-break. Is there a simple executable or a Word 2002 script which will take out all the excess line-breaks within a paragraph. Most of these files have two line-breaks between paragraphs.
I have, on my system Atlantis Word Processor and Word 2002 if anyone has a script for one of those.
Derek
In the MobileaRead Wiki on the conversion page is a link to Stingo's Word Macro. I use it a lot and it turns hard returns into great text. If you already have Word then the total cost is $0.
Quote JSWolf
Got a good text editor that doesn't cost & doesn't need cygwin (hate it)?
Try the
AEdiX Suite. It's a very flexible editor and the search/replace has RE support. I use it for coding, but it's fine for generic text editing as well.
Quote RWood
In the MobileaRead Wiki on the conversion page is a link to Stingo's Word Macro. I use it a lot and it turns hard returns into great text. If you already have Word then the total cost is $0.
Just don't use word to create HTML files from text the results are pretty awful.