VIDEO 2 Converting a Formatted Word file to Clean HTML…
A clean HTML conversion is the best way to start when formatting for e-publication. Conversion from word to HTM is an easy process if you use one of the many free online sources. Simply do a search for “Word to HTML” and you will be presented with several choices.
We are here to help you. “Format Once – Use Everywhere™
NOTE: Information on this page is copyright and may not be shared in any other format. Please respect our other members.
Here’s a short tutorial and step-by-step Video on how to convert a Word file to clean HTML in one easy step.
For our Tutorial we’ve chosen the Word to HTML online tool Word2CleanHTML.com
- Watch the Video (2:38 minutes)
- Highlight (Cmd+A, Ctrl+A) and Copy (Cmd+C, Ctrl+C) all your Word content.
- Go to Word 2 Clean HTML: http://word2cleanhtml.com/
- Paste the document where indicated. (Cmd+V, Ctrl+V)
- Check only these three options:
a. Remove empty paragraphs
b. Convert <b> to <strong>, <i> to <em>
c. Replace non-ascii with HTML entities
NOTICE: DIY KIT Pre-made Style Sheet: The HTML conversion takes out any manual indents, or extra paragraph spacing you created; however, the pre-made style sheet with your DIY Kit automatically replaces indents in each paragraph. If you do no want to have the first paragraph of each chapter indented, we have how-to instructions on this further in the series.
Continue Tutorial
You are now ready to move onto Video 3 – Using the Formatting Template in SIGIL
Hi Suzanne,
I was wondering why you wouldn’t just save the word file as an html file directly. Why is it necessary to use a different program such as “http://word2cleanhtml.com/” to do it?
Thank you.
Tom
Tom, Because Word to HTML creates a lot of excess code that you need to take out to make it Clean — however, if the Microsoft CSS doesn’t bother you then you can do it that way.
Also, the new 5.9 BETA of Sigil now allows you to copy the Word file and paste it into Sigil directly, but the BETA still has bugs, so I would wait until they release the stable 6.0 version. :) Suz