Word HTML Cleaners
Here I made a summary of most used Word HTML cleaning tools. I wish it would be helpful in choosing a proper tool for a Word HTML cleaning work. And I don't have a intention to praise or complain some tool, if my opinion is not correct, please email me and I would correct it.- Dreamweaver
Dreamweaver has a function to cleanup Word HTML, but its speed is very slow. When I tried to clean a file of 500KB, it halted.
See the official article of using Dreamweaver to clean Word HTML - Word HTML Cleaner by Textism
An online service. Files smaller than 20K is free, I tried one, it works, generate basic HTML, but styles lost. And most Word generated HTML file is very fat, thus subscription is needed. - Word Cleaner by Zapodo
That's a powerful tool, it can convert DOC to HTML directly without MS Word. I tried, but only a piece of file was converted in trial version, so I don't know the real result of it. It prices $99. - Word to HTML by Maluke
This tool is a convertor from Word to HTML, - Gmail
Send a DOC as attachment to your Gmail address, don't open or download it, just view as HTML, then you get a very cleaned HTML file, but without styles, very like the HTML generated from Word 97. It's an alter way to generate sinple pure HTML.
| Solutions | Word HTML Cleaner | Word Cleaner | Word to HTML | Gmail | Dream-weaver | HTML Cleaner for Word |
|---|---|---|---|---|---|---|
| Producer | Textism | Zapadoo | Maluke | Macromedia (Adobe) | Wonder Studio | |
| Price | C49/year | $99 | $47 | Free | - | $39 |
| Type | Online Service | Software | Software | Online Function | Software Function | Software |
| Speed | Medium | Medium | Medium | Fast | Slow | Fast |
| Big-file |
|
| ||||
| Multi-file | ? |
|
| |||
| Input file type | HTML | DOC | DOC | DOC | HTML | HTML |
| Clean Office tag |
|
|
|
|
|
|
| Clean HTML redundancy |
|
|
|
|
| |
| Retain appearance | ? |
| ||||
| Can generate pure HTML |
|
|
|
|
|
|
| Various options | ? |
|
| |||
| Trial limitation | File size < 20K | a piece conversion | No saving |
- The data is only for reference.
- Some items is not exact for the poor comparability.
- This table will be updated when get new information instantly.
And some other tools may be helpful for cleaning.
- MS Office HTML Filter 2.0 This is a patch tool for Word 2000 originally. It will remove office specific tags. Since Word XP and 2003 has a intrinsic function to generate filtered web page, most users don't need it anymore, except those who still use Word 2000. it just clean unfiltered HTML to filtered, its a tool for Word 2000, which can't generate filter web page. From Word 2002, this function had been ebedded.
- Word 2007 Word 2007 has a new feature, "publish to blog", some says it can generate cleaned HTML, I am not sure, I didn't have Word 2007, I asked my friend to try, he said it will be published directly to some MS blog. I don't know how can it be saved as HTML file.
- HTML Tidy
Actually, I don't know why it's always picked up by some one, even in this field, seems it can do everything in HTML cleaning. After trying, I can't find what it can do to Word HTML cleaning. - Word HTML Cleaner by wordcleaner.co.uk
A free online cleaning website. Generates pure HTML. - Word HTML Clean-up by Bersoft
A small tool to clean . - word2cleanhtml by Oliver Cope
A free online converter. - Microsoft Word 2000 HTML Mess Cleaner by Morten Nilsson
A free online service by ASP and VBScript, source code can be bought.
Reference
Word HTML Cleaning Software Producers
- Zappado: Word Cleaner
- Textism: Word HTML Cleaner
- Maluke: Word to HTML
- Bersoft: Word HTML Clean-up
- WordCleaner.co : Word HTML Cleaner
- Morten Nilsson : Microsoft Word 2000 HTML Mess Cleaner
- Oliver Cope : word2cleanhtml
Articles
