@Switch BlaydeBoth Ernest and I have gone into the 'crap' that MS dumps into WORD, which we've always stripped out simply because it significantly impacts how effective the html is (i.e. if the processor is continually checking invalid codes, it's NOT process the codes which ARE meaningful nearly as quickly.
But the things that WORD does as html entires are for internal housekeeping only, like names, place names, numerics and other details like that which is ONLY there so that WORD can recreate the original WORD file from the html file.
Saving the file as "Filtered html" is the most effective method of stripping those details out, though it still leaves a LOT of crap behind (ex: < span> commands around internal formatting such as italicizing and emphasis commands: "< span style='font-family:"Garamond",serif'>"). Since that and the other commands doesn't DO anything, there's really no point to the commands, but when the page loads, the html code has to run through ALL the various commands to determine whether it applies in some way (the Span command is a handy way of saying 'pardon me, but I've got my own reason for burying this code, so please, don't worry about what it is) and is most often used because Apple has it's own formatting rules, and needs a duplicate "Centering" rule, because it'll ignore ANY center command that doesn't have the duel coding.
SOL, on the other hand, treats all html text as text only, only recognizing a few html commands, specifically in-line formatting < i> < b> and < u>, < a hlink> and < img commands>, as it strips out ALL predefined paragraph Styles (such as first line indenting, or lack thereof, or other specifically formatted paragraph types (such as indented text).
For those of us who also submit our stories to other sites for publishing, this is a real pain in the ass, as we've got to keep separate versions of each variant files of each chapter, section or book. Luckily, SOL does NOT choke when it encounter the various paragraph types, it simply ignores the various paragraph styles, so they essentially 'pass through' without negatively impacting anything.