Presently stymied by the utter BS that is MS word.
This issue is a place to list content patterns, and devise tactics, to extract content from Word.
This issue is a spec that is a work in process.
Feel free to edit and augment comments as opposed to adding to the thread. We can create separate issues for individual tactics, cross-referencing them back here.