The SitePoint Forums have moved.

You can now find them here.
This forum is now closed to new posts, but you can browse existing content.
You can find out more information about the move and how to open a new account (if necessary) here.
If you get stuck you can get support by emailing forums@sitepoint.com

If this is your first visit, be sure to
check out the FAQ by clicking the
link above. You may have to register
before you can post: click the register link above to proceed. To start viewing messages,
select the forum that you want to visit from the selection below.

stripping garbage from Word

I am trying to strip all the tags and proprietary garbage from a word doc using PHP. I extract the doc's contents into a string, then pass that string to the function below. This gets rid of a lot of it, but unfortunately not all. Can anyone improve on this?