TagSoupSAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. Free and Open Source software, licensed under the Academic Free License
2007-01-24