By: | John Cowan |
Version: | 1.0rc3 |
Platforms: | Java |
Category: | General N-converters |
Info on web: | http://www.tagsoup.info/ |
Feedback: | Something wrong? Follow this link. |
TagSoup is an HTML parser which can parse HTML as it is found on the web (and not just valid HTML) and make it available through a SAX 2.0 interface. It attempts to emulate the behaviour of web browsers by correctly nesting elements, inferring missing tags, and adding default attribute values, and it never signals syntax errors.