Tag Soup 1.0.5
Sponsored Links
Tag Soup 1.0.5 Ranking & Summary
File size:
0.050 MB
Platform:
Any Platform
License:
GPL (GNU General Public License)
Price:
Downloads:
953
Date added:
2007-03-21
Publisher:
John Cowan
Tag Soup 1.0.5 description
TagSoup is a SAX2 parser written in Java that, instead of parsing well-formed or valid XML. Tag Soup parses HTML as it is found in the wild: nasty and brutish, though quite often far from short.
By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. It is a parser, not a whole application; it isnt intended to permanently clean up bad HTML, as HTML Tidy does, only to parse it on the fly.
The following options are understood:
--files
Output into individual files, with html extensions changed to xhtml. Otherwise, all output is sent to the standard output.
--html
Output is in clean HTML: the XML declaration is suppressed, as are end-tags for the known empty elements.
--omit-xml-declaration
The XML declaration is suppressed.
--method=html
End-tags for the known empty HTML elements are suppressed.
--pyx
Output is in PYX format.
--pyxin
Input is in PYXoid format (need not be well-formed).
--nons
Namespaces are suppressed. Normally, all elements are in the XHTML 1.x namespace, and all attributes are in no namespace.
--nobogons
Bogons (unknown elements) are suppressed. Normally, they are treated as empty.
--nodefaults
suppress default attribute values
--nocolons
change explicit colons in element and attribute names to underscores
--norestart
dont restart any normally restartable elements
--any
Bogons are given a content model of ANY rather than EMPTY.
--lexical
Pass through HTML comments. Has no effect when output is in PYX format.
--reuse
Reuse a single instance of TagSoup parser throughout. Normally, a new one is instantiated for each input file.
--nocdata
Change the content models of the script and style elements to treat them as ordinary #PCDATA (text-only) elements, as in XHTML, rather than with the special CDATA content model.
--encoding=encoding
Specify the input encoding. The default is the Java platform default.
--help
Print help.
--version
Print the version number.
By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. It is a parser, not a whole application; it isnt intended to permanently clean up bad HTML, as HTML Tidy does, only to parse it on the fly.
The following options are understood:
--files
Output into individual files, with html extensions changed to xhtml. Otherwise, all output is sent to the standard output.
--html
Output is in clean HTML: the XML declaration is suppressed, as are end-tags for the known empty elements.
--omit-xml-declaration
The XML declaration is suppressed.
--method=html
End-tags for the known empty HTML elements are suppressed.
--pyx
Output is in PYX format.
--pyxin
Input is in PYXoid format (need not be well-formed).
--nons
Namespaces are suppressed. Normally, all elements are in the XHTML 1.x namespace, and all attributes are in no namespace.
--nobogons
Bogons (unknown elements) are suppressed. Normally, they are treated as empty.
--nodefaults
suppress default attribute values
--nocolons
change explicit colons in element and attribute names to underscores
--norestart
dont restart any normally restartable elements
--any
Bogons are given a content model of ANY rather than EMPTY.
--lexical
Pass through HTML comments. Has no effect when output is in PYX format.
--reuse
Reuse a single instance of TagSoup parser throughout. Normally, a new one is instantiated for each input file.
--nocdata
Change the content models of the script and style elements to treat them as ordinary #PCDATA (text-only) elements, as in XHTML, rather than with the special CDATA content model.
--encoding=encoding
Specify the input encoding. The default is the Java platform default.
--help
Print help.
--version
Print the version number.
Tag Soup 1.0.5 Screenshot
Tag Soup 1.0.5 Keywords
HTML
TagSoup
XML
SAX2
Tag Soup 1.0.5
Java
written in Java
Tag Soup
Written in
SAX2 Parser
In Java
soup
tag
elements
parser
output
Bookmark Tag Soup 1.0.5
Tag Soup 1.0.5 Copyright
WareSeeker periodically updates pricing and software information of Tag Soup 1.0.5 full version from the publisher, so some information may be slightly out-of-date. You should confirm all information before relying on it. Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future development of Tag Soup 1.0.5 Edition. Download links are directly from our publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed
Featured Software
Want to place your software product here?
Please contact us for consideration.
Contact WareSeeker.com
Related Information
games written in java
yahoo news tag soup
html tags
programs written in java
tetris game written in java
news tag soup
xml parser
soups
web browser written in java
elements of a short story
cabbage soup diet
sax2 parser property
elegy written in a country churchyard
thinking in java
what is tag soup
periodic table of elements
soup and sandwich dishes
applications written in java
Related Software
Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Free Download
Tagneto is a web developer tool and JavaScript libraries to aid MVC development of XML user interfaces. Free Download
JawFlow is a Workflow Engine partially conformal to WfMC directives. Free Download
DeXSS project provides a SAX2 Parser to help protect against Cross-site scripting (XSS) attacks. Free Download
A classical CD music cataloging utility Free Download
Java-GNOME is a set of Java bindings for the GNOME and GTK libraries. Free Download
Distributed FTP is a distributed FTP daemon written in java. Free Download
CodePrinter is a tiny utility to print out source code or other text files. Free Download
Latest Software
Popular Software
Favourite Software