convert html documents
Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 4386
Mail::Message::Convert::HtmlFormatPS 2.066
Mail::Message::Convert::HtmlFormatPS can convert HTML into PostScript. more>>
Mail::Message::Convert::HtmlFormatPS can convert HTML into PostScript.
INHERITANCE
Mail::Message::Convert::HtmlFormatPS
is a Mail::Message::Convert
is a Mail::Reporter
SYNOPSIS
use Mail::Message::Convert::HtmlFormatPS;
my $af = Mail::Message::Convert::HtmlFormatPS->new;
my $postscript = $af->format($body);
Translate an HTML/XHTML message body into a postscript body using HTML::FormatPS.
<<lessINHERITANCE
Mail::Message::Convert::HtmlFormatPS
is a Mail::Message::Convert
is a Mail::Reporter
SYNOPSIS
use Mail::Message::Convert::HtmlFormatPS;
my $af = Mail::Message::Convert::HtmlFormatPS->new;
my $postscript = $af->format($body);
Translate an HTML/XHTML message body into a postscript body using HTML::FormatPS.
Download (0.57MB)
Added: 2006-08-03 License: Perl Artistic License Price:
1178 downloads
Mail::Message::Convert::Html 2.066
Mail::Message::Convert::Html is a Perl module to format messages in HTML. more>>
Mail::Message::Convert::Html is a Perl module to format messages in HTML.
INHERITANCE
Mail::Message::Convert::Html
is a Mail::Message::Convert
is a Mail::Reporter
SYNOPSIS
use Mail::Message::Convert::Html;
my $Html = Mail::Message::Convert::Html->new;
print $html->fieldToHtml($head);
print $html->headToHtmlHead($head);
print $html->headToHtmlTable($head);
print $html->textToHtml($text);
The package contains various translators which handle HTML or XHTML without the help of external modules. There are more HTML related modules, which do require extra packages to be installed.
<<lessINHERITANCE
Mail::Message::Convert::Html
is a Mail::Message::Convert
is a Mail::Reporter
SYNOPSIS
use Mail::Message::Convert::Html;
my $Html = Mail::Message::Convert::Html->new;
print $html->fieldToHtml($head);
print $html->headToHtmlHead($head);
print $html->headToHtmlTable($head);
print $html->textToHtml($text);
The package contains various translators which handle HTML or XHTML without the help of external modules. There are more HTML related modules, which do require extra packages to be installed.
Download (0.57MB)
Added: 2006-08-12 License: Perl Artistic License Price:
1168 downloads
CyberNeko HTML Parser 0.9.5
NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents. more>>
NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
The parser can scan HTML files and "fix up" many common mistakes that human (and computer) authors make in writing HTML documents. NekoHTML adds missing parent elements; automatically closes elements with optional end tags; and can handle mismatched inline element tags.
NekoHTML is written using the Xerces Native Interface (XNI) that is the foundation of the Xerces2 implementation. This enables you to use the NekoHTML parser with existing XNI tools without modification or rewriting code.
Version restrictions:
- There are HTML documents for which NekoHTML cannot properly generate a well-formed XML document event stream. For example, documents with multiple tags are inherently ill-formed because XML documents may only have a single root element.
- Code added to the core DOM implementation in Xerces-J 2.0.1 introduced a bug in the HTML DOM implementation based on it.
The bug causes the element nodes in the resultant HTML document object to be of type org.apache.xerces.dom.ElementNSImpl instead of the appropriate HTML DOM element objects.
The problem affects NekoHTML users who use the parser with Xerces-J 2.0.1 and anyone using the HTML DOM implementation in Xerces-J 2.0.1.
- There are no other known major limitations with this release. However, additional work can always be done to improve performance, fix bugs, and add functionality.
<<lessThe parser can scan HTML files and "fix up" many common mistakes that human (and computer) authors make in writing HTML documents. NekoHTML adds missing parent elements; automatically closes elements with optional end tags; and can handle mismatched inline element tags.
NekoHTML is written using the Xerces Native Interface (XNI) that is the foundation of the Xerces2 implementation. This enables you to use the NekoHTML parser with existing XNI tools without modification or rewriting code.
Version restrictions:
- There are HTML documents for which NekoHTML cannot properly generate a well-formed XML document event stream. For example, documents with multiple tags are inherently ill-formed because XML documents may only have a single root element.
- Code added to the core DOM implementation in Xerces-J 2.0.1 introduced a bug in the HTML DOM implementation based on it.
The bug causes the element nodes in the resultant HTML document object to be of type org.apache.xerces.dom.ElementNSImpl instead of the appropriate HTML DOM element objects.
The problem affects NekoHTML users who use the parser with Xerces-J 2.0.1 and anyone using the HTML DOM implementation in Xerces-J 2.0.1.
- There are no other known major limitations with this release. However, additional work can always be done to improve performance, fix bugs, and add functionality.
Download (0.38MB)
Added: 2005-09-28 License: The Apache License Price:
1486 downloads
Convert::Wiki 0.05
Convert::Wiki is a Perl module that can convert HTML/POD/txt from/to Wiki code. more>>
Convert::Wiki is a Perl module that can convert HTML/POD/txt from/to Wiki code.
SYNOPSIS
use Convert::Wiki;
my $wiki = Convert::Wiki->new();
$wiki->from_txt ( $txt );
die ("Error: " . $wiki->error()) if $wiki->error;
print $wiki->as_wiki();
$wiki->from_html ( $html );
die ("Error: " . $wiki->error()) if $wiki->error;
print $wiki->as_wiki();
# clear the object manually
$wiki->clear();
$wiki->add_txt ( $txt );
die ("Error: " . $wiki->error()) if $wiki->error;
print $wiki->as_wiki();
Convert::Wiki converts from various formats to various Wiki formats.
Input can come as HTML, POD or plain TXT (like it is written in many READMEs). The data will be converted to an internal, node based format and can then be converted to Wikicode as used by many wikis like the Wikipedia.
<<lessSYNOPSIS
use Convert::Wiki;
my $wiki = Convert::Wiki->new();
$wiki->from_txt ( $txt );
die ("Error: " . $wiki->error()) if $wiki->error;
print $wiki->as_wiki();
$wiki->from_html ( $html );
die ("Error: " . $wiki->error()) if $wiki->error;
print $wiki->as_wiki();
# clear the object manually
$wiki->clear();
$wiki->add_txt ( $txt );
die ("Error: " . $wiki->error()) if $wiki->error;
print $wiki->as_wiki();
Convert::Wiki converts from various formats to various Wiki formats.
Input can come as HTML, POD or plain TXT (like it is written in many READMEs). The data will be converted to an internal, node based format and can then be converted to Wikicode as used by many wikis like the Wikipedia.
Download (0.019MB)
Added: 2006-08-04 License: Perl Artistic License Price:
1176 downloads
HTML::WikiConverter::Markdown 0.02
HTML::WikiConverter::Markdown is a Perl module to convert HTML to Markdown markup. more>>
HTML::WikiConverter::Markdown is a Perl module to convert HTML to Markdown markup.
SYNOPSIS
use HTML::WikiConverter;
my $wc = new HTML::WikiConverter( dialect => Markdown );
print $wc->html2wiki( $html );
This module contains rules for converting HTML into Markdown markup. You should not use this module directly; HTML::WikiConverter is the entry point for html->wiki conversion (eg, see synopsis above). See HTML::WikiConverter for additional usage details.
<<lessSYNOPSIS
use HTML::WikiConverter;
my $wc = new HTML::WikiConverter( dialect => Markdown );
print $wc->html2wiki( $html );
This module contains rules for converting HTML into Markdown markup. You should not use this module directly; HTML::WikiConverter is the entry point for html->wiki conversion (eg, see synopsis above). See HTML::WikiConverter for additional usage details.
Download (0.008MB)
Added: 2006-10-19 License: Perl Artistic License Price:
1101 downloads
Jericho HTML Parser 2.4
Jerich HTML Parser is a simple but powerful java library allowing analysis and manipulation of parts of an HTML document. more>>
Jerich HTML Parser is a simple but powerful java library allowing analysis and manipulation of parts of an HTML document, including some common server-side tags, while reproducing verbatim any unrecognised or invalid HTML. It also provides high-level HTML form manipulation functions.
Jericho HTML Parser project is an open source library released under the GNU Lesser General Public License (LGPL). You are therefore free to use it in commercial applications subject to the terms detailed in the licence document.
Main features:
- No parse tree of the entire document is ever generated. The document source text is searched only for the markup relevant to the current operation. This allows the library to analyse and modify documents containing incorrect or badly formatted HTML or any other server or client side code, script, macro or markup. Most other parsers cant handle content that they are not explicitly programmed to accept.
- The beginning and end positions in the source text of all parsed segments are accessible, allowing modification of only selected segments of the document without having to reconstruct the entire document from a parse tree. This feature, in combination with the one above, makes the toolkit extremely powerful in its simplicity.
- Provides a simple but comprehensive interface for the analysis and manipulation of HTML form controls, including the extraction and population of initial values, and conversion to read-only or data display modes. Analysis of the form controls also allows data received from the form to be stored and presented in an appropriate manner.
- ASP, JSP, PSP, PHP and Mason server tags can be registered for recognition by the parser, and are recognised as accurately as is possible without incorporating actual parsers for these languages into the library. The library then allows any of these segments to be ignored when parsing the rest of the document so that they do not interfere with the HTML syntax. (see Segment.ignoreWhenParsing())
- Custom tag types can be easily defined and registered for recognition by the parser.
Enhancements:
- This version has been released under a dual licence system, allowing a choice between the Eclipse Public License (EPL) and the LGPL.
- It includes important bugfixes and introduces the following major features: simple rendering of HTML markup into text, integrated logging with various logging frameworks, and easier parsing of HTML tags containing server tags.
<<lessJericho HTML Parser project is an open source library released under the GNU Lesser General Public License (LGPL). You are therefore free to use it in commercial applications subject to the terms detailed in the licence document.
Main features:
- No parse tree of the entire document is ever generated. The document source text is searched only for the markup relevant to the current operation. This allows the library to analyse and modify documents containing incorrect or badly formatted HTML or any other server or client side code, script, macro or markup. Most other parsers cant handle content that they are not explicitly programmed to accept.
- The beginning and end positions in the source text of all parsed segments are accessible, allowing modification of only selected segments of the document without having to reconstruct the entire document from a parse tree. This feature, in combination with the one above, makes the toolkit extremely powerful in its simplicity.
- Provides a simple but comprehensive interface for the analysis and manipulation of HTML form controls, including the extraction and population of initial values, and conversion to read-only or data display modes. Analysis of the form controls also allows data received from the form to be stored and presented in an appropriate manner.
- ASP, JSP, PSP, PHP and Mason server tags can be registered for recognition by the parser, and are recognised as accurately as is possible without incorporating actual parsers for these languages into the library. The library then allows any of these segments to be ignored when parsing the rest of the document so that they do not interfere with the HTML syntax. (see Segment.ignoreWhenParsing())
- Custom tag types can be easily defined and registered for recognition by the parser.
Enhancements:
- This version has been released under a dual licence system, allowing a choice between the Eclipse Public License (EPL) and the LGPL.
- It includes important bugfixes and introduces the following major features: simple rendering of HTML markup into text, integrated logging with various logging frameworks, and easier parsing of HTML tags containing server tags.
Download (0.85MB)
Added: 2007-05-20 License: LGPL (GNU Lesser General Public License) Price:
534 downloads

Html Code Convert 3.3
Speed up the conversion of HTML code into different format more>>
HTML Code Convert helps speed up the conversion of HTML code into different format including Java Script, JavaServer Pages, Microsoft ASP, PHP, Perl, Python, and the UNIX Shell. It is particularly useful in CGI scripting.
Enhancements:
- Colors and font selected in prefeferences box.
- Fixe bug with Quit button. First try to support accessibility.
- Updated schemas.
<<lessEnhancements:
- Colors and font selected in prefeferences box.
- Fixe bug with Quit button. First try to support accessibility.
- Updated schemas.
Download (184KB)
Added: 2009-04-29 License: Freeware Price:
198 downloads
YaHP Converter 1.2.17
YaHP is a java library that allows you to convert an HTML document into a PDF document. more>>
YaHP Converter is a java library that can convert an HTML document into a PDF document.
YaHP is licensed under the LGPL (GNU).
Method Summary
void convertToPdf(java.lang.String content, IHtmlToPdfTransformer.PageSize size, java.util.List hf, java.lang.String urlForBase, java.io.OutputStream out)
Convert the document in content in a PDF file.
void convertToPdf(java.net.URL url, IHtmlToPdfTransformer.PageSize size, java.util.List hf, java.io.OutputStream out)
Convert the document pointed by url in a PDF file.
private java.net.URL copyToTemp(java.net.URL url)
Copy the file pointed to url in the temp directory
private void init()
initialize the classloader, and the transforme.
Enhancements:
- A NPE which occurred if the FOP_TTF_FONT_PATH property is not set was fixed.
<<lessYaHP is licensed under the LGPL (GNU).
Method Summary
void convertToPdf(java.lang.String content, IHtmlToPdfTransformer.PageSize size, java.util.List hf, java.lang.String urlForBase, java.io.OutputStream out)
Convert the document in content in a PDF file.
void convertToPdf(java.net.URL url, IHtmlToPdfTransformer.PageSize size, java.util.List hf, java.io.OutputStream out)
Convert the document pointed by url in a PDF file.
private java.net.URL copyToTemp(java.net.URL url)
Copy the file pointed to url in the temp directory
private void init()
initialize the classloader, and the transforme.
Enhancements:
- A NPE which occurred if the FOP_TTF_FONT_PATH property is not set was fixed.
Download (29.3MB)
Added: 2007-07-06 License: LGPL (GNU Lesser General Public License) Price:
841 downloads
HTML::GenToc 2.30
HTML::GenToc is a Perl module that generate a Table of Contents for HTML documents. more>>
HTML::GenToc is a Perl module that generate a Table of Contents for HTML documents.
SYNOPSIS
use HTML::GenToc;
# create a new object
my $toc = new HTML::GenToc();
my $toc = new HTML::GenToc(title=>"Table of Contents",
toc=>$my_toc_file,
toc_entry=>{
H1=>1,
H2=>2
},
toc_end=>{
H1=>/H1,
H2=>/H2
}
);
# add further arguments
$toc->args(toc_tag=>"BODY",
toc_tag_replace=>0,
);
# generate anchors for a file
$toc->generate_anchors(infile=>$html_file,
overwrite=>0,
);
# generate a ToC from a file
$toc->generate_toc(infile=>$html_file,
footer=>$footer_file,
header=>$header_file
);
HTML::GenToc generates anchors and a table of contents for HTML documents. Depending on the arguments, it will insert the information it generates, or output to a string, a separate file or STDOUT.
While it defaults to taking H1 and H2 elements as the significant elements to put into the table of contents, any tag can be defined as a significant element. Also, it doesnt matter if the input HTML code is complete, pure HTML, one can input pseudo-html or page-fragments, which makes it suitable for using on templates and HTML meta-languages such as WML.
Also included in the distrubution is hypertoc, a script which uses the module so that one can process files on the command-line in a user-friendly manner.
<<lessSYNOPSIS
use HTML::GenToc;
# create a new object
my $toc = new HTML::GenToc();
my $toc = new HTML::GenToc(title=>"Table of Contents",
toc=>$my_toc_file,
toc_entry=>{
H1=>1,
H2=>2
},
toc_end=>{
H1=>/H1,
H2=>/H2
}
);
# add further arguments
$toc->args(toc_tag=>"BODY",
toc_tag_replace=>0,
);
# generate anchors for a file
$toc->generate_anchors(infile=>$html_file,
overwrite=>0,
);
# generate a ToC from a file
$toc->generate_toc(infile=>$html_file,
footer=>$footer_file,
header=>$header_file
);
HTML::GenToc generates anchors and a table of contents for HTML documents. Depending on the arguments, it will insert the information it generates, or output to a string, a separate file or STDOUT.
While it defaults to taking H1 and H2 elements as the significant elements to put into the table of contents, any tag can be defined as a significant element. Also, it doesnt matter if the input HTML code is complete, pure HTML, one can input pseudo-html or page-fragments, which makes it suitable for using on templates and HTML meta-languages such as WML.
Also included in the distrubution is hypertoc, a script which uses the module so that one can process files on the command-line in a user-friendly manner.
Download (0.043MB)
Added: 2006-06-16 License: Perl Artistic License Price:
1225 downloads
HTML::WikiConverter::Dialects 0.61
HTML::WikiConverter::Dialects is a Perl module which contains a tutorial on how to add a dialect. more>>
HTML::WikiConverter::Dialects is a Perl module which contains a tutorial on how to add a dialect.
SYNOPSIS
# In your dialect module:
package HTML::WikiConverter::MySlimWiki;
use HTML::WikiConverter -dialect;
rule b => { start => **, end => ** };
rule i => { start => //, end => // };
rule strong => { alias => b };
rule em => { alias => i };
rule hr => { replace => "n----n" };
# In a nearby piece of code:
package main;
use Test::More tests => 5;
my $wc = new HTML::WikiConverter(
dialect => MySlimWiki
);
is( $wc->html2wiki( text ), **text**, b );
is( $wc->html2wiki( text ), //text//, i );
is( $wc->html2wiki( text ), **text**, strong );
is( $wc->html2wiki( text ), //text//, em );
is( $wc->html2wiki(
), ----, hr );
HTML::WikiConverter (or H::WC, for short) is an HTML to wiki converter. It can convert HTML source into a variety of wiki markups, called wiki "dialects". This manual describes how you to create your own dialect to be plugged into HTML::WikiConverter.
DIALECTS
Each dialect has a separate dialect module containing rules for converting HTML into wiki markup specific for that dialect. Currently, all dialect modules are in the HTML::WikiConverter:: package space and subclass HTML::WikiConverter. For example, the MediaWiki dialect module is HTML::WikiConverter::MediaWiki, while PhpWikis is HTML::WikiConverter::PhpWiki. However, dialect modules need not be in the HTML::WikiConverter:: package space; you may just as easily use package MyWikiDialect; and H::WC will Do The Right Thing.
From now on, Ill be using the terms "dialect" and "dialect module" interchangeably.
Subclassing
To interface with H::WC, dialects need to subclass it. Because youll probably be wanting the rule() and attribute() functions as well, subclassing and importing these functions is done in a single step:
use HTML::WikiConverter -dialect;
This will add HTML::WikiConverter to your dialects @ISA and will import the attribute() and rule() functions into your dialects package.
Conversion rules
Dialects guide H::WCs conversion process with a set of rules that define how HTML elements are turned into their wiki counterparts. Each rule corresponds to an HTML tag (including nonstandard tags), and there may be any number of rules. Rules are added with the rule() function that was imported when you subclassed H::WC (see above).
The syntax for rule() is as follows:
rule $tag => %subrules;
where $tag is the name of the HTML tag (e.g., "b", "em", etc.) and %subrules contains subrules that specify how that tag will be converted.
<<lessSYNOPSIS
# In your dialect module:
package HTML::WikiConverter::MySlimWiki;
use HTML::WikiConverter -dialect;
rule b => { start => **, end => ** };
rule i => { start => //, end => // };
rule strong => { alias => b };
rule em => { alias => i };
rule hr => { replace => "n----n" };
# In a nearby piece of code:
package main;
use Test::More tests => 5;
my $wc = new HTML::WikiConverter(
dialect => MySlimWiki
);
is( $wc->html2wiki( text ), **text**, b );
is( $wc->html2wiki( text ), //text//, i );
is( $wc->html2wiki( text ), **text**, strong );
is( $wc->html2wiki( text ), //text//, em );
is( $wc->html2wiki(
), ----, hr );
HTML::WikiConverter (or H::WC, for short) is an HTML to wiki converter. It can convert HTML source into a variety of wiki markups, called wiki "dialects". This manual describes how you to create your own dialect to be plugged into HTML::WikiConverter.
DIALECTS
Each dialect has a separate dialect module containing rules for converting HTML into wiki markup specific for that dialect. Currently, all dialect modules are in the HTML::WikiConverter:: package space and subclass HTML::WikiConverter. For example, the MediaWiki dialect module is HTML::WikiConverter::MediaWiki, while PhpWikis is HTML::WikiConverter::PhpWiki. However, dialect modules need not be in the HTML::WikiConverter:: package space; you may just as easily use package MyWikiDialect; and H::WC will Do The Right Thing.
From now on, Ill be using the terms "dialect" and "dialect module" interchangeably.
Subclassing
To interface with H::WC, dialects need to subclass it. Because youll probably be wanting the rule() and attribute() functions as well, subclassing and importing these functions is done in a single step:
use HTML::WikiConverter -dialect;
This will add HTML::WikiConverter to your dialects @ISA and will import the attribute() and rule() functions into your dialects package.
Conversion rules
Dialects guide H::WCs conversion process with a set of rules that define how HTML elements are turned into their wiki counterparts. Each rule corresponds to an HTML tag (including nonstandard tags), and there may be any number of rules. Rules are added with the rule() function that was imported when you subclassed H::WC (see above).
The syntax for rule() is as follows:
rule $tag => %subrules;
where $tag is the name of the HTML tag (e.g., "b", "em", etc.) and %subrules contains subrules that specify how that tag will be converted.
Download (0.034MB)
Added: 2007-07-09 License: Perl Artistic License Price:
838 downloads
iCal::Parser::HTML 1.06
iCal::Parser::HTML is a Perl module to generate HTML calendars from iCalendars. more>>
iCal::Parser::HTML is a Perl module to generate HTML calendars from iCalendars.
SYNOPSIS
use iCal::Parser::HTML;
my $parser=iCal::Parser::HTML->new;
print $parser->parse(type=>$type,start=>$date,files=>[@icals]);
This module uses iCal::Parser::SAX and XML::LibXSLT with included stylesheets to generates html calendars from icalendars.
The html document generated includes (when appropriate) a sidebar containing a legend, a list of todos and a three month calendar for the previous, current and next months.
The stylesheets are stored in the HTML/stylesheet directory under the installed package directory.
Also included in this package are an optionally installed command line program "ical2html" in scripts and, in the example directory, a cgi handler ("ical.cgi" in examples) and a stylesheet ("calendar.css" in examples) for formatting the html output. Note that the html output will look quite broken without the stylesheet.
<<lessSYNOPSIS
use iCal::Parser::HTML;
my $parser=iCal::Parser::HTML->new;
print $parser->parse(type=>$type,start=>$date,files=>[@icals]);
This module uses iCal::Parser::SAX and XML::LibXSLT with included stylesheets to generates html calendars from icalendars.
The html document generated includes (when appropriate) a sidebar containing a legend, a list of todos and a three month calendar for the previous, current and next months.
The stylesheets are stored in the HTML/stylesheet directory under the installed package directory.
Also included in this package are an optionally installed command line program "ical2html" in scripts and, in the example directory, a cgi handler ("ical.cgi" in examples) and a stylesheet ("calendar.css" in examples) for formatting the html output. Note that the html output will look quite broken without the stylesheet.
Download (0.049MB)
Added: 2007-04-07 License: Perl Artistic License Price:
930 downloads
HTML::WikiConverter 0.61
HTML::WikiConverter is a Perl module that can convert HTML to wiki markup. more>>
HTML::WikiConverter is a Perl module that can convert HTML to wiki markup.
SYNOPSIS
use HTML::WikiConverter;
my $wc = new HTML::WikiConverter( dialect => MediaWiki );
# Provide HTML directly
print $wc->html2wiki( $html );
# ...which is the same as
print $wc->html2wiki( html => $html );
# Or fetch it from a file
print $wc->html2wiki( file => $path );
# ...slurp it all at once rather than parsing incrementally
print $wc->html2wiki( file => $path, slurp => 1 );
# Or from a URI
print $wc->html2wiki( uri => $uri );
# Get a list of installed dialects
my @dialects = HTML::WikiConverter->available_dialects;
HTML::WikiConverter is an HTML to wiki converter. It can convert HTML source into a variety of wiki markups, called wiki "dialects". The following dialects are supported:
DokuWiki
Kwiki
MediaWiki
MoinMoin
Oddmuse
PbWiki
PhpWiki
PmWiki
SlipSlap
TikiWiki
UseMod
WakkaWiki
WikkaWiki
Note that while dialects usually produce satisfactory wiki markup, not all features of all dialects are supported. Consult individual dialects documentation for details of supported features. Suggestions for improvements, especially in the form of patches, are very much appreciated.
<<lessSYNOPSIS
use HTML::WikiConverter;
my $wc = new HTML::WikiConverter( dialect => MediaWiki );
# Provide HTML directly
print $wc->html2wiki( $html );
# ...which is the same as
print $wc->html2wiki( html => $html );
# Or fetch it from a file
print $wc->html2wiki( file => $path );
# ...slurp it all at once rather than parsing incrementally
print $wc->html2wiki( file => $path, slurp => 1 );
# Or from a URI
print $wc->html2wiki( uri => $uri );
# Get a list of installed dialects
my @dialects = HTML::WikiConverter->available_dialects;
HTML::WikiConverter is an HTML to wiki converter. It can convert HTML source into a variety of wiki markups, called wiki "dialects". The following dialects are supported:
DokuWiki
Kwiki
MediaWiki
MoinMoin
Oddmuse
PbWiki
PhpWiki
PmWiki
SlipSlap
TikiWiki
UseMod
WakkaWiki
WikkaWiki
Note that while dialects usually produce satisfactory wiki markup, not all features of all dialects are supported. Consult individual dialects documentation for details of supported features. Suggestions for improvements, especially in the form of patches, are very much appreciated.
Download (0.034MB)
Added: 2006-08-15 License: Perl Artistic License Price:
1167 downloads
txt2html 2.45
txt2html is a Perl program that uses the HTML::TextToHTML perl module to convert plain text to HTML. more>>
txt2html is a Perl program that converts plain text to HTML. txt2html project uses the HTML::TextToHTML perl module to do so.
HTML::TextToHTML is a Perl module that converts plain text to HTML.
It supports headings, lists, some tables, simple character markup, and hyperlinking, and is highly customizable.
It recognizes some of the apparent structure of the source document (mostly whitespace and typographic layout), and attempts to mark that structure explicitly using HTML.
Our intent in writing this tool is to provide an easier way of converting existing text documents to HTML format.
txt2html is not a program to convert wordprocessor files or other marked-up document formats. It is also not a program to convert HTML to text. Most HTML browsers do that.
If you need to convert something other than plain text to HTML, or you need to convert from HTML, you should look for a more appropriate tool.
txt2html is not a program for automatically generating a table-of-contents from a file. If you want that, then use txt2html to generate a HTML file, and then use htmltoc or hypertoc on the HTML file.
Enhancements:
- This release fixes a bug with umlauts and a bug with UTF-8 characters.
- The "--underline_delimiter" option was added.
<<lessHTML::TextToHTML is a Perl module that converts plain text to HTML.
It supports headings, lists, some tables, simple character markup, and hyperlinking, and is highly customizable.
It recognizes some of the apparent structure of the source document (mostly whitespace and typographic layout), and attempts to mark that structure explicitly using HTML.
Our intent in writing this tool is to provide an easier way of converting existing text documents to HTML format.
txt2html is not a program to convert wordprocessor files or other marked-up document formats. It is also not a program to convert HTML to text. Most HTML browsers do that.
If you need to convert something other than plain text to HTML, or you need to convert from HTML, you should look for a more appropriate tool.
txt2html is not a program for automatically generating a table-of-contents from a file. If you want that, then use txt2html to generate a HTML file, and then use htmltoc or hypertoc on the HTML file.
Enhancements:
- This release fixes a bug with umlauts and a bug with UTF-8 characters.
- The "--underline_delimiter" option was added.
Download (0.11MB)
Added: 2007-01-26 License: GPL (GNU General Public License) Price:
1003 downloads
RTF to HTML convertor 3.6
The RTF to HTML convertor converts RTF files to HTML file. more>>
The RTF to HTML convertor converts RTF files (in Windows-1250 encoding) to HTML file (in ISO-8859-2 encoding).
Main features:
- Bullets
- Superscript and subscript look bad in html document.
- Subscript is transformed to number. Superscript is transformed to "[number]".
- Text: bold, italic and underline
- Footnotes
- Alignments: left, center and right. "Justify" alignment
- looks bad - program use left alignment. Centered text is greater.
- Tables
- Links: text "aaa@bbb.cz" and "http://www.aaaaaa.cz" convert
- to html links.
- Unicode: Commentary with the character
- name is added to the non ISO Latin2 characters. The program htm2htm will
- convert html with commentaries to the unicode.
- Rtf commands sa and sb.
- (sa>0) or (sb>0) New paragraph - "p" html command
- (sa=0) and (sb=0) New paragraph (left aligned text) "< br >"
Enhancements:
- Processing was fixed in the RTF commands "fldinst", "fldrslt", "plain", "bkmkstart", and "bkmend".
<<lessMain features:
- Bullets
- Superscript and subscript look bad in html document.
- Subscript is transformed to number. Superscript is transformed to "[number]".
- Text: bold, italic and underline
- Footnotes
- Alignments: left, center and right. "Justify" alignment
- looks bad - program use left alignment. Centered text is greater.
- Tables
- Links: text "aaa@bbb.cz" and "http://www.aaaaaa.cz" convert
- to html links.
- Unicode: Commentary with the character
- name is added to the non ISO Latin2 characters. The program htm2htm will
- convert html with commentaries to the unicode.
- Rtf commands sa and sb.
- (sa>0) or (sb>0) New paragraph - "p" html command
- (sa=0) and (sb=0) New paragraph (left aligned text) "< br >"
Enhancements:
- Processing was fixed in the RTF commands "fldinst", "fldrslt", "plain", "bkmkstart", and "bkmend".
Download (0.041MB)
Added: 2005-11-01 License: GPL (GNU General Public License) Price:
1456 downloads
HTML::FromText 2.0.5
HTML::FromText is a Perl module that can convert plain text to HTML. more>>
HTML::FromText is a Perl module that can convert plain text to HTML.
SYNOPSIS
use HTML::FromText;
text2html( $text, %options );
# or
use HTML::FromText ();
my $t2h = HTML::FromText->new( %options );
my $html = $t2h->parse( $html );
HTML::FromText converts plain text to HTML. There are a handfull of options that shape the conversion. There is a utility function, text2html, thats exported by default. This function is simply a short- cut to the Object Oriented interface described in detail below.
<<lessSYNOPSIS
use HTML::FromText;
text2html( $text, %options );
# or
use HTML::FromText ();
my $t2h = HTML::FromText->new( %options );
my $html = $t2h->parse( $html );
HTML::FromText converts plain text to HTML. There are a handfull of options that shape the conversion. There is a utility function, text2html, thats exported by default. This function is simply a short- cut to the Object Oriented interface described in detail below.
Download (0.013MB)
Added: 2006-08-05 License: Perl Artistic License Price:
1175 downloads
Secleted [ 0 ] software to compare
Copyright Notice:
Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future software development. The above convert html documents search only lists software in full, demo and trial versions for free download. Download links are directly from our mirror sites or publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed