Encode::HanExtra 0.10
Sponsored Links
Encode::HanExtra 0.10 Ranking & Summary
File size:
1.3 MB
Platform:
Any Platform
License:
Perl Artistic License
Price:
Downloads:
836
Date added:
2007-07-25
Publisher:
Autrijus Tang
Encode::HanExtra 0.10 description
Encode::HanExtra Perl module contains extra sets of Chinese encodings.
SYNOPSIS
use Encode;
# Traditional Chinese
$euc_tw = encode("euc-tw", $utf8); # loads Encode::HanExtra implicitly
$utf8 = decode("euc-tw", $euc_tw); # ditto
# Simplified Chinese
$gb18030 = encode("gb18030", $utf8); # loads Encode::HanExtra implicitly
$utf8 = decode("gb18030", $gb18030); # ditto
Perl 5.7.3 and later ships with an adequate set of Chinese encodings, including the commonly used CP950, CP936 (also known as GBK), Big5 (alias for Big5-Eten), Big5-HKSCS, EUC-CN, HZ, and ISO-IR-165.
However, the numbers of Chinese encodings are staggering, and a complete coverage will easily increase the size of perl distribution by several megabytes; hence, this CPAN module tries to provide the rest of them.
If you are using perl 5.8 or later, Encode::CN and Encode::TW will automatically load the extra encodings for you, so theres no need to explicitly write use Encode::HanExtra if you are using one of them already.
ENCODINGS
This version includes the following encoding tables:
Canonical Alias Description
-----------------------------------------------------------------------------
big5-1984 /b(tca-)?big5-?(19)?84$/i TCAs original Big5-1984
big5ext /b(cmex-)?big5-?e(xt)?$/i CMEXs Big5e Extension
big5plus /b(cmex-)?big5-?p(lus)?$/i CMEXs Big5+ Extension
/b(cmex-)?big5+$/i
cccii /b(ccag-)?cccii$/i Chinese Character Code for
Information Interchange
cns11643-1 /bCNS[-_ ]?11643[-_]1$/i Taiwans CNS map, plane 1
cns11643-2 /bCNS[-_ ]?11643[-_]2$/i Taiwans CNS map, plane 2
cns11643-3 /bCNS[-_ ]?11643[-_]3$/i Taiwans CNS map, plane 3
cns11643-4 /bCNS[-_ ]?11643[-_]4$/i Taiwans CNS map, plane 4
cns11643-5 /bCNS[-_ ]?11643[-_]5$/i Taiwans CNS map, plane 5
cns11643-6 /bCNS[-_ ]?11643[-_]6$/i Taiwans CNS map, plane 6
cns11643-7 /bCNS[-_ ]?11643[-_]7$/i Taiwans CNS map, plane 7
cns11643-f /bCNS[-_ ]?11643[-_]f$/i Taiwans CNS map, plane F
euc-tw /beuc.*tw$/i EUC (Extended Unix Character)
/btw.*euc$/i
gb18030 /bGB[-_ ]?18030$/i GBK with Traditional Characters
unisys /bunisys$/i Unisys Traditional Chinese
unisys-sosi1 Unisys SOSI1 transport encoding
unisys-sosi2 Unisys SOSI2 transport encoding
Detailed descriptions are as follows:
BIG5-1984
This is the original Big5 encoding made by TCA Taiwan.
BIG5PLUS
This encoding, while not heavily used, is an attempt to bring all Taiwans conflicting internal-use encodings together, and fit it as an extension to the widely-deployed Big5 range, by CMEX Taiwan.
BIG5EXT
The CMEXs second (and less ambitious) try at unifying the most commonly used characters not covered by Big5, while not polluting out of the 94x94 arragement like BIG5PLUS did.
CCCII
The earliest (and most sophisticated) Traditional Chinese encoding, with a three-byte raw character map, made in 1980 by the Chinese Character Analysis Group (CCAG), used mostly in library systems.
EUC-TW
The EUC transport version of CNS11643 (planes 1-7), the comprehensive character set used by the Taiwan government.
CNS11643-*
The raw character map extracted from the Unihan database, including the plane F which wasnt included in EUC-TW.
GB18030
An extension to GBK, this encoding lists most Han characters (both simplified and traditional), as well as some other encodings used by other peoples in China.
UNISYS
Unisys Systems internal Chinese mapping.
SYNOPSIS
use Encode;
# Traditional Chinese
$euc_tw = encode("euc-tw", $utf8); # loads Encode::HanExtra implicitly
$utf8 = decode("euc-tw", $euc_tw); # ditto
# Simplified Chinese
$gb18030 = encode("gb18030", $utf8); # loads Encode::HanExtra implicitly
$utf8 = decode("gb18030", $gb18030); # ditto
Perl 5.7.3 and later ships with an adequate set of Chinese encodings, including the commonly used CP950, CP936 (also known as GBK), Big5 (alias for Big5-Eten), Big5-HKSCS, EUC-CN, HZ, and ISO-IR-165.
However, the numbers of Chinese encodings are staggering, and a complete coverage will easily increase the size of perl distribution by several megabytes; hence, this CPAN module tries to provide the rest of them.
If you are using perl 5.8 or later, Encode::CN and Encode::TW will automatically load the extra encodings for you, so theres no need to explicitly write use Encode::HanExtra if you are using one of them already.
ENCODINGS
This version includes the following encoding tables:
Canonical Alias Description
-----------------------------------------------------------------------------
big5-1984 /b(tca-)?big5-?(19)?84$/i TCAs original Big5-1984
big5ext /b(cmex-)?big5-?e(xt)?$/i CMEXs Big5e Extension
big5plus /b(cmex-)?big5-?p(lus)?$/i CMEXs Big5+ Extension
/b(cmex-)?big5+$/i
cccii /b(ccag-)?cccii$/i Chinese Character Code for
Information Interchange
cns11643-1 /bCNS[-_ ]?11643[-_]1$/i Taiwans CNS map, plane 1
cns11643-2 /bCNS[-_ ]?11643[-_]2$/i Taiwans CNS map, plane 2
cns11643-3 /bCNS[-_ ]?11643[-_]3$/i Taiwans CNS map, plane 3
cns11643-4 /bCNS[-_ ]?11643[-_]4$/i Taiwans CNS map, plane 4
cns11643-5 /bCNS[-_ ]?11643[-_]5$/i Taiwans CNS map, plane 5
cns11643-6 /bCNS[-_ ]?11643[-_]6$/i Taiwans CNS map, plane 6
cns11643-7 /bCNS[-_ ]?11643[-_]7$/i Taiwans CNS map, plane 7
cns11643-f /bCNS[-_ ]?11643[-_]f$/i Taiwans CNS map, plane F
euc-tw /beuc.*tw$/i EUC (Extended Unix Character)
/btw.*euc$/i
gb18030 /bGB[-_ ]?18030$/i GBK with Traditional Characters
unisys /bunisys$/i Unisys Traditional Chinese
unisys-sosi1 Unisys SOSI1 transport encoding
unisys-sosi2 Unisys SOSI2 transport encoding
Detailed descriptions are as follows:
BIG5-1984
This is the original Big5 encoding made by TCA Taiwan.
BIG5PLUS
This encoding, while not heavily used, is an attempt to bring all Taiwans conflicting internal-use encodings together, and fit it as an extension to the widely-deployed Big5 range, by CMEX Taiwan.
BIG5EXT
The CMEXs second (and less ambitious) try at unifying the most commonly used characters not covered by Big5, while not polluting out of the 94x94 arragement like BIG5PLUS did.
CCCII
The earliest (and most sophisticated) Traditional Chinese encoding, with a three-byte raw character map, made in 1980 by the Chinese Character Analysis Group (CCAG), used mostly in library systems.
EUC-TW
The EUC transport version of CNS11643 (planes 1-7), the comprehensive character set used by the Taiwan government.
CNS11643-*
The raw character map extracted from the Unihan database, including the plane F which wasnt included in EUC-TW.
GB18030
An extension to GBK, this encoding lists most Han characters (both simplified and traditional), as well as some other encodings used by other peoples in China.
UNISYS
Unisys Systems internal Chinese mapping.
Encode::HanExtra 0.10 Screenshot
Encode::HanExtra 0.10 Keywords
HanExtra
EUC
HanExtra Perl
HanExtra 0.10
CMEXs
GBK
Taiwans CNS
chinese encodings
Perl module
chinese
map
encodings
plane
CNS
Perl
Encode::HanExtra
Bookmark Encode::HanExtra 0.10
Encode::HanExtra 0.10 Copyright
WareSeeker periodically updates pricing and software information of Encode::HanExtra 0.10 full version from the publisher, so some information may be slightly out-of-date. You should confirm all information before relying on it. Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future development of Encode::HanExtra 0.10 Edition. Download links are directly from our publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed
Featured Software
Want to place your software product here?
Please contact us for consideration.
Contact WareSeeker.com
Related Information
Related Software
Encode::RAD50 is a Perl module that can convert to and from the Rad50 character set. Free Download
Encode is a Perl module created to deal with character encodings. Free Download
Convert::Braille is a Perl module that can convert Between Braille Encodings. Free Download
Convert::CharMap is a Perl module that can conversion between Unicode Character Maps. Free Download
Encode::JIS2K is aJIS X 0212 (aka JIS 2000) Encodings. Free Download
Data::Stag is a Perl module with structured tags datastructures. Free Download
Bundle::Encode is a Perl bundle to install Encode modules and dependencies. Free Download
Hash::Merge Perl module merges arbitrarily deep hashes into a single hash. Free Download
Latest Software
Popular Software
Favourite Software