lingua ru charset 0.02
Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 376
Lingua::RU::Charset 0.02
Lingua::RU::Charset is a Perl extension for detecting and converting various russian character sets. more>>
Lingua::RU::Charset is a Perl extension for detecting and converting various russian character sets: KOI8-r, Windows-1251, CP866, ISO-8859-5, X-Mac-Cyrillic, russian text in english letters, russian part of Unicode and UTF-8. Lingua::RU::Charset module can be especially useful for computers with broken cyrillic locales (like foreign web hosts).
SYNOPSIS
use Lingua::RU::Charset qw (:CHARSET);
use Lingua::RU::Charset qw (:CONVERT);
use Lingua::RU::Charset qw (:CONVERT :CHARCASE);
use Lingua::RU::Charset qw (any2koi koi2lc koi2uc);
More documentation and examples coming soon...
NOTE
Unfortunately I dont have time to implement the Unicode and UTF-8 subroutines. But I am sure that such functions would be useful for interesting Perl scripts exchanging russian data with Java servlets. So you are welcome to submit some code!
<<lessSYNOPSIS
use Lingua::RU::Charset qw (:CHARSET);
use Lingua::RU::Charset qw (:CONVERT);
use Lingua::RU::Charset qw (:CONVERT :CHARCASE);
use Lingua::RU::Charset qw (any2koi koi2lc koi2uc);
More documentation and examples coming soon...
NOTE
Unfortunately I dont have time to implement the Unicode and UTF-8 subroutines. But I am sure that such functions would be useful for interesting Perl scripts exchanging russian data with Java servlets. So you are welcome to submit some code!
Download (0.004MB)
Added: 2007-06-12 License: Perl Artistic License Price:
864 downloads
MARC::Charset 0.95
MARC::Charset is a Perl module to convert MARC-8 encoded strings to UTF-8. more>>
MARC::Charset is a Perl module to convert MARC-8 encoded strings to UTF-8.
SYNOPSIS
# import the marc8_to_utf8 function
use MARC::Charset marc8_to_utf8;
# prepare STDOUT for utf8
binmode(STDOUT, utf8);
# print out some marc8 as utf8
print marc8_to_utf8($marc8_string);
MARC::Charset allows you to turn MARC-8 encoded strings into UTF-8 strings. MARC-8 is a single byte character encoding that predates unicode, and allows you to put non-Roman scripts in MARC bibliographic records.
<<lessSYNOPSIS
# import the marc8_to_utf8 function
use MARC::Charset marc8_to_utf8;
# prepare STDOUT for utf8
binmode(STDOUT, utf8);
# print out some marc8 as utf8
print marc8_to_utf8($marc8_string);
MARC::Charset allows you to turn MARC-8 encoded strings into UTF-8 strings. MARC-8 is a single byte character encoding that predates unicode, and allows you to put non-Roman scripts in MARC bibliographic records.
Download (0.18MB)
Added: 2006-08-24 License: Perl Artistic License Price:
1158 downloads
Lingua::ZH::DateTime 0.01
Lingua::ZH::DateTime is a Perl module to convert time to chinese format. more>>
Lingua::ZH::DateTime is a Perl module to convert time to chinese format.
SYNOPSIS
This module can convert asctime to chinese format in your locale. you can select china singapore hongkong taiwan locale and gb2312/big5 charset. asctime is ASC C standard format will from localtime like this:
$asctime = localtime();
<<lessSYNOPSIS
This module can convert asctime to chinese format in your locale. you can select china singapore hongkong taiwan locale and gb2312/big5 charset. asctime is ASC C standard format will from localtime like this:
$asctime = localtime();
Download (0.003MB)
Added: 2006-08-09 License: Perl Artistic License Price:
1173 downloads
MIME::Charset 0.044
MIME::Charset module contains charset informations for MIME. more>>
MIME::Charset module contains charset information for MIME.
SYNOPSIS
Getting charset informations:
use MIME::Charset qw(:info);
$benc = body_encoding("iso-8859-2"); # "Q"
$cset = canonical_charset("ANSI X3.4-1968"); # "US-ASCII"
$henc = header_encoding("utf-8"); # "S"
$cset = output_charset("shift_jis"); # "ISO-2022-JP"
Translating text data:
use MIME::Charset qw(:trans);
($text, $charset, $encoding) =
header_encode(
"xc9xc2xc5xaaxc0xdexc3xefxc5xaa".
"xc7xd1xcaxaaxbdxd0xcexcfxb4xef",
"euc-jp");
# ...returns (< converted >, "ISO-2022-JP", "B");
($text, $charset, $encoding) =
body_encode(
"Collectioneur pathxe9tiquement ".
"xe9clectique de dxe9chets",
"latin1");
# ...returns (< original >, "ISO-8859-1", "QUOTED-PRINTABLE");
$len = encoded_header_len(
"Perlxe8xa8x80xe8xaax9e", "b", "utf-8"); # 28
Manipulating module defaults:
use MIME::Charset;
MIME::Charset::alias("csEUCKR", "euc-kr");
MIME::Charset::default("iso-8859-1");
MIME::Charset::fallback("us-ascii");
<<lessSYNOPSIS
Getting charset informations:
use MIME::Charset qw(:info);
$benc = body_encoding("iso-8859-2"); # "Q"
$cset = canonical_charset("ANSI X3.4-1968"); # "US-ASCII"
$henc = header_encoding("utf-8"); # "S"
$cset = output_charset("shift_jis"); # "ISO-2022-JP"
Translating text data:
use MIME::Charset qw(:trans);
($text, $charset, $encoding) =
header_encode(
"xc9xc2xc5xaaxc0xdexc3xefxc5xaa".
"xc7xd1xcaxaaxbdxd0xcexcfxb4xef",
"euc-jp");
# ...returns (< converted >, "ISO-2022-JP", "B");
($text, $charset, $encoding) =
body_encode(
"Collectioneur pathxe9tiquement ".
"xe9clectique de dxe9chets",
"latin1");
# ...returns (< original >, "ISO-8859-1", "QUOTED-PRINTABLE");
$len = encoded_header_len(
"Perlxe8xa8x80xe8xaax9e", "b", "utf-8"); # 28
Manipulating module defaults:
use MIME::Charset;
MIME::Charset::alias("csEUCKR", "euc-kr");
MIME::Charset::default("iso-8859-1");
MIME::Charset::fallback("us-ascii");
Download (0.029MB)
Added: 2007-07-09 License: Perl Artistic License Price:
840 downloads
Lingua::ZH::HanConvert 0.12
Lingua::ZH::HanConvert is a Perl module to convert between Traditional and Simplified Chinese characters. more>> <<less
Download (0.030MB)
Added: 2006-08-09 License: Perl Artistic License Price:
1171 downloads
Lingua::EO::Supersignoj 0.02
Lingua::EO::Supersignoj is a Perl module that can convert Esperanto characters. more>>
Lingua::EO::Supersignoj is a Perl module that can convert Esperanto characters.
SYNOPSIS
use Lingua::EO::Supersignoj;
my $transkodigilo = Lingua::EO::Supersignoj->nova(
de => fronte,
al => X,
u => u*
);
print $transkodigilo->transkodigu(Mia ^suoj estas ankau* en la ^cambro.);
# prints: Mia sxuoj estas ankaux en la cxambro.
my $transkodigilo = Lingua::EO::Supersignoj->nova(de => X);
for (qw(X x H h poste fronte apostrofoj iso unikodo)) {
$transkodigilo->al = $_;
print $transkodigilo->transkodigu(
Laux Ludoviko Zamenhof bongustas .
fresxa cxecxa mangxajxo kun spicoj.
);
}
Esperanto has 6 letters that ASCII doesnt have. These characters do exist in Unicode and ISO-8859-3. This object orientated module makes conversion easier.
<<lessSYNOPSIS
use Lingua::EO::Supersignoj;
my $transkodigilo = Lingua::EO::Supersignoj->nova(
de => fronte,
al => X,
u => u*
);
print $transkodigilo->transkodigu(Mia ^suoj estas ankau* en la ^cambro.);
# prints: Mia sxuoj estas ankaux en la cxambro.
my $transkodigilo = Lingua::EO::Supersignoj->nova(de => X);
for (qw(X x H h poste fronte apostrofoj iso unikodo)) {
$transkodigilo->al = $_;
print $transkodigilo->transkodigu(
Laux Ludoviko Zamenhof bongustas .
fresxa cxecxa mangxajxo kun spicoj.
);
}
Esperanto has 6 letters that ASCII doesnt have. These characters do exist in Unicode and ISO-8859-3. This object orientated module makes conversion easier.
Download (0.004MB)
Added: 2006-08-08 License: Perl Artistic License Price:
1172 downloads
Math::String::Charset 1.27
Math::String::Charset is a simple charset for Math::String objects. more>>
Math::String::Charset is a simple charset for Math::String objects.
SYNOPSIS
use Math::String::Charset;
$a = new Math::String::Charset; # default a-z
$b = new Math::String::Charset [a..z]; # same
$c = new Math::String::Charset
{ start => [a..z], sep => }; # with between chars
print $b->length(); # a-z => 26
# construct a charset from bigram table, and an initial set (containing
# valid start-characters)
# Note: After an a, either an b, c or a can follow, in this order
# After an d only an a can follow
$bi = new Math::String::Charset ( {
start => a..d,
bi => {
a => [ b, c, a ],
b => [ c, b ],
c => [ a, c ],
d => [ a, ],
q => [ ], # q will be automatically in end
}
end => [ a, b, ],
} );
print $bi->length(); # a,b => 2 (cross of end and start)
print scalar $bi->class(2); # count of combinations with 2 letters
# will be 3+2+2+1 => 8
$d = new Math::String::Charset ( { start => [a..z],
minlen => 2, maxlen => 4, } );
print $d->first(0),"n"; # undef, too short
print $d->first(1),"n"; # undef, to short
print $d->first(2),"n"; # aa
$d = new Math::String::Charset ( { start => [a..z] } );
print $d->first(0),"n"; #
print $d->first(1),"n"; # a
print $d->last(1),"n"; # z
print $d->first(2),"n"; # aa
This module lets you create an charset object, which is used to contruct Math::String objects. This object knows how to handle simple charsets as well as complex onex consisting of bi-grams (later tri and more).
In case of more complex charsets, a reference to a Math::String::Charset::Nested or Math::String::Charset::grouped will be returned.
The default charset is the set containing "abcdefghijklmnopqrstuvwxyz" (thus producing always lower case output).
<<lessSYNOPSIS
use Math::String::Charset;
$a = new Math::String::Charset; # default a-z
$b = new Math::String::Charset [a..z]; # same
$c = new Math::String::Charset
{ start => [a..z], sep => }; # with between chars
print $b->length(); # a-z => 26
# construct a charset from bigram table, and an initial set (containing
# valid start-characters)
# Note: After an a, either an b, c or a can follow, in this order
# After an d only an a can follow
$bi = new Math::String::Charset ( {
start => a..d,
bi => {
a => [ b, c, a ],
b => [ c, b ],
c => [ a, c ],
d => [ a, ],
q => [ ], # q will be automatically in end
}
end => [ a, b, ],
} );
print $bi->length(); # a,b => 2 (cross of end and start)
print scalar $bi->class(2); # count of combinations with 2 letters
# will be 3+2+2+1 => 8
$d = new Math::String::Charset ( { start => [a..z],
minlen => 2, maxlen => 4, } );
print $d->first(0),"n"; # undef, too short
print $d->first(1),"n"; # undef, to short
print $d->first(2),"n"; # aa
$d = new Math::String::Charset ( { start => [a..z] } );
print $d->first(0),"n"; #
print $d->first(1),"n"; # a
print $d->last(1),"n"; # z
print $d->first(2),"n"; # aa
This module lets you create an charset object, which is used to contruct Math::String objects. This object knows how to handle simple charsets as well as complex onex consisting of bi-grams (later tri and more).
In case of more complex charsets, a reference to a Math::String::Charset::Nested or Math::String::Charset::grouped will be returned.
The default charset is the set containing "abcdefghijklmnopqrstuvwxyz" (thus producing always lower case output).
Download (0.060MB)
Added: 2006-06-16 License: GPL (GNU General Public License) Price:
1226 downloads
Lingua::Phonology::Segment 0.32
Lingua::Phonology::Segment is a module to represent a segment as a bundle of feature values. more>>
Lingua::Phonology::Segment is a module to represent a segment as a bundle of feature values.
SYNOPSIS
use Lingua::Phonology;
$phono = new Lingua::Phonology;
# Define a feature set
$features = $phono->features;
$features->loadfile;
# Make a segment
$segment = $phono->segment;
# Set some values
$segment->labial(1);
$segment->continuant(0);
$segment->voice(1);
# Segment is now voiced labial stop, i.e. [b]
# Reset the segment
$segment->clear;
A Lingua::Phonology::Segment object provides a programmatic representation of a linguistic segment. Such a segment is associated with a Lingua::Phonology::Features object that lists the available features and the relationships between them. The segment itself is a list of the values for those features.
This module provides methods for returning and setting these feature values. A segment may also be associated with a Lingua::Phonology::Symbols object, which allows the segment to return the symbol that it best matches.
OVERLOADING
As of Lingua::Phonology v0.32 (Lingua::Phonology::Segment v0.4), string conversion of segments is overloaded. When you use a Lingua::Phonology::Segment in string context, the spell() method is automatically called, and the representation of the segment from the current symbolset is returned. String comparison operators (cmp eq ne lt le gt ge) are also overloaded. Therefore, the following work correctly, assuming that you have a Lingua::Phonology object correctly set up in $phono.
my ($b, $k) = $phono->symbols->segment(b, k);
print "Segments: $b, $kn"; # Prints "Segments: b, k";
print "$b is greater than $kn" if $b gt $k; # Wont print
print "$b is less than $kn" if $b lt $k; # Prints b is less than k;
print "$b is equal to $kn" if $b eq $k; # Wont print
my $b2 = $b->duplicate;
print "$b is equal to $b2n" if $b eq $b2 # Prints b is equal to b;
Note that stringification is not overloaded if the symbolset is not properly set. However, it turns on as soon as a symbolset is available:
my $b = Lingua::Phonology::Segment->new($features);
$b->voice(1);
$b->labial(1);
print "$bn"; # Prints Lingua::Phonology::Segment=HASH(0x88af598)
# or something similar, because there is no symbolset
# defined for spelling the segment.
$b->symbolset($symbols);
print "$bn"; # Prints b
Number conversion is not overloaded.
<<lessSYNOPSIS
use Lingua::Phonology;
$phono = new Lingua::Phonology;
# Define a feature set
$features = $phono->features;
$features->loadfile;
# Make a segment
$segment = $phono->segment;
# Set some values
$segment->labial(1);
$segment->continuant(0);
$segment->voice(1);
# Segment is now voiced labial stop, i.e. [b]
# Reset the segment
$segment->clear;
A Lingua::Phonology::Segment object provides a programmatic representation of a linguistic segment. Such a segment is associated with a Lingua::Phonology::Features object that lists the available features and the relationships between them. The segment itself is a list of the values for those features.
This module provides methods for returning and setting these feature values. A segment may also be associated with a Lingua::Phonology::Symbols object, which allows the segment to return the symbol that it best matches.
OVERLOADING
As of Lingua::Phonology v0.32 (Lingua::Phonology::Segment v0.4), string conversion of segments is overloaded. When you use a Lingua::Phonology::Segment in string context, the spell() method is automatically called, and the representation of the segment from the current symbolset is returned. String comparison operators (cmp eq ne lt le gt ge) are also overloaded. Therefore, the following work correctly, assuming that you have a Lingua::Phonology object correctly set up in $phono.
my ($b, $k) = $phono->symbols->segment(b, k);
print "Segments: $b, $kn"; # Prints "Segments: b, k";
print "$b is greater than $kn" if $b gt $k; # Wont print
print "$b is less than $kn" if $b lt $k; # Prints b is less than k;
print "$b is equal to $kn" if $b eq $k; # Wont print
my $b2 = $b->duplicate;
print "$b is equal to $b2n" if $b eq $b2 # Prints b is equal to b;
Note that stringification is not overloaded if the symbolset is not properly set. However, it turns on as soon as a symbolset is available:
my $b = Lingua::Phonology::Segment->new($features);
$b->voice(1);
$b->labial(1);
print "$bn"; # Prints Lingua::Phonology::Segment=HASH(0x88af598)
# or something similar, because there is no symbolset
# defined for spelling the segment.
$b->symbolset($symbols);
print "$bn"; # Prints b
Number conversion is not overloaded.
Download (0.098MB)
Added: 2006-06-02 License: Perl Artistic License Price:
1239 downloads
Net::Akismet 0.02
Net::Akismet is a Perl interface to Akismet - comment and trackback spam fighter. more>>
Net::Akismet is a Perl interface to Akismet - comment and trackback spam fighter.
SYNOPSIS
my $akismet = Net::Akismet->new(
KEY => secret-baba-API-key,
URL => http://example.blog.net/,
) or die(Key verification failure!);
my $verdict = $akismet->check(
USER_IP => 10.10.10.11,
COMMENT_CONTENT => Run, Lola, Run, the spam will catch you!,
COMMENT_AUTHOR => dosser,
COMENT_AUTHOR_EMAIL => dosser@subway.de,
REFERRER => http://lola.home/,
) or die(Is the server here?);
if (true eq $verdict) {
print "I found spam. I am a spam-founder!n";
}
<<lessSYNOPSIS
my $akismet = Net::Akismet->new(
KEY => secret-baba-API-key,
URL => http://example.blog.net/,
) or die(Key verification failure!);
my $verdict = $akismet->check(
USER_IP => 10.10.10.11,
COMMENT_CONTENT => Run, Lola, Run, the spam will catch you!,
COMMENT_AUTHOR => dosser,
COMENT_AUTHOR_EMAIL => dosser@subway.de,
REFERRER => http://lola.home/,
) or die(Is the server here?);
if (true eq $verdict) {
print "I found spam. I am a spam-founder!n";
}
Download (0.004MB)
Added: 2007-03-23 License: Perl Artistic License Price:
945 downloads
Lingua::Phonology::Rules 0.32
Lingua::Phonology::Rules is a Perl module for defining and applying phonological rules. more>>
Lingua::Phonology::Rules is a Perl module for defining and applying phonological rules.
SYNOPSIS
use Lingua::Phonology;
$phono = new Lingua::Phonology;
$rules = $phono->rules;
# Adding and manipulating rules is discussed in the "WRITING RULES"
# section
This module allows for the creation of linguistic rules, and the application of those rules to "words" of Segment objects. You, the user, add rules to a Rules object, defining various parameters and code references that actually perform the action of the rule. Lingua::Phonology::Rules will take care of the guts of applying and creating rules.
The rules you create may have the following parameters. This is just a brief description of the parameters--a more detailed discussion of their effect is in the "WRITING RULES" section.
domain
Defines the domain within which the rule applies. This should be the name of a feature in the featureset of the segments which the rule is applied to.
tier
Defines the tier on which the rule applies. Must be the name of a feature in the feature set for the segments of the word you pass in.
direction
Defines the direction that the rule applies in. Must be either leftward or rightward. If no direction is given, defaults to rightward.
filter
Defines a filter for the segments that the rule applies on. Must a code reference that returns a truth value.
linguistic
Defines a linguistic-style rule to be parsed. When you provide a linguistic-style rule, it is parsed into code references that take the place of the where and do properties listed below. The format of linguistic rules is described in "LINGUISTIC-STYLE RULES" in Lingua::Phonology::FileFormatPOD.
where - defines the condition or conditions where the rule applies. Must be a coderef that returns a truth value. If no value is given, defaults to always true.
do - defines the action to take when the where condition is met. Must be a code reference. If no value is given, does nothing.
result - EXPERIMENTAL. Defines a condition that must be true after the do code has applied. Must be a code reference that returns a truth value. NOTE: This parameter depends on the module Whatif (available from CPAN), and will behave differently if this module is not present. See "Using result".
Lingua::Phonology::Rules is flexible and powerful enough to handle any sequential type of rule system. It cannot handle Optimality Theory-style processes, because those require a fundamentally different kind of algorithm.
<<lessSYNOPSIS
use Lingua::Phonology;
$phono = new Lingua::Phonology;
$rules = $phono->rules;
# Adding and manipulating rules is discussed in the "WRITING RULES"
# section
This module allows for the creation of linguistic rules, and the application of those rules to "words" of Segment objects. You, the user, add rules to a Rules object, defining various parameters and code references that actually perform the action of the rule. Lingua::Phonology::Rules will take care of the guts of applying and creating rules.
The rules you create may have the following parameters. This is just a brief description of the parameters--a more detailed discussion of their effect is in the "WRITING RULES" section.
domain
Defines the domain within which the rule applies. This should be the name of a feature in the featureset of the segments which the rule is applied to.
tier
Defines the tier on which the rule applies. Must be the name of a feature in the feature set for the segments of the word you pass in.
direction
Defines the direction that the rule applies in. Must be either leftward or rightward. If no direction is given, defaults to rightward.
filter
Defines a filter for the segments that the rule applies on. Must a code reference that returns a truth value.
linguistic
Defines a linguistic-style rule to be parsed. When you provide a linguistic-style rule, it is parsed into code references that take the place of the where and do properties listed below. The format of linguistic rules is described in "LINGUISTIC-STYLE RULES" in Lingua::Phonology::FileFormatPOD.
where - defines the condition or conditions where the rule applies. Must be a coderef that returns a truth value. If no value is given, defaults to always true.
do - defines the action to take when the where condition is met. Must be a code reference. If no value is given, does nothing.
result - EXPERIMENTAL. Defines a condition that must be true after the do code has applied. Must be a code reference that returns a truth value. NOTE: This parameter depends on the module Whatif (available from CPAN), and will behave differently if this module is not present. See "Using result".
Lingua::Phonology::Rules is flexible and powerful enough to handle any sequential type of rule system. It cannot handle Optimality Theory-style processes, because those require a fundamentally different kind of algorithm.
Download (0.097MB)
Added: 2007-07-16 License: Perl Artistic License Price:
831 downloads
dvdmenuauthor 0.02
dvdmenuauthor makes it easy and efficient to author a DVD with menus in an indirect (non-WYSIWYG) way. more>>
dvdmenuauthor project makes it easy and efficient to author a DVD with menus in an indirect (non-WYSIWYG) way. An XML project file drives the DVD authoring, from which both menus and a dvdauthor XML file are generated.
dvdauthor and spumux are then used to author the DVD filesystem. Menu items (buttons and static items such as images and text) can be specified conscisely in the project XML file with LaTeX markup (to be processed by pdfLaTeX and rendered by xpdf).
<<lessdvdauthor and spumux are then used to author the DVD filesystem. Menu items (buttons and static items such as images and text) can be specified conscisely in the project XML file with LaTeX markup (to be processed by pdfLaTeX and rendered by xpdf).
Download (0.097MB)
Added: 2007-04-17 License: GPL (GNU General Public License) Price:
921 downloads
TechnoBar 0.02
TechnoBar is an extension which provides the ultimate Technorati reverse browsing experience. more>>
TechnoBar is an extension which provides the ultimate Technorati reverse browsing experience.
Techno(rati)Bar provides the ultimate "reverse" browsing experience by showing live Technorati content as sidebar on the web pages youre browsing.
TechnoBar will show blogs that link to the current site while browsing and open blogs by clicking on the sidebar entry.
<<lessTechno(rati)Bar provides the ultimate "reverse" browsing experience by showing live Technorati content as sidebar on the web pages youre browsing.
TechnoBar will show blogs that link to the current site while browsing and open blogs by clicking on the sidebar entry.
Download (0.019MB)
Added: 2007-04-18 License: MPL (Mozilla Public License) Price:
919 downloads
Lingua::ID::Nums2Words 0.01
Lingua::ID::Nums2Words is a Perl module to convert number to Indonesian verbage. more>>
Lingua::ID::Nums2Words is a Perl module to convert number to Indonesian verbage.
SYNOPSIS
use Lingua::ID::Nums2Words ;
print nums2words(123) ; # "seratus dua puluh tiga"
print nums2words_simple(123) ; # "satu dua tiga"
nums2words currently can handle real numbers in normal and scientific form in the order of hundreds of trillions. It also preserves formatting in the number string (e.g, given "1.00" nums2words will pronounce the zeros).
<<lessSYNOPSIS
use Lingua::ID::Nums2Words ;
print nums2words(123) ; # "seratus dua puluh tiga"
print nums2words_simple(123) ; # "satu dua tiga"
nums2words currently can handle real numbers in normal and scientific form in the order of hundreds of trillions. It also preserves formatting in the number string (e.g, given "1.00" nums2words will pronounce the zeros).
Download (0.003MB)
Added: 2006-08-23 License: Perl Artistic License Price:
1157 downloads
Lingua::EN::WordsToNumbers 0.11
Lingua::EN::WordsToNumbers is Perl module to convert numbers written in English to actual numbers. more>>
Lingua::EN::WordsToNumbers is Perl module to convert numbers written in English to actual numbers.
This module is an OO wrapper for Lingua::EN::Words2Nums, which is very nice, but has an unfortunate name (which doesnt help when youre searching CPAN for modules that turn words into numbers).
SYNOPSIS
use Lingua::EN::WordsToNumbers;
my $numberizer = Lingua::EN::WordsToNumbers->new;
print $numberizer->parse("Forty-two"); # "42"
$numberizer->debug(1); # debug on
$numberizer->debug(undef); # debug off
$numberizer->billion(10**12); # use "English billion"
METHODS
new
my $numberizer = Lingua::EN::WordsToNumbers->new;
Gives you a new "object". Doesnt really do much.
parse
print $numberizer->parse("Forty-two"); # "42"
An alias for Lingua::EN::Words2Numss words2nums method. Read that modules documentation to see how it works. You can also call this method as words2nums if you really want to.
debug
$numberizer->debug(1);
Set Lingua::EN::Words2Nums debug mode. True values are on, anything else is off.
billion
$numberizer->billion(10**12);
Define the value of a billion ($Lingua::EN::Words2Nums::billion).
<<lessThis module is an OO wrapper for Lingua::EN::Words2Nums, which is very nice, but has an unfortunate name (which doesnt help when youre searching CPAN for modules that turn words into numbers).
SYNOPSIS
use Lingua::EN::WordsToNumbers;
my $numberizer = Lingua::EN::WordsToNumbers->new;
print $numberizer->parse("Forty-two"); # "42"
$numberizer->debug(1); # debug on
$numberizer->debug(undef); # debug off
$numberizer->billion(10**12); # use "English billion"
METHODS
new
my $numberizer = Lingua::EN::WordsToNumbers->new;
Gives you a new "object". Doesnt really do much.
parse
print $numberizer->parse("Forty-two"); # "42"
An alias for Lingua::EN::Words2Numss words2nums method. Read that modules documentation to see how it works. You can also call this method as words2nums if you really want to.
debug
$numberizer->debug(1);
Set Lingua::EN::Words2Nums debug mode. True values are on, anything else is off.
billion
$numberizer->billion(10**12);
Define the value of a billion ($Lingua::EN::Words2Nums::billion).
Download (0.003MB)
Added: 2006-08-23 License: Perl Artistic License Price:
1158 downloads
Lingua::EN::VarCon 1.00
Lingua::EN::VarCon is a Perl module that provides access to the VarCon (Variant Conversion Info). more>>
VarCon is a data set provided as part of the Word List project.
It contains a number of lists and tables of words that can be used to help automatically convert the English language between the differing spellings of its local dialects (American, British, Canadian).
In an age where Spanish application translations are common, but finding a British, Canadian or Australian translation (as used by FAR more people than American English) can be almost impossible, and when "English" on the internet is increasingly seeming to be "American", it is hoped that by providing access to the raw data in a variety of different ways (optimised for different application types with different load profiles) it will encourage developers to integrate support for automated dialect translation into internationalisation toolkits, and thus into many applications.
METHODS
For this initial release, only methods to locate the files are provided.
Additional access methods will be provided later, or on request.
abbc_file
The abbc_file method returns the location of the abbc.tab file from the VarCon data set.
also_file
The also_file method returns the location of the variant-also.tab file from the VarCon data set.
infl_file
The infl_file method returns the location of the variant-infl.tab file from the VarCon data set.
wroot_file
The infl_file method returns the location of the variant-wroot.tab file from the VarCon data set.
voc_file
The voc_file method returns the location of the voc.tab file from the VarCon data set.
Download (0.18MB)
Added: 2007-02-22 License: Perl Artistic License Price:
974 downloads
Secleted [ 0 ] software to compare
Copyright Notice:
Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future software development. The above lingua ru charset 0.02 search only lists software in full, demo and trial versions for free download. Download links are directly from our mirror sites or publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed