cpdetector 1.05
Sponsored Links
cpdetector 1.05 Ranking & Summary
File size:
0.70 MB
Platform:
Any Platform
License:
MPL (Mozilla Public License)
Price:
Downloads:
925
Date added:
2007-04-21
Publisher:
Achim Westermann
cpdetector 1.05 description
cpdetector project is a small yet clever framework for codepage detection.
cpdetector is a small yet clever framework for codepage detection that integrates different strategies. It may be used as a library for third party software that accesses textual data over network.
It also includes a best-practice implementation in form of a command line tool that allows sorting and transforming large collections of documents based on their codepage.
Available strategies include: jchardet (exclusion, frequency analysis, and guessing), detection of the HTML charset property, and detection of the XML encoding declaration.
What is a code page?
At first, a textual document is nothing more than sequences of bits. A computer has to decide, how he can display this data in form of characters (which are identified by the computer as numbers).
A code page - which is also known as charset encoding - maps the raw data of a textual document to characters. The original ASCII code page for example only uses 7 bits of an octet (byte) for deciding the character that is represented thus allowing only to map 128 different characters. In the past memory was expensive and computers most often only had registers and busses for 8 bit.
When a mainframe was conceived it had to be decided, which characters it should support. Physicians and mathematicians for example needed special characters for equations. As a result, a computer often shipped with a special codepage.
cpdetector is a small yet clever framework for codepage detection that integrates different strategies. It may be used as a library for third party software that accesses textual data over network.
It also includes a best-practice implementation in form of a command line tool that allows sorting and transforming large collections of documents based on their codepage.
Available strategies include: jchardet (exclusion, frequency analysis, and guessing), detection of the HTML charset property, and detection of the XML encoding declaration.
What is a code page?
At first, a textual document is nothing more than sequences of bits. A computer has to decide, how he can display this data in form of characters (which are identified by the computer as numbers).
A code page - which is also known as charset encoding - maps the raw data of a textual document to characters. The original ASCII code page for example only uses 7 bits of an octet (byte) for deciding the character that is represented thus allowing only to map 128 different characters. In the past memory was expensive and computers most often only had registers and busses for 8 bit.
When a mainframe was conceived it had to be decided, which characters it should support. Physicians and mathematicians for example needed special characters for equations. As a result, a computer often shipped with a special codepage.
cpdetector 1.05 Screenshot
cpdetector 1.05 Keywords
codepage detection
clever framework
cpdetector
detection
Codepage
framework
clever
characters
cpdetector 1.05
Information Management
Miscellaneous
Bookmark cpdetector 1.05
cpdetector 1.05 Copyright
WareSeeker periodically updates pricing and software information of cpdetector 1.05 full version from the publisher, so some information may be slightly out-of-date. You should confirm all information before relying on it. Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future development of cpdetector 1.05 Edition. Download links are directly from our publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed
Featured Software
Want to place your software product here?
Please contact us for consideration.
Contact WareSeeker.com
Related Information
detection dogs
early pregnancy detection
asp codepage
detection times
codepage 1252
detection logic
detection software
american leak detection
codepage 950
oil leak detection
detection dog training
detection devices
intrusion detection
detection limit
detection of ovarian cancer
codepage 65001
unicode codepage
virus detection
Related Software
Pod::TOC is a Perl module to extract a table of contents from a Pod file. Free Download
Peng project consists of an AOL Linux dialer. Free Download
Pod::Perldoc::ToToc is a Perl module to translate Pod to a Table of Contents. Free Download
Data::Walker is a tool for navigating through Perl data structures. Free Download
Test::Cmd is a Perl module for portable testing of commands and scripts. Free Download
NetCARD Config project helps linux users to configure network cards for two ip one for DSL network one for Local Network. Free Download
Object adapter for ODBC Free Download
Plosxom is a blogsoftware written in PHP by Pali Dhar. Free Download
Latest Software
Popular Software
Favourite Software