unicode data browser 1.5

Unicode Data Browser 1.5
UnicodeDataBrowser is a very useful browser designed for the UnicodeData.txt file which consists of much useful information but is not easily read by humans. more>>
UnicodeDataBrowser 1.5 is a very useful browser designed for the UnicodeData.txt file which consists of much useful information but is not easily read by humans. The browser creates a scrollable table in which columns represent properties.
The table may be sorted on any column. Abbreviations are expanded and characters cross-referenced in decomposition and casing fields are named. Regular expression search restricted to a selected column is available. The set of characters for which information is displayed may be restricted to those characters matching a regular expression on a specified property.
Each such filtering operation applies to the output of the previous filtering operation unless the table is reset to the original full set of characters, so filtering on multiple properties is possible.
Enhancements: Adds canonical decomposition info for Hangul syllables.
<<lessICD Browser 0.1
ICD Browser for the ICD-10 codes from W.H.O. more>>
This program is an easy to use browser with search capability for the International Classification of Diseases (ICD) codes, as they are published from the W.H.O.
Main features:
- Three versions, for Windows (XP, 2000, 98), Linux and Pocket PC (WM2003, WM5).
- Two ways for displaying codes, Grouped all together into a treebox, or separeted into chapters, sections and subsections.
- Search codes.
- Easy to use interface.
KDE CIM Browser 0.3
The KDE CIM Browser is a KDE based frontend to management brokers using the CIM-XML protocol. more>>
The tool will be useful to view and modify CIM class hierarchies as well as instance and association structures.
Enhancements:
- The latest release of kim-browser is mainly a maintenance release.
- One new feature is namespace browsing, which allows you to pick the namespace to navigate in once connected to a CIMOM.
- In addition to this, some bugfixes have been incorporated and the visual appearance has been improved a bit.
Unicode Error Detector 1.0
Unicode Error Detector is a product for Plone used to pinpoint errors in your application leading to UnicodeDecodeErrors. more>>
Do not use this product unless you are actively debugging a Unicode Error. Never use this product in production sites.
UnicodeDecodeErrors typically occur when you try to add a Unicode string to a non-ascii string. This product patches StringIO used by page templates to check if the appended string is a Unicode string, and if it is, it replaces the string with an error marker.
As there is some overhead associated with inspecting the strings instead of just appending to the output, this product is meant for debugging purposes only.
Usage
Put the product in your Products directory and restart Zope. Load the template causing the UnicodeDecodeError, and this tool will indicate the location by printing THIS IS WHERE THE ERROR IS in the rendered template.
You can then inspect the template and/or code more closely to figure out where the decode error happens.
The Modular Manual Browser 1.2
The Modular Manual Browser is a set scripts designed as a man/apropos work-alike. more>>
It is easy to set up and includes highlighting, linking support in man pages, browsing and searching of pages, categories, and manuals.
It can also optionally set up a database containing descriptions of pages from the page titles.
Enhancements:
- BUGS, COPYING, INSTALL, INSTALL.roff, Makefile, README, README.roff, TODO.sh, config.php, index.php, mandb.php, modfunc.php, modman.php, api/files.php, api/modfunc.php, api/pages.php, api/whatis.php, install/BUGS, install/COPYING, install/INSTALL, install/INSTALL.roff, install/Makefile, install/README, install/README.roff, install/TODO.sh, tmp/.local: api split up, install data moved, so that modman may be dropped directly into webspace. mandb.php can pick up multi-line descriptions now.
Apropos results can be filtered by Section or Manual, but not Page (obviously).
whatis pseudo-database implemented, enabled by default in config.
Sections include Subsections (3->3ucb, 3ucb->3ucblib). Local Apropos and description support added via apropos/whatis programs.
- Release 1.2 -- The Small-Box/WhatIS Release.
Cheetah web browser 0.10
Cheetah web browser has been started to create a fully functional. more>> The cheetah web browser is a project that has been started to create a fully functional, light-weight, bloat-free web browser for Linux (and other free unix clones) that is not dependant on KDE, GNOME or Mozilla.
Why another web browser?
There are a lot of web browsers available for Unix and all of them that I have tried are great. However, I have yet to find one that truly satisfies my needs.
This is what I require in a web browser:
* It must not require another more developed web browser, such as mozilla, to operate. It can certainly use code from another web browser (providing the applications authors permit that), but the code must be included natively in its source, or in a shared library that comes with the browser. For example, I do not want to have mozilla installed to use my browser. I am using my browser not mozilla.
* It must not depend on an entire desktop suite. For example, if I use icewm I should not have to install kde to use my web browser.
* It must be extremely fast and provide an enjoyable browsing experience (duh).
* It must provide only the ability to surf the internet and download files through http and ftp. I do not want my web browser to provide the means to send/receive e-mail, compose web pages, chat on irc etcetera. I have other programs to do that.
When will Cheetah be usable?
I dont have a roadmap layed out yet. I wont have one for awhile. It all depends upon how many people are willing and have the time to contribute.<<less
TVBrowser 2.53
TV-Browser is a TV guide that is easily extensible using plugins. more>>
At the moment, it only loads data in its own binary format, but its possible to write a plugin that loads XMLTV data.
The tv data are retrieved from multiple sources and can be downloaded from our servers after having been automatically processed.
The whole project - both the client and the server part - is distributed under the GNU GPL.
Currently, we offer only channels in german language but we are highly motivated to extend TV-Browser for other countries.
Enhancements:
- Minor bugfixes.
Unicode::Collate 0.52
Unicode::Collate is a Unicode Collation Algorithm. more>>
SYNOPSIS
use Unicode::Collate;
#construct
$Collator = Unicode::Collate->new(%tailoring);
#sort
@sorted = $Collator->sort(@not_sorted);
#compare
$result = $Collator->cmp($a, $b); # returns 1, 0, or -1.
# If %tailoring is false (i.e. empty),
# $Collator should do the default collation.
This module is an implementation of Unicode Technical Standard #10 (a.k.a. UTS #10) - Unicode Collation Algorithm (a.k.a. UCA).
Constructor and Tailoring
The new method returns a collator object.
$Collator = Unicode::Collate->new(
UCA_Version => $UCA_Version,
alternate => $alternate, # deprecated: use of variable is recommended.
backwards => $levelNumber, # or @levelNumbers
entry => $element,
hangul_terminator => $term_primary_weight,
ignoreName => qr/$ignoreName/,
ignoreChar => qr/$ignoreChar/,
katakana_before_hiragana => $bool,
level => $collationLevel,
normalization => $normalization_form,
overrideCJK => &overrideCJK,
overrideHangul => &overrideHangul,
preprocess => &preprocess,
rearrange => @charList,
table => $filename,
undefName => qr/$undefName/,
undefChar => qr/$undefChar/,
upper_before_lower => $bool,
variable => $variable,
);
Yukatan data model 1.0
Yukatan data model project is the schema definition of the Yukatan webmail database. more>>
The PostgreSQL database structures defined in this file can be used as a backend store of an email message handling application. The database should be created with the "UNICODE" encoding to properly support messages in different languages.
New data types
The special data types commonly used in the Yukatan data model have been made explicit by the introduction of seven new domains. The domains and the related COMMENT statements make field semantics more clear than before.
See the SQL schema file for more detailed documentation on these domains.
Explicitly named constraints
All the table constraints in the database are now explicitly named and documented. This change makes the database implementation more orthogonal and cleans up the documentation.
Renamed fields and tables
All the *address field names have been truncated to *addr, to make it visually clearer that they are always paired with the corresponding *name fields. The change also makes parts of the documentation less repetitive.
The referencesfield table has been renamed to referencefield to avoid the plural form in the table name. Also all the contained references* field names have been renamed to reference*.
Semantic changes
Quite a few changes have been made to the semantics of various fields. The unnecessarily tight constraints on sequence numbers have been replaced with clearer documentation, the format and encoding of most fields has been explicitly documented, and the previously allowed dual use of the enttext and enddata fields has been prohibited.
Dropped envelope data
The envelope data added in version 0.5 of the data model has for now been removed. The reason for the removal is that the envelope data is not an integral part of an email message, and I wanted to make the version 1.0 as clear as possible. The database now stores "email messages" - nothing less, nothing more. Envelope data can and probably will be reintroduced in an incremental version 1.x along with other extensions.
Enhancements:
- cleans up and documents the data model that has developed since version 0.1
- removal of the envelope data added in version 0.5
- enaming and redefinition of some of the fields and tables
- database structure has also been extensively documented
Unicode Utilities 2.25
Unicode Utilities project are a set of programs for manipulating and analyzing Unicode text. more>>
unidesc reports the character ranges to which different portions of the text belong. unihist generates a histogram of the characters in its input. ExplicateUTF8 is intended for debugging or for learning about Unicode. It determines and explains the validity of a sequence of bytes as a UTF-8 encoding. unirev reverses UTF-8 strings.
Enhancements:
- Adds to unidesc the option -r which causes it to list the ranges detected after reading all input rather than listing them as they are encountered, and adds to uniname the option -B which causes it to ignore characters within the Basic Multilingual Plane.
Hermes JMS Browser 1.12
Hermes JMS Browser is a console that helps you interact with JMS providers. more>>
Many providers include a plugin that uses the native API to do non-JMS things like getting queue depths (and other statistics) or finding queue and topic names.
Hypertext DOM Browser 1.0.2
Hypertext DOM Browser is a Firefox extension for Web developers and Firefox developers (and JavaScript calculator). more>>
Other features include: shows exceptions, can show several DOM objects (or JavaScript expressions) on one page, shows number of function arguments, automatically updates and emphasizes changed properties (in this version only in unsafe mode), in-page search, back and forward buttons, can use tabbed browsing, displays current expression in window title, etc.
To call the DOM browser select "Tools/Hypetext DOM Browser" in the navigator menu. Type "content" to show the properties of the content window, "content.document" to show properties of the HTML document in the calling window, "self" or "window" will show properties of that chrome window where youve selected "Hypetext DOM Browser" menu item, "document" will show properies of the calling XUL frame.
Unicode::MapUTF8 1.11
Unicode::MapUTF8 is a Perl module with conversions to and from arbitrary character sets and UTF8. more>>
SYNOPSIS
use Unicode::MapUTF8 qw(to_utf8 from_utf8 utf8_supported_charset);
# Convert a string in ISO-8859-1 to UTF8
my $output = to_utf8({ -string => An example, -charset => ISO-8859-1 });
# Convert a string in UTF8 encoding to encoding ISO-8859-1
my $other = from_utf8({ -string => Other text, -charset => ISO-8859-1 });
# List available character set encodings
my @character_sets = utf8_supported_charset;
# Add a character set alias
utf8_charset_alias({ ms-japanese => sjis });
# Convert between two arbitrary (but largely compatible) charset encodings
# (SJIS to EUC-JP)
my $utf8_string = to_utf8({ -string =>$sjis_string, -charset => sjis});
my $euc_jp_string = from_utf8({ -string => $utf8_string, -charset => euc-jp })
# Verify that a specific character set is supported
if (utf8_supported_charset(ISO-8859-1) {
# Yes
}
Provides an adapter layer between core routines for converting to and from UTF8 and other encodings. In essence, a way to give multiple existing Unicode modules a single common interface so you dont have to know the underlaying implementations to do simple UTF8 to-from other character set encoding conversions. As such, it wraps the Unicode::String, Unicode::Map8, Unicode::Map and Jcode modules in a standardized and simple API.
This also provides general character set conversion operation based on UTF8 - it is possible to convert between any two compatible and supported character sets via a simple two step chaining of conversions.
As with most things Perlish - if you give it a few big chunks of text to chew on instead of lots of small ones it will handle many more characters per second.
By design, it can be easily extended to encompass any new charset encoding conversion modules that arrive on the scene.
This module is intended to provide good Unicode support to versions of Perl prior to 5.8. If you are using Perl 5.8.0 or later, you probably want to be using the Encode module instead. This module does work with Perl 5.8, but Encode is the preferred method in that environment.
phpDIR browser 3.1
hpDIR is an easy to use and install script that can list files and folders. more>>
phpDIR browser application also supports admin sessions, to also be able to delete files. In addition, invisible files can be configured. Folders can be downloaded as archive files. There is no need for additional PHP modules on your Web server.
Enhancements:
- On some servers, hidden file support was broken due to double slashes being returned by some functions.
- This has been fixed.
ustring C++ Unicode library 0.5
ustring is a C++ library to implement the complete Unicode 3.0 standard. more>>
ustring C++ Unicode library provides a wide string class which fits with STL, implementing a full Unicode normalizer, uppercase and lowercase transformation, and diacritical mark removal. A Unicode level-1 regular expression class is also provided.