clustering
Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 268
Document clustering 0.2
Document clustering project is a data mining suite to cluster a document set. more>>
Document clustering project is a data mining suite to cluster a document set. This set of tools was implemented from a series of papers: "Clustering Web Pages Semantically using Combinatorial Topology", "Data mining using granular computing", and "A fast association rule algorithm based on bitmap and granular computing".
Enhancements:
- A bug with hash table has been fixed.
<<lessEnhancements:
- A bug with hash table has been fixed.
Download (0.060MB)
Added: 2007-05-18 License: GPL (GNU General Public License) Price:
906 downloads
MySQL High Availability clustering Alpha-0.7
MySQL High Availability clustering is a set of scripts and programs that provide a high availability database cluster. more>>
MySQL High Availability clustering is a set of scripts and programs that provide a high availability database cluster using MySQL replication.
MySQL High Availability clustering is transparent to client applications, as the cluster uses a shared logical IP to provide the service.
Enhancements:
- References have been changed from MASTER_NODE to CLUSTER_IP in takeover, failover, and slave_routine, when it was appropiate.
- This release introduces changes in compat.sh, several main cluster files, and the installation documentation, according to bug reports 1707251 and 1707212.
<<lessMySQL High Availability clustering is transparent to client applications, as the cluster uses a shared logical IP to provide the service.
Enhancements:
- References have been changed from MASTER_NODE to CLUSTER_IP in takeover, failover, and slave_routine, when it was appropiate.
- This release introduces changes in compat.sh, several main cluster files, and the installation documentation, according to bug reports 1707251 and 1707212.
Download (0.007MB)
Added: 2007-05-19 License: GPL (GNU General Public License) Price:
889 downloads
Cluster Live 1.0
Cluster Live is a live cd that boots a cluster of diskless Thin Clients. more>>
Cluster Live can do the following:
- Boots a cluster of diskless Thin Clients.
- Automatically loads a Web browser.
- Share workload between Thin Clients.
Why?
- No individual installations required.
- Fast deployment of centralised software over multiple computers.
- No need to access the hard disk of your existing system.
- Portable yet scalable infrastructure.
- Less spending on high end computer hardware such as hard drives, processors and memory.
Who can benefit from it?
- Public kiosk
- Internet cafe
- Home, Office and other intranet premises
- Training centre
- Government sectors, and other organisations requiring large deployment of dedicated applications.
- Ideal for academics in developing countries to make use of computers cost effectively.
- Open Source developers can freely create their own Cluster Live CDs.
How does it work?
- A server gets booted by a Cluster Live CD.
- The clients BIOS is configured to boot through the LAN by default.
- When the server has finished the bootup sequence, the client can be booted through the LAN.
What was it tested with?
- IBM Thinkpad T30 (CPU: Pentium 4 - 1GHz; RAM: 512MB)
- Virtual Machines (Software to allow running of multiple O.S. simultaneously)
- IDE CDRW (To rewrite the ISO image on to the CD for testing)
- A hub with several LAN cables connecting the infrastructure.
- Source built with Red Hat Fedora Core 3 Linux
<<less- Boots a cluster of diskless Thin Clients.
- Automatically loads a Web browser.
- Share workload between Thin Clients.
Why?
- No individual installations required.
- Fast deployment of centralised software over multiple computers.
- No need to access the hard disk of your existing system.
- Portable yet scalable infrastructure.
- Less spending on high end computer hardware such as hard drives, processors and memory.
Who can benefit from it?
- Public kiosk
- Internet cafe
- Home, Office and other intranet premises
- Training centre
- Government sectors, and other organisations requiring large deployment of dedicated applications.
- Ideal for academics in developing countries to make use of computers cost effectively.
- Open Source developers can freely create their own Cluster Live CDs.
How does it work?
- A server gets booted by a Cluster Live CD.
- The clients BIOS is configured to boot through the LAN by default.
- When the server has finished the bootup sequence, the client can be booted through the LAN.
What was it tested with?
- IBM Thinkpad T30 (CPU: Pentium 4 - 1GHz; RAM: 512MB)
- Virtual Machines (Software to allow running of multiple O.S. simultaneously)
- IDE CDRW (To rewrite the ISO image on to the CD for testing)
- A hub with several LAN cables connecting the infrastructure.
- Source built with Red Hat Fedora Core 3 Linux
Download (147.1MB)
Added: 2007-02-21 License: GPL (GNU General Public License) Price:
977 downloads
Gluster 1.2.2 (GlusterFS)
GlusterFS package contains clustered file storage that can scale to peta bytes. more>>
GlusterFS package contains clustered file storage that can scale to peta bytes. GlusterFS is a programmable system. With little thinking, you can even redesign the GlusterFS file system by re-arranging the GlusterFS components using translator interface. It is all achieved through volume specification file. This allows GlusterFS to be flexible for all kinds of storage needs. Even with all these advanced features, GlusterFS is very easy to setup and manage.
Gluster is a GNU cluster distribution aimed at commoditizing Supercomputing and Superstorage. Core of the Gluster provides a platform for developing clustering applications tailored for a specific tasks such as HPC Clustering, Storage Clustering, Enterprise Provisioning, Database Clustering etc.
<<lessGluster is a GNU cluster distribution aimed at commoditizing Supercomputing and Superstorage. Core of the Gluster provides a platform for developing clustering applications tailored for a specific tasks such as HPC Clustering, Storage Clustering, Enterprise Provisioning, Database Clustering etc.
Download (0.26MB)
Added: 2007-01-17 License: GPL (GNU General Public License) Price:
1012 downloads
Other version of Gluster
License:GPL (GNU General Public License)
Bio::ClusterI 1.4
Bio::ClusterI module is a cluster Perl interface. more>>
Bio::ClusterI module is a cluster Perl interface.
SYNOPSIS
# see the implementations of this interface for details but # basically
my $cluster= $cluster->new(-description=>"POLYUBIQUITIN",
-members =>[$seq1,$seq2]);
my @members = $cluster->get_members();
my @sub_members = $cluster->get_members(-species=>"homo sapiens");
This interface is the basic structure for a cluster of bioperl objects. In this case it is up to the implementer to check arguments and initialize whatever new object the implementing class is designed for.
<<lessSYNOPSIS
# see the implementations of this interface for details but # basically
my $cluster= $cluster->new(-description=>"POLYUBIQUITIN",
-members =>[$seq1,$seq2]);
my @members = $cluster->get_members();
my @sub_members = $cluster->get_members(-species=>"homo sapiens");
This interface is the basic structure for a cluster of bioperl objects. In this case it is up to the implementer to check arguments and initialize whatever new object the implementing class is designed for.
Download (4.7MB)
Added: 2007-08-16 License: Perl Artistic License Price:
799 downloads
Rocks Cluster 4.3
Rocks Cluster Tool Kit is a Turnkey Linux COTS Clusters for x86 and IA64. more>>
Rocks Cluster is a complete "cluster on a CD" solution for x86 and IA64 Red Hat Linux COTS clusters.
Building a Rocks cluster does not require any experience in clustering, yet a cluster architect will find a flexible and programmatic way to redesign the entire software stack just below the surface (appropriately hidden from the majority of users).
Although Rocks includes the tools expected from any clustering software stack (PBS, Maui, GM support, Ganglia, etc), it is unique in its simplicity of installation.
From a hardware component and raw processing power perspective, commodity clusters are phenomenal price/performance compute engines. However, if a scalable ``cluster management strategy is not adopted, the favorable economics of clusters are offset by the additional on-going personnel costs involved to ``care and feed for the machine. The complexity of cluster management (e.g., determining if all nodes have a consistent set of software) often overwhelms part-time cluster administrators, who are usually domain application scientists. When this occurs, machine state is forced to either of two extremes: the cluster is not stable due to configuration problems, or software becomes stale, security holes abound, and known software bugs remain unpatched.
While earlier clustering toolkits expend a great deal of effort (i.e., software) to compare configurations of nodes, Rocks makes complete Operating System (OS) installation on a node the basic management tool. With attention to complete automation of this process, it becomes faster to reinstall all nodes to a known configuration than it is to determine if nodes were out of synchronization in the first place. Unlike a users desktop, the OS on a cluster node is considered to be soft state that can be changed and/or updated rapidly.
This is clearly more heavywieght than the philosophy of configuration management tools [Cfengine] that perform exhaustive examination and parity checking of an installed OS. At first glance, it seems wrong to reinstall the OS when a configuration parameter needs to be changed. Indeed, for a single node this might seem too severe. However, this approach scales exceptionally well, making it a preferred mode for even a modest-sized cluster. Because the OS can be installed from scratch in a short period of time, different (and perhaps incompatible) application-specific configurations can easily be installed on nodes. In addition, this structure insures any upgrade will not interfere with actively running jobs.
One of the key ingredients of Rocks is a robust mechanism to produce customized distributions (with security patches pre-applied) that define the complete set of software for a particular node. A cluster may require several node types including compute nodes, frontend nodes file servers, and monitoring nodes. Each of these roles requires a specialized software set. Within a distribution, different node types are defined with a machine specific Red Hat Kickstart file, made from a Rocks Kickstart Graph.
A Kickstart file is a text-based description of all the software packages and software configuration to be deployed on a node. The Rocks Kickstart Graph is an XML-based tree structure used to define RedHat Kickstart files. By using a graph, Rocks can efficiently define node types without duplicating shared components. Similiar to mammalian species sharing 80% of their genes, Rocks node types share much of their software set. The Rocks Kickstart Graph easily defines the differences between node types without duplicating the description of their similarities. See the Bibliography section for papers that describe the design of this structure in more depth.
By leveraging this installation technology, we can abstract out many of the hardware differences and allow the Kickstart process to autodetect the correct hardware modules to load (e.g., disk subsystem type: SCSI, IDE, integrated RAID adapter; Ethernet interfaces; and high-speed network interfaces). Further, we benefit from the robust and rich support that commercial Linux distributions must have to be viable in todays rapidly advancing marketplace.
Wherever possible, Rocks uses automatic methods to determine configuration differences. Yet, because clusters are unified machines, there are a few services that require ``global knowledge of the machine -- e.g., a listing of all compute nodes for the hosts database and queuing system. Rocks uses an SQL database to store the definitions of these global configurations and then generates database reports to create service-specific configuration files (e.g., DHCP configuration file, /etc/hosts, and PBS nodes file).
Enhancements:
- Rocks v4.3 is released for i386 and x86_64 CPU architectures. New features: Rocks command line - initial release of the Rocks command line which facilitates non-SQL administrative access to the database; PXE First - hosts can now be configured in BIOS with a boot order of CD, PXE, hard disk. Enhancements: based on CentOS 4.5 and all updates as of July 4, 2007; Anaconda installer updated to 10.1.1.63; performance improvement when building torrent files for the Avalanche Installer; database indirects, more flexibility with Rocks variables; Globus updated to gt4.0.4 with web services....
<<lessBuilding a Rocks cluster does not require any experience in clustering, yet a cluster architect will find a flexible and programmatic way to redesign the entire software stack just below the surface (appropriately hidden from the majority of users).
Although Rocks includes the tools expected from any clustering software stack (PBS, Maui, GM support, Ganglia, etc), it is unique in its simplicity of installation.
From a hardware component and raw processing power perspective, commodity clusters are phenomenal price/performance compute engines. However, if a scalable ``cluster management strategy is not adopted, the favorable economics of clusters are offset by the additional on-going personnel costs involved to ``care and feed for the machine. The complexity of cluster management (e.g., determining if all nodes have a consistent set of software) often overwhelms part-time cluster administrators, who are usually domain application scientists. When this occurs, machine state is forced to either of two extremes: the cluster is not stable due to configuration problems, or software becomes stale, security holes abound, and known software bugs remain unpatched.
While earlier clustering toolkits expend a great deal of effort (i.e., software) to compare configurations of nodes, Rocks makes complete Operating System (OS) installation on a node the basic management tool. With attention to complete automation of this process, it becomes faster to reinstall all nodes to a known configuration than it is to determine if nodes were out of synchronization in the first place. Unlike a users desktop, the OS on a cluster node is considered to be soft state that can be changed and/or updated rapidly.
This is clearly more heavywieght than the philosophy of configuration management tools [Cfengine] that perform exhaustive examination and parity checking of an installed OS. At first glance, it seems wrong to reinstall the OS when a configuration parameter needs to be changed. Indeed, for a single node this might seem too severe. However, this approach scales exceptionally well, making it a preferred mode for even a modest-sized cluster. Because the OS can be installed from scratch in a short period of time, different (and perhaps incompatible) application-specific configurations can easily be installed on nodes. In addition, this structure insures any upgrade will not interfere with actively running jobs.
One of the key ingredients of Rocks is a robust mechanism to produce customized distributions (with security patches pre-applied) that define the complete set of software for a particular node. A cluster may require several node types including compute nodes, frontend nodes file servers, and monitoring nodes. Each of these roles requires a specialized software set. Within a distribution, different node types are defined with a machine specific Red Hat Kickstart file, made from a Rocks Kickstart Graph.
A Kickstart file is a text-based description of all the software packages and software configuration to be deployed on a node. The Rocks Kickstart Graph is an XML-based tree structure used to define RedHat Kickstart files. By using a graph, Rocks can efficiently define node types without duplicating shared components. Similiar to mammalian species sharing 80% of their genes, Rocks node types share much of their software set. The Rocks Kickstart Graph easily defines the differences between node types without duplicating the description of their similarities. See the Bibliography section for papers that describe the design of this structure in more depth.
By leveraging this installation technology, we can abstract out many of the hardware differences and allow the Kickstart process to autodetect the correct hardware modules to load (e.g., disk subsystem type: SCSI, IDE, integrated RAID adapter; Ethernet interfaces; and high-speed network interfaces). Further, we benefit from the robust and rich support that commercial Linux distributions must have to be viable in todays rapidly advancing marketplace.
Wherever possible, Rocks uses automatic methods to determine configuration differences. Yet, because clusters are unified machines, there are a few services that require ``global knowledge of the machine -- e.g., a listing of all compute nodes for the hosts database and queuing system. Rocks uses an SQL database to store the definitions of these global configurations and then generates database reports to create service-specific configuration files (e.g., DHCP configuration file, /etc/hosts, and PBS nodes file).
Enhancements:
- Rocks v4.3 is released for i386 and x86_64 CPU architectures. New features: Rocks command line - initial release of the Rocks command line which facilitates non-SQL administrative access to the database; PXE First - hosts can now be configured in BIOS with a boot order of CD, PXE, hard disk. Enhancements: based on CentOS 4.5 and all updates as of July 4, 2007; Anaconda installer updated to 10.1.1.63; performance improvement when building torrent files for the Avalanche Installer; database indirects, more flexibility with Rocks variables; Globus updated to gt4.0.4 with web services....
Download (601MB)
Added: 2007-07-07 License: BSD License Price:
511 downloads
OSCAR Cluster 5.0
OSCAR Cluster is a Linux cluster installer based on best known practices. more>>
OSCAR version 4.0 is a snapshot of the best known methods for building, programming, and using clusters. OSCAR Cluster project consists of a fully integrated and easy to install software bundle designed for high performance cluster computing.
Everything needed to install, build, maintain, and use a modest sized Linux cluster is included in the suite, making it unnecessary to download or even install any individual software packages on your cluster.
<<lessEverything needed to install, build, maintain, and use a modest sized Linux cluster is included in the suite, making it unnecessary to download or even install any individual software packages on your cluster.
Download (5.8MB)
Added: 2006-11-12 License: GPL (GNU General Public License) Price:
1088 downloads
Image Cluster 0.1
Image Cluster copies and renames images based on Exif data and file number names. more>>
Image Cluster copies and renames images based on Exif data and file number names. Image Cluster also clusters those images into directories based on a variable sliding window (with a default of 36 hours), which makes it easy to group images based on events without manual intervention.
The inspiration for this program came from recently getting a new Canon SD500 camera to replace my Canon S30 that Id had for years. The upside, the Canon SD500 rocks! The downside, I now have 2 cameras that are burning through the same sequence numbers, so my previous solution of just putting all the files in to directories by the first 2 digits of the sequence numbers was no longer going to work.
Imagecluster solves this problem, plus another grouping problem that Id been thinking about, by extracting the CreateDate and FileNumber exif tags from the images, and using that as the basis of a new filename (typically YYYY:mm:dd_HH:MM:SS_FileNumber.jpg). This ensures that 2 images taken at the same second have an even smaller chance of colliding, as their camera sequence numbers would have to also be the same at that second.
But that is just the first step. I have noticed that I am an occational photographer, so take pictures in bursts, often for a weekend of hanging out with folks, though sometimes for a vacation as well. This got me thinking. What I really needed is a tool that also creates directories that allows for some minimum tollerance between CreateDate, that is used to cluster images. For me, the optimum value seems to be 36 hours, though this is configurable via the command line.
This took me an afternoon to pull together, Im sure it could be smarter, but it is useful enough to post for others to use.
Options:
-d directory
Set the target directory for images. Defaults to /tmp/photos, which is probably not what you want.
-D
Dryrun. Tells you what the program would have done.
-h
Print out help message
-s
Seperator character. It defaults to : (i.e. 2005:10:09...), but is user configurable because my friend Clemens wants to use - (i.e. 2005-10-09) instead.
-t
Set the tollerance for image clustering. This is the maximum time between any 2 pictures in a cluster, which will cause a new cluster to be created. The name of the cluster will be YYYY:MM:DD of the first image in the cluster, even if it spans multiple days. Because this tollerance is the maximum time between any two images in the cluster, it is possible that all images you have ever taken could be in 1 cluster, if you took a picture every day of your life. Hence, this feature isnt useful to everyone. If you are that kind of person, set tollerance to 16 hours or something, and youll tend to get 1 day sized buckets.
-v
Prints verbose output
<<lessThe inspiration for this program came from recently getting a new Canon SD500 camera to replace my Canon S30 that Id had for years. The upside, the Canon SD500 rocks! The downside, I now have 2 cameras that are burning through the same sequence numbers, so my previous solution of just putting all the files in to directories by the first 2 digits of the sequence numbers was no longer going to work.
Imagecluster solves this problem, plus another grouping problem that Id been thinking about, by extracting the CreateDate and FileNumber exif tags from the images, and using that as the basis of a new filename (typically YYYY:mm:dd_HH:MM:SS_FileNumber.jpg). This ensures that 2 images taken at the same second have an even smaller chance of colliding, as their camera sequence numbers would have to also be the same at that second.
But that is just the first step. I have noticed that I am an occational photographer, so take pictures in bursts, often for a weekend of hanging out with folks, though sometimes for a vacation as well. This got me thinking. What I really needed is a tool that also creates directories that allows for some minimum tollerance between CreateDate, that is used to cluster images. For me, the optimum value seems to be 36 hours, though this is configurable via the command line.
This took me an afternoon to pull together, Im sure it could be smarter, but it is useful enough to post for others to use.
Options:
-d directory
Set the target directory for images. Defaults to /tmp/photos, which is probably not what you want.
-D
Dryrun. Tells you what the program would have done.
-h
Print out help message
-s
Seperator character. It defaults to : (i.e. 2005:10:09...), but is user configurable because my friend Clemens wants to use - (i.e. 2005-10-09) instead.
-t
Set the tollerance for image clustering. This is the maximum time between any 2 pictures in a cluster, which will cause a new cluster to be created. The name of the cluster will be YYYY:MM:DD of the first image in the cluster, even if it spans multiple days. Because this tollerance is the maximum time between any two images in the cluster, it is possible that all images you have ever taken could be in 1 cluster, if you took a picture every day of your life. Hence, this feature isnt useful to everyone. If you are that kind of person, set tollerance to 16 hours or something, and youll tend to get 1 day sized buckets.
-v
Prints verbose output
Download (0.010MB)
Added: 2006-02-10 License: GPL (GNU General Public License) Price:
1355 downloads
Cluster SSH 3.19.1
Cluster SSH opens terminal windows with connections to specified hosts and an administration console. more>>
Cluster SSH opens terminal windows with connections to specified hosts and an administration console. Any text typed into the administration console is replicated to all other connected and active windows.
This tool is intended for, but not limited to, cluster administration where the same configuration or commands must be run on each node within the cluster. Performing these commands all at once via this tool ensures all nodes are kept in sync.
Enhancements:
- Check for failure to connect to X session
- Totally rework character mapping and events to cope with non-QWERTY keyboards
- Rework pasting code to cope with non-QWERTY charatcters
- Manpage/help doc updates and corrections
- Check for missing definitions for cluster tags in .csshrc
- Run through perltidy -b -i=2
- Apply patch to add in optional port information from D. Dumont
- Amend hotkey code to not pick up - as default clientname shortcut
- Alter repeat function to improve efficiency
- Rework retiling code
- Add "-e " to evaluate terminal and communcation methods
- Add in toggle option on hosts menu
- Fix check in find_binary to ensure one is actually found
- Search $PATH and other standard places for binaries incase $PATH is incompleteAmend code to allow getting help when no X display available
- Allow override of both key and mouse paste key sequences
- Added icons and desktop file
- Amended clusterssh.spec to cope with icons and desktop file
- Improve cluster file import efficiency as was taking faaar too long previouslyFixed bug whereby when pids of the xterm changes records were not updated
- Do not die when pipe open fails, but continue as others may be connected
- Remove code that breaks the minimize/maximise stuff;
- Catch X button presses on title bar to close all windows correctly
- Delay map event capture at program start to avoid infinite loop
- Fix execvp error on Solaris 10
<<lessThis tool is intended for, but not limited to, cluster administration where the same configuration or commands must be run on each node within the cluster. Performing these commands all at once via this tool ensures all nodes are kept in sync.
Enhancements:
- Check for failure to connect to X session
- Totally rework character mapping and events to cope with non-QWERTY keyboards
- Rework pasting code to cope with non-QWERTY charatcters
- Manpage/help doc updates and corrections
- Check for missing definitions for cluster tags in .csshrc
- Run through perltidy -b -i=2
- Apply patch to add in optional port information from D. Dumont
- Amend hotkey code to not pick up - as default clientname shortcut
- Alter repeat function to improve efficiency
- Rework retiling code
- Add "-e " to evaluate terminal and communcation methods
- Add in toggle option on hosts menu
- Fix check in find_binary to ensure one is actually found
- Search $PATH and other standard places for binaries incase $PATH is incompleteAmend code to allow getting help when no X display available
- Allow override of both key and mouse paste key sequences
- Added icons and desktop file
- Amended clusterssh.spec to cope with icons and desktop file
- Improve cluster file import efficiency as was taking faaar too long previouslyFixed bug whereby when pids of the xterm changes records were not updated
- Do not die when pipe open fails, but continue as others may be connected
- Remove code that breaks the minimize/maximise stuff;
- Catch X button presses on title bar to close all windows correctly
- Delay map event capture at program start to avoid infinite loop
- Fix execvp error on Solaris 10
Download (0.036MB)
Added: 2006-07-26 License: GPL (GNU General Public License) Price:
1202 downloads
SenseClusters 0.95
SenseClusters is a natural language processing package that allows you to cluster similar contexts or to identify clusters. more>>
SenseClusters is a natural language processing package that allows you to cluster similar contexts or to identify clusters of related words.
SenseClusters supports its own native methods based on first and second order representations of context, and also supports Latent Semantic Analysis. It is fully unsupervised, and can automatically discover the optimal number of clusters in your text.
SenseClusters is a complete system that takes users from preprocessing of raw text to providing clustered output.
Enhancements:
- Full support for Latent Semantic Analysis was introduced.
- Both contexts and words may be clustered using either native SenseClusters methods (first or second order) or Latent Semantic Analysis.
<<lessSenseClusters supports its own native methods based on first and second order representations of context, and also supports Latent Semantic Analysis. It is fully unsupervised, and can automatically discover the optimal number of clusters in your text.
SenseClusters is a complete system that takes users from preprocessing of raw text to providing clustered output.
Enhancements:
- Full support for Latent Semantic Analysis was introduced.
- Both contexts and words may be clustered using either native SenseClusters methods (first or second order) or Latent Semantic Analysis.
Download (20.1MB)
Added: 2006-08-29 License: GPL (GNU General Public License) Price:
1153 downloads
Algorithm::Cluster 1.35
Algorithm::Cluster is a Perl interface to the C Clustering Library. more>>
Algorithm::Cluster is a Perl interface to the C Clustering Library.
This module is an interface to the C Clustering Library, a general purpose library implementing functions for hierarchical clustering (pairwise simple, complete, average, and centroid linkage), along with k-means and k-medians clustering, and 2D self-organizing maps.
This library was developed at the Human Genome Center of the University of Tokyo. The C Clustering Library is distributed along with Cluster 3.0, an enhanced version of the famous Cluster program originally written by Michael Eisen while at Stanford University.
<<lessThis module is an interface to the C Clustering Library, a general purpose library implementing functions for hierarchical clustering (pairwise simple, complete, average, and centroid linkage), along with k-means and k-medians clustering, and 2D self-organizing maps.
This library was developed at the Human Genome Center of the University of Tokyo. The C Clustering Library is distributed along with Cluster 3.0, an enhanced version of the famous Cluster program originally written by Michael Eisen while at Stanford University.
Download (0.048MB)
Added: 2007-05-16 License: Perl Artistic License Price:
894 downloads
BioCluster 0.1 Beta
BioCluster is a peer-to-peer clustering platform for Asterisk, the open source PBX. more>>
BioCluster is a peer-to-peer clustering platform for Asterisk, the open source PBX, which allows Asterisk to be used as a full carrier-grade telephony solution. This project is meant to be installed on several machines together with Asterisk, turning them into a VoIP cluster.
While the BioCluster peer-to-peer protocol was initially designed to cater to Asterisk-based clustering solutions, the BioCluster framework is capable of being extended to support various forms of normally unclustered devices or software packages.
<<lessWhile the BioCluster peer-to-peer protocol was initially designed to cater to Asterisk-based clustering solutions, the BioCluster framework is capable of being extended to support various forms of normally unclustered devices or software packages.
Download (1.1MB)
Added: 2007-07-24 License: GPL (GNU General Public License) Price:
823 downloads
Linux Cluster Manager 2.75-1
Linux Cluster Manager is a graphical tool for managing multiple Linux systems from a central location. more>>
Linux Cluster Manager is a graphical tool for managing multiple Linux systems from a central location. Meant primarily for Beowulf style clusters, it has many useful features for general system administration and even some non Linux specific possibilities like block level imaging.
Imaging with LCM is quick and easy as each node has its IP and hostname changed ready for boot as an independant node preloaded with all of your applications. Using a file based image, items can be changed such as the target device, the file system capacity, or even the file system type. Imagine requiring a new partition on boot disks already filled across hundreds of machines. It is a simple property change, re-image the node(s), and you are ready to go.
Image creation can be done from a running system or LCM can remote power on a node through WOL, PXE boot it, collect an image, and power off the node when complete.
All operations are simple point and click, no command line options to remember (these are optional), or complicated setup.
Main features:
- Easy to use GUI for all operations
- Real time status information for all nodes
- Connect to individual nodes via a user specified protocol (ssh, rsh, rlogin ,etc)
- Report on running processes across the cluster
Imaging Features
- Imaged nodes have IP and hostname information changed automatically, just image and boot
- Block level system imaging (can be used for any x86 based operating system)
- File level system imaging
- Wake on LAN for imaging
- Preconfigured for PXE boot of clients
- Images can be customized - target device changed, file system size, file system type
Monitor Features
- Real time status monitoring for all nodes, CPU and Network
- Scrolling, scalable graphs
- View all nodes or a subset you are interested in
Scripting Features
- Run scripts across the cluster or on select nodes without a client agent
- Connects via a user selected protocol with secure authentication
- Command line or GUI interface
- Can be incorporated into external scripts
<<lessImaging with LCM is quick and easy as each node has its IP and hostname changed ready for boot as an independant node preloaded with all of your applications. Using a file based image, items can be changed such as the target device, the file system capacity, or even the file system type. Imagine requiring a new partition on boot disks already filled across hundreds of machines. It is a simple property change, re-image the node(s), and you are ready to go.
Image creation can be done from a running system or LCM can remote power on a node through WOL, PXE boot it, collect an image, and power off the node when complete.
All operations are simple point and click, no command line options to remember (these are optional), or complicated setup.
Main features:
- Easy to use GUI for all operations
- Real time status information for all nodes
- Connect to individual nodes via a user specified protocol (ssh, rsh, rlogin ,etc)
- Report on running processes across the cluster
Imaging Features
- Imaged nodes have IP and hostname information changed automatically, just image and boot
- Block level system imaging (can be used for any x86 based operating system)
- File level system imaging
- Wake on LAN for imaging
- Preconfigured for PXE boot of clients
- Images can be customized - target device changed, file system size, file system type
Monitor Features
- Real time status monitoring for all nodes, CPU and Network
- Scrolling, scalable graphs
- View all nodes or a subset you are interested in
Scripting Features
- Run scripts across the cluster or on select nodes without a client agent
- Connects via a user selected protocol with secure authentication
- Command line or GUI interface
- Can be incorporated into external scripts
Download (18.5MB)
Added: 2007-04-15 License: GPL (GNU General Public License) Price:
926 downloads
Cluster Installation Finishing Scripts 3.1.1
Cluster Installation Finishing Scripts is a post-installation adjustment system for compute nodes. more>>
The Finishing Scripts for Cluster Installations handle specific post-installation configuration that might not be convienent nor possible using existing cluster installation methods.
The usual installation process is used to build a reasonably configured node, and the system then reboots into normal mode, achieves network visibility, and executes the finishing script.
The finishing script handles all of the finer details of installing packaged or non-packaged software, tweaking installation, setting host/net specific parameters/files, etc. It is controlled via a single, easily modified script.
<<lessThe usual installation process is used to build a reasonably configured node, and the system then reboots into normal mode, achieves network visibility, and executes the finishing script.
The finishing script handles all of the finer details of installing packaged or non-packaged software, tweaking installation, setting host/net specific parameters/files, etc. It is controlled via a single, easily modified script.
Download (0.011MB)
Added: 2005-04-07 License: Artistic License Price:
1661 downloads
CLUTO 2.1.2a
CLUTO is a software package for clustering low- and high-dimensional datasets. more>>
CLUTO is a software package for clustering low- and high-dimensional datasets and for analyzing the characteristics of the various clusters. The project is well-suited for clustering data sets arising in many diverse application areas including information retrieval, customer purchasing transactions, web, GIS, science, and biology.
CLUTOs distribution consists of both stand-alone programs and a library via which an application program can access directly the various clustering and analysis algorithms implemented in CLUTO.
Main features:
- Multiple classes of clustering algorithms:
partitional, agglomerative, & graph-partitioning based.
- Multiple similarity/distance functions:
Euclidean distance, cosine, correlation coefficient, extended Jaccard, user-defined.
- Numerous novel clustering criterion functions and agglomerative merging schemes.
- Traditional agglomerative merging schemes:
single-link, complete-link, UPGMA
- Extensive cluster visualization capabilities and output options:
postscript, SVG, gif, xfig, etc.
- Multiple methods for effectively summarizing the clusters:
most descriptive and discriminating dimensions, cliques, and frequent itemsets.
- Can scale to very large datasets containing hundreds of thousands of objects and tens of thousands of dimensions.
Enhancements:
- Support for Windows X86_64 was added.
<<lessCLUTOs distribution consists of both stand-alone programs and a library via which an application program can access directly the various clustering and analysis algorithms implemented in CLUTO.
Main features:
- Multiple classes of clustering algorithms:
partitional, agglomerative, & graph-partitioning based.
- Multiple similarity/distance functions:
Euclidean distance, cosine, correlation coefficient, extended Jaccard, user-defined.
- Numerous novel clustering criterion functions and agglomerative merging schemes.
- Traditional agglomerative merging schemes:
single-link, complete-link, UPGMA
- Extensive cluster visualization capabilities and output options:
postscript, SVG, gif, xfig, etc.
- Multiple methods for effectively summarizing the clusters:
most descriptive and discriminating dimensions, cliques, and frequent itemsets.
- Can scale to very large datasets containing hundreds of thousands of objects and tens of thousands of dimensions.
Enhancements:
- Support for Windows X86_64 was added.
Download (16.5MB)
Added: 2007-01-14 License: GPL (GNU General Public License) Price:
1014 downloads
Secleted [ 0 ] software to compare
Copyright Notice:
Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future software development. The above clustering search only lists software in full, demo and trial versions for free download. Download links are directly from our mirror sites or publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed