duplicate
JDuplicate 0.5.9
JDuplicate project is a multiplayer word game suite supporting Scrabble and Duplicate. more>>
It supports popular word games such as Duplicate and Scrabble.
JDuplicate Swing Client, commonly called JDSC, is the JDuplicate reference
client. It will allow you to connect to a JDuplicate server and play without
having to input complex commands as its main goal is usability.
Unlike JDCC, which was developped for debugging purposes, JDSC is developped
from the ground up with ease of use in mind. JDSC is fully graphical and
features user friendly characteristics such as drag&dropping tiles, full sound
support and a click and play interface for game selection.
Like the rest of the JDuplicate "official" package, running JDSC will require
that you install Java.
Duplicate Music Matcher 2.3
Duplicate Music Matcher is a script to quickly find duplicate MP3 files based on letter matching. more>>
Duplicate Music Matcher is helpful for weeding out duplicates that may not be of the same encoding, format, or even the same filename.
Enhancements:
- pymad pyvorbis python-flac deps were all removed in favor of mutagen
- GUI code was updated (no more SimpleGladeApp)
- Delete key have been bound in the GUI
- More accurate matching for ogg and flac files
- The filename column is now resizable
- The GUI play and stop buttons now use audacious.FLAC should now also give a length value
- Some oggs might have a huge bitrate value due to a bug in mutagen.
Remove MH duplicates 0.0.1
Removes duplicates from MH mail folders. more>>
Remove MH duplicates software removes duplicate mail messages from MH-style folders.
Remove playlist duplicates 0.2
Remove playlist duplicates detects all duplicate playlist entries and leaves just the first one of each of them. more>>
I have files duplicated on my mp3 player and hard drive and when I plugged my player in, my collection contained duplicate entries. this was bit annoying, so Ive made a script to remove songs that were in playlist twice. probably theres easier way to do it...
Duplidel 0.91
Duplidel finds and removes the duplicate messages that often occur when downloading email from multiple accounts. more>>
Duplidel supports maildir and mbox formats. It works, seems quite stable, and makes a backup of your email before doing anything to it.
Dupseek 1.3
Dupseek is a command-line interactive perl program to find and remove duplicate files. more>>
A few strategies are possible for finding duplicate files in a big set, such as a heavily populated directory.
One of the most widely used consists of grouping files by size (because files of different size cant be identical) and then computing a short digital fingerprint (such as a md5 checksum) for the files.
Files with a different fingerprint are different, and files with the same digital fingerprint are very probably the same. Just to be sure, one can further check possible duplicates.
Main features:
- It starts by grouping files by size.
- Then it starts reading small chunks of the files of the same size and comparing them. It creates smaller groups depending on these comparisons.
- It goes on with bigger and bigger chunks (of size up to a hard-coded limit).
- It stops reading from files as soon as they form a single-element group or they are read completely (which only happens when they have a very high probability of having duplicates).
This algorithm is much more efficient than competitors when dealing with large files of the same size. When files differ, reading usually stops after very few reads.
Dupseek (and destroy) can be interrupted at any moment. The user is then presented with partial results and can either intervene manually or go on with the reading and computation, on a group-by-group basis. Since subsequent reads happen sparsely in the file, if some files are still in the same group after many iterations, they are most probably identical, unless the differences are very small.
DigicaMerge 2.00
DigicaMerge is a commandline tool to merge directories of pictures taken with digital cameras. more>>
This utility allows you to merge such directories contents into a new directory, and renames all the pictures on the fly, ensuring no filename clash will occur.
You can define your own naming scheme, using either a set of predefined variables or any recognized Exif tag which may be present in your pictures, and also specify a pattern to select only certain files.
Installation:
- Extract it :
$ gzip -d digicamerge-x.xx.tar.gz | tar -xf -
where x.xx is digicamerges latest version number.
- Install it :
Go to digicamerges directory:
$ cd digicamerge-x.xx
Just type:
$ python setup.py install
You may need to be logged in with sufficient privileges (e.g. root)
This will generally install digicamerge in /usr/local/bin or an equivalent path depending on your system.
- Launch it :
Just type :
$ digicamerge
And read the long help message which contains examples.
Enhancements:
- A command line option was added to automatically remove duplicate pictures when merging directories.
- The manual page is now included in the package.
File Splitter 1.3
Split large text/html files into smaller files. I find it much faster and more accurate than cut and paste. You embed commands in the big file telling it which pieces of it are to go where, then let Splitter do the work. It is much faster and more accurate than trying to select huge blocks of text in an editor. You dont accidentally lose or duplicate text. Keeping files small makes the site more responsive. more>>
File Splitter - Split large text/html files into smaller files. I find it
much faster and more accurate than cut and paste.
You embed commands in the big file telling it which pieces
of it are to go where, then let Splitter do the work. It is
much faster and more accurate than trying to select huge
blocks of text in an editor. You dont accidentally lose or
duplicate text. Keeping files small makes the site more
responsive.
In the following pretend that [...] are actually lessthan...greater than.
You embed multiple [split tags in the file to be split of
the form:
[split charlie.html]
...
stuff that will end up in the charlie.html file.
...
[/split]
The text between the [split xxx] and [/split] tags is split
off into that named file and the text is removed from the
original file along with the tags.
1. Filenames may be absolute or relative, with no quotes or spaces.
2. Tags may be nested, but they must balance (equal number
of [split xxx] and [/split]).
3. Tags are case-insensitive, i.e. may be lower or upper case.
4. Multiple [split xxx] tags may be directed to the same
file, where they will be appended.
5. If the files mentioned in the split tags already exist,
they will be overwritten.
6. Anything not inside [split xxx].. [/split] is retained in
the original file. Everything else is removed.
The file being split must be small enough to fit in RAM.
Java array addressing limits the file to 2GB, though other
considerations mean in practice the largest file you can
handle will be smaller still.
To install, Extract the zip download with Winzip, available from
http://www.winzip.com (or similar unzip utility) into any
directory you please, often C:\ -- ticking off the (user
folder names) option. To run as an application, type:
java -jar C:\com\mindprod\splitter\splitter.jar x.html
adjusting as necessary to account for where the jar file is.
Enhancements:
Version 1.3
allow you to specify encoding
System Requirements:<<less
Duper 20041213
Duper is a duplicate file removal tool. more>>
Duper first scans all file sizes. If any two file sizes are identical, an MD5 sum is computed for each file and then used to determine if they are identical. After scanning, it allows you to browse through the duplicates found and select those you wish to delete.
iCalDoubleRemover
iCalDoubleRemover script just removes duplicate Entries in iCals. more>>
Be aware that you need Perl, Config::Inifiles via Perlupdate ( perl -MCPAN -e shell install Config::IniFiles ).
DupeFinder 1.0.2
DupeFinder is a simple application for locating, moving, renaming, and deleting duplicate files in a directory structure. more>>
Its perfect both for users who havent kept their hard drives very well organized and need to do some cleaning to free space, and for users who like to keep lots of backup copies of important data "just in case" something bad should happen.
Main features:
Although DupeFinder is a quite small application, it should have all of the features you will need to remove and reorganize large directories full of duplicate files:
- Well designed graphical interface with full tooltip and "Whats This?" question button support, useful in an application which you probably wont need to use frequently
- Quick processing by eliminating analysis of unwanted data through file extension filtering
- View files in external applications by double-clicking
- Rename files in place or move to new locations
- Default settings disallow deletion of all copies of duplicate files to prevent accidental data loss
- Generate simple reports identifying groups of duplicate files for later processing
While everything works pretty well in most cases, there are a few issues with DupeFinder to be aware of. I hope to fix most of the following bugs sometime soon:
- May crash if files containing "~" or ":" characters are encountered
- May crash if self referencing symlinks are encountered
- Zero byte files cannot be deleted
- May not be able to delete files with Unicode characters in filename
- Display does not update if identified duplicates are moved, renamed or modified external to DupeFinder
mp3dup 0.3
mp3dup looks for duplicate files in recursively searched directories. more>>
Parsed files are inserted in a hash table keyed with the size of the file. Files with the same size are md5summed, and those md5sums are inserted in a new hash table. Files with the same size and md5sum are brute-force-diffed. Non regular files are just skipped. Empty files are printed as such.
mp3dup can also find files that are equal with the first, or last N bytes. This is accomplished by hashing on the head and tail parts of the files, and byte-comparing the collisions.
Enhancements:
- Identical files are no longer reported as Match:ID3 too.
DuMP3 0.2.16 Beta
DuMP3 (derived from Duplicate MP3) is a Java program to find any duplicate or similar file. more>>
It finds files by calculating a fingerprint based on the image, audio or text data for each file and then comparing the fingerprints. The project does not compare filenames or even ID3 tags (even though plugin classes could be written that perform these operations). Calculated fingerprints can be stored in a MySQL database so that they do not have to be calculated again.
As an extra bonus DuMP3 will mark the files that can not be read or decoded correctly as corrupt or with a signature mismatch.
DuMP3 can find files that are not exact duplicates:
- Binary files are compared by SHA1 hash (configurable to any MD hash)
- Text files that were changed by addition or deletion (2 fingerprint algorithms available)
- Pictures in different formats, sizes and/or rotations (BMP, GIF, JPEG, JPEG2000, PNG, PNM, RAW, TIFF)
- Audio files that were recorded at different bit rates or saved in different formats (AU, AIF, WAV, MP3, OGG)
- plugin fingerprint classes can be written for any file where inexact matching is needed (fonts, videos, etc)
Version restrictions:
- DuMP3 is subject to some Java limitations as well as some limitations in the libraries I have chosen.

CheckPlaces 1.2.0
CheckPlaces is professionally designed as a Firefox extension which lets you check your bookmarks in different ways. more>> CheckPlaces 1.2.0 is professionally designed as a Firefox extension which lets you check your bookmarks in different ways. There is a basic check of your bookmarks, and optional checks that the pages still exist, duplicate bookmarks, and empty folders. You can also use it to restore favicons that have gone missing.
Enhancements:
- Failed pages results now grouped intfive different types for easier handling
- When select more than one result the details of the last result are now displayed
- When delete a result the next one in the list is automatically selected
- When resize the results window, the list boxes now increase in size tdisplay more results
- Added CheckPlaces icons
- Added toolbar button and status bar icon as extra ways of starting up
- BUG FIX: 'Delete all' buttons were not working
gquilt 0.20
quilt is a tool for managing a series of patches by keeping track of the changes each patch makes. more>>
Patches can be applied, un-applied, refreshed, etc. gquilt is a PyGTK GUI wrapper for quilt.
Main features:
- Currently, point and click interfaces are only provided for a small (frequently used) sub set of quilts commands:
- an Open menu item in the Playground that allows the root directory for the quilt playground to be selected,
- tool bar buttons that execute common operations on the top applied patch, and
- pop up menus (triggered by the right mouse button) over the playground files tree, patch files tree and patch series list that execute common operations such as adding files to the top patch, etc.
- To compensate for the incomplete coverage of quilts command set a text entry widget is provided in the tool bar where arbitrary quilt commands may be executed. All quilt commands initiated by the user (either by point and click or the provided entry widget) are echoed (with a time stamp) to the console widget at the bottom of the main window as is the resulting output from the quilt command. To enable easy distinction between stdout and stderr output the former is printed in black and the latter in red.
Enhancements:
Add new functionality to the patch list panel:
- (where possible) "files" now displays files in unapplied patches;
- (where possible) "diff" now displays diffs for unapplied patches;
- (where possible) "diff" allows unapplied diffs to be edited and saved;
- new item "duplicate" can be used to import a duplicate of a selected unapplied patch;
- it the interdiff program from patchutils is available an "interdiff" of the top patch and a selected unapplied patch can be imported.
This new functionality is intended to facilitate the splittin of patches into a numbe of smaller patches.