extract data
Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 5301
GNU awk 3.1.4
GNU awk is a pattern scanning and processing language. more>>
If you are like many computer users, you would frequently like to make changes in various text files wherever certain patterns appear, or extract data from parts of certain lines while discarding the rest.
To write a program to do this in a language such as C or Pascal is a time-consuming inconvenience that may take many lines of code. The job is easy with awk, especially the GNU implementation: gawk.
The awk utility interprets a special-purpose programming language that makes it possible to handle simple data-reformatting jobs with just a few lines of code.
<<lessTo write a program to do this in a language such as C or Pascal is a time-consuming inconvenience that may take many lines of code. The job is easy with awk, especially the GNU implementation: gawk.
The awk utility interprets a special-purpose programming language that makes it possible to handle simple data-reformatting jobs with just a few lines of code.
Download (2.2MB)
Added: 2005-04-03 License: GPL (GNU General Public License) Price:
1666 downloads
Data Display Debugger 3.3.11
Data Display Debugger is a common graphical user interface for GDB, DBX and XDB. more>>
GNU DDD is a graphical front-end for command-line debuggers such as GDB, DBX, WDB, Ladebug, JDB, XDB, the Perl debugger, the bash debugger, or the Python debugger.
Besides "usual front-end features such as viewing source texts, DDD has become famous through its interactive graphical data display, where data structures are displayed as graphs.
<<lessBesides "usual front-end features such as viewing source texts, DDD has become famous through its interactive graphical data display, where data structures are displayed as graphs.
Download (7.3MB)
Added: 2005-04-18 License: GPL (GNU General Public License) Price:
1661 downloads
ADS Dexter 0.40
ADS Dexter is a utility for extracting data from scanned graphs. more>>
The following is an excerpt from a poster presented at the American Astronomical Societys 2000 Summer meeting in Rochester, NY.
ADS roughly 1,000,000 scanned pages contain numerous diagrams and figures for which the original data sets are lost or inaccessible. Having scans for the figures invites digitizing the data points to recover at least a part of these data.
Performing this digitization automatically is still beyond the capabilities of current OCR systems, but the computer can ease this process for a human.
This was the starting point for Dexter, a Java applet that runs in the users browsers and provides an interface for selecting the part of the page that is of interest. On that selection, coordinate axes, points and error bars can be marked and, of course, corrected. [...]
In the future, we plan to implement some recognition algorithms that would, e.g., trace a line for the user or automatically search for markers.
<<lessADS roughly 1,000,000 scanned pages contain numerous diagrams and figures for which the original data sets are lost or inaccessible. Having scans for the figures invites digitizing the data points to recover at least a part of these data.
Performing this digitization automatically is still beyond the capabilities of current OCR systems, but the computer can ease this process for a human.
This was the starting point for Dexter, a Java applet that runs in the users browsers and provides an interface for selecting the part of the page that is of interest. On that selection, coordinate axes, points and error bars can be marked and, of course, corrected. [...]
In the future, we plan to implement some recognition algorithms that would, e.g., trace a line for the user or automatically search for markers.
Download (0.079MB)
Added: 2005-05-20 License: GPL (GNU General Public License) Price:
1620 downloads
Nabu svn235
Nabu is a simple framework that extracts chunks of various types of information. more>>
Nabu is a simple framework that extracts chunks of various types of information from documents written in simple text files (written with reStructuredText conventions) and that stores this information (including the document) in a remote database for later retrieval.
The processing and extraction of the document is handled on a server, and there is a small and simple client that is used to push the files to the server for processing and storage (think rsync). The client requires only Python to work.
The presentation layer is left unspecified: you can use whichever web application framework you like to present the extracted data in the way that you prefer.
Main features:
- It is flexible: you can use
- any text editor you like to edit the files;
- any source code control system you like to store and maintain them (or none);
- any database for storage;
- and any web application framework for presentation. Nabu does not dictate how the information is presented/served to the clients;
- you edit files locally, not in a bleeping web browser window (programmers will appreciate the value of this), in your favourite editor environment;
- the organization of the source files in subdirectories has nothing to do with how the content is presented. We use a unique ID system (similar to arch) where your document to be published must contain a unique string to mark it with that id. You can put that string in a reStructuredText comment or a bibliographic field. Unlike Wikis, this allows you to change the title of your documents while keeping the possibility of a permanent link to them.
- It effectively offers you a sandbox for creating content, and then how you organize and present the content is dictated by ways that you decide, most likely independent of the source file organization structure;
- the input data can be scattered over many files, it does not have to be stored in files per-category (for example, you dont have to store all your "contacts" in a single "address book" file, they can be found within/across all your body of published file and a server might present as a single list if desired). I conjecture that this may be closer to how humans think of this data. This body of files can be used to create a mind-mapping system;
- we recognize that the value of the information lies in the source itself, the text files. This valuable source remains with you, and you are free to manage them in any way you prefer, with any version control system you like (if you want to do that). You can completely dump the data stored in the database and rebuild it from the text files;
- various semantic chunks of content are automatically extracted from your document. These semantic things are easily written with little code and are configurable. Nabu comes with example content extractors;
- a light-weight program with minimal dependencies is used to upload the files to the server. The server processes the files for content. This maximizes the potential that you will be able to use Nabu anywhere, on any platform. The client only requires Python to work;
<<lessThe processing and extraction of the document is handled on a server, and there is a small and simple client that is used to push the files to the server for processing and storage (think rsync). The client requires only Python to work.
The presentation layer is left unspecified: you can use whichever web application framework you like to present the extracted data in the way that you prefer.
Main features:
- It is flexible: you can use
- any text editor you like to edit the files;
- any source code control system you like to store and maintain them (or none);
- any database for storage;
- and any web application framework for presentation. Nabu does not dictate how the information is presented/served to the clients;
- you edit files locally, not in a bleeping web browser window (programmers will appreciate the value of this), in your favourite editor environment;
- the organization of the source files in subdirectories has nothing to do with how the content is presented. We use a unique ID system (similar to arch) where your document to be published must contain a unique string to mark it with that id. You can put that string in a reStructuredText comment or a bibliographic field. Unlike Wikis, this allows you to change the title of your documents while keeping the possibility of a permanent link to them.
- It effectively offers you a sandbox for creating content, and then how you organize and present the content is dictated by ways that you decide, most likely independent of the source file organization structure;
- the input data can be scattered over many files, it does not have to be stored in files per-category (for example, you dont have to store all your "contacts" in a single "address book" file, they can be found within/across all your body of published file and a server might present as a single list if desired). I conjecture that this may be closer to how humans think of this data. This body of files can be used to create a mind-mapping system;
- we recognize that the value of the information lies in the source itself, the text files. This valuable source remains with you, and you are free to manage them in any way you prefer, with any version control system you like (if you want to do that). You can completely dump the data stored in the database and rebuild it from the text files;
- various semantic chunks of content are automatically extracted from your document. These semantic things are easily written with little code and are configurable. Nabu comes with example content extractors;
- a light-weight program with minimal dependencies is used to upload the files to the server. The server processes the files for content. This maximizes the potential that you will be able to use Nabu anywhere, on any platform. The client only requires Python to work;
Download (0.20MB)
Added: 2005-07-13 License: GPL (GNU General Public License) Price:
1563 downloads
Spammergrok 0.96beta
Spammergrok is a bash script to extract URLs and download data from them a set number of times. more>>
Spammergrok is a simple bash script that will take URLs on the command line, from one or more files, or extract them from one or more files containing a single email message each (via ripmime).
Spammergrok will then proceed to download data from these URLs a set number of times (via wget) in an effort to waste a spammers bandwidth.
Main features:
- Autoconf configuration and installation of the spammergrok script
- User configuration of operational parameters via resource file
- Sane default values, with user-overrides via resource file (updated/expanded soon)
- Configurable limitations on download speed and number of download iterations to prevent spammergrok from overpowering a host
- Ability to download in the background
- Logging of actions and results, with automatic compression of log files when completed (coming soon), and automatic removal of old log files after a configurable amount of time (coming soon)
- Spam mail or URLs may be piped to spammergrok, given on the command line, or given in one or more files (one email message per file)
<<lessSpammergrok will then proceed to download data from these URLs a set number of times (via wget) in an effort to waste a spammers bandwidth.
Main features:
- Autoconf configuration and installation of the spammergrok script
- User configuration of operational parameters via resource file
- Sane default values, with user-overrides via resource file (updated/expanded soon)
- Configurable limitations on download speed and number of download iterations to prevent spammergrok from overpowering a host
- Ability to download in the background
- Logging of actions and results, with automatic compression of log files when completed (coming soon), and automatic removal of old log files after a configurable amount of time (coming soon)
- Spam mail or URLs may be piped to spammergrok, given on the command line, or given in one or more files (one email message per file)
Download (0.083MB)
Added: 2005-09-30 License: GPL (GNU General Public License) Price:
1485 downloads
Xephyrus Data Structures Tag Library 1.5
Xephyrus Data Structures Tag Library is a tag library to provide access to common data-structures. more>>
Xephyrus Data Structures Tag Library provides an easy way to create and manipulate the contents of common Java data-structures such as maps and lists.
Enhancements:
- The library was polished up.
- Several improvements were made and a few bugs were fixed.
- This version is aimed at Java 5 and JSP 2.0.
<<lessEnhancements:
- The library was polished up.
- Several improvements were made and a few bugs were fixed.
- This version is aimed at Java 5 and JSP 2.0.
Download (0.021MB)
Added: 2005-10-13 License: BSD License Price:
1471 downloads
Data.FormValidator 0.04
Data.FormValidators aim is to bring all the benefits of the perl module Data::FormValidator over to javascript. more>>
Data.FormValidators aim is to bring all the benefits of the perl module Data::FormValidator over to javascript, using the same input profiles (they can be dumped into javascript objects using the perl module Data::JavaScript.
Data.FormValidator library lets you define profiles which declare the required and optional fields and any constraints they might have.
The results are provided as an object which makes it easy to handle missing and invalid results, return error messages about which constraints failed, or process the resulting valid data.
IMPORTANT NOTE: JavaScript form validation is NOT a replacement for data validation in your backend scripts. This is the primary reason this module was written... so that it would be easy to share the same validation profile for both the frontend (via Data.FormValidator.js) and backend (via Data::FormValidator.pm).
Enhancements:
- A problem where some functions were not terminated by a semi-colon, so JavaScript compactors would end up creating broken code was fixed.
<<lessData.FormValidator library lets you define profiles which declare the required and optional fields and any constraints they might have.
The results are provided as an object which makes it easy to handle missing and invalid results, return error messages about which constraints failed, or process the resulting valid data.
IMPORTANT NOTE: JavaScript form validation is NOT a replacement for data validation in your backend scripts. This is the primary reason this module was written... so that it would be easy to share the same validation profile for both the frontend (via Data.FormValidator.js) and backend (via Data::FormValidator.pm).
Enhancements:
- A problem where some functions were not terminated by a semi-colon, so JavaScript compactors would end up creating broken code was fixed.
Download (0.047MB)
Added: 2006-01-20 License: GPL (GNU General Public License) Price:
1372 downloads
THC-ManipulateData 1.3
THC-ManipulateData can search data on a harddisk/partition/file. more>>
THC-ManipulateData can search data on a harddisk/partition/file, extract the part you are interested in, and write it back after you modified it.
Useful to find and modify really all unencrypted Logfiles on a system. Does everything in RAW mode, and hence does not tamper a/m/ctimes.
It comes with 4 tools:
Syntax of search_data: ./search_data [-i] [-d] blockdevice searchstring
-i - the only parameter which is optional. This does the
search case insensitive.
-d - dump the found occasions in hex
blockdevice - a blockdevice you want to search for data. It need
not to be a blockdevice, it can be anything, but normaly
you use it on these.
searchstring - a string you want to search for
The blockdevice is searched for the occurance of searchstring, which are printed with location when found.
Example: ./search_data -i /dev/hda3 "connect from 10.0.0.1"
Output looks like:
found at 234600: connect from 10.0.0.1/unresolved (UNKNOWN)
Syntax of read_data: ./read_data blockdevice start_address no_of_bytes
blockdevice - a blockdevice you want to get your data from. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
start_address - from which offset of the blockdevice you want to extract data from
no_of_bytes - how many bytes of data starting at the start_address you want to extract in a file.
The output filename is always START_ADDRESS.NO_OF_BYTES
Example: ./read_data /dev/hda3 234653 1024
writes 1024 bytes of data from /dev/hda3 starting from offset 234653 to the file "234653.1024"
Syntax of write_data: ./write_data blockdevice filename
blockdevice - a blockdevice you want to write your data to. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
filename - the data you want to write to the blockdevice. For error protection, the location where it is put it gathered from the filename - as you can see above from read_data. If you modified the data extracted with read_data into the file, it may not have a different size than defined in the filename! The data in filename is written to the blockdevice
Example: ./write_data /dev/hda3 234653.1024
writes 1024 bytes of data to /dev/hda3 starting at offset 234653 with the
data read from the file "234653.1024"
Syntax of replace_data: ./replace_data [-i] blockdevice searchstring replacestring
-i - the only parameter which is optional. This does the search case insensitive.
blockdevice - a blockdevice you want to search for data. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
searchstring - a string you want to search for
replacestring - the string you want to replace the found entries with
The blockdevice is searched for the occurance of searchstring, and is then replaced.
Example: ./replace_data -i /dev/hda3 "connect from 1.0.0.1" "Remap table failure "
Output looks like:
found at 234600 - replaced
Enhancements:
- fixed a bug in read/write_data, seeks over 2gb now succeed
- added -d for hexdump display of occasions found in search_data
<<lessUseful to find and modify really all unencrypted Logfiles on a system. Does everything in RAW mode, and hence does not tamper a/m/ctimes.
It comes with 4 tools:
Syntax of search_data: ./search_data [-i] [-d] blockdevice searchstring
-i - the only parameter which is optional. This does the
search case insensitive.
-d - dump the found occasions in hex
blockdevice - a blockdevice you want to search for data. It need
not to be a blockdevice, it can be anything, but normaly
you use it on these.
searchstring - a string you want to search for
The blockdevice is searched for the occurance of searchstring, which are printed with location when found.
Example: ./search_data -i /dev/hda3 "connect from 10.0.0.1"
Output looks like:
found at 234600: connect from 10.0.0.1/unresolved (UNKNOWN)
Syntax of read_data: ./read_data blockdevice start_address no_of_bytes
blockdevice - a blockdevice you want to get your data from. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
start_address - from which offset of the blockdevice you want to extract data from
no_of_bytes - how many bytes of data starting at the start_address you want to extract in a file.
The output filename is always START_ADDRESS.NO_OF_BYTES
Example: ./read_data /dev/hda3 234653 1024
writes 1024 bytes of data from /dev/hda3 starting from offset 234653 to the file "234653.1024"
Syntax of write_data: ./write_data blockdevice filename
blockdevice - a blockdevice you want to write your data to. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
filename - the data you want to write to the blockdevice. For error protection, the location where it is put it gathered from the filename - as you can see above from read_data. If you modified the data extracted with read_data into the file, it may not have a different size than defined in the filename! The data in filename is written to the blockdevice
Example: ./write_data /dev/hda3 234653.1024
writes 1024 bytes of data to /dev/hda3 starting at offset 234653 with the
data read from the file "234653.1024"
Syntax of replace_data: ./replace_data [-i] blockdevice searchstring replacestring
-i - the only parameter which is optional. This does the search case insensitive.
blockdevice - a blockdevice you want to search for data. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
searchstring - a string you want to search for
replacestring - the string you want to replace the found entries with
The blockdevice is searched for the occurance of searchstring, and is then replaced.
Example: ./replace_data -i /dev/hda3 "connect from 1.0.0.1" "Remap table failure "
Output looks like:
found at 234600 - replaced
Enhancements:
- fixed a bug in read/write_data, seeks over 2gb now succeed
- added -d for hexdump display of occasions found in search_data
Download (0.007MB)
Added: 2006-03-08 License: GPL (GNU General Public License) Price:
1325 downloads
Common Data Format 3.1
Common Data Format is a self-describing data abstraction for the storage and manipulation of multidimensional data. more>>
Common Data Format is a self-describing data abstraction for the storage and manipulation of multidimensional data in a platform- and discipline-independent fashion.
It consists of a scientific data management package (known as the "CDF Library") that allows programmers and application developers to manage and manipulate scalar, vector, and multi-dimensional data arrays.
Enhancements:
- Adds new sets of APIs to allow Standard Interface to interact with zVariables and other CDF-related information.
- Adds MingW and FreeBSD ports.
- Adds support for Intel C++ and Fortran for Linux.
- Adds the ability to create legacy CDF 2.7 files.
- Fixes a bug that prevented directories from having .cdf or .skt extensions.
<<lessIt consists of a scientific data management package (known as the "CDF Library") that allows programmers and application developers to manage and manipulate scalar, vector, and multi-dimensional data arrays.
Enhancements:
- Adds new sets of APIs to allow Standard Interface to interact with zVariables and other CDF-related information.
- Adds MingW and FreeBSD ports.
- Adds support for Intel C++ and Fortran for Linux.
- Adds the ability to create legacy CDF 2.7 files.
- Fixes a bug that prevented directories from having .cdf or .skt extensions.
Download (1.5MB)
Added: 2006-03-13 License: Public Domain Price:
1320 downloads
htmlSQL 0.5
htmlSQL is a PHP class to query the web by an SQL like language. more>>
htmlSQL is a PHP class to query the web by an SQL like language. htmlSQL is a experimental PHP class which allows you to access HTML values by an SQL like syntax.
This means that you dont have to write complex functions (regular expressions) to extract specific values.
How to use
Just include the "snoopy.class.php" and the "htmlsql.class.php" files into your PHP scripts and look at the examples (examples/) to get an idea of how to use the htmlSQL class. It should be very simple.
Background & Idea
I had this idea while extracting some data from a website. As I realized that the algorithms and functions to extract links and other tags are often the same - I had the idea to combine all functions to an universal usable class. While drinking a coffee and thinking on that problem, I thought it would be cool to access HTML elements by using SQL. So I started creating this class...
Warning!
The eval() function is used for the WHERE statement. Make sure that all user data is checked and filtered against malicious PHP code. Never trust user input!
<<lessThis means that you dont have to write complex functions (regular expressions) to extract specific values.
How to use
Just include the "snoopy.class.php" and the "htmlsql.class.php" files into your PHP scripts and look at the examples (examples/) to get an idea of how to use the htmlSQL class. It should be very simple.
Background & Idea
I had this idea while extracting some data from a website. As I realized that the algorithms and functions to extract links and other tags are often the same - I had the idea to combine all functions to an universal usable class. While drinking a coffee and thinking on that problem, I thought it would be cool to access HTML elements by using SQL. So I started creating this class...
Warning!
The eval() function is used for the WHERE statement. Make sure that all user data is checked and filtered against malicious PHP code. Never trust user input!
Download (0.041MB)
Added: 2006-05-09 License: BSD License Price:
1264 downloads
HampusDB 1.0.1
HampusDB is a small, flexible and efficient hierarchical database. more>>
HampusDB is a small, flexible and efficient hierarchical database. It comes with a wide support of command line utilities for manipulating and extracting data.
Its designed for both embedded and bigger systems. HampusDB currently runs on Linux and has interfaces to C, C++, Java and Perl.
HDB aims to fill the gap when storing data in a relational database is too rigid and storing data in textfiles is too cumbersome.
A typical example would be when you have heirarchical data such as XML or configuration data that you want to store and retrieve in a flexible manner.
<<lessIts designed for both embedded and bigger systems. HampusDB currently runs on Linux and has interfaces to C, C++, Java and Perl.
HDB aims to fill the gap when storing data in a relational database is too rigid and storing data in textfiles is too cumbersome.
A typical example would be when you have heirarchical data such as XML or configuration data that you want to store and retrieve in a flexible manner.
Download (0.34MB)
Added: 2006-05-26 License: LGPL (GNU Lesser General Public License) Price:
1247 downloads
wavextract 1.0.0
wavextract is a program for extracting embedded audio data from JPEG images. more>>
wavextract is a program for extracting embedded audio data from JPEG images. wavextract project is useful if you have a digital camera that can record audio notes and embed them in photos (e.g. HP, Kodak, Fujifilm, Canon, etc.)
Wavextract is written in Python and is tested on Linux (but it should probably
work also on other operating systems).
You must have Python 2.4 (maybe it will work also with Python 2.3, but I didnt
test it) and Python Imaging Library (PIL) installed.
<<lessWavextract is written in Python and is tested on Linux (but it should probably
work also on other operating systems).
You must have Python 2.4 (maybe it will work also with Python 2.3, but I didnt
test it) and Python Imaging Library (PIL) installed.
Download (0.008MB)
Added: 2006-06-01 License: GPL (GNU General Public License) Price:
1241 downloads
Convert::BinHex 1.119
Convert::BinHex can extract data from Macintosh BinHex files. more>>
Convert::BinHex can extract data from Macintosh BinHex files.
ALPHA WARNING: this code is currently in its Alpha release. Things may change drastically until the interface is hammered out: if you have suggestions or objections, please speak up now!
SYNOPSIS
Simple functions:
use Convert::BinHex qw(binhex_crc macbinary_crc);
# Compute HQX7-style CRC for data, pumping in old CRC if desired:
$crc = binhex_crc($data, $crc);
# Compute the MacBinary-II-style CRC for the data:
$crc = macbinary_crc($data, $crc);
Hex to bin, low-level interface. Conversion is actually done via an object ("Convert::BinHex::Hex2Bin") which keeps internal conversion state:
# Create and use a "translator" object:
my $H2B = Convert::BinHex->hex2bin; # get a converter object
while (< STDIN >) {
print $STDOUT $H2B->next($_); # convert some more input
}
print $STDOUT $H2B->done; # no more input: finish up
Hex to bin, OO interface. The following operations must be done in the order shown!
# Read data in piecemeal:
$HQX = Convert::BinHex->open(FH=>*STDIN) || die "open: $!";
$HQX->read_header; # read header info
@data = $HQX->read_data; # read in all the data
@rsrc = $HQX->read_resource; # read in all the resource
Bin to hex, low-level interface. Conversion is actually done via an object ("Convert::BinHex::Bin2Hex") which keeps internal conversion state:
# Create and use a "translator" object:
my $B2H = Convert::BinHex->bin2hex; # get a converter object
while (< STDIN >) {
print $STDOUT $B2H->next($_); # convert some more input
}
print $STDOUT $B2H->done; # no more input: finish up
Bin to hex, file interface. Yes, you can convert to BinHex as well as from it!
# Create new, empty object:
my $HQX = Convert::BinHex->new;
# Set header attributes:
$HQX->filename("logo.gif");
$HQX->type("GIFA");
$HQX->creator("CNVS");
# Give it the data and resource forks (either can be absent):
$HQX->data(Path => "/path/to/data"); # here, data is on disk
$HQX->resource(Data => $resourcefork); # here, resource is in core
# Output as a BinHex stream, complete with leading comment:
$HQX->encode(*STDOUT);
PLANNED!!!! Bin to hex, "CAP" interface. Thanks to Ken Lunde for suggesting this.
# Create new, empty object from CAP tree:
my $HQX = Convert::BinHex->from_cap("/path/to/root/file");
$HQX->encode(*STDOUT);
BinHex is a format used by Macintosh for transporting Mac files safely through electronic mail, as short-lined, 7-bit, semi-compressed data streams. Ths module provides a means of converting those data streams back into into binary data.
<<lessALPHA WARNING: this code is currently in its Alpha release. Things may change drastically until the interface is hammered out: if you have suggestions or objections, please speak up now!
SYNOPSIS
Simple functions:
use Convert::BinHex qw(binhex_crc macbinary_crc);
# Compute HQX7-style CRC for data, pumping in old CRC if desired:
$crc = binhex_crc($data, $crc);
# Compute the MacBinary-II-style CRC for the data:
$crc = macbinary_crc($data, $crc);
Hex to bin, low-level interface. Conversion is actually done via an object ("Convert::BinHex::Hex2Bin") which keeps internal conversion state:
# Create and use a "translator" object:
my $H2B = Convert::BinHex->hex2bin; # get a converter object
while (< STDIN >) {
print $STDOUT $H2B->next($_); # convert some more input
}
print $STDOUT $H2B->done; # no more input: finish up
Hex to bin, OO interface. The following operations must be done in the order shown!
# Read data in piecemeal:
$HQX = Convert::BinHex->open(FH=>*STDIN) || die "open: $!";
$HQX->read_header; # read header info
@data = $HQX->read_data; # read in all the data
@rsrc = $HQX->read_resource; # read in all the resource
Bin to hex, low-level interface. Conversion is actually done via an object ("Convert::BinHex::Bin2Hex") which keeps internal conversion state:
# Create and use a "translator" object:
my $B2H = Convert::BinHex->bin2hex; # get a converter object
while (< STDIN >) {
print $STDOUT $B2H->next($_); # convert some more input
}
print $STDOUT $B2H->done; # no more input: finish up
Bin to hex, file interface. Yes, you can convert to BinHex as well as from it!
# Create new, empty object:
my $HQX = Convert::BinHex->new;
# Set header attributes:
$HQX->filename("logo.gif");
$HQX->type("GIFA");
$HQX->creator("CNVS");
# Give it the data and resource forks (either can be absent):
$HQX->data(Path => "/path/to/data"); # here, data is on disk
$HQX->resource(Data => $resourcefork); # here, resource is in core
# Output as a BinHex stream, complete with leading comment:
$HQX->encode(*STDOUT);
PLANNED!!!! Bin to hex, "CAP" interface. Thanks to Ken Lunde for suggesting this.
# Create new, empty object from CAP tree:
my $HQX = Convert::BinHex->from_cap("/path/to/root/file");
$HQX->encode(*STDOUT);
BinHex is a format used by Macintosh for transporting Mac files safely through electronic mail, as short-lined, 7-bit, semi-compressed data streams. Ths module provides a means of converting those data streams back into into binary data.
Download (0.083MB)
Added: 2006-08-04 License: Perl Artistic License Price:
1234 downloads
pkcrack 1.2.2
pkcrack is an implementation of a known plaintext attack on password encrypted ZIP archives. more>>
pkcrack is an implementation of a known plaintext attack on password encrypted ZIP archives. For the attack to be successful you have to have at least 106 bytes of unencrypted compressed data from the given ZIP archive (the data must be compressed using the same compression method as the encrypted counterpart). Providing this, the sophisticated attack can be thousand times (or even more) faster than a brute force attack (considering non-trivial long password). The speed is affected by the amount of plaintext you transmit.
Enhancements:
- "Relf" has found and fixed a bug that caused pkcrack to fail under certain circumstances (roughly one out of 32 keys couldnt be found). Thanks!
- Two more options for pkcrack, also by "Relf" (Thanks again!): -a will stop stage2 immediately when a combination of key0, key1, key2 has been found -n disables the (new!) progress indicator
- corrected handling of data descriptor in extract-functions (Thanks to "Relf" and Andreas Lessig !)
- zipdecrypt now uses dynamic allocation for directory entries, which means the 200 files limit is gone
- The extract program now has a -v flag to print a verbose description of the ZIP file
- Considerable speedups in stage1
- The source distribution now includes an automated test suite
<<lessEnhancements:
- "Relf" has found and fixed a bug that caused pkcrack to fail under certain circumstances (roughly one out of 32 keys couldnt be found). Thanks!
- Two more options for pkcrack, also by "Relf" (Thanks again!): -a will stop stage2 immediately when a combination of key0, key1, key2 has been found -n disables the (new!) progress indicator
- corrected handling of data descriptor in extract-functions (Thanks to "Relf" and Andreas Lessig !)
- zipdecrypt now uses dynamic allocation for directory entries, which means the 200 files limit is gone
- The extract program now has a -v flag to print a verbose description of the ZIP file
- Considerable speedups in stage1
- The source distribution now includes an automated test suite
Download (0.17MB)
Added: 2006-07-06 License: GPL (GNU General Public License) Price:
1220 downloads
OutGuess 0.2
OutGuess is a universal tool that allows the insertion of hidden information into the redundant bits of data sources. more>>
OutGuess is a universal tool that allows the insertion of hidden information into the redundant bits of data sources.
The program relies on data specific handlers that will extract redundant bits and write them back after modification. In this version the PNM and JPEG image formats are supported. In the next paragraphs, images will be used as concrete example of data objects, though OutGuess can use any kind of data, as long as a handler is provided.
For JPEG images, OutGuess preserves statistics which are based on frequency counts. As a result, statistical tests based on frequency counts are unable to detect the presence of steganographic content. Before embedding data into an image, OutGuess can determine the maximum message size that can be hidden while still being able to maintain statistics based on frequency counts.
Enhancements:
- Use statistical corrections to defend against steganalysis.
- A lot of cleanup.
- Use all DCT coefficients for JPG now. This version is not any more compatible with the previous versions.
<<lessThe program relies on data specific handlers that will extract redundant bits and write them back after modification. In this version the PNM and JPEG image formats are supported. In the next paragraphs, images will be used as concrete example of data objects, though OutGuess can use any kind of data, as long as a handler is provided.
For JPEG images, OutGuess preserves statistics which are based on frequency counts. As a result, statistical tests based on frequency counts are unable to detect the presence of steganographic content. Before embedding data into an image, OutGuess can determine the maximum message size that can be hidden while still being able to maintain statistics based on frequency counts.
Enhancements:
- Use statistical corrections to defend against steganalysis.
- A lot of cleanup.
- Use all DCT coefficients for JPG now. This version is not any more compatible with the previous versions.
Download (0.45MB)
Added: 2006-07-14 License: GPL (GNU General Public License) Price:
1208 downloads
Secleted [ 0 ] software to compare
Copyright Notice:
Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future software development. The above extract data search only lists software in full, demo and trial versions for free download. Download links are directly from our mirror sites or publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed