Main > Free Download Search >

Free how to extract data software for linux

how to extract data

Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 5351
MDF audio extracter 0.1

MDF audio extracter 0.1


MDF audio extracter project is a tool to extract audio data from MDF/MDS audio cd images to wav or raw files. more>>
MDF audio extracter project is a tool to extract audio data from MDF/MDS audio cd images to wav or raw files. Tracks will be extracted to the current directory and named track_[num].wav or track_[num].raw. Output can also be redirected to stdout to allow the audio data to be fed to an encoder or player directly.
You need both a .mds and .mdf file to be able to extract tracks. The .mds file contains the information about the tracks, while the .mdf file contains the actual data. They usually should have the same name (except for the extension ofcourse). This is case sensitive, so for example FileName is not the same as filename.
Usage: mdfextract [options] file.mds
Options:
-i output mds info instead of extracting
-r extract raw track data
-s output to stdout instead of file
-t < num > extract single track (num > 0)
-q quiet, no output
Version restrictions:
- Probably does not work on big endian archs.
<<less
Download (0.010MB)
Added: 2007-03-27 License: GPL (GNU General Public License) Price:
949 downloads
wavextract 1.0.0

wavextract 1.0.0


wavextract is a program for extracting embedded audio data from JPEG images. more>>
wavextract is a program for extracting embedded audio data from JPEG images. wavextract project is useful if you have a digital camera that can record audio notes and embed them in photos (e.g. HP, Kodak, Fujifilm, Canon, etc.)

Wavextract is written in Python and is tested on Linux (but it should probably
work also on other operating systems).

You must have Python 2.4 (maybe it will work also with Python 2.3, but I didnt
test it) and Python Imaging Library (PIL) installed.

<<less
Download (0.008MB)
Added: 2006-06-01 License: GPL (GNU General Public License) Price:
1241 downloads
libextractor 0.5.18a

libextractor 0.5.18a


libextractor is a library used to extract meta-data from files of arbitrary type. more>>
libextractor is a library that is used to extract meta-data from files of arbitrary type. It is designed to use helper-libraries to perform the actual extraction, and to be trivially extendable by linking against external extractors for additional file types. libextractor is part of the GNU project. Our official GNU website can be found at http://www.gnu.org/software/libextractor/. libextractor can be downloaded from this site or the GNU mirrors.
The goal is to provide developers of file-sharing networks or WWW-indexing bots with a universal library to obtain simple keywords to match against queries. libextractor contains a shell-command "extract" that, similar to the well-known "file" command, can extract meta-data from a file an print the results to stdout.
Currently, libextractor supports the following formats: HTML, PDF, PS, OLE2 (DOC, XLS, PPT), OpenOffice (sxw), StarOffice (sdw), DVI, MAN, MP3 (ID3v1 and ID3v2), OGG, WAV, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, ELF, REAL, RIFF (AVI), MPEG, QT and ASF.
Also, various additional MIME types are detected.
libextractor is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
Enhancements:
- This release fixes various build problems and a crash with recent versions of libgsf.
vAn incomplete manual was added.
<<less
Download (7.5MB)
Added: 2007-07-05 License: (FDL) GNU Free Documentation License Price:
842 downloads
Extract archive 1.11

Extract archive 1.11


Extract archive is a service menu that allows you to extract any archive file that ark is capable to read + ACE files. more>>
Extract archive is a service menu that allows you to extract any archive file that ark is capable to read + ACE files.
You can chose to extract here, or in a subdirectory. In that case, it asks you the name of the subdirectory where you want to extract, and suggests you the archive name by default.
To install, simply extract, and move the 2 files in ~/.kde/share/apps/konqueror/servicemenus
Its highly recommended to download the latest version of unace on www.winace.com and to run this command line in the unace directory location : sudo cp unace /usr/bin/unace
You should desactivate ark integration into kde by going into the configuration menu of ark.
If you want translation to be added, post me comments.
Enhancements:
- thanks to shirka, extract archive path is now "test" for an "test.tar.bz2" file
- Name that appears in the menu is now "Extract archive" as suggested by dovidhalevi to differentiate with ark
<<less
Download (MB)
Added: 2007-05-16 License: GPL (GNU General Public License) Price:
896 downloads
The layer extract plug-in

The layer extract plug-in


The layer extract plug-in is a Gimp plug-in that creates a new image/film by periodically extracting layers from another image. more>>
The layer extract plug-in is a Gimp plug-in that creates a new image/film by periodically extracting layers from another image.

It takes one interval (number of sequent layers) every period. The number of periods can be limited.

"< Image >/Filters/Animation/Layer Extract"

<<less
Download (0.014MB)
Added: 2006-09-14 License: GPL (GNU General Public License) Price:
1156 downloads
Obscure-Extractor-GTK 0.2

Obscure-Extractor-GTK 0.2


Obscure-Extractor-GTK can extract data from simple and unusual archives as used by games. more>>
Obscure-Extractor-GTK can extract data from simple and unusual archives as used by games, e.g. Neverwinter Nights, Homeworld 2, BloodRayne.

Mostly a framework where I can easily add new modules when I want to have a look at the inner workings of games, though the Delphi version has some more advanced stuff like support for old InstallShield archives that would need to be ported.

<<less
Download (0.012MB)
Added: 2006-07-24 License: GPL (GNU General Public License) Price:
1202 downloads
Test::Data 1.20

Test::Data 1.20


Test::Data is a Perl module to test functions for particular variable types. more>>
Test::Data is a Perl module to test functions for particular variable types.

SYNOPSIS

use Test::Data qw(Scalar Array Hash Function);

Test::Data provides utility functions to check properties and values of data and variables.

Functions

Plug-in modules define functions for each data type. See the appropriate module.

How it works

The Test::Data module simply emports functions from Test::Data::* modules. Each module defines a self-contained function, and puts that function name into @EXPORT. Test::Data defines its own import function, but that does not matter to the plug-in modules.

If you want to write a plug-in module, follow the example of one that already exists. Name the module Test::Data::Foo, where you replace Foo with the right name. Test::Data should automatically find it.

<<less
Download (0.008MB)
Added: 2007-05-03 License: Perl Artistic License Price:
904 downloads
Locale::Maketext::Extract 0.62

Locale::Maketext::Extract 0.62


Locale::Maketext::Extract is a Perl module used to extract translatable strings from source. more>>
Locale::Maketext::Extract is a Perl module used to extract translatable strings from source.

SYNOPSIS

my $Ext = Locale::Maketext::Extract->new;
$Ext->read_po(messages.po);
$Ext->extract_file($_) for ;

# Set $entries_are_in_gettext_format if the .pl files above use
# loc(%1) instead of loc([_1])
$Ext->compile($entries_are_in_gettext_format);

$Ext->write_po(messages.po);

This module can extract translatable strings from files, and write them back to PO files. It can also parse existing PO files and merge their contents with newly extracted strings.

A command-line utility, xgettext.pl, is installed with this module as well.

Following formats of input files are supported:

Perl source files
Valid localization function names are: translate, maketext, gettext, loc, x, _ and __.

HTML::Mason
Strings inside ... and ... are extracted.

Template Toolkit
Strings inside [%|l%]...[%END%] or [%|loc%]...[%END%] are extracted.

Text::Template
Sentences between STARTxxx and ENDxxx are extracted individually.

Generic Template
Strings inside {{...}} are extracted.

<<less
Download (0.082MB)
Added: 2007-02-13 License: MIT/X Consortium License Price:
983 downloads
Text::Scraper 0.02

Text::Scraper 0.02


Text::Scraper contains structured data from (un)structured text. more>>
Text::Scraper contains structured data from (un)structured text.

SYNOPSIS

use Text::Scraper;

use LWP::Simple;
use Data::Dumper;

#
# 1. Get our template and source text
#
my $tmpl = Text::Scraper->slurp(*DATA);
my $src = get(http://search.cpan.org/recent) || die $!;

#
# 2. Extract data from source
#
my $obj = Text::Scraper->new(tmpl => $tmpl);
my $data = $obj->scrape($src);

#
# 3. Do something really neat...(left as excercise)
#
print "Newest Submission: ", $data->[0]{submissions}[0]{name}, "nn";
print "Scraper model:n", Dumper($obj), "nn";
print "Parsed model:n", Dumper($data) , "nn";

__DATA__

< div class=path>< center>< table>< tr>
< ?tmpl stuff pre_nav ?>
< td class=datecell>< span>< big>< b> < ?tmpl var date_string ?> < /b>< /big>< /span>< /td>
< ?tmpl stuff post_nav ?>
< /tr>< /table>< /center>< /div>

< ul>
< ?tmpl loop submissions ?>
< li>< a href="< ?tmpl var link ?>">< ?tmpl var name ?>< /a>
< ?tmpl if has_description ?>
< small> -- < ?tmpl var description ?>< /small>
< ?tmpl end has_description ?>
< /li>
< ?tmpl end submissions ?>
< /ul>

ABSTRACT

Text::Scraper provides a fully functional base-class to quickly develop Screen-Scrapers and other text extraction tools. Programmatically generated text such as dynamic webpages are trivially reversed engineered.

Using templates, the programmer is freed from staring at fragile, heavily escaped regular expressions, mapping capture groups to named variables or wrestling with the DOM and badly formed HTML. In addition, extracted data can be hierarchical, which is beyond the capabilities of vanilla regular expressions.

Text::Scrapers functionality overlaps some existing CPAN modules - Template::Extract and WWW::Scraper.
Text::Scraper is much more lightweight than either and has a more general application domain than the latter. It has no dependencies on other frameworks, modules or design-decisions. On average, Text::Scraper benchmarks around 250% faster than Template::Extract - and uses significantly less memory.

Unlike both existing modules, Text::Scraper generalizes its functionality to allow the programmer to refine template capture groups beyond (.*?), fully redefine the template syntax and introduce new template constructs bound to custom classes.

<<less
Download (0.045MB)
Added: 2007-08-22 License: Perl Artistic License Price:
796 downloads
Locale::Maketext::Extract::Run 0.62

Locale::Maketext::Extract::Run 0.62


Locale::Maketext::Extract::Run is a Perl module interface to xgettext.pl. more>>
Locale::Maketext::Extract::Run is a Perl module interface to xgettext.pl.

SYNOPSIS

use Locale::Maketext::Extract::Run xgettext;
xgettext(@ARGV);

<<less
Download (0.082MB)
Added: 2007-02-13 License: Perl Artistic License Price:
987 downloads
CPAN::Mini::Extract 1.16

CPAN::Mini::Extract 1.16


CPAN::Mini::Extract is a Perl module that can create CPAN::Mini mirrors with the archives extracted. more>>
CPAN::Mini::Extract is a Perl module that can create CPAN::Mini mirrors with the archives extracted.

SYNOPSIS

# Create a CPAN extractor
my $cpan = CPAN::Mini::Extract->new(
remote => http://mirrors.kernel.org/cpan/,
local => /home/adam/.minicpan,
trace => 1,
extract => /home/adam/.cpanextracted,
extract_filter => sub { /.pm$/ and ! /b(inc|t)b/ },
extract_check => 1,
);

# Run the minicpan process
my $changes = $cpan->run;

CPAN::Mini::Extract provides a base for implementing systems that download "all" of CPAN, extract the dists and then process the files within.
It provides the same syncronisation functionality as CPAN::Mini except that it also maintains a parallel directory tree that contains a directory located at an identical path to each archive file, with a controllable subset of the files in the archive extracted below.

How does it work

CPAN::Mini::Extract starts with a CPAN::Mini local mirror, which it will optionally update before each run. Once the CPAN::Mini directory is current, it will scan both directory trees, extracting any new archives and removing any extracted archives no longer in the minicpan mirror.

<<less
Download (0.026MB)
Added: 2007-07-25 License: Perl Artistic License Price:
821 downloads
ADS Dexter 0.40

ADS Dexter 0.40


ADS Dexter is a utility for extracting data from scanned graphs. more>>
The following is an excerpt from a poster presented at the American Astronomical Societys 2000 Summer meeting in Rochester, NY.

ADS roughly 1,000,000 scanned pages contain numerous diagrams and figures for which the original data sets are lost or inaccessible. Having scans for the figures invites digitizing the data points to recover at least a part of these data.

Performing this digitization automatically is still beyond the capabilities of current OCR systems, but the computer can ease this process for a human.

This was the starting point for Dexter, a Java applet that runs in the users browsers and provides an interface for selecting the part of the page that is of interest. On that selection, coordinate axes, points and error bars can be marked and, of course, corrected. [...]

In the future, we plan to implement some recognition algorithms that would, e.g., trace a line for the user or automatically search for markers.
<<less
Download (0.079MB)
Added: 2005-05-20 License: GPL (GNU General Public License) Price:
1620 downloads
Google::Adwords::Data 0.6.0

Google::Adwords::Data 0.6.0


Google::Adwords::Data is base class for the Data modules. more>>
Google::Adwords::Data is base class for the Data modules.

This module is not supposed to be used directly. Use the child data modules.

<<less
Download (0.041MB)
Added: 2006-11-28 License: Perl Artistic License Price:
1060 downloads
THC-ManipulateData 1.3

THC-ManipulateData 1.3


THC-ManipulateData can search data on a harddisk/partition/file. more>>
THC-ManipulateData can search data on a harddisk/partition/file, extract the part you are interested in, and write it back after you modified it.
Useful to find and modify really all unencrypted Logfiles on a system. Does everything in RAW mode, and hence does not tamper a/m/ctimes.
It comes with 4 tools:
Syntax of search_data: ./search_data [-i] [-d] blockdevice searchstring
-i - the only parameter which is optional. This does the
search case insensitive.
-d - dump the found occasions in hex
blockdevice - a blockdevice you want to search for data. It need
not to be a blockdevice, it can be anything, but normaly
you use it on these.
searchstring - a string you want to search for
The blockdevice is searched for the occurance of searchstring, which are printed with location when found.
Example: ./search_data -i /dev/hda3 "connect from 10.0.0.1"
Output looks like:
found at 234600: connect from 10.0.0.1/unresolved (UNKNOWN)
Syntax of read_data: ./read_data blockdevice start_address no_of_bytes
blockdevice - a blockdevice you want to get your data from. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
start_address - from which offset of the blockdevice you want to extract data from
no_of_bytes - how many bytes of data starting at the start_address you want to extract in a file.
The output filename is always START_ADDRESS.NO_OF_BYTES
Example: ./read_data /dev/hda3 234653 1024
writes 1024 bytes of data from /dev/hda3 starting from offset 234653 to the file "234653.1024"
Syntax of write_data: ./write_data blockdevice filename
blockdevice - a blockdevice you want to write your data to. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
filename - the data you want to write to the blockdevice. For error protection, the location where it is put it gathered from the filename - as you can see above from read_data. If you modified the data extracted with read_data into the file, it may not have a different size than defined in the filename! The data in filename is written to the blockdevice
Example: ./write_data /dev/hda3 234653.1024
writes 1024 bytes of data to /dev/hda3 starting at offset 234653 with the
data read from the file "234653.1024"
Syntax of replace_data: ./replace_data [-i] blockdevice searchstring replacestring
-i - the only parameter which is optional. This does the search case insensitive.
blockdevice - a blockdevice you want to search for data. It need not to be a blockdevice, it can be anything, but normaly you use it on these.
searchstring - a string you want to search for
replacestring - the string you want to replace the found entries with
The blockdevice is searched for the occurance of searchstring, and is then replaced.
Example: ./replace_data -i /dev/hda3 "connect from 1.0.0.1" "Remap table failure "
Output looks like:
found at 234600 - replaced
Enhancements:
- fixed a bug in read/write_data, seeks over 2gb now succeed
- added -d for hexdump display of occasions found in search_data
<<less
Download (0.007MB)
Added: 2006-03-08 License: GPL (GNU General Public License) Price:
1325 downloads
WWW::Myspace::Data 0.13

WWW::Myspace::Data 0.13


WWW::Myspace::Data is a WWW::Myspace database interaction. more>>
WWW::Myspace::Data is a WWW::Myspace database interaction.

SYNOPSIS

This module is the database interface for the WWW::Myspace modules. It imports methods into the callers namespace which allow the caller to bypass the loader object by calling the methods directly. This module is intended to be used as a back end for the Myspace modules, but it can also be called directly from a script if you need direct database access.

my %db = (
dsn => dbi:mysql:database_name,
user => username,
password => password,
);

# create a new object
my $data = WWW::Myspace::Data->new( $myspace, { db => %db } );

# set up a database connection
my $loader = $data->loader();

# initialize the database with Myspace login info
my $account_id = $data->set_account( $username, $password );

# now do something useful...
my $update = $data->update_friend( $friend_id );

<<less
Download (0.016MB)
Added: 2007-07-26 License: Perl Artistic License Price:
824 downloads
Secleted [ 0 ] software to compare
  • Page: 1 of 5
  • 1
  • 2
  • 3
  • 4
  • 5