Main > Free Download Search >

Free data extractor downloads software for linux

data extractor downloads

Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 6515
Top Downloads 2.0

Top Downloads 2.0


Top Downloads provides a script that counts how many times a file has been downloaded. more>>
Top Downloads provides a script that counts how many times a file has been downloaded.

It keeps statistics of your downloads and can generate a TOP10 of your downloaded files. The administration function of this script allows you to delete, rename or change count numbers for your downloads.

It can check a directory that you specify for new files and add it to a database. Count numbers and TOP10 can be included into a HTML document.

Most web servers require SSI documents to have the extension .shtml or .shtm

For Apache web servers:
- AddType text/html .shtml
- AddHandler server-parsed .shtml

<<less
Download (0.035MB)
Added: 2007-04-27 License: GPL (GNU General Public License) Price:
916 downloads
Flat File Extractor 0.2.2

Flat File Extractor 0.2.2


Flat File Extractor can be used for reading different flat file structures and printing them in different formats. more>>
Flat File Extractor can be used for reading different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GNU General Public License 2 or later.
Main areas of use are:
- Extracting particular fields or records from a flat file
- Converting data from one format to an other, e.g. from CSV to fixed length
- Verifying a flat file structure
- Testing tool for flat file development
- Displaying flat file content in human readable form
Main features:
- Command-line tool
- Reads standard input and writes to standard output as default
- One input file can contain several types of records (lines)
- Fields in a flat file can be fixed length or separated
- Input file structure and output definitions are independent, meaning one output format can be used with several input files
- Input file structure and output format are freely configurable, they are not predefined
- Output can be formatted e.g. as: fixed length, separated, tokenized, XML, SQL,...
- ffe tries to guess the input format, user needs not to give it as a parameter
Enhancements:
- Configuration keyword const has been added
<<less
Download (0.23MB)
Added: 2007-05-30 License: GPL (GNU General Public License) Price:
882 downloads
XML Extractor 0.3.0

XML Extractor 0.3.0


XML Extractor is a set of tools for transforming XML-like markup into entities or well-formed XML files. more>>
XML Extractor is a set of tools for transforming XML-like markup into entities or well-formed XML files.

The sourcecode XML metadata extraction tools are intended to be used for extracting and transforming XML-like markup embedded in source code comments into syntactically correct external entities or well-formed XML files.

This can be used for JavaDoc-like code annotation, providing structured comments, or even embedding metadata used by the build process or configuration management tools.

INSTALLATION

For info and options about installing this tool, type:
# python setup.py --help

USAGE

To see usage info for this tool, type:
# python xlf_to_wfx_cli.py --help
<<less
Download (0.020MB)
Added: 2006-10-04 License: LGPL (GNU Lesser General Public License) Price:
1116 downloads
ccextractor 0.30

ccextractor 0.30


ccextractor is a fast closed captions extractor for MPEG files. more>>
ccextractor project is a fast closed captions extractor for MPEG files.
ccextractor is mostly a mildly optimized C port of McPoodles excellent but painfully slow Perl script SCC_RIP. It lets you rip the raw closed captions (read: subtitles) data from a number of sources, such as DVD or replay TV.
As an added bonus compared to the original SCC_RIP, ccextractor can extract subtitles from the HDTV transport streams that are becoming more common.
At this point ccextractor extracts the line 21 captions (which must legally be present for a number of years until the transition to digital is complete). Note that in most .ts you can find, there will be subtitle data for both analog (EIA-608) decoders and digital (EIA-708). AFAIK there are not
freely available EIA-708 rippers.
Anyway, since line 21 captions will be available for some time, we have time to build a decent 708 ripper.
Basic Usage
For details on CC, please go to McPoodles page:
http://www.geocities.com/mcpoodle43/SCC_TOOLS/DOCS/SCC_TOOLS.HTML
You will need his tools to use ccextrators output.
The basic idea is that you get the raw closed caption dump from ccextractor.
Then you need other tools (which vary depending on what you want to do) to continue processing.
To get a transcript from a .ts file in .srt (I assume this will be the most common use) do this:
ccextractor -12 input_file
-12 means "extract both subtitle tracks" (actually technical names are fields but tracks is easier to understand). 1 is almost always English. 2 is Spanish in HBO (at least in the few samples Ive seen) but could be anything. Just extract both of them and check.
Example: cctractor -12 house315.ts
ccextractor will create two files, called house315_1.bin and _2.
Then use McPoodles RAW2SCC to create a temporary SCC file (means Scenerist, which is originally the native format for some program, its not important here).
raw2scc house315_1.bin
This creates house315_1.scc
From this .scc file, you can get the final .srt by using McPoodles CCASDI:
ccasdi -s house315_1.srt
Which looks like this (just 3 random lines shown).
514
00:24:07,400 --> 00:24:09,300
Theyve got another trial
going on at Duke.
515
00:24:09,367 --> 00:24:12,567
15% extend their lives
beyond five years.
516
00:24:12,634 --> 00:24:13,701
If youre positive
for protein PHF--
Enhancements:
- This release adds support for DVR-MS files.
- It improves the CC decoder.
- There are several bugfixes, a major speed boost (20%-40%), improved timing for non-TS files, improved format autodetection, and other minor improvements.
<<less
Download (0.033MB)
Added: 2007-05-24 License: GPL (GNU General Public License) Price:
893 downloads
Unix configuration extractor 4

Unix configuration extractor 4


The Unix configuration extractor is a script more>> The Unix configuration extractor is a script that runs on the server to extract necessary security configurations. This script doesnt make any changes to the server other than creating the dump files<<less
Download (19KB)
Added: 2009-03-31 License: Freeware Price: Free
206 downloads
Retriever Download Manager 1.05

Retriever Download Manager 1.05


Retriever allows you to download files from the internet easier and with more control than from your web browser. more>>
Retriever Download Manager program was written completely in the Java programming language and will run on any operating system with a Java Virtual Machine installed (JVM 1.4 or higher).
Retriever allows you to download files from the internet easier and with more control than from your web browser, with it you can resume interrupted downloads at any time thats convenient or even schedule downloads to occur when youre away from your computer. Even if one server is too busy to accomodate you Retriever will automatically switch to another to continue your download.
You can even temporarily pause your download and resume it instantly a few minutes later.
No matter how many times your download is interrupted or stopped Retriever gives you a successful download.
Retriever is highly reliable and customizable.
Downloads can be input manually into the program or captured from the clipboard.
You can also drag and drop links onto the download list to capture them. If a selection of text is dropped, it will be parsed for links, you can use this behaviour to add multiple files at once.
Main features:
- Resumable FTP and HTTP downloads.
- Support for BitTorrent transfers (starting with version 0.905).
- FTP and HTTP server authentication.
- supports HTTP, FTP and SOCKS proxies with authentication.
- Ability to add file download mirrors.
- Scheduled downloading.
- Pausable downloading.
- Execute external application on download start and/or completion.
- User customizable icons. See the file "Icon-Set.txt"in the icons/default.zip archive for details.
Enhancements:
- Removed "resume" button, paused downloads are now resumed with the "start download" button.
- Referer information can now be edited or input manually.
- Fixed problem with FTP servers that did not use the default port.
- Fixed issue where status information would be incorrect.
<<less
Download (0.44MB)
Added: 2007-01-03 License: Freeware Price:
1029 downloads
Data.FormValidator 0.04

Data.FormValidator 0.04


Data.FormValidators aim is to bring all the benefits of the perl module Data::FormValidator over to javascript. more>>
Data.FormValidators aim is to bring all the benefits of the perl module Data::FormValidator over to javascript, using the same input profiles (they can be dumped into javascript objects using the perl module Data::JavaScript.
Data.FormValidator library lets you define profiles which declare the required and optional fields and any constraints they might have.
The results are provided as an object which makes it easy to handle missing and invalid results, return error messages about which constraints failed, or process the resulting valid data.
IMPORTANT NOTE: JavaScript form validation is NOT a replacement for data validation in your backend scripts. This is the primary reason this module was written... so that it would be easy to share the same validation profile for both the frontend (via Data.FormValidator.js) and backend (via Data::FormValidator.pm).
Enhancements:
- A problem where some functions were not terminated by a semi-colon, so JavaScript compactors would end up creating broken code was fixed.
<<less
Download (0.047MB)
Added: 2006-01-20 License: GPL (GNU General Public License) Price:
1372 downloads
Data::ICal::Entry::Todo 0.11

Data::ICal::Entry::Todo 0.11


Data::ICal::Entry::Todo is a Perl module that represents a to-do entry in an iCalendar file. more>>
Data::ICal::Entry::Todo is a Perl module that represents a to-do entry in an iCalendar file.

SYNOPSIS

my $vtodo = Data::ICal::Entry::Todo->new();
$vtodo->add_properties(
summary => "go to sleep",
status => INCOMPLETE,
# Dat*e*::ICal is not a typo here
dtstart => Date::ICal->new( epoch => time )->ical,
);

$calendar->add_entry($vtodo);

$vtodo->add_entry($alarm);

A Data::ICal::Entry::Todo object represents a single to-do entry in an iCalendar file. (Note that the iCalendar RFC refers to entries as "components".) It is a subclass of Data::ICal::Entry and accepts all of its methods.

<<less
Download (0.10MB)
Added: 2007-01-13 License: Perl Artistic License Price:
1014 downloads
Data::Generate 0.01

Data::Generate 0.01


Data::Generate allows you to create various types of synthetic data by parsing regex-like data creation rules. more>>
Data::Generate allows you to create various types of synthetic data by parsing "regex-like" data creation rules.

This module generates data by parsing given text statements (data creation rules). These statements are flexible and powerful regex-like way to control the production of synthetic data. Think about a program that instead of selecting data which matches a regex filter expression, produces it. For example, from the rule [a-c], the generator would produce the array a,b,c. The module works as following:

Specify data creation rules.
my $generator= Data::Generate::parse(VC(24) [0-9][2-3]);
At this step first you define one kind of output datatype (for ex. VC(24)= "output is a string with max length 24") and then with the rest of the expression define what it should look like. If parsing is successful a Data Generator object is instantiated.

Get data
my $Data= $generator->get_unique_data(10);
To really get the data, users must call the get_unique_data method by indicating the desired number of output values. The generator returns the values contained in an array reference. Please remark that output format is fixed according to the data type.

<<less
Download (0.025MB)
Added: 2007-03-31 License: Perl Artistic License Price:
937 downloads
Obscure-Extractor-GTK 0.2

Obscure-Extractor-GTK 0.2


Obscure-Extractor-GTK can extract data from simple and unusual archives as used by games. more>>
Obscure-Extractor-GTK can extract data from simple and unusual archives as used by games, e.g. Neverwinter Nights, Homeworld 2, BloodRayne.

Mostly a framework where I can easily add new modules when I want to have a look at the inner workings of games, though the Delphi version has some more advanced stuff like support for old InstallShield archives that would need to be ported.

<<less
Download (0.012MB)
Added: 2006-07-24 License: GPL (GNU General Public License) Price:
1202 downloads
Data::Phrasebook::Loader::XML 0.12

Data::Phrasebook::Loader::XML 0.12


Data::Phrasebook::Loader::XML Perl module can abstract your phrases with XML. more>>
Data::Phrasebook::Loader::XML Perl module can abstract your phrases with XML.

SYNOPSIS

use Data::Phrasebook;

my $q = Data::Phrasebook->new(
class => Fnerk,
loader => XML,
file => phrases.xml,
dict => Dictionary, # optional
);

OR

my $q = Data::Phrasebook->new(
class => Fnerk,
loader => XML,
file => {
file => phrases.xml,
ignore_whitespace => 1,
}
);

# simple keyword to phrase mapping
my $phrase = $q->fetch($keyword);

# keyword to phrase mapping with parameters
$q->delimiters( qr{ [% s* (w+) s* %] }x );
my $phrase = $q->fetch($keyword,{this => that});

<<less
Download (0.017MB)
Added: 2007-07-24 License: Perl Artistic License Price:
822 downloads
Erwin Data Structures 2.1.58633

Erwin Data Structures 2.1.58633


Erwin Data Structures is a library that is meant to be the ultimate data structure library for mixed usage of C and C++. more>>
Erwin Data Structures is a library that is meant to be the ultimate data structure library for mixed usage of C and C++.

Arbitrary key and value types are implemented by template files that dont use C++ templates, but are instantiated by a Perl script.

This way, mixed usage in C and C++ is possible. However, a C++ interface is generated to support the advantages of the C++ language. No templates, no void*.
Erwin contains a number of tools, too, all of them written in Perl. The following list shows the data structures and tools, together with some typical examples.
<<less
Download (0.67MB)
Added: 2007-02-09 License: Freely Distributable Price:
997 downloads
Data::Faker::StreetAddress 0.07

Data::Faker::StreetAddress 0.07


Data::Faker::StreetAddress is a Data::Faker plugin. more>>
Data::Faker::StreetAddress is a Data::Faker plugin.

DATA PROVIDERS

us_zip_code

Return a random zip or zip+4 zip code in the US zip code format. Note that this is not necessarily a valid zip code, just a 5 or 9 digit number in the correct format.

us_state

Return a random US state name.

us_state_abbr

Return a random US state abbreviation. (Includes US Territories and AE, AA, AP military designations.)

From the USPS list at http://www.usps.com/ncsc/lookups/usps_abbreviations.html

street_suffix

Return a random street suffix (Drive, Street, Road, etc.)

From the USPS list at http://www.usps.com/ncsc/lookups/usps_abbreviations.html

street_name

Return a fake street name.

street_address

Return a fake street address.

secondary_unit_designator

Return a random secondary unit designator, with a range if needed (secondary unit designators are things like apartment number, building number, suite, penthouse, etc that differentiate different units with a common address.)

secondary_unit_number

Return a random secondary unit number, for the secondary unit designators that take ranges.

<<less
Download (0.020MB)
Added: 2006-10-25 License: Perl Artistic License Price:
1100 downloads
Data::ICal::Entry::Journal 0.12

Data::ICal::Entry::Journal 0.12


Data::ICal::Entry::Journal is a Perl module that represents a journal entry in an iCalendar file. more>>
Data::ICal::Entry::Journal is a Perl module that represents a journal entry in an iCalendar file.

SYNOPSIS

my $vjournal = Data::ICal::Entry::Journal->new();
$vjournal->add_properties(
summary => "Minutes of my party",
description => "I cried because I wanted to.",
# Dat*e*::ICal is not a typo here
dtstart => Date::ICal->new( epoch => time )->ical,
);

$calendar->add_entry($vjournal);

A Data::ICal::Entry::Journal object represents a single journal entry in an iCalendar file. (Note that the iCalendar RFC refers to entries as "components".) It is a subclass of Data::ICal::Entry and accepts all of its methods.

<<less
Download (0.10MB)
Added: 2007-01-17 License: Perl Artistic License Price:
1012 downloads
Data::FormValidator::Constraints 4.40

Data::FormValidator::Constraints 4.40


Data::FormValidator::Constraints is a Perl module with basic sets of constraints on input profile. more>>
Data::FormValidator::Constraints is a Perl module with basic sets of constraints on input profile.

SYNOPSIS

use Data::FormValidator::Constraints qw(:all);
In an Data::FormValidator profile:
constraint_methods => {
email => email(),
fax => american_phone(),
phone => american_phone(),
state => state(),
},

These are the builtin constraints that can be specified by name in the input profiles.
Be sure to check out the SEE ALSO section for even more pre-packaged constraints you can use.

<<less
Download (0.086MB)
Added: 2006-10-04 License: Perl Artistic License Price:
1115 downloads
Secleted [ 0 ] software to compare
  • Page: 1 of 5
  • 1
  • 2
  • 3
  • 4
  • 5