Main > Free Download Search >

Free web spider software for linux

web spider

Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 4011
Silver Spider

Silver Spider


Silver Spider is a KDE theme like Fluxbox style. more>>
Silver Spider is a KDE theme like Fluxbox style.

Wallpaper von Deviant with copyright by dayvid.net

It is for 1024x768 screen resolution. Wallpaper is from Deviant with copyright by dayvid.net. The symbols style is "Amaranth by Michael Doches(Doches@linuxcult.com)"

Windowsdeco, Bar & Button style: Keramik

You have to take same changes in KDE Design Control to look the same as in the preview. Take a look at all the screenshots to know want you have to change (KDE Control, Look&Feel, Symbols, Extra..).Thats why it depends on KDE 3.3, also because the transparency effect for the control bar, and for the menues when do you like that.

<<less
Download (0.47MB)
Added: 2007-04-19 License: GPL (GNU General Public License) Price:
924 downloads
The Spider 1.71

The Spider 1.71


The Spider project is a complete object-oriented environment for machine learning in Matlab. more>>
The Spider project is a complete object-oriented environment for machine learning in Matlab.
Aside from ease of use for base learning algorithms, algorithms can be plugged together and can be compared with, for example, model selection, statistical tests, and visual plots.
This gives all the power of objects (reusability, the ability to plug together, sharing of code), but also all the power of Matlab for machine learning research.
Enhancements:
- A WEKA interface was added, supporting decision trees.
- Additional code, documentation, and demos were added.
<<less
Download (4.2MB)
Added: 2006-07-27 License: GPL (GNU General Public License) Price:
1185 downloads
Web Site Robot 2.4

Web Site Robot 2.4


Web Site Robot is an easy to use Web site builder with a blog tool. more>>
Web Site Robot is an easy to use Web site builder with a blog tool, shopping cart, and online photo album. It is free and does not require additional software, HTML, programming, or a hosting company.
It enables secure online payments with credit cards. There is virtually unlimited storage space for HTML, documents, blogs, photos, music, videos, or any multimedia content.
Enhancements:
- The Robot is now able to discover and inform UPnP routers about its presence and the need to have a NAT port available for its Web server.
- In most SOHO environments with UPnP enabled routers, manual configuration is not needed.
- The operator just installs, registers the Robot and the Web site is online effortlessly.
<<less
Download (26MB)
Added: 2005-09-13 License: Freeware Price:
3097 downloads
Web Business Pro 4a

Web Business Pro 4a


Web Business Pro is a Web application designed for everyday business activities. more>>
Web Business Pro is a Web application designed for everyday business activities. The project includes support for customer lists, inventory, receipt printing, and accounting. Fully featured search functions and filters make it easy to find what you need quickly.

<<less
Download (1.3MB)
Added: 2007-07-22 License: LGPL (GNU Lesser General Public License) Price:
828 downloads
TkSpider 0.97

TkSpider 0.97


TkSpider project is a game of solitaire is played with the equivalent of two packs of cards. more>>
TkSpider project is a game of solitaire is played with the equivalent of two packs of cards.

The last release includes several small new features, including a redo feature.

Some code, and the card images, taken from Jeff Godreys "Once in a Lifetime".

Spider, based on the card-game shipped with Windows XP

<<less
Download (MB)
Added: 2006-11-22 License: GPL (GNU General Public License) Price:
1069 downloads
Web Secretary 1.9.0

Web Secretary 1.9.0


Web Secretary is a web page monitoring software. more>>
Web Secretary is a web page monitoring software. However, it goes beyond the normal functionalities offered by such software. It will detect changes based on content analysis, making sure that its not just HTML that changed, but actual content.
You can tell it what to ignore in the page (hit counters and such), and it can mail you the document with the changes highlighted or load the highlighted page in a browser.
Web Secretary is actually a suite of two Perl scripts called websec and webdiff. websec retrieves web pages and email them to you based on a URL list that you provide. webdiff compares two web pages (current and archive) and creates a new page based on the current page but with all the differences highlighted using a predefined color.
For example you can look at the Web Secretary page as it was monitored:
- Old Page - The original page before the change.
- Modified Page - The page after the change.
- Highlighted Page - The page as WebSec sends, highlighted.
Personally, I put Web Secretary on crontab to monitor a large number of web pages. When the highlighted pages are delivered to me, I use procmail to sort them out and file them into another folder. Sometimes, when I am busy, I will not have time to accessing the web for a few days. However, with Web Secretary, I can always access the "archive" that it has created for me at my own leisure.
<<less
Download (0.031MB)
Added: 2006-01-20 License: GPL (GNU General Public License) Price:
1375 downloads
Web of Life 1.0

Web of Life 1.0


Web of Life is a life simulation game. more>>
Web of Life is a life simulation game.

An isometric game done with C++ and SDL.

In this game you control some beings in an ecosystem, the goal is to make only that the beings you control survive.

Your beings should survive fighting with other beings, reproducting to make a massive attack and eating.
But sometimes you will have to eat some of your own live beings so that others could stay alive, well its life.
<<less
Download (MB)
Added: 2007-03-01 License: GPL (GNU General Public License) Price:
971 downloads
Sws Web Server 0.1.7

Sws Web Server 0.1.7


Sws Web Server is a fast, secure, and simple Web server. more>>
Sws Web Server is a fast, secure, and simple Web server.
Installation:
./make
./make install
Usage:
For start
Redhat and Debian users
/etc/init.d/sws_web_server.init start
slackware users
/etc/rc.d/init.d/rc.sws start
First create /etc/sws directory
sws.conf file copy in to /etc/sws
Edit sws.conf
Enhancements:
- A file transfer problem has been solved.
<<less
Download (0.005MB)
Added: 2005-07-13 License: GPL (GNU General Public License) Price:
1569 downloads
WWW::Spyder 0.19

WWW::Spyder 0.19


WWW::Spyder is a Perl module that acts like a web spider. more>>
WWW::Spyder is a Perl module that acts like a web spider.

A web spider that returns plain text, HTML, and other information per page crawled and can determine what pages to get and parse based on supplied terms compared to the text in links as well as page content.

METHODS

$spyder->new()

Construct a new spyder object. Without at least the seed() set, or go_to_seed() turned on, the spyder isnt ready to crawl.

$spyder = WWW::Spyder->new(shift||die"Gimme a URL!n");
# ...or...
$spyder = WWW::Spyder->new( %options );

Options include: sleep_base (in seconds), exit_on (hash of methods and settings). Examples below.

$spyder->seed($url)

Adds a URL (or URLs) to the top of the queues for crawling. If the spyder is constructed with a single scalar argument, that is considered the seed_url.

$spyder->bell([bool])

This will print a bell ("a") to STDERR on every successfully crawled page. It might seem annoying but it is an excellent way to know your spyder is behaving and working. True value turns it on. Right now it cant be turned off.

$spyder->spyder_time([bool])

Returns raw seconds since Spyder was created if given a boolean value, otherwise returns "D day(s) HH::MM:SS."

$spyder->terms([list of terms to match])

The more terms, the more the spyder is going to grasp at. If you give a straight list of strings, they will be turned into very open regexes. E.g.: "king" would match "sulking" and "kinglet" but not "King." It is case sensitive right now. If you want more specific matching or different behavior, pass your own regexes instead of strings.

$spyder->terms( qr/bkings?b/i, qr/bqueens?b/i );

terms() is only settable once right now, then its a done deal.

$spyder->spyder_data()

A comma formatted number of kilobytes retrieved so far. Dont give it an argument. Its a set/get routine.

$spyder->slept()

Returns the total number of seconds the spyder has slept while running. Useful for getting accurate page/time counts (spyder performance) discounting the added courtesy naps.

$spyder->UA->...

The LWP::UserAgent. You can reset them, I do believe, by calling methods on the UA. Here are the initialized values you might want to tweak (see LWP::UserAgent for more information):

$spyder->UA->timeout(30);
$spyder->UA->max_size(250_000);
$spyder->UA->agent(Mozilla/5.0);

Changing the agent name can hurt your spyder b/c some servers wont return content unless its requested by a "browser" they recognize.

You should probably add your email with from() as well.

$spyder->UA->from(bluefintuna@fish.net);
$spyder->cookie_file([local_file])

They live in $ENV{HOME}/spyderCookie by default but you can set your own file if you prefer or want to save different cookie files for different spyders.

<<less
Download (0.017MB)
Added: 2007-08-02 License: Perl Artistic License Price:
814 downloads
Web Sharing 0.0.3

Web Sharing 0.0.3


Web Sharing is a small tray applet for sharing files on the web. more>>
Web Sharing is a small tray applet for sharing files on the web.

Publish anything on the Internet or your local area network at home, office or school from a folder on your hard disk with just a few clicks of a mouse!

<<less
Download (1.9MB)
Added: 2005-07-28 License: GPL (GNU General Public License) Price:
1553 downloads
Web 1.35

Web 1.35


Web Perl module contains a set of useful routines for many webworking purposes. more>>
Web Perl module contains a set of useful routines for many webworking purposes.

This module was primarily made for UNIX/Linux-Systems. Parts of it cannot be used on other systems. E.g. the procedures for file locking demand systems that can use symlinks. If you use the modul on systems where symlinks cannot be used, fatal errors may happen.

ABSTRACT

This perl module serves users with several useful routines for many purposes, like generating webpages, processing CGI scripts, working with XML datafiles and net-connections. It also uses own variants of routines, that was invented first in the famous libraries CGI.pm and cgi-lib.pl.

INSTALLATION

If you dont have sufficient privileges to install web.pm in the Perl library directory, you can put web.pm into some convenient spot, such as your home directory, or in cgi-bin itself and prefix all Perl scripts that call it with something along the lines of the following preamble:

use lib /home/myname/perl/lib;
use web;

NLock

This routine allows to set a filelock across NFS-boundaries. The common used perl-routine flock() fails at this point, so this routine is a useable alternative for bigger file-systems. It uses the modular functions link() and unlink() to mark a file locked. In addition to this, it also gives the locked file a counter: A file that is locked for more than $web::MAX_LOCKTIME seconds will be freed by the next process that calls NLock() on this file. A calling process gets either 0 or 1 as a return value, where 1 is returned if the file-locking was successful. 0 is returned only if the process waits for more than $web::MAX_WAITLOCK seconds or if symlink() fails.

Example 1:

$filename = "data.txt";
NLock($filename);
open(f1,"$filename");
# do something
close f1;
NUnlock($filename);

Example 2:

#!/local/bin/perl5
use web;

$stat= &NLock("jump.pl");
print "Lock: stat= $statn";
$stat= &NLock("jump.pl");
print "Lock this file again: stat= $statn";
sleep 8;
$stat= &NLock("jump.pl");
print "Lock this file again: stat= $statn";
$stat= &NUnlock("jump.pl");
print "Unlock: stat= $statn";
exit;

<<less
Download (0.017MB)
Added: 2007-07-21 License: Perl Artistic License Price:
864 downloads
WWW::Orkut::Spider 0.03

WWW::Orkut::Spider 0.03


WWW::Orkut::Spider is a Perl extension for spidering the orkut community. more>>
WWW::Orkut::Spider is a Perl extension for spidering the orkut community.

SYNOPSIS

use WWW::Orkut::Spider;
my $orkut = WWW::Orkut::Spider->new;
$orkut->login($user,$pass);
$orkut->get_hisfriends($uid);
print $orkut->get_xml_profile($uid);

WWW::Orkut::Spider uses WWW:Mechanize to scrape orkut.com.
Output is a simple xml format containing friends, communities and profiles for a given Orkut UID.

- Access to orkut.com via WWW::Mechanize
- Collects UIDs
- Fetches Profiles/Communities/Friends for a given UID
- Output via simple xml format

<<less
Download (0.005MB)
Added: 2006-09-13 License: Perl Artistic License Price:
682 downloads
Abyss Web Server X1 2.6

Abyss Web Server X1 2.6


Abyss Web Server X1 is a free and compact Web server. It supports SSL, compression, CGI/FastCGI, ISAPI, XSSI, URL rewriting,bandwidth throttling, anti-leeching, anti-hacking, and features a remote web management interface. more>> <<less
Download (0.64MB)
Added: 2009-04-04 License: Freeware Price: $0
73043 downloads
 
Other version of Abyss Web Server X1
Abyss Web Server X1 2.0.6Abyss Web Server is a compact web server available for Windows, MacOS X ... interface that makes its configuration as easy as browsing a web site. Main features: - Virtual
License:GPL (GNU General Public License)
Download (0.25MB)
1512 downloads
Added: 2005-09-07
Mobile Web Proxy 1.0

Mobile Web Proxy 1.0


Mobile Web Proxy is a (CGI) tool to make more Web pages available from some mobile devices (cell phone Web browsers, etc.). more>>
Mobile Web Proxy is a (CGI) tool to make more Web pages available from some mobile devices (cell phone Web browsers, etc.).
When I began using my cell phones mobile web browser, one of the things I realized was that there were a lot of web pages that my cell phones web browser simply wouldnt let me see. My shiny new Motorola Razr from T-Mobile, for many pages, would display an error message like "413: Page cannot be displayed." It happened annoyingly often; browsing the web from my cell phone was a lot like trying to read a book or a magazine with large chunks simply ripped out. And when I tried troubleshooting, I might have missed something, but it seemed like a big problem without an easily available solution.
So I tried to provide my own solution. I made a proxy that would present webpages for the mobile web so that they would display in a way that would work for my cell phones web browser. Technologies include paging, tag cleaning, and optional caching to improve performance, but without getting into technical details, this means that I can access some pages on the web that I couldnt access earlier.
Enhancements:
- A bug that caused the script to spuriously report 404 errors was fixed.
- A smaller default page size was set so that tag-rich pages would be rendered in a way more likely to be displayable on mobile Web browsers.
<<less
Download (0.017MB)
Added: 2007-08-04 License: Perl Artistic License Price:
911 downloads
X personal web server 0.1

X personal web server 0.1


X personal web server is a personal Web server that sits in your desktop notification area (as a tray icon). more>>
X personal web server is a personal Web server that sits in your desktop notification area (as a tray icon).

X personal web server is a (much) simpler cousin of KPF, but doesnt require kicker and uses the GTK 2 library for its minimal user interface.

<<less
Download (0.059MB)
Added: 2006-09-05 License: GPL (GNU General Public License) Price:
1148 downloads
Secleted [ 0 ] software to compare
  • Page: 1 of 5
  • 1
  • 2
  • 3
  • 4
  • 5