documents and designs
Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 4811
Document clustering 0.2
Document clustering project is a data mining suite to cluster a document set. more>>
Document clustering project is a data mining suite to cluster a document set. This set of tools was implemented from a series of papers: "Clustering Web Pages Semantically using Combinatorial Topology", "Data mining using granular computing", and "A fast association rule algorithm based on bitmap and granular computing".
Enhancements:
- A bug with hash table has been fixed.
<<lessEnhancements:
- A bug with hash table has been fixed.
Download (0.060MB)
Added: 2007-05-18 License: GPL (GNU General Public License) Price:
906 downloads
Document Archiver 1.0.1
Document Archiver allows automatic archiving of documents. more>>
Document Archiver allows automatic archiving of documents.
The Document Archiver is a standalone Java application which should be scheduled to run nightly to facilitate the automatic archiving of documents in the KnowledgeTree DMS.
The programs checks if any documents are due to be archived, based on the Date or Utilisation criteria set in the web application (e.g. archive after 2 months, or archive 15 days after the last download).
<<lessThe Document Archiver is a standalone Java application which should be scheduled to run nightly to facilitate the automatic archiving of documents in the KnowledgeTree DMS.
The programs checks if any documents are due to be archived, based on the Date or Utilisation criteria set in the web application (e.g. archive after 2 months, or archive 15 days after the last download).
Download (0.54MB)
Added: 2007-02-07 License: Other/Proprietary License Price:
989 downloads
Borges Documents Management System 0.14.5
Borges is an open-source project aimed at XML-aware documentation projects. more>>
Borges is an open-source project aimed at XML-aware documentation projects which care about internationalisation, reusable contents, teamwork, etc. The system currently supports the DocBook DTD.
Main features:
- Easily write modular documentation in XML DocBook;
- Compile the source files into various formats, including PDF and HTML;
- Manage different versions of a single manual by easily defining derived versions based on conditional parts;
- Track the state of each module according to six predefined states (from "writing" to "final language proofreading"). Once a task is completed, the corresponding state is passed and the module switches to the next one;
- Each module is assigned a set of contributors: author/translators/proofreaders, each one responsible for one state of a module. Each contributor can easily review his attributions through web pages, and can receive e-mails with his current todo list;
- Track the work in progress, from the whole project (made of various manuals) to the most basic components (paragraphs), and their translations.
Enhancements:
- the modules images compilation issue is now solved
- requires xmllint
<<lessMain features:
- Easily write modular documentation in XML DocBook;
- Compile the source files into various formats, including PDF and HTML;
- Manage different versions of a single manual by easily defining derived versions based on conditional parts;
- Track the state of each module according to six predefined states (from "writing" to "final language proofreading"). Once a task is completed, the corresponding state is passed and the module switches to the next one;
- Each module is assigned a set of contributors: author/translators/proofreaders, each one responsible for one state of a module. Each contributor can easily review his attributions through web pages, and can receive e-mails with his current todo list;
- Track the work in progress, from the whole project (made of various manuals) to the most basic components (paragraphs), and their translations.
Enhancements:
- the modules images compilation issue is now solved
- requires xmllint
Download (0.13MB)
Added: 2006-07-25 License: GPL (GNU General Public License) Price:
1187 downloads
NOAH Document Management System 3.1.3
NOAH is a Web-based document management system that is feature-rich, easy to use, and simple to install. more>>
NOAH is a Web-based document management system that is feature-rich, easy to use, and simple to install. Through a browser interface, users can easily create, update, lock, search, and manage document revisions for access by multiple users across many geographical sites.
NOAH includes many powerful features such as client scripting, hierarchical document access control, and email notification. Administrators can manage users and archives and monitor database activity through an extensive suite of tools.
Main features:
- no expiry date.
- maximum number of users: 5 plus one Super Admin user.
- maximum number of documents: 100 (unlimited number of versions per document).
- includes Content Search option.
- does NOT include Client Scripting.
- does NOT include Email Notification.
- does NOT include multiple databases.
- does NOT include Support however emails will be answered if time permits.
Enhancements:
- A user can now get NOAH to report on recently changed files for a given directory tree and integrate this report in a wiki or HTML page.
- A user can also associate a filename with a custom icon and link a DokuWiki page with NOAHs content search engine, link critical NOAH documents and directories to your wiki project page, and manage usernames and passwords smoothly from NOAH to DokuWiki.
<<lessNOAH includes many powerful features such as client scripting, hierarchical document access control, and email notification. Administrators can manage users and archives and monitor database activity through an extensive suite of tools.
Main features:
- no expiry date.
- maximum number of users: 5 plus one Super Admin user.
- maximum number of documents: 100 (unlimited number of versions per document).
- includes Content Search option.
- does NOT include Client Scripting.
- does NOT include Email Notification.
- does NOT include multiple databases.
- does NOT include Support however emails will be answered if time permits.
Enhancements:
- A user can now get NOAH to report on recently changed files for a given directory tree and integrate this report in a wiki or HTML page.
- A user can also associate a filename with a custom icon and link a DokuWiki page with NOAHs content search engine, link critical NOAH documents and directories to your wiki project page, and manage usernames and passwords smoothly from NOAH to DokuWiki.
Download (MB)
Added: 2007-07-11 License: Free To Use But Restricted Price:
839 downloads
Simple Document Management System 1.1.4
Simple Document Management System is a document repository system. more>>
SDMS is an open-source system that allows you to store any document in a database via a bunch of webpages.
That was the short description, now comes the long one. SDMS uses PHP to provide you with a pretty interface to a MySQL server that allows you to store and retrieve documents and to share those documents between users.
In addition, the system uses ACL (Access Control Lists) to grant access rights to documents on a per-user basis.
So what is all this good for? Well, it for instance allows you to distribute project documentation on a need-to-know basis, whilst keeping a central repository of documents that is accessible to all team members and easy to manage.
<<lessThat was the short description, now comes the long one. SDMS uses PHP to provide you with a pretty interface to a MySQL server that allows you to store and retrieve documents and to share those documents between users.
In addition, the system uses ACL (Access Control Lists) to grant access rights to documents on a per-user basis.
So what is all this good for? Well, it for instance allows you to distribute project documentation on a need-to-know basis, whilst keeping a central repository of documents that is accessible to all team members and easy to manage.
Download (0.05MB)
Added: 2005-04-28 License: GPL (GNU General Public License) Price:
1643 downloads
Document Library 1.2b2
Document Library is a Web application for document management in larger organizations with a lot of documents. more>>
Document Library is a Web application for document management in larger organizations with a lot of documents.
Organizations deal with numerous documents, such as word processor documents and PDFs. These documents often reside on someones computer and are not network accessible. Versions of documents are hard to track - the same document may be passed around using email in multiple versions over time. In large organizations it therefore becomes important to structure the flow of documents and present them in a common format. This is typically done using a document management system. Document Library is one such document management system.
Information in the Document Library can be accessed using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), meaning that besides being open source, the Document Library is also a good example of an open data application. Because it is open data, the Document Library is easier to integrate with other applications, such as the Silva CMS or any other application capable of OAI-PMH harvesting.
Main features:
- Automatic conversion service: using OpenOffice, the Document Library can convert Word documents into PDFs and plain text, PDFs into plain text. The plain text version is important in that it allows for full-text indexing of document contents, and also makes documents more accessible to people with disabilities.
- Publication workflow: documents only become available for harvesting and download after a review process.
- Delegation of control: reviewers ("librarians") can be assigned to particular sections.
- Dynamic access: authors have automatic access to all the documents that list them as an author.
- Versions: multiple versions of the same document can coexist, one public and one under preparation.
- Email reminder functionality: users receive emails of the progress of the document through the workflow.
- OAI-PMH data provider: allows other systems to harvest document metadata using standard protocol.
- Integration with Silva CMS (using OAI-PMH).
- Fast upload and download integration with Apache using Tramline.
- Easy overview screens for librarians.
- Smart file upload user interface: files need to be uploaded only once even if rest of form needs to be amended.
- Document Library is built using the powerful Zope 3 application server platform.
Enhancements:
- Installation was made easier by using zc.buildout.
- Optional LDAP support was added.
- Filesystem storage was integrated with Tramline.
- The conversion provided by OooConv was improved.
- Table rendering was improved using zc.table.
- Zope was updated to 3.3.
<<lessOrganizations deal with numerous documents, such as word processor documents and PDFs. These documents often reside on someones computer and are not network accessible. Versions of documents are hard to track - the same document may be passed around using email in multiple versions over time. In large organizations it therefore becomes important to structure the flow of documents and present them in a common format. This is typically done using a document management system. Document Library is one such document management system.
Information in the Document Library can be accessed using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), meaning that besides being open source, the Document Library is also a good example of an open data application. Because it is open data, the Document Library is easier to integrate with other applications, such as the Silva CMS or any other application capable of OAI-PMH harvesting.
Main features:
- Automatic conversion service: using OpenOffice, the Document Library can convert Word documents into PDFs and plain text, PDFs into plain text. The plain text version is important in that it allows for full-text indexing of document contents, and also makes documents more accessible to people with disabilities.
- Publication workflow: documents only become available for harvesting and download after a review process.
- Delegation of control: reviewers ("librarians") can be assigned to particular sections.
- Dynamic access: authors have automatic access to all the documents that list them as an author.
- Versions: multiple versions of the same document can coexist, one public and one under preparation.
- Email reminder functionality: users receive emails of the progress of the document through the workflow.
- OAI-PMH data provider: allows other systems to harvest document metadata using standard protocol.
- Integration with Silva CMS (using OAI-PMH).
- Fast upload and download integration with Apache using Tramline.
- Easy overview screens for librarians.
- Smart file upload user interface: files need to be uploaded only once even if rest of form needs to be amended.
- Document Library is built using the powerful Zope 3 application server platform.
Enhancements:
- Installation was made easier by using zc.buildout.
- Optional LDAP support was added.
- Filesystem storage was integrated with Tramline.
- The conversion provided by OooConv was improved.
- Table rendering was improved using zc.table.
- Zope was updated to 3.3.
Download (0.30MB)
Added: 2006-12-07 License: GPL (GNU General Public License) Price:
1055 downloads
Uber Project Document Management System 1.0
Uber Project Document Management System is a PHP/SQL-based project document management system. more>>
Project document and management tracking system written in PHP using PostgreSQL to store user, project and document related data and MySQL to store the documents, with revision control.
How to:
How to setup the system:
Just untar every thing in your webdirecorty:
cd /var/www/html/
tar -xvzf ueber-1.0.tar.bz2
How to setup the DB:
- Create ueberp user in PostgreSQL [ueberp@psql]$ psql -U admin -c "CREATE USER ueberp WITH PASSWORD ueberp CREATEDB"
- Create a ueberp database in PostgreSQL
[ueberp@psql]$ createdb -U ueberp ueberp
- execute db-setup.psql to setup the PostgreSQL database.
[ueberp@psql]$ psql -U ueberp -d ueberp -f db-setup.psql
- Basic data is in db-data.psql for setting up the system
[ueberp@psql]$ psql -U ueberp -d ueberp -f db-data.psql
- Create a database in MySQL (i.e. ueberp)
- execute db-setup.mysql to setup the MySQL database.
- execute db-data.mysql to insert the basic data.
Finally edit ueberp/lib/config.inc and try to log into the system (user: admin, pass: admin)
Note: In order to have correct "Account Manager" and "Project Manager" entries
in the projects, the acronym for the "Account Manager" must be AM and for
the "Project Manager" PM.
<<lessHow to:
How to setup the system:
Just untar every thing in your webdirecorty:
cd /var/www/html/
tar -xvzf ueber-1.0.tar.bz2
How to setup the DB:
- Create ueberp user in PostgreSQL [ueberp@psql]$ psql -U admin -c "CREATE USER ueberp WITH PASSWORD ueberp CREATEDB"
- Create a ueberp database in PostgreSQL
[ueberp@psql]$ createdb -U ueberp ueberp
- execute db-setup.psql to setup the PostgreSQL database.
[ueberp@psql]$ psql -U ueberp -d ueberp -f db-setup.psql
- Basic data is in db-data.psql for setting up the system
[ueberp@psql]$ psql -U ueberp -d ueberp -f db-data.psql
- Create a database in MySQL (i.e. ueberp)
- execute db-setup.mysql to setup the MySQL database.
- execute db-data.mysql to insert the basic data.
Finally edit ueberp/lib/config.inc and try to log into the system (user: admin, pass: admin)
Note: In order to have correct "Account Manager" and "Project Manager" entries
in the projects, the acronym for the "Account Manager" must be AM and for
the "Project Manager" PM.
Download (0.20MB)
Added: 2005-04-12 License: GPL (GNU General Public License) Price:
1658 downloads
XML::DOM::Document 1.44
XML::DOM::Document is an XML document node in XML::DOM. more>>
XML::DOM::Document is an XML document node in XML::DOM.
XML::DOM::Document extends XML::DOM::Node.
It is the main root of the XML document structure as returned by XML::DOM::Parser::parse and XML::DOM::Parser::parsefile.
Since elements, text nodes, comments, processing instructions, etc. cannot exist outside the context of a Document, the Document interface also contains the factory methods needed to create these objects. The Node objects created have a getOwnerDocument method which associates them with the Document within whose context they were created.
METHODS
getDocumentElement
This is a convenience method that allows direct access to the child node that is the root Element of the document.
getDoctype
The Document Type Declaration (see DocumentType) associated with this document. For HTML documents as well as XML documents without a document type declaration this returns undef. The DOM Level 1 does not support editing the Document Type Declaration.
Not In DOM Spec: This implementation allows editing the doctype. See XML::DOM::ignoreReadOnly for details.
getImplementation
The DOMImplementation object that handles this document. A DOM application may use objects from multiple implementations.
createElement (tagName)
Creates an element of the type specified. Note that the instance returned implements the Element interface, so attributes can be specified directly on the returned object.
DOMExceptions:
INVALID_CHARACTER_ERR
Raised if the tagName does not conform to the XML spec.
createTextNode (data)
Creates a Text node given the specified string.
createComment (data)
Creates a Comment node given the specified string.
createCDATASection (data)
Creates a CDATASection node given the specified string.
createAttribute (name [, value [, specified ]])
Creates an Attr of the given name. Note that the Attr instance can then be set on an Element using the setAttribute method.
Not In DOM Spec: The DOM Spec does not allow passing the value or the specified property in this method. In this implementation they are optional.
Parameters: value The attributes value. See Attr::setValue for details. If the value is not supplied, the specified property is set to 0. specified Whether the attribute value was specified or whether the default value was used. If not supplied, its assumed to be 1.
DOMExceptions:
INVALID_CHARACTER_ERR
Raised if the name does not conform to the XML spec.
createProcessingInstruction (target, data)
Creates a ProcessingInstruction node given the specified name and data strings.
Parameters: target The target part of the processing instruction. data The data for the node.
DOMExceptions:
INVALID_CHARACTER_ERR
Raised if the target does not conform to the XML spec.
createDocumentFragment
Creates an empty DocumentFragment object.
createEntityReference (name)
Creates an EntityReference object.
<<lessXML::DOM::Document extends XML::DOM::Node.
It is the main root of the XML document structure as returned by XML::DOM::Parser::parse and XML::DOM::Parser::parsefile.
Since elements, text nodes, comments, processing instructions, etc. cannot exist outside the context of a Document, the Document interface also contains the factory methods needed to create these objects. The Node objects created have a getOwnerDocument method which associates them with the Document within whose context they were created.
METHODS
getDocumentElement
This is a convenience method that allows direct access to the child node that is the root Element of the document.
getDoctype
The Document Type Declaration (see DocumentType) associated with this document. For HTML documents as well as XML documents without a document type declaration this returns undef. The DOM Level 1 does not support editing the Document Type Declaration.
Not In DOM Spec: This implementation allows editing the doctype. See XML::DOM::ignoreReadOnly for details.
getImplementation
The DOMImplementation object that handles this document. A DOM application may use objects from multiple implementations.
createElement (tagName)
Creates an element of the type specified. Note that the instance returned implements the Element interface, so attributes can be specified directly on the returned object.
DOMExceptions:
INVALID_CHARACTER_ERR
Raised if the tagName does not conform to the XML spec.
createTextNode (data)
Creates a Text node given the specified string.
createComment (data)
Creates a Comment node given the specified string.
createCDATASection (data)
Creates a CDATASection node given the specified string.
createAttribute (name [, value [, specified ]])
Creates an Attr of the given name. Note that the Attr instance can then be set on an Element using the setAttribute method.
Not In DOM Spec: The DOM Spec does not allow passing the value or the specified property in this method. In this implementation they are optional.
Parameters: value The attributes value. See Attr::setValue for details. If the value is not supplied, the specified property is set to 0. specified Whether the attribute value was specified or whether the default value was used. If not supplied, its assumed to be 1.
DOMExceptions:
INVALID_CHARACTER_ERR
Raised if the name does not conform to the XML spec.
createProcessingInstruction (target, data)
Creates a ProcessingInstruction node given the specified name and data strings.
Parameters: target The target part of the processing instruction. data The data for the node.
DOMExceptions:
INVALID_CHARACTER_ERR
Raised if the target does not conform to the XML spec.
createDocumentFragment
Creates an empty DocumentFragment object.
createEntityReference (name)
Creates an EntityReference object.
Download (0.11MB)
Added: 2006-07-14 License: Perl Artistic License Price:
1200 downloads
Print OpenOffice 2 documents 1
Print OpenOffice 2 documents is a service menu that enables the user to right click on an OpenOffice 2 document and print it. more>>
Print OpenOffice 2 documents is a service menu that enables the user to right click on an OpenOffice 2 document and print it.
<<less Download (MB)
Added: 2006-05-25 License: GPL (GNU General Public License) Price:
1247 downloads
Epiware Project and Document Management 4.8.0
Epiware is an AJAX-enabled Project and Document Management Web application. more>>
Epiware Project and Document Management is an AJAX-enabled Web application that provides access to a complete set of file management functions, including document check-in, check-out, version control, approval, change notification, and access history.
Upload, Manage and Share documents online
Epiware enables organizations to easily share information and effectively collaborate on documents in a browser-based environment.
Bring together people, content, and process
Epiware enables the creation of virtual teams by providing an online workspace for users to collaborate and exchange information in a secure, protected setting. It is intuitive and easy to use, which allows users to concentrate on what they do best rather than on trying to master an application.
With Epiware, costs can be lowered, projects can be expedited, decisions can be made in a more efficient and timely manner, changes can be executed more quickly, and errors due to miscommunications can be reduced or eliminated.
Enhancements:
- Login issues have been fixed.
- There are document history updates.
- Delete to has been added to Tasking.
- LDAP bugs and other minor issues have been fixed.
<<lessUpload, Manage and Share documents online
Epiware enables organizations to easily share information and effectively collaborate on documents in a browser-based environment.
Bring together people, content, and process
Epiware enables the creation of virtual teams by providing an online workspace for users to collaborate and exchange information in a secure, protected setting. It is intuitive and easy to use, which allows users to concentrate on what they do best rather than on trying to master an application.
With Epiware, costs can be lowered, projects can be expedited, decisions can be made in a more efficient and timely manner, changes can be executed more quickly, and errors due to miscommunications can be reduced or eliminated.
Enhancements:
- Login issues have been fixed.
- There are document history updates.
- Delete to has been added to Tasking.
- LDAP bugs and other minor issues have been fixed.
Download (8.6MB)
Added: 2007-07-17 License: Free To Use But Restricted Price:
831 downloads
Structured Document Validator 0.7.9
Structured Document Validator project implements a generalized method for structured documents. more>>
Structured Document Validator project implements a generalized method for validating both the structure and content of structured documents.
Any data format that can be deterministically divided into tags and data is classed as a structured document. This definition applies to a wide array of data formats, including XML, Java properties files, and delimited value files.
The application performs validations based on user-defined Structured Document Definitions (SDDs). It provides an environment for validation, SDD development, and document editing.
<<lessAny data format that can be deterministically divided into tags and data is classed as a structured document. This definition applies to a wide array of data formats, including XML, Java properties files, and delimited value files.
The application performs validations based on user-defined Structured Document Definitions (SDDs). It provides an environment for validation, SDD development, and document editing.
Download (0.59MB)
Added: 2006-01-06 License: LGPL (GNU Lesser General Public License) Price:
1387 downloads
KinoSearch::Document::Doc 0.15
KinoSearch::Document::Doc is a Doc object that is akin to a row in a database, in that it is made up of several fields. more>>
KinoSearch::Document::Doc is a Doc object that is akin to a row in a database, in that it is made up of several fields, each of which has a value. Doc objects are only created via factory methods of other classes.
SYNOPSIS
my $doc = $invindexer->new_doc;
$doc->set_value( title => $title );
$doc->set_value( bodytext => $bodytext );
$invindexer->add($doc);
METHODS
set_value get_value
$doc->set_value( title => $title_text );
my $text = $doc->get_value( title );
set_value and get_value are used to modify and access the values of the fields within a Doc object.
set_boost get_boost
$doc->set_boost(2.5);
boost is a scoring multiplier. Setting boost to something other than 1 causes a document to score better or worse against a given query relative to other documents.
<<lessSYNOPSIS
my $doc = $invindexer->new_doc;
$doc->set_value( title => $title );
$doc->set_value( bodytext => $bodytext );
$invindexer->add($doc);
METHODS
set_value get_value
$doc->set_value( title => $title_text );
my $text = $doc->get_value( title );
set_value and get_value are used to modify and access the values of the fields within a Doc object.
set_boost get_boost
$doc->set_boost(2.5);
boost is a scoring multiplier. Setting boost to something other than 1 causes a document to score better or worse against a given query relative to other documents.
Download (0.22MB)
Added: 2007-03-09 License: Perl Artistic License Price:
959 downloads
Auto Save Document 1.2.0.28
Auto Save Document is an extension used to store the currently shown document in a separate directory automatically. more>>
Auto Save Document is an extension used to store the currently shown document in a separate directory automatically.
Autosave is a small extension, which can be used to store the currently shown document in a seperate directory automatically when browsing a certain site. You may configure the sites, where all pages will be stored to cache, by adding them to a list. The documents are stored as html-text only (no embedded objects like pictures etc.)
The files are cyclic reused, e.g. if you set the cache size to 100, Autosave will store the document contents to X:/MyCache/Autosave_0.html ... AutoSave_100.html and start again with AutoSave_0.htm when the 101st page is browsed. A small Icon in the statusbar will change its colour each time to indicate a page was written.
If you click this small icon, the current page will be dumped, regardless if the site is in the autosave list or not.
NOTE: this extension is really, really simple! Only plain text is saved, no external .css,.js => the result may look ugly! The one and only intention for this extension is, to save the text for external parsing or similar! Look for Scrapbook etc. in case you want to store more than the plain text!
<<lessAutosave is a small extension, which can be used to store the currently shown document in a seperate directory automatically when browsing a certain site. You may configure the sites, where all pages will be stored to cache, by adding them to a list. The documents are stored as html-text only (no embedded objects like pictures etc.)
The files are cyclic reused, e.g. if you set the cache size to 100, Autosave will store the document contents to X:/MyCache/Autosave_0.html ... AutoSave_100.html and start again with AutoSave_0.htm when the 101st page is browsed. A small Icon in the statusbar will change its colour each time to indicate a page was written.
If you click this small icon, the current page will be dumped, regardless if the site is in the autosave list or not.
NOTE: this extension is really, really simple! Only plain text is saved, no external .css,.js => the result may look ugly! The one and only intention for this extension is, to save the text for external parsing or similar! Look for Scrapbook etc. in case you want to store more than the plain text!
Download (0.009MB)
Added: 2007-04-03 License: MPL (Mozilla Public License) Price:
973 downloads
Xapian and Omega 1.0.2
Xapian is an Open Source Probabilistic Information Retrieval library, released under the GPL. more>>
Xapian is an Open Source Probabilistic Information Retrieval library, released under the GPL. Xapian iss written in C++, with bindings to allow use from other languages (Perl, Java, Python, PHP, and TCL are currently supported; Guile and C# are being worked on).
Xapian is designed to be a highly adaptable toolkit to allow developers to easily add advanced indexing and search facilities to their own applications.
If youre after a packaged search engine for your website, you should take a look at Omega, which is an application we supply built upon Xapian. But unlike most other website search solutions, Xapians versatility allows you to extend Omega to meet your needs as they grow.
Main features:
- Free Software/Open Source - licensed under the GPL.
- Highly portable - runs on many Linux, MacOS X, many other Unix platforms, and Microsoft Windows.
- Written in C++. Perl bindings are available in the module Search::Xapian on CPAN. Java JNI bindings are included in the xapian-bindings module. We also support SWIG which can generate bindings for 13 languages. At present those for Python, PHP4, and TCL are working. Guile and C# are being worked on.
- Ranked probablistic search - important words get more weight than unimportant words, so the most relevant documents are more likely to come near the top of the results list.
- Relevance feedback - given one or more documents, Xapian can suggest the most relevant index terms to expand a query, suggest related documents, categorise documents, etc.
- Phrase and proximity searching - users can search for words occuring in an exact phrase or within a specified number of words, either in a specified order, or in any order.
- Full range of structured boolean search operators ("stock NOT market", etc). The results of the boolean search are ranked by the probablistic weights. Boolean filters can also be applied to restrict a probabilistic search.
- Supports stemming of search terms (e.g. a search for "football" would match documents which mention "footballs" or "footballer"). This helps to find relevant documents which might otherwise be missed. Stemmers are currently included for Danish, Dutch, English, Finnish, French, German, Italian, Norwegian, Portuguese, Russian, Spanish, and Swedish.
- Supports database files > 2GB - essential for scaling to large document collections.
- Platform independent data formats - you can build a database on one machine and search it on another.
- Allows simultaneous update and searching. New documents become searchable right away.
As well as the library, we supply a number of small example programs, and a larger application - an indexing and CGI-based application called omega:
- The indexer supplied can index HTML, PHP, PDF, PostScript, and plain text. Adding support for indexing other formats is easy where conversion filters are available (e.g. Microsoft Word). This indexer works using the filing system, but we also provide a script to allow the htdig web crawler to be hooked in, allowing remote sites to be searched using Omega.
- You can also index data from any SQL or other RDBMS supported by the Perl DBI module. That includes MySQL, PostgreSQL, SQLite, Sybase, MS SQL, LDAP, and ODBC.
- CGI search front-end supplied with highly customisable appearance. This can also be customised to output results in XML or CSV, which is useful if you are dynamically generating pages (e.g. with PHP or mod_perl) and just want raw search results which you can process in your own page layout code.
Enhancements:
- This release adds support for spelling correction and synonym expansion.
- Some Btree tables are now optional, so opening a database is slightly quicker.
- The Xapian::NumberValueRangeProcessor class has been reworked to fix a design flaw.
- A few bugs have been fixed.
- Performance was improved in several cases.
<<lessXapian is designed to be a highly adaptable toolkit to allow developers to easily add advanced indexing and search facilities to their own applications.
If youre after a packaged search engine for your website, you should take a look at Omega, which is an application we supply built upon Xapian. But unlike most other website search solutions, Xapians versatility allows you to extend Omega to meet your needs as they grow.
Main features:
- Free Software/Open Source - licensed under the GPL.
- Highly portable - runs on many Linux, MacOS X, many other Unix platforms, and Microsoft Windows.
- Written in C++. Perl bindings are available in the module Search::Xapian on CPAN. Java JNI bindings are included in the xapian-bindings module. We also support SWIG which can generate bindings for 13 languages. At present those for Python, PHP4, and TCL are working. Guile and C# are being worked on.
- Ranked probablistic search - important words get more weight than unimportant words, so the most relevant documents are more likely to come near the top of the results list.
- Relevance feedback - given one or more documents, Xapian can suggest the most relevant index terms to expand a query, suggest related documents, categorise documents, etc.
- Phrase and proximity searching - users can search for words occuring in an exact phrase or within a specified number of words, either in a specified order, or in any order.
- Full range of structured boolean search operators ("stock NOT market", etc). The results of the boolean search are ranked by the probablistic weights. Boolean filters can also be applied to restrict a probabilistic search.
- Supports stemming of search terms (e.g. a search for "football" would match documents which mention "footballs" or "footballer"). This helps to find relevant documents which might otherwise be missed. Stemmers are currently included for Danish, Dutch, English, Finnish, French, German, Italian, Norwegian, Portuguese, Russian, Spanish, and Swedish.
- Supports database files > 2GB - essential for scaling to large document collections.
- Platform independent data formats - you can build a database on one machine and search it on another.
- Allows simultaneous update and searching. New documents become searchable right away.
As well as the library, we supply a number of small example programs, and a larger application - an indexing and CGI-based application called omega:
- The indexer supplied can index HTML, PHP, PDF, PostScript, and plain text. Adding support for indexing other formats is easy where conversion filters are available (e.g. Microsoft Word). This indexer works using the filing system, but we also provide a script to allow the htdig web crawler to be hooked in, allowing remote sites to be searched using Omega.
- You can also index data from any SQL or other RDBMS supported by the Perl DBI module. That includes MySQL, PostgreSQL, SQLite, Sybase, MS SQL, LDAP, and ODBC.
- CGI search front-end supplied with highly customisable appearance. This can also be customised to output results in XML or CSV, which is useful if you are dynamically generating pages (e.g. with PHP or mod_perl) and just want raw search results which you can process in your own page layout code.
Enhancements:
- This release adds support for spelling correction and synonym expansion.
- Some Btree tables are now optional, so opening a database is slightly quicker.
- The Xapian::NumberValueRangeProcessor class has been reworked to fix a design flaw.
- A few bugs have been fixed.
- Performance was improved in several cases.
Download (2.7MB)
Added: 2007-07-05 License: GPL (GNU General Public License) Price:
843 downloads
OpenOffice::OODoc::Document 2.027
OpenOffice::OODoc::Document is a top level component for content and layout processing. more>>
OpenOffice::OODoc::Document is a top level component for content and layout processing.
SYNOPSIS
# get an OOo file handler
my $oofile = ooFile("myfile.odt");
# connect a content-focused document interface
my $content = ooDocument
(
file => $oofile,
member => content
);
# connect a style-focused document interface
my $styles = ooDocument
(
file => $oofile,
member => styles
);
# process any content and style element
$content->appendParagraph
(
text => "An additional paragraph",
style => "BlueStyle"
);
$styles->createStyle
(
"BlueStyle",
parent => Text body,
family => paragraph,
properties =>
{
area => text,
fo:color => rgb2oo(blue)
}
);
# commit the changes using the file handler
$oofile->save;
This module defines the top level Document class, which is a connector allowing any kind of content and presentation processing. It inherits from OODoc::XPath, OODoc::Text, OODoc::Styles and OODoc::Image.
The most usual instruction to get access to any member of a document, with the exception if the metadata (meta.xml) should be something like:
my $doc = ooDocument([options]);
This constructor, if successful, returns an object that can be used (according to its "member" option) to process styles, images and text.
This module is designed simply to create objects which include all the functionality of OODoc::Text, OODoc::Image, OODoc::Styles and OODoc::XPath (which should not be called directly by applications).
For example
my $styles = ooDocument(file => "source.odt", member => "styles");
is generally better than
my styles = ooStyles(file => "source.odt");
While OODoc::Document inherits all the methods and properties of these classes, its detailed documentation in essentially provided in the following manual pages:
OpenOffice::OODoc::Text -> text content
OpenOffice::OODoc::Styles -> style & layout
OpenOffice::OODoc::Image -> graphic objects
OpenOffice::OODoc::XPath -> common features & low-level API
For example, the appendParagraph() and createStyle() methods used in the synopsis above are respectively described in OpenOffice::OODoc::Text and OpenOffice::OODoc::Styles.
The present manual page only describes those methods (there are very few) which combine layout and content processing.
<<lessSYNOPSIS
# get an OOo file handler
my $oofile = ooFile("myfile.odt");
# connect a content-focused document interface
my $content = ooDocument
(
file => $oofile,
member => content
);
# connect a style-focused document interface
my $styles = ooDocument
(
file => $oofile,
member => styles
);
# process any content and style element
$content->appendParagraph
(
text => "An additional paragraph",
style => "BlueStyle"
);
$styles->createStyle
(
"BlueStyle",
parent => Text body,
family => paragraph,
properties =>
{
area => text,
fo:color => rgb2oo(blue)
}
);
# commit the changes using the file handler
$oofile->save;
This module defines the top level Document class, which is a connector allowing any kind of content and presentation processing. It inherits from OODoc::XPath, OODoc::Text, OODoc::Styles and OODoc::Image.
The most usual instruction to get access to any member of a document, with the exception if the metadata (meta.xml) should be something like:
my $doc = ooDocument([options]);
This constructor, if successful, returns an object that can be used (according to its "member" option) to process styles, images and text.
This module is designed simply to create objects which include all the functionality of OODoc::Text, OODoc::Image, OODoc::Styles and OODoc::XPath (which should not be called directly by applications).
For example
my $styles = ooDocument(file => "source.odt", member => "styles");
is generally better than
my styles = ooStyles(file => "source.odt");
While OODoc::Document inherits all the methods and properties of these classes, its detailed documentation in essentially provided in the following manual pages:
OpenOffice::OODoc::Text -> text content
OpenOffice::OODoc::Styles -> style & layout
OpenOffice::OODoc::Image -> graphic objects
OpenOffice::OODoc::XPath -> common features & low-level API
For example, the appendParagraph() and createStyle() methods used in the synopsis above are respectively described in OpenOffice::OODoc::Text and OpenOffice::OODoc::Styles.
The present manual page only describes those methods (there are very few) which combine layout and content processing.
Download (0.21MB)
Added: 2006-08-29 License: Perl Artistic License Price:
1152 downloads
Secleted [ 0 ] software to compare
Copyright Notice:
Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future software development. The above documents and designs search only lists software in full, demo and trial versions for free download. Download links are directly from our mirror sites or publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed