Terrier 1.1.0
Sponsored Links
Terrier 1.1.0 Ranking & Summary
File size:
MB
Platform:
Any Platform
License:
MPL (Mozilla Public License)
Price:
Downloads:
861
Date added:
2007-06-18
Publisher:
University of Glasgow
Terrier 1.1.0 description
Terrier project is a probabilistic Java toolkit for building search engines.
Terrier is software for the rapid development of Web, intranet, and desktop search engines.
More generally, it is a modular platform for building large-scale information retrieval applications, providing indexing and probabilistic retrieval functionalities.
It comes with a desktop search application.
Terrier has various cutting-edge features including parameter-free probabilistic retrieval approaches (such as Divergence from Randomness models), automatic query expansion/re-formulation methodologies, and efficient data compression techniques.
Terrier comes with a powerful proof-of-concept Desktop search application [Screenshots], and full TREC capabilities including the ability to index, query and evaluate the standard TREC collections, such as AP, WSJ, WT10G, .GOV and .GOV2.
Terrier is written in Java [Requirements] and has been successfully used for adhoc retrieval, Web search and cross-language retrieval, in a centralised or distributed setting.
Currently, it is also being used for running various applications.
Main features:
- Open Source (Mozilla Public Licence)
- Written in cross-platform Java
- Highly compressed disk data structures.
- Handling large-scale document collections.
- Direct file for efficient query expansion.
- Modular and open indexing and querying APIs.
- Testbed for indexing and retrieval from standard TREC test collections.
- Interactive querying application.
- Desktop search application for searching various types of documents.
- Input/output of gamma, unary and binary encoded integers for compressing streams or random access files.
- Standard evaluation of TREC ad-hoc and known-item search retrieval results.
- Indexing of tagged document collections, as well as documents of various formats, such as HTML, PDF, or Microsoft Word, Excel and Powerpoint files.
- Indexing of field information.
- Indexing of position information on a word, or a block level.
- Support for classic retrieval models, such as tf-idf, BM25 and Ponte-Croft language model, and Rocchios query expansion.
- Provides a number of Divergence From Randomness (DFR) document ranking models.
- Provides a number of parameter-free DFR term weighting models for automatic query expansion.
- Advanced query language that supports AND/NOT operators, phrase and proximity search.
- Flexible processing of terms through a pipeline of components, such as stop-words removers and stemmers.
Enhancements:
- This is a major update with improvements in indexing and retrieval functionalities, including faster indexing and retrieval, and new retrieval models (including models from Divergence from Randomness and Language modeling).
- It has support for much larger collections of documents, including TREC GOV2 collections (25M documents), merging of indices, and multi-lingual and non-English collections of documents.
- The documentation has been vastly improved.
Terrier is software for the rapid development of Web, intranet, and desktop search engines.
More generally, it is a modular platform for building large-scale information retrieval applications, providing indexing and probabilistic retrieval functionalities.
It comes with a desktop search application.
Terrier has various cutting-edge features including parameter-free probabilistic retrieval approaches (such as Divergence from Randomness models), automatic query expansion/re-formulation methodologies, and efficient data compression techniques.
Terrier comes with a powerful proof-of-concept Desktop search application [Screenshots], and full TREC capabilities including the ability to index, query and evaluate the standard TREC collections, such as AP, WSJ, WT10G, .GOV and .GOV2.
Terrier is written in Java [Requirements] and has been successfully used for adhoc retrieval, Web search and cross-language retrieval, in a centralised or distributed setting.
Currently, it is also being used for running various applications.
Main features:
- Open Source (Mozilla Public Licence)
- Written in cross-platform Java
- Highly compressed disk data structures.
- Handling large-scale document collections.
- Direct file for efficient query expansion.
- Modular and open indexing and querying APIs.
- Testbed for indexing and retrieval from standard TREC test collections.
- Interactive querying application.
- Desktop search application for searching various types of documents.
- Input/output of gamma, unary and binary encoded integers for compressing streams or random access files.
- Standard evaluation of TREC ad-hoc and known-item search retrieval results.
- Indexing of tagged document collections, as well as documents of various formats, such as HTML, PDF, or Microsoft Word, Excel and Powerpoint files.
- Indexing of field information.
- Indexing of position information on a word, or a block level.
- Support for classic retrieval models, such as tf-idf, BM25 and Ponte-Croft language model, and Rocchios query expansion.
- Provides a number of Divergence From Randomness (DFR) document ranking models.
- Provides a number of parameter-free DFR term weighting models for automatic query expansion.
- Advanced query language that supports AND/NOT operators, phrase and proximity search.
- Flexible processing of terms through a pipeline of components, such as stop-words removers and stemmers.
Enhancements:
- This is a major update with improvements in indexing and retrieval functionalities, including faster indexing and retrieval, and new retrieval models (including models from Divergence from Randomness and Language modeling).
- It has support for much larger collections of documents, including TREC GOV2 collections (25M documents), merging of indices, and multi-lingual and non-English collections of documents.
- The documentation has been vastly improved.
Terrier 1.1.0 Screenshot
Terrier 1.1.0 Keywords
TREC
Terrier 1.1.0
for building
Java toolkit
Desktop Search
terrier
retrieval
search
indexing
query
probabilistic
Terrier 1.1.0
Information Management
Miscellaneous
Bookmark Terrier 1.1.0
Terrier 1.1.0 Copyright
WareSeeker periodically updates pricing and software information of Terrier 1.1.0 full version from the publisher, so some information may be slightly out-of-date. You should confirm all information before relying on it. Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future development of Terrier 1.1.0 Edition. Download links are directly from our publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed
Featured Software
Want to place your software product here?
Please contact us for consideration.
Contact WareSeeker.com
Related Information
Version History
Related Software
bengsaver is a fascinating screensaver for KDE. Free Download
Reliby is a Firefox extension that can reload all your Live Bookmarks aka RSS feeds with a push of a button. Free Download
DateSite v1.0 is a multi-browser compatible perl script. Free Download
crapsearch project enhances privacy by polling targeted Web search engines with random search terms. Free Download
KMameRun project is a KDE frontend for M.A.M.E. Free Download
SMA consists of a small collection of programs that perform different tests for association between genotypes. Free Download
ScriptSite v1.0 is a multi-browser compatible perl script. Free Download
Search::Lemur is a Perl class to query a Lemur server, and parse the results. Free Download
Latest Software
Popular Software
Favourite Software