OrcaTec LLC Partners with rPath to Launch V 2.0 Of its Information Retrieval Toolkit as a Software Appliance

OrcaTec LLC announced the release of Version 2.0 of the OrcaTec Information Retrieval Toolkit. The Toolkit will be distributed as an rPath-based software appliance, to make it simple to install and maintain. This software appliance provides an integrated collection of information analysis and management services, including concept search, near-duplicate clustering, language identification, and an interesting-phrase finder. These services are ideal for building scalable, reliable, and effective information analysis and management applications. The OrcaTec Information Retrieval Toolkit is designed to be a key component of systems for enterprise search, legal discovery, business intelligence, text data mining, content management, email archiving, knowledge management, and many other applications. OrcaTec Concept Searching learns the meaning of words from the documents that it reads. Concept searching allows users to find information even when they may not know exactly the specific words that a document's author used. Built on top of Lucene, the Toolkit also includes the full complement of Boolean and proximity searching. Version 2.0 supports data ingest rates as high as two million documents per day per system. These documents can be in any language from any source. The Toolkit is based on language modeling, which is the process of analyzing the patterns of language usage in a text and using these patterns to organize and retrieve it. The Toolkit has a REST-based API. http://www.rpath.com

0 TrackBacks

Listed below are links to blogs that reference this entry: OrcaTec LLC Partners with rPath to Launch V 2.0 Of its Information Retrieval Toolkit as a Software Appliance.

TrackBack URL for this entry: http://gilbane.com/blog/mt-tb.cgi/3645

Leave a comment

About this Entry

This page contains a single entry by NewsShark published on January 15, 2008 9:39 AM.

GridIron Introduces Flow was the previous entry in this blog.

W3C Opens Data on the Web with SPARQL is the next entry in this blog.

Find recent content on the main index or look in the archives to find all content.

Gilbane Boston 2008 conference banner

SAJAN-LOGO-with-Tagline.gif

Now available! "Beyond Search: What to do When Your Enterprise Search System Doesn't Work, by Stephen Arnold

Beyond Search Report cover

Gilbane Links

NewsShark

Sign-up for our weekly NewsShark newsletter.
Content technology industry news without the hype:

* Email

* First Name

* Last Name

* = Required Field