| <HTML> |
| <HEAD> |
| <META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1"> |
| <META NAME="Author" CONTENT="Doug Cutting"> |
| <META NAME="GENERATOR" CONTENT="Mozilla/4.04 [en] (Win95; U) [Netscape]"> |
| <TITLE>Lucene: a full-text search engine in Java</TITLE> |
| </HEAD> |
| <BODY> |
| |
| <H1> |
| Lucene</H1> |
| Lucene is a full-text search engine written in Java. It is efficient, |
| providing high-performance indexing and searching using few system resources. |
| State-of-the-art search algorithms produce highest-quality search results. |
| The use of Java allows easy integration with cross-platform applications. |
| <H2> |
| Potential Applications</H2> |
| |
| <UL> |
| <LI> |
| <B>Searchable E-Mail</B></LI> |
| |
| <BR>Search large e-mail archives instantly; update index as new messages |
| arrive. |
| <LI> |
| <B>CD-ROM-based Online Documentation Search</B></LI> |
| |
| <BR>Search large publications quickly with platform-independent system. |
| <LI> |
| <B>Search Previously-Visited Web Pages</B></LI> |
| |
| <BR>Relocate a page seen weeks or months ago. |
| <LI> |
| <B>Web Site Searching</B></LI> |
| |
| <BR>Let users search all the pages on your website.</UL> |
| |
| <H2> |
| Features</H2> |
| |
| <UL> |
| <LI> |
| <B>Scalable, High-Performance Indexing</B></LI> |
| |
| <DL> |
| <DL> |
| <LI> |
| over 200MB/hour on Pentium II/266</LI> |
| |
| <LI> |
| incremental indexing as fast as batch indexing</LI> |
| |
| <LI> |
| small RAM requirements -- only 1MB heap</LI> |
| |
| <LI> |
| index size roughly 30% the size of text indexed</LI> |
| </DL> |
| </DL> |
| |
| <LI> |
| <B>Powerful, Accurate and Efficient Search Algorithms</B></LI> |
| |
| <DL> |
| <DL> |
| <LI> |
| ranked searching -- best results returned first</LI> |
| |
| <LI> |
| boolean and phrase queries</LI> |
| |
| <LI> |
| fielded searching (e.g., title, author, contents)</LI> |
| |
| <LI> |
| date-range searching</LI> |
| |
| <LI> |
| <B><I>coming soon:</I></B></LI> |
| |
| <DL> |
| <DL> |
| <LI> |
| <I>multiple-index searching with merged results</I></LI> |
| |
| <LI> |
| <I>distributed searching over a network</I></LI> |
| </DL> |
| </DL> |
| </DL> |
| </DL> |
| |
| <LI> |
| <B>Simple API's allow developers to:</B></LI> |
| |
| <DL> |
| <DL> |
| <LI> |
| incorporate new document types</LI> |
| |
| <LI> |
| localize for new languages (already handles most European languages)</LI> |
| |
| <LI> |
| develop new user interfaces</LI> |
| </DL> |
| </DL> |
| |
| <LI> |
| <B>Cross-Platform Solution</B></LI> |
| |
| <DL> |
| <DL> |
| <LI> |
| 100%-pure Java <I>(not yet certified)</I></LI> |
| </DL> |
| </DL> |
| </UL> |
| |
| <H2> |
| Contact</H2> |
| |
| <UL><B>Douglass R. Cutting</B> |
| <BR>Email: cutting@lucene.com |
| <BR>Phone: 1 (510) 595-0232</UL> |
| |
| </BODY> |
| </HTML> |