diff --git a/Attic/api/overview.html b/Attic/api/overview.html new file mode 100644 index 00000000000..71738ae5260 --- /dev/null +++ b/Attic/api/overview.html @@ -0,0 +1,174 @@ + + + + + + + + Jakarta Lucene API Documentation + + + +

Jakarta Lucene API Documentation

+The Jakarta Lucene API is divided into several +packages: + +To use Lucene, an application should: +
    +
  1. +Create Document's by +adding +Field's.
  2. + +
  3. +Create an IndexWriter +and add documents to to it with addDocument();
  4. + +
  5. +Call QueryParser.parse() +to build a query from a string; and
  6. + +
  7. +Create an IndexSearcher +and pass the query to it's search() +method.
  8. +
+Some simple examples of code which does this are: + +To demonstrate these, try: +
F:\> java demo.IndexFiles rec.food.recipes\soups +
adding rec.food.recipes\soups\abalone-chowder +
  [ ... ] +

F:\> java demo.SearchFiles +
Query: chowder +
Searching for: chowder +
34 total matching documents +
0. rec.food.recipes\soups\spam-chowder +
  [ ... thirty-four documents contain the word "chowder", +"spam-chowder" with the greatest density.] +

Query: path:chowder +
Searching for: path:chowder +
31 total matching documents +
0. rec.food.recipes\soups\abalone-chowder +
  [ ... only thrity-one have "chowder" in the "path" +field. ] +

Query: path:"clam chowder" +
Searching for: path:"clam chowder" +
10 total matching documents +
0. rec.food.recipes\soups\clam-chowder +
  [ ... only ten have "clam chowder" in the "path" field. +] +

Query: path:"clam chowder" AND manhattan +
Searching for: +path:"clam chowder" +manhattan +
2 total matching documents +
0. rec.food.recipes\soups\clam-chowder +
  [ ... only two also have "manhattan" in the contents. +] +
    [ Note: "+" and "-" are canonical, but "AND", "OR" +and "NOT" may be used. ]

+The IndexHtml demo is more sophisticated.  +It incrementally maintains an index of HTML files, adding new files as +they appear, deleting old files as they disappear and re-indexing files +as they change. +
F:\>java demo.IndexHTML -create java\jdk1.1.6\docs\relnotes +
adding java/jdk1.1.6/docs/relnotes/SMICopyright.html +
  [ ... create an index containing all the relnotes ] +

F:\>del java\jdk1.1.6\docs\relnotes\smicopyright.html +

F:\>java demo.IndexHTML java\jdk1.1.6\docs\relnotes +
deleting java/jdk1.1.6/docs/relnotes/SMICopyright.html

+HTML indexes are searched using SUN's JavaWebServer +(JWS) and Search.jhtml.  To use +this: + +Note that indexes can be updated while searches are going on.  Search.jhtml +will re-open the index when it is updated so that the latest version is +immediately available. +
  + + diff --git a/build.bat b/build.bat new file mode 100755 index 00000000000..98147a324a2 --- /dev/null +++ b/build.bat @@ -0,0 +1,9 @@ +@echo off + +for %%i in (.\lib\*.jar) do call cpappend.bat %%i + +echo CLASSPATH="%_CP%" + +java -classpath "%_CP%" org.apache.tools.ant.Main -Dant.home=%_AH% %1 %2 %3 + +SET _CP= diff --git a/build.sh b/build.sh new file mode 100755 index 00000000000..3397ffd779f --- /dev/null +++ b/build.sh @@ -0,0 +1,36 @@ +#!/bin/sh + +if [ "$JAVA_HOME" = "" ] ; then + echo You must set JAVA_HOME to point at your Java Development Kit directory + exit 1 +fi + +# convert the existing path to unix +if [ "$OSTYPE" = "cygwin32" ] || [ "$OSTYPE" = "cygwin" ] ; then + CLASSPATH=`cygpath --path --unix "$CLASSPATH"` +fi + +# Add in your .jar files first +for i in ./lib/*.jar +do + CLASSPATH=$CLASSPATH:"$i" +done +# Add in the jakarta-site2 library files +for i in ../jakarta-site2/lib/*.jar +do + CLASSPATH=$CLASSPATH:"$i" +done + +# convert the unix path to windows +if [ "$OSTYPE" = "cygwin32" ] || [ "$OSTYPE" = "cygwin" ] ; then + CLASSPATH=`cygpath --path --windows "$CLASSPATH"` +fi + +BUILDFILE=build.xml + +#echo $CLASSPATH + +java $ANT_OPTS -classpath "$CLASSPATH" org.apache.tools.ant.Main \ + -Dant.home=$ANT_HOME \ + -buildfile ${BUILDFILE} \ + "$@" diff --git a/build.xml b/build.xml new file mode 100644 index 00000000000..26c2b7325e1 --- /dev/null +++ b/build.xml @@ -0,0 +1,47 @@ + + + + + + + + + + + + + + AnakiaTask is not present! Please check to make sure that + velocity.jar is in your classpath. + + + + + + + + + + + + + + + + + + diff --git a/cpappend.bat b/cpappend.bat new file mode 100755 index 00000000000..c62555342e3 --- /dev/null +++ b/cpappend.bat @@ -0,0 +1 @@ +set _CP=%1;%_CP% diff --git a/docs/applications.html b/docs/applications.html new file mode 100644 index 00000000000..ce4984fed67 --- /dev/null +++ b/docs/applications.html @@ -0,0 +1,169 @@ + + + + + + + + + + + + + + + + + Jakarta Lucene - Applications - Jakarta Lucene + + + + + + + + + +
+ + +Jakarta Lucene +
+ + + + + + + + + + + + +
+
+
+

About

+ +

Documentation

+ +

Download

+ +

Community

+ +

Jakarta

+ +
+ + + + +
+ + APPLICATIONS + +
+
+

Lucene can be used for a wide range of applications

+
+

+

+ + + + +
+ + Example Applications + +
+
+

Lucene is designed to be used in a wide range of applications--from small, +desktop applications with a few hundred documents, to large internet server-based +applications with a few million documents.

+

Searchable E-Mail

+
    +
  • Search large e-mail archives instantly; update index as new messages +arrive.
  • +
+

CD-ROM-based Online Documentation Search

+
    +
  • Search large publications quickly with platform-independent system.
  • +
+

Search Previously-Visited Web Pages

+
    +
  • Relocate a page seen weeks or months ago.
  • +
+

Web Site Searching

+
    +
  • Let users search all the pages on your website.
  • +
+
+

+

+
+
+
+
+ Copyright © 1999-2001, Apache Software Foundation +
+
+ + + + + + + + + + + + + + + + + diff --git a/docs/background.html b/docs/background.html new file mode 100644 index 00000000000..760f8dcb5df --- /dev/null +++ b/docs/background.html @@ -0,0 +1,177 @@ + + + + + + + + + + + + + + + + + Jakarta Lucene - Background - Jakarta Lucene + + + + + + + + + +
+ + +Jakarta Lucene +
+ + + + + + + + + + + + +
+
+
+

About

+ +

Documentation

+ +

Download

+ +

Community

+ +

Jakarta

+ +
+ + + + +
+ + BACKGROUND + +
+
+

Doug CuttingHow Lucene came to be

+

Lucene is the brainchild of Doug Cutting (pictured), who has been working in the +field of information retrieval for over a decade.

+

Beginning in 1988, Doug spent five years at Xerox's Palo Alto Research +Center (PARC) developing novel +approaches to information access. These included a high-performance +retrieval engine, several innovative search paradigms, advanced linguistic +analysis methods, and high-quality text summarization algorithms. +This work resulted in seven publications +and six issued patents. Some of these technologies are now marketed by +Inxight. +

+

In 1993, Doug moved to Apple's Advanced Technology Group (ATG). There +he developed a state-of-the-art retrieval engine code-named V-Twin. +This engine was to be a part of the Copland +operating system, automatically indexing the content of all files as they +are created so that the the entire file system could be efficiently searched +at any time. Copland was cancelled, but V-Twin has been used in several +other Apple products. +

+

In April of 1996, Doug left Apple and joined Excite. +Here he took over development of the core search technology. This included +growing Excite's web index from two million to fifty million +pages; substantially optimizing Excite's search performance; adding phrase-searching +capabilities; and creating a thesaurus-like feature which suggests related +terms to add to queries. +

+

In the fall of 1997, Doug reduced his commitment at Excite to +part-time so that he could write Lucene, +an efficient, full-featured text search engine written in Java. In early 1998 he +returned to Excite full-time for two more years. Lucene sat on the shelf for +much of that time, and was made open-source in the spring of 2000.

+

Lucene quickly became recognized as the leading server-side searching +solution for Java, and attracted several other open source developers, eager +to help refine the Lucene codebase.

+

In the fall of 2001, Lucene joined the Apache Jakarta Project, where +the product is maintained by a team of volunteer developers.

+

Doug now works for Grand +Central, a web services network. In his spare time he still +tries to help out with Lucene.

+

Please do not email Doug directly about Lucene. Instead use +the Jakarta-Lucene mailing lists.

+
+

+

+
+
+
+
+ Copyright © 1999-2001, Apache Software Foundation +
+
+ + + + + + + + + + + + + + + + + diff --git a/docs/features.html b/docs/features.html new file mode 100644 index 00000000000..7ae1e5cf5bb --- /dev/null +++ b/docs/features.html @@ -0,0 +1,225 @@ + + + + + + + + + + + + + + + + + Jakarta Lucene - Features - Jakarta Lucene + + + + + + + + + +
+ + +Jakarta Lucene +
+ + + + + + + + + + + + +
+
+
+

About

+ +

Documentation

+ +

Download

+ +

Community

+ +

Jakarta

+ +
+ + + + +
+ + FEATURES + +
+
+

Lucene offers powerful features through a simple API

+
+

+

+ + + + +
+ + Scalable, High-Performance Indexing + +
+
+
    +
  • over 200MB/hour on Pentium II/266
  • +
  • incremental indexing as fast as batch indexing
  • +
  • small RAM requirements -- only 1MB heap
  • +
  • index size roughly 30% the size of text indexed
  • +
+
+

+

+ + + + +
+ + Powerful, Accurate and Efficient Search Algorithms + +
+
+
    +
  • ranked searching -- best results returned first
  • +
  • boolean and phrase queries
  • +
  • fielded searching (e.g., title, author, contents)
  • +
  • date-range searching
  • +
+
+

+

+ + + + +
+ + Simple API's allow developers to ... + +
+
+
    +
  • incorporate new document types
  • +
  • localize for new languages (already handles most European languages)
  • +
  • develop new user interfaces
  • +
+
+

+

+ + + + +
+ + Cross-Platform Solution + +
+
+
    +
  • 100%-pure Java (not yet certified)
  • +
+
+

+

+ + + + +
+ + Coming soon + +
+
+
    +
  • multiple-index searching with merged results
  • +
  • distributed searching over a network
  • +
+
+

+

+
+
+
+
+ Copyright © 1999-2001, Apache Software Foundation +
+
+ + + + + + + + + + + + + + + + + diff --git a/docs/images/dcutting.gif b/docs/images/dcutting.gif new file mode 100644 index 00000000000..7a05fb7ce0f Binary files /dev/null and b/docs/images/dcutting.gif differ diff --git a/docs/images/lucene-logo.gif b/docs/images/lucene-logo.gif new file mode 100644 index 00000000000..98bdf4bc0c7 Binary files /dev/null and b/docs/images/lucene-logo.gif differ diff --git a/docs/images/lucene.gif b/docs/images/lucene.gif new file mode 100644 index 00000000000..0f834eb6933 Binary files /dev/null and b/docs/images/lucene.gif differ diff --git a/docs/index.html b/docs/index.html new file mode 100644 index 00000000000..c0efc18edd1 --- /dev/null +++ b/docs/index.html @@ -0,0 +1,222 @@ + + + + + + + + + + + + + + + + + + + Jakarta Lucene - Overview - Jakarta Lucene + + + + + + + + + +
+ + +Jakarta Lucene +
+ + + + + + + + + + + + +
+
+
+

About

+ +

Documentation

+ +

Download

+ +

Community

+ +

Jakarta

+ +
+ + + + +
+ + OVERVIEW + +
+
+

Lucene: Better Searching Through Java

+
+

+

+ + + + +
+ + *** UNDER CONSTRUCTION *** + +
+
+

+This site will be the home of Jakarta Lucene. The product is still being transferred. +For now, the home of the Lucene project is still at SourceForge. See +www.lucene.com for those links. +

+
+

+

+ + + + +
+ + What is Lucene? + +
+
+

+Jakarta Lucene is a high-performance, full-featured text search engine +written entirely in Java. It is a technology suitable for nearly any +application that requires full-text search, especially +cross-platform. +

+

+Jakarta Lucene is an open source project available for +free download from Apache Jakarta. +Please use the links on the left to access Lucene. +

+
+

+

+ + + + +
+ + Lucene News + +
+
+

+Lucene v1.02 released - This release repackages Lucene as product +of the Apache Software Foundation. Download it +here. +

+

+Lucene Joins Jakarta - The Lucene Team is happy to announce that +Lucene is now part of a member of the Apache Jakarta Project. This move will +help Lucene continue to grow, and enhance its position as the leading +server-side searching solution for Java. +

+
+

+

+ + + + +
+ + About Apache Jakarta + +
+
+

+The goal of the Apache Jakarta Project +is to provide commercial-quality server solutions based on the Java Platform that +are developed in an open and cooperative fashion. +

+
+

+

+
+
+
+
+ Copyright © 1999-2001, Apache Software Foundation +
+
+ + + + + + + + + + + + + + + + + diff --git a/docs/powered.html b/docs/powered.html new file mode 100644 index 00000000000..63bd546e1d8 --- /dev/null +++ b/docs/powered.html @@ -0,0 +1,141 @@ + + + + + + + + + + + + + + + + + Jakarta Lucene - Who We Are - Jakarta Lucene + + + + + + + + + +
+ + +Jakarta Lucene +
+ + + + + + + + + + + + +
+
+
+

About

+ +

Documentation

+ +

Download

+ +

Community

+ +

Jakarta

+ +
+ + + + +
+ + POWERED BY LUCENE + +
+
+

Several public applications are using Lucene

+ +
+

+

+
+
+
+
+ Copyright © 1999-2001, Apache Software Foundation +
+
+ + + + + + + + + + + + + + + + + diff --git a/docs/resources.html b/docs/resources.html new file mode 100644 index 00000000000..287e79ee675 --- /dev/null +++ b/docs/resources.html @@ -0,0 +1,140 @@ + + + + + + + + + + + + + + + + + Jakarta Lucene - Who We Are - Jakarta Lucene + + + + + + + + + +
+ + +Jakarta Lucene +
+ + + + + + + + + + + + +
+
+
+

About

+ +

Documentation

+ +

Download

+ +

Community

+ +

Jakarta

+ +
+ + + + +
+ + RESOURCES + +
+
+

Further reading

+ +
+

+

+
+
+
+
+ Copyright © 1999-2001, Apache Software Foundation +
+
+ + + + + + + + + + + + + + + + + diff --git a/docs/whoweare.html b/docs/whoweare.html new file mode 100644 index 00000000000..507451089df --- /dev/null +++ b/docs/whoweare.html @@ -0,0 +1,178 @@ + + + + + + + + + + + + + + + + + Jakarta Lucene - Who We Are - Jakarta Lucene + + + + + + + + + +
+ + +Jakarta Lucene +
+ + + + + + + + + + + + +
+
+
+

About

+ +

Documentation

+ +

Download

+ +

Community

+ +

Jakarta

+ +
+ + + + +
+ + WHO WE ARE + +
+
+

Lucene is maintained by a team of volunteer developers

+
+

+

+ + + + +
+ + Committers + +
+
+
    +
  • Doug Cutting (cutting at apache.org)
  • +
  • Otis Gospodnetic (otis at apache.org)
  • +
  • Brian Goetz (briangoetz at apache.org)
  • +
  • Scott Ganyo (scottganyo at apache.org)
  • +
  • Eugene Gluzberg (drag0n at apache.org)
  • +
  • Matt Tucker (mtucker at apache.org)
  • +
  • Cory Hubert (clhubert at apache.org)
  • +
  • Dave Kor (davekor at apache.org)
  • +
  • Jon Stevens (jon at latchkey.com)
  • +
  • Tal Dayan (zapta at apache.org)
  • +
+
+

+

+ + + + +
+ + Other Contributors + +
+
+
    +
  • Josh Bloch
  • +
  • Ted Husted
  • +
+
+

+

+
+
+
+
+ Copyright © 1999-2001, Apache Software Foundation +
+
+ + + + + + + + + + + + + + + + + diff --git a/lib/ant-1.3.jar b/lib/ant-1.3.jar new file mode 100644 index 00000000000..13acd1e128d --- /dev/null +++ b/lib/ant-1.3.jar @@ -0,0 +1,2 @@ +AnyObjectId[e4efab3ced62cc2c310add71f678bb899e2e51cd] was removed in git history. +Apache SVN contains full history. \ No newline at end of file diff --git a/lib/jdom-b7.jar b/lib/jdom-b7.jar new file mode 100644 index 00000000000..ac5b958103f --- /dev/null +++ b/lib/jdom-b7.jar @@ -0,0 +1,2 @@ +AnyObjectId[6be1f2de7126b91b14a11a4b00326859a85e5981] was removed in git history. +Apache SVN contains full history. \ No newline at end of file diff --git a/lib/velocity-1.2-dev.jar b/lib/velocity-1.2-dev.jar new file mode 100644 index 00000000000..9aa4c31fdfd --- /dev/null +++ b/lib/velocity-1.2-dev.jar @@ -0,0 +1,2 @@ +AnyObjectId[5f8889968e8e6ebbee53e6425ccaca3130c12fef] was removed in git history. +Apache SVN contains full history. \ No newline at end of file diff --git a/lib/xerces-1.3.0.jar b/lib/xerces-1.3.0.jar new file mode 100644 index 00000000000..fb3d01308f0 --- /dev/null +++ b/lib/xerces-1.3.0.jar @@ -0,0 +1,2 @@ +AnyObjectId[311ab8d15990e3d0b0234e37afb89fd7f21f8eb8] was removed in git history. +Apache SVN contains full history. \ No newline at end of file diff --git a/velocity.log b/velocity.log new file mode 100644 index 00000000000..1fe67c7136d --- /dev/null +++ b/velocity.log @@ -0,0 +1,57 @@ +Tue Sep 11 16:23:09 EDT 2001 [debug] AvalonLogSystem initialized using logfile D:\jakarta\CVS\jakarta-lucene\velocity.log +Tue Sep 11 16:23:09 EDT 2001 [info] RuntimeInstance v1.2x initializing : org.apache.velocity.runtime.RuntimeInstance@63b895 +Tue Sep 11 16:23:09 EDT 2001 [info] Default Properties File: org\apache\velocity\runtime\defaults\velocity.properties +Tue Sep 11 16:23:09 EDT 2001 [info] Resource manager initializing. +Tue Sep 11 16:23:09 EDT 2001 [info] Resource Loader Instantiated: org.apache.velocity.runtime.resource.loader.FileResourceLoader +Tue Sep 11 16:23:09 EDT 2001 [info] FileResourceLoader : initialization starting. +Tue Sep 11 16:23:09 EDT 2001 [info] FileResourceLoader : adding path 'D:\jakarta\CVS\jakarta-site2\xdocs\stylesheets' +Tue Sep 11 16:23:09 EDT 2001 [info] FileResourceLoader : initialization complete. +Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Literal +Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Macro +Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Parse +Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Include +Tue Sep 11 16:23:09 EDT 2001 [info] Loaded Pluggable Directive: org.apache.velocity.runtime.directive.Foreach +Tue Sep 11 16:23:10 EDT 2001 [info] Created: 20 parsers. +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : initialization starting. +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : adding VMs from VM library template : VM_global_library.vm +Tue Sep 11 16:23:10 EDT 2001 [error] ResourceManager : unable to find resource 'VM_global_library.vm' in any resource loader. +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : error using VM library template VM_global_library.vm : org.apache.velocity.exception.ResourceNotFoundException: Unable to find resource 'VM_global_library.vm' +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : VM library template macro registration complete. +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : allowInline = true : VMs can be defined inline in templates +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : allowInlineToOverride = false : VMs defined inline may NOT replace previous VM definitions +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : allowInlineLocal = false : VMs defined inline will be global in scope if allowed. +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : messages on : VM system will output logging messages +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : autoload off : VM system will not automatically reload global library macros +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : initialization complete. +Tue Sep 11 16:23:10 EDT 2001 [info] Velocity successfully started. +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #table( table ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #tr( tr ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #td( value ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #th( value ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #projectanchor( name value ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #metaauthor( author email ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #image( value ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #source( value ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #subsection( subsection ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #section( section ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #makeProject( ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #getProjectImage( ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [info] Velocimacro : added new VM : #document( ) : source = ./site.vsl +Tue Sep 11 16:23:10 EDT 2001 [error] VM #tr: error : too few arguments to macro. Wanted 1 got 0 --> +Tue Sep 11 16:23:10 EDT 2001 [error] VM #td: error : too few arguments to macro. Wanted 1 got 0 --> +Tue Sep 11 16:23:10 EDT 2001 [error] VM #th: error : too few arguments to macro. Wanted 1 got 0 --> +Tue Sep 11 16:23:11 EDT 2001 [info] ResourceManager : found ./site.vsl with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : table : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : tr : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : td : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : th : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : projectanchor : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : metaauthor : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : image : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : source : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : subsection : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : section : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : makeProject : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : getProjectImage : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [warn] Velocimacro : VM addition rejected : document : inline not allowed to replace existing VM +Tue Sep 11 16:23:11 EDT 2001 [info] ResourceManager : found ./site.vsl with loader org.apache.velocity.runtime.resource.loader.FileResourceLoader diff --git a/xdocs/applications.xml b/xdocs/applications.xml new file mode 100644 index 00000000000..3b0eaa2f796 --- /dev/null +++ b/xdocs/applications.xml @@ -0,0 +1,38 @@ + + + +Ted Husted +Applications - Jakarta Lucene + + + +
+

Lucene can be used for a wide range of applications

+
+ +
+

Lucene is designed to be used in a wide range of applications--from small, +desktop applications with a few hundred documents, to large internet server-based +applications with a few million documents.

+

Searchable E-Mail

+ +

CD-ROM-based Online Documentation Search

+ +

Search Previously-Visited Web Pages

+ +

Web Site Searching

+ +
+ + +
+ diff --git a/xdocs/background.xml b/xdocs/background.xml new file mode 100644 index 00000000000..7b7e812176a --- /dev/null +++ b/xdocs/background.xml @@ -0,0 +1,56 @@ + + + +Ted Husted +Background - Jakarta Lucene + + + +
+

Doug CuttingHow Lucene came to be

+

Lucene is the brainchild of Doug Cutting (pictured), who has been working in the +field of information retrieval for over a decade.

+

Beginning in 1988, Doug spent five years at Xerox's Palo Alto Research +Center (PARC) developing novel +approaches to information access. These included a high-performance +retrieval engine, several innovative search paradigms, advanced linguistic +analysis methods, and high-quality text summarization algorithms. +This work resulted in seven publications +and six issued patents. Some of these technologies are now marketed by +Inxight. +

+

In 1993, Doug moved to Apple's Advanced Technology Group (ATG). There +he developed a state-of-the-art retrieval engine code-named V-Twin. +This engine was to be a part of the Copland +operating system, automatically indexing the content of all files as they +are created so that the the entire file system could be efficiently searched +at any time. Copland was cancelled, but V-Twin has been used in several +other Apple products. +

+

In April of 1996, Doug left Apple and joined Excite. +Here he took over development of the core search technology. This included +growing Excite's web index from two million to fifty million +pages; substantially optimizing Excite's search performance; adding phrase-searching +capabilities; and creating a thesaurus-like feature which suggests related +terms to add to queries. +

+

In the fall of 1997, Doug reduced his commitment at Excite to +part-time so that he could write Lucene, +an efficient, full-featured text search engine written in Java. In early 1998 he +returned to Excite full-time for two more years. Lucene sat on the shelf for +much of that time, and was made open-source in the spring of 2000.

+

Lucene quickly became recognized as the leading server-side searching +solution for Java, and attracted several other open source developers, eager +to help refine the Lucene codebase.

+

In the fall of 2001, Lucene joined the Apache Jakarta Project, where +the product is maintained by a team of volunteer developers.

+

Doug now works for Grand +Central, a web services network. In his spare time he still +tries to help out with Lucene.

+

Please do not email Doug directly about Lucene. Instead use +the Jakarta-Lucene mailing lists.

+
+ + +
+ diff --git a/xdocs/features.xml b/xdocs/features.xml new file mode 100644 index 00000000000..506581d805b --- /dev/null +++ b/xdocs/features.xml @@ -0,0 +1,54 @@ + + + +Ted Husted +Features - Jakarta Lucene + + + +
+

Lucene offers powerful features through a simple API

+
+ +
+ +
+ +
+ +
+ +
+ +
+ +
+ +
+ +
+ +
+ + +
+ diff --git a/xdocs/images/dcutting.gif b/xdocs/images/dcutting.gif new file mode 100644 index 00000000000..7a05fb7ce0f Binary files /dev/null and b/xdocs/images/dcutting.gif differ diff --git a/xdocs/images/lucene-logo.gif b/xdocs/images/lucene-logo.gif new file mode 100644 index 00000000000..98bdf4bc0c7 Binary files /dev/null and b/xdocs/images/lucene-logo.gif differ diff --git a/xdocs/index.xml b/xdocs/index.xml new file mode 100644 index 00000000000..6cd2137e6c4 --- /dev/null +++ b/xdocs/index.xml @@ -0,0 +1,60 @@ + + + +Jon S. Stevens +Ted Husted +Overview - Jakarta Lucene + + + +
+

Lucene: Better Searching Through Java

+
+ +
+

+This site will be the home of Jakarta Lucene. The product is still being transferred. +For now, the home of the Lucene project is still at SourceForge. See +www.lucene.com for those links. +

+
+ +
+

+Jakarta Lucene is a high-performance, full-featured text search engine +written entirely in Java. It is a technology suitable for nearly any +application that requires full-text search, especially +cross-platform. +

+

+Jakarta Lucene is an open source project available for +free download from Apache Jakarta. +Please use the links on the left to access Lucene. +

+
+ +
+

+Lucene v1.02 released - This release repackages Lucene as product +of the Apache Software Foundation. Download it +here. +

+

+Lucene Joins Jakarta - The Lucene Team is happy to announce that +Lucene is now part of a member of the Apache Jakarta Project. This move will +help Lucene continue to grow, and enhance its position as the leading +server-side searching solution for Java. +

+
+ +
+

+The goal of the Apache Jakarta Project +is to provide commercial-quality server solutions based on the Java Platform that +are developed in an open and cooperative fashion. +

+
+ + +
+ diff --git a/xdocs/powered.xml b/xdocs/powered.xml new file mode 100644 index 00000000000..eb474a03d56 --- /dev/null +++ b/xdocs/powered.xml @@ -0,0 +1,20 @@ + + + +Ted Husted +Who We Are - Jakarta Lucene + + + +
+

Several public applications are using Lucene

+ +
+ + +
+ diff --git a/xdocs/resources.xml b/xdocs/resources.xml new file mode 100644 index 00000000000..c20597eaab9 --- /dev/null +++ b/xdocs/resources.xml @@ -0,0 +1,19 @@ + + + +Ted Husted +Who We Are - Jakarta Lucene + + + +
+

Further reading

+ +
+ + +
+ diff --git a/xdocs/stylesheets/project.xml b/xdocs/stylesheets/project.xml new file mode 100644 index 00000000000..51fdcc43c31 --- /dev/null +++ b/xdocs/stylesheets/project.xml @@ -0,0 +1,42 @@ + + + + Jakarta Lucene + Jakarta Lucene + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + diff --git a/xdocs/whoweare.xml b/xdocs/whoweare.xml new file mode 100644 index 00000000000..cbb0366cb2d --- /dev/null +++ b/xdocs/whoweare.xml @@ -0,0 +1,37 @@ + + + +Ted Husted +Who We Are - Jakarta Lucene + + + +
+

Lucene is maintained by a team of volunteer developers

+
+ +
+ +
+ +
+ +
+ + +
+