Apache Lucene open-source search software
Go to file
David Spencer 1d68f8c88d Logic ignored stop words were in a early version of this code but it was taken out in the belief that there
was no point in explicitly looking for them as the scoring algorithm would effictively ignore them.

I did a test and indexed 700 pages on a corporate web site and then ran the MoreLikeThis code on them
and 1/2 of the docs had stop words identified as interesting.

So - I added code in to ignore stop words, but make it backward compatible so that by default this code
is not used.




git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169512 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 19:29:56 +00:00
contrib Logic ignored stop words were in a early version of this code but it was taken out in the belief that there 2005-05-10 19:29:56 +00:00
docs fixing typos; WordNet url update 2005-05-04 23:10:37 +00:00
site fixing property 2005-04-29 22:58:33 +00:00
src throw a more helpful exception if supposed directory is a file 2005-05-08 14:51:29 +00:00
xdocs fixing typos; WordNet url update 2005-05-04 23:10:37 +00:00
.cvsignore CVS should ignore build and dist directories. 2004-09-16 21:29:20 +00:00
BUILD.txt update build instructions and version numbers 2005-05-05 13:38:34 +00:00
CHANGES.txt only delete our own files when re-creating an index (#34695) 2005-05-04 23:34:52 +00:00
LICENSE.txt - Updated ASL from version 1.1 to 2.0 2004-01-29 12:26:56 +00:00
README.txt update homepage url and mailing list address 2005-04-21 20:47:18 +00:00
build-deprecated.xml belated checkin - moved deprecated build/test targets to separate easily removable import build file 2005-05-02 00:53:41 +00:00
build.xml #34816 - adjust for contrib/WordNet renaming 2005-05-10 01:19:03 +00:00
common-build.xml prefix all JARs with lucene- 2005-05-06 23:43:54 +00:00
index.html add redirect to docs 2001-11-04 17:19:24 +00:00

README.txt

Lucene README file

$Id$

INTRODUCTION

Lucene is a Java full-text search engine.  Lucene is not a complete
application, but rather a code library and API that can easily be used
to add search capabilities to applications.

The Lucene web site is at:
  http://lucene.apache.org/

Please join the Lucene-User mailing list by sending a message to:
  java-user-subscribe@lucene.apache.org

FILES

lucene-XX.jar
  The compiled lucene library.

docs/index.html
  The contents of the Lucene website.

docs/api/index.html
  The Javadoc Lucene API documentation.

src/java
  The Lucene source code.

src/demo
  Some example code.