Commit Graph

23482 Commits

Author SHA1 Message Date
Doug Cutting 8016aa3ea5 Fix StandardTokenizer's handling of CJK characters.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150159 13f79535-47bb-0310-9956-ffa450edef68
2003-12-22 22:12:24 +00:00
Doug Cutting fb13af2325 Updated for change to DocumentWriter.java.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150158 13f79535-47bb-0310-9956-ffa450edef68
2003-12-22 21:42:48 +00:00
Doug Cutting c03491e6a2 Distinguish between positions and length when indexing a field. The
length is now defined as the total number of tokens, not the final
position.  Length is used for score normalization
(Similarity.lengthNorm()) and for controlling memory usage
(IndexWriter.maxFieldLength).  In both cases the total number of
tokens is more reasonable than the final position.  Position is used
in phrase searching (see PhraseQuery and Token.setPositionIncrement()).


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150157 13f79535-47bb-0310-9956-ffa450edef68
2003-12-22 21:40:18 +00:00
Doug Cutting bd2acf0bf8 Add new method IndexReader.setNorm(), to permit altering boosts after an index is created.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150156 13f79535-47bb-0310-9956-ffa450edef68
2003-12-15 23:04:42 +00:00
Doug Cutting 321b292be9 Fixed a NPE in Query.explain().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150155 13f79535-47bb-0310-9956-ffa450edef68
2003-12-15 22:26:13 +00:00
Erik Hatcher 859e6eee20 noted TooManyClauses catch in QueryParser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150154 13f79535-47bb-0310-9956-ffa450edef68
2003-12-14 17:35:32 +00:00
Erik Hatcher 537368266a remove failing StopAnalyzer tests since the positional change was reverted
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150153 13f79535-47bb-0310-9956-ffa450edef68
2003-12-08 16:16:32 +00:00
Erik Hatcher d83ae1586c revert position increment change due to conflict with PhraseQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150152 13f79535-47bb-0310-9956-ffa450edef68
2003-12-05 14:30:12 +00:00
Otis Gospodnetic 7298a4a49e - Added The Server Side.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150151 13f79535-47bb-0310-9956-ffa450edef68
2003-12-02 12:43:44 +00:00
Doug Cutting 5bf1b92b14 Removed a line of dead code.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150150 13f79535-47bb-0310-9956-ffa450edef68
2003-12-01 18:23:47 +00:00
Christoph Goller 32f3d1794a two unused variables deleted
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150149 13f79535-47bb-0310-9956-ffa450edef68
2003-11-28 17:03:25 +00:00
Erik Hatcher fd5806ddf2 Use position increments to account for removed stop words
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150148 13f79535-47bb-0310-9956-ffa450edef68
2003-11-28 02:03:14 +00:00
Christoph Goller 40dd950e3f closing tags added
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150147 13f79535-47bb-0310-9956-ffa450edef68
2003-11-27 12:50:21 +00:00
Erik Hatcher e869bd9c87 noted TooManyClauses catch in QueryParser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150146 13f79535-47bb-0310-9956-ffa450edef68
2003-11-26 11:10:54 +00:00
Erik Hatcher fcc15f9a46 new QueryParser tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150145 13f79535-47bb-0310-9956-ffa450edef68
2003-11-26 11:03:30 +00:00
Erik Hatcher 2fd0c0205b Catch TooManyClauses and throw ParseException instead
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150144 13f79535-47bb-0310-9956-ffa450edef68
2003-11-26 11:00:58 +00:00
Doug Cutting a0384c45ff Update for RC3.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150142 13f79535-47bb-0310-9956-ffa450edef68
2003-11-25 22:01:47 +00:00
Doug Cutting ef2d6effb7 Updated for RC3.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150141 13f79535-47bb-0310-9956-ffa450edef68
2003-11-25 21:56:08 +00:00
Doug Cutting c76efdf450 Optimize patch of yesterday: only clone when needed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150140 13f79535-47bb-0310-9956-ffa450edef68
2003-11-25 21:16:36 +00:00
Doug Cutting 393e5c2655 Fixed bug 24786. Correctly rewrite clauses.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150139 13f79535-47bb-0310-9956-ffa450edef68
2003-11-25 00:21:20 +00:00
Christoph Goller 41ab0e084c Fix for problem with "<" and ">" within script.
Fixes bug #16952


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150138 13f79535-47bb-0310-9956-ffa450edef68
2003-11-23 18:37:32 +00:00
Christoph Goller 4e9ed24307 Add space for certain html tags. This is a fix for bug 19253.
I committed a slightly modified version of Daniel's patch.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150137 13f79535-47bb-0310-9956-ffa450edef68
2003-11-23 17:24:32 +00:00
Christoph Goller dc4da65595 Fix for deadlock between indexing thread and parsing
thread that occurs with long titles. Parsing thread waits
for indexing thread to read from pipeIn, indexing thread
waits for summary. This fixes bug #24301


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150136 13f79535-47bb-0310-9956-ffa450edef68
2003-11-23 15:37:26 +00:00
Christoph Goller e535a52af6 file timestamp of segments file substituted by version
number in segments file


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150135 13f79535-47bb-0310-9956-ffa450edef68
2003-11-20 19:10:41 +00:00
Christoph Goller dfaa46dd93 seek checks removed; such ckecks are not implemented
by the other subclasses of InputStream either. Unit test
adapted accordingly.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150134 13f79535-47bb-0310-9956-ffa450edef68
2003-11-20 19:03:22 +00:00
Christoph Goller 841e65e492 avoid double buffering in CSInputStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150133 13f79535-47bb-0310-9956-ffa450edef68
2003-11-20 19:01:44 +00:00
Otis Gospodnetic 054d9a4029 - Renamed a variable to reflect its functionality and fixed indentation.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150132 13f79535-47bb-0310-9956-ffa450edef68
2003-11-18 13:05:13 +00:00
Otis Gospodnetic 80071b33e3 - Added section 1.3 RC3 and the first change in this version.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150131 13f79535-47bb-0310-9956-ffa450edef68
2003-11-18 12:00:12 +00:00
Otis Gospodnetic 3a80552bd0 - Patch from case 23754:
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23754


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150130 13f79535-47bb-0310-9956-ffa450edef68
2003-11-18 11:58:19 +00:00
Otis Gospodnetic 1381c24605 - Removed tabs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150129 13f79535-47bb-0310-9956-ffa450edef68
2003-11-18 11:35:57 +00:00
Erik Hatcher 1a39c55c4c only throw ParseException from parse methods
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150128 13f79535-47bb-0310-9956-ffa450edef68
2003-11-13 11:42:26 +00:00
Erik Hatcher 812d3329d7 detabify and removed redundant (char) cast
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150127 13f79535-47bb-0310-9956-ffa450edef68
2003-11-12 01:00:50 +00:00
Otis Gospodnetic cad41dbce6 - Fixed indentation.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150126 13f79535-47bb-0310-9956-ffa450edef68
2003-11-10 14:37:08 +00:00
Otis Gospodnetic 0401cf63e4 - Fixed indentation.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150125 13f79535-47bb-0310-9956-ffa450edef68
2003-11-10 14:31:19 +00:00
Erik Hatcher e0c4641ac2 harmless qualification of variable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150124 13f79535-47bb-0310-9956-ffa450edef68
2003-11-06 15:30:57 +00:00
Otis Gospodnetic 97232bf114 - Added webfood.us.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150123 13f79535-47bb-0310-9956-ffa450edef68
2003-11-06 12:48:24 +00:00
Christoph Goller 9cf06f739c After undeleteAll the reader does not have deletions any
longer. So hasDeletions has to become false.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150122 13f79535-47bb-0310-9956-ffa450edef68
2003-10-31 09:46:54 +00:00
Christoph Goller af3fc298bd docCount was computed incorrectly by merge causing
problems during indexing together with my last change in
IndexWriter.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150121 13f79535-47bb-0310-9956-ffa450edef68
2003-10-31 09:28:44 +00:00
Christoph Goller 14b64e1380 use docCount from merger in mergeSegments
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150120 13f79535-47bb-0310-9956-ffa450edef68
2003-10-25 13:54:08 +00:00
Christoph Goller 1a3d49c1ff modify to use VM property for base data directory, so IDEs can run tests without Ant,
simply copied Eriks modification from TestRussianAnalyzer


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150119 13f79535-47bb-0310-9956-ffa450edef68
2003-10-25 11:23:49 +00:00
Erik Hatcher 7d03980697 remove System.out.printlns from tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150118 13f79535-47bb-0310-9956-ffa450edef68
2003-10-23 02:20:48 +00:00
Erik Hatcher f538150c0b change tests to use new sys property
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150117 13f79535-47bb-0310-9956-ffa450edef68
2003-10-23 02:02:06 +00:00
Erik Hatcher ae5b2cc6b7 cleanup velocity.log; set sys property for test cases to use for test files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150116 13f79535-47bb-0310-9956-ffa450edef68
2003-10-23 01:59:55 +00:00
Otis Gospodnetic 4fd1cea634 - Commented out left-over debugging/print statements
- Nuked trailing spaces


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150115 13f79535-47bb-0310-9956-ffa450edef68
2003-10-22 16:24:25 +00:00
Doug Cutting 3c52ab7b7b Updated for 1.2 RC2 release.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150113 13f79535-47bb-0310-9956-ffa450edef68
2003-10-22 15:09:33 +00:00
Doug Cutting 4e6bd151d1 prepare for rc2 release
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150112 13f79535-47bb-0310-9956-ffa450edef68
2003-10-22 14:43:17 +00:00
Doug Cutting ede163802d Add IndexReader.undeleteAll().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150111 13f79535-47bb-0310-9956-ffa450edef68
2003-10-21 18:24:23 +00:00
Doug Cutting f0d57d81a3 Changed IndexReader so that it can be subclassed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150110 13f79535-47bb-0310-9956-ffa450edef68
2003-10-21 17:59:17 +00:00
Erik Hatcher ebd6dc86fa unify stop word lists with StopAnalyzer - they were identical, so just reuse and avoid possible mismatch later
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150109 13f79535-47bb-0310-9956-ffa450edef68
2003-10-20 18:20:48 +00:00
Erik Hatcher eda583780c modify to use VM property for base data directory, so IDEs can run tests without Ant
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150108 13f79535-47bb-0310-9956-ffa450edef68
2003-10-20 18:07:57 +00:00