Erik Hatcher
32fb624ebc
LUCENE-437 - Add position increment pass through on SnowballFilter tokens
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@290943 13f79535-47bb-0310-9956-ffa450edef68
2005-09-22 13:38:58 +00:00
Mark Harwood
c00b260ecf
Added fix to QueryScorer - if a query has multiple WeightedTerms with different weights for the same term the highest weight is used for scoring that term (previously selected last weight in list)
...
SimpleHTMLEncoder now encodes characters outside of ASCII range as character entities as per suggestion here: http://issues.apache.org/bugzilla/show_bug.cgi?id=36333
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@279088 13f79535-47bb-0310-9956-ffa450edef68
2005-09-06 20:19:50 +00:00
Mark Harwood
f6b07dabe8
Changed TokenGroup.isDistinct after problems reported with JapaneseAnalyzer (no gaps between tokens)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@279078 13f79535-47bb-0310-9956-ffa450edef68
2005-09-06 19:38:12 +00:00
Daniel Naber
dd5c74112f
a query parser by Ronnie Kolehmainen that also sends PrefixQuerys etc. through the analyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@231523 13f79535-47bb-0310-9956-ffa450edef68
2005-08-11 21:28:58 +00:00
Erik Hatcher
6e9c0b6f45
remove unused file
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@209184 13f79535-47bb-0310-9956-ffa450edef68
2005-07-05 02:29:34 +00:00
Erik Hatcher
9d70229506
#34331 - Add Paul Elschot's Surround query language parser
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@209183 13f79535-47bb-0310-9956-ffa450edef68
2005-07-05 02:29:03 +00:00
Mark Harwood
0062898ada
Updated version of MemoryIndex - reliant on new Term.createTerm() method in Trunk
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@208688 13f79535-47bb-0310-9956-ffa450edef68
2005-06-30 21:40:05 +00:00
Mark Harwood
7894a0c0c0
Added (simple) SpanQuery support - matches any terms declared in Spans - proper impl should check for distances
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@208673 13f79535-47bb-0310-9956-ffa450edef68
2005-06-30 20:09:58 +00:00
Daniel Naber
6da2ef197d
update to Apache Software License 2.0
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@189623 13f79535-47bb-0310-9956-ffa450edef68
2005-06-08 19:48:19 +00:00
Mark Harwood
07cee0b287
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@179637 13f79535-47bb-0310-9956-ffa450edef68
2005-06-02 20:27:06 +00:00
Daniel Naber
a3f99b1f43
small javadoc improvements
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178893 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 22:58:17 +00:00
Daniel Naber
27597a5c71
small javadoc fixes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178892 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 22:40:36 +00:00
Daniel Naber
fe52019614
javadoc fixes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178880 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 19:21:49 +00:00
Daniel Naber
06bb3230ff
make this non-public, as it's not documented properly and has a confusing name
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178878 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 19:05:54 +00:00
Daniel Naber
bd2345d856
small javadoc fixes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178839 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:07:00 +00:00
Daniel Naber
952cfd54be
small javadoc fixes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178833 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:02:07 +00:00
Daniel Naber
816f370c0e
small javadoc fixes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178832 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:00:49 +00:00
Daniel Naber
9d2d4ead75
use entity for umlaut
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178239 13f79535-47bb-0310-9956-ffa450edef68
2005-05-24 18:44:20 +00:00
Daniel Naber
69b1f490df
javadoc: fix typo and use HTML entity so generated HTML is correct
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169681 13f79535-47bb-0310-9956-ffa450edef68
2005-05-11 19:33:12 +00:00
David Spencer
1d68f8c88d
Logic ignored stop words were in a early version of this code but it was taken out in the belief that there
...
was no point in explicitly looking for them as the scoring algorithm would effictively ignore them.
I did a test and indexed 700 pages on a corporate web site and then ran the MoreLikeThis code on them
and 1/2 of the docs had stop words identified as interesting.
So - I added code in to ignore stop words, but make it backward compatible so that by default this code
is not used.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169512 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 19:29:56 +00:00
David Spencer
81087e8bb6
Touchup javadoc.
...
Make retrieveInterestingTerms only return the top terms, not all terms.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169511 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 19:10:28 +00:00
David Spencer
175cf8a9fd
[1] Added comments to retrieveTerms() to document the return value.
...
[2] Added convenience routine retrieveInterestingTerms() which makes it easier to get at the "interesting words" in a document.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169508 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 18:49:43 +00:00
David Spencer
c696188668
don't print out summary unless it's present
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169366 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 21:37:50 +00:00
David Spencer
7f8bf69311
cleanup deprecated warnings so it compiles cleanly w/ the current lucene code, lucene-core-1.9-rc1-dev.jar
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169365 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 21:36:22 +00:00
David Spencer
c680751f63
test checkin of README, just to verify my permissions
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169349 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 19:25:40 +00:00
Erik Hatcher
78dbe41805
prefix all JARs with lucene-
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168986 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 23:43:54 +00:00
Erik Hatcher
e8c90fb050
rename WordNet to wordnet, required intermediate move due to OS case insensitivity
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168480 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:32:00 +00:00
Erik Hatcher
5fd5169a6f
temporary move to lowercase WordNet
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168479 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:31:11 +00:00
Erik Hatcher
dd472377dd
adjust code to fix compile/javadoc errors on JDK 1.5
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168478 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:26:08 +00:00
Erik Hatcher
a12dac37b4
adjust project names for consistency
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168476 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:24:18 +00:00
Mark Harwood
12a91b4395
Fixed bug where docs larger than maxDocBytesToAnalyze would cause last fragment to be sized as remainder of doc (which could be huge).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168452 13f79535-47bb-0310-9956-ffa450edef68
2005-05-05 22:40:45 +00:00
Erik Hatcher
8f70c09b9b
Wolfgang is non-stop with the additions. Easy enough to paste in, so here it is with a Collection-based TokenStream
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168029 13f79535-47bb-0310-9956-ffa450edef68
2005-05-04 00:24:17 +00:00
Erik Hatcher
f94ebdb41e
applied norm caching path from Wolfgang
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@167958 13f79535-47bb-0310-9956-ffa450edef68
2005-05-03 19:01:58 +00:00
Erik Hatcher
2a37a3e820
Apply wolfgangs fix to the tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@167835 13f79535-47bb-0310-9956-ffa450edef68
2005-05-03 00:33:27 +00:00
Andreas Vajda
572633f8c4
- reworked store I/O to use new IndexInput and IndexOutput classes
...
- reworked store I/O to avoid upstream buffering giving better txn control
- added DbStoreTest unit test adapted from StoreTest
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165674 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 20:06:00 +00:00
Erik Hatcher
8f9e2a15e7
Enhancement #34585 - high-performance in-memory index contributed by Wolfgang Hoschek
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165606 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 09:04:07 +00:00
Erik Hatcher
c3847f26ea
overhaul of build system to facilitate building and packaging of contrib sub-projects. some work still to be done, but core Lucene build still working fine
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165566 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 00:11:11 +00:00
Erik Hatcher
21431112fe
adjust license headers to be ASL 2.0
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165565 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 00:08:04 +00:00
Erik Hatcher
df52ba1ec6
standardizing source layout
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165562 13f79535-47bb-0310-9956-ffa450edef68
2005-05-01 23:57:31 +00:00
Erik Hatcher
f56d33e2d4
Add ASL header - sorry for the oversight on this.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165559 13f79535-47bb-0310-9956-ffa450edef68
2005-05-01 22:57:39 +00:00
Andreas Vajda
77130721ce
- replaced db.jar with db-4.3.27.jar
...
- downloading db-4.3.27.jar from http://downloads.osafoundation.org/db
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165319 13f79535-47bb-0310-9956-ffa450edef68
2005-04-29 17:33:27 +00:00
Erik Hatcher
d9042b00d8
move PrecedenceQueryParser to contrib/misc until the kinks are worked out
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164964 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 09:32:33 +00:00
Erik Hatcher
7b8f43ec7c
move misc over to official contrib area
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164963 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 09:16:31 +00:00
Erik Hatcher
5c9ccb2442
Add Lucene's test classes to contrib test classpath, some tests rely on the utility methods in the core tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164937 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 01:52:17 +00:00
Erik Hatcher
790dfc1490
javadoc fixup
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164742 13f79535-47bb-0310-9956-ffa450edef68
2005-04-26 04:41:54 +00:00
Erik Hatcher
26aab23901
add ignores
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164698 13f79535-47bb-0310-9956-ffa450edef68
2005-04-26 00:30:08 +00:00
Erik Hatcher
d650384d4b
add GreekAnalyzer, contributed by Panagiotis Astithas (past@ebs.gr)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164686 13f79535-47bb-0310-9956-ffa450edef68
2005-04-25 23:23:37 +00:00
Erik Hatcher
2fe0a80189
rename misspelled indexDictionnary method
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160988 13f79535-47bb-0310-9956-ffa450edef68
2005-04-12 00:11:33 +00:00
Erik Hatcher
ec522fc1c8
Fixed deprecation issues, adjusted test cases to use assertEquals better, reformatted style
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160987 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:48:02 +00:00
Erik Hatcher
0c99b57cc1
Fixed issue with ctor parameter being ignored
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160984 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:43:57 +00:00