Michael McCandless
24d76157a0
LUCENE-1420: let Similarity.computeNorm compute the norm; add option to discount overlap tokens when computing lengthNorm
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@710117 13f79535-47bb-0310-9956-ffa450edef68
2008-11-03 18:03:58 +00:00
Michael McCandless
f2c988ae2b
don't include contrib/memory class files in highlighter's jar
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@707834 13f79535-47bb-0310-9956-ffa450edef68
2008-10-25 10:19:30 +00:00
Grant Ingersoll
8dfe073760
LUCENE-1406. Added Arabic stemming and normalization. Also added new method to WordListLoader to allow for comments in word lists.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706342 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 17:19:29 +00:00
Michael McCandless
da78e20bd0
LUCENE-1416: don't fail contrib/ant's unit test when file path contains non-URL safe characters like space
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706160 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 09:12:46 +00:00
Karl-Johan Wettin
456b10fdf9
LUCENE-1423
...
InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68
2008-10-18 16:29:53 +00:00
Daniel Naber
20ddb46fec
minor javadoc fix
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@703717 13f79535-47bb-0310-9956-ffa450edef68
2008-10-11 19:45:16 +00:00
Michael McCandless
3f27b17a89
fix non-1.4-compatible throws clause
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@701827 13f79535-47bb-0310-9956-ffa450edef68
2008-10-05 16:40:59 +00:00
Mark Harwood
92d6a0ae7b
Fix for a potential null-pointer-introducing bug which came about as part of the DocIdSet changes. TermsFilter no longer implemented bits(IndexReader) and the Filter base class' version of this was changed to return null.
...
When dropping 2.4 Lucene in as a direct replacement for 2.3.2 my client code was getting NullPointer errors - returning null was never part of the Filter.bits contract and so this could be a problem for others using this class.
Fix is for TermsFilter to implement bits(IndexReader) - this can safely be removed in later versions because it is a deprecated method going forward.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700815 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 15:37:05 +00:00
Michael McCandless
98e1129a14
break off contrib/CHANGES.txt's 2.4.0 release section
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700743 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 11:22:58 +00:00
Andreas Vajda
88283a7027
- upgraded to bdb-je 3.3.69
...
- re-enabled testBytes() and testArrays() (per Aaron, Michael's instructions)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699961 13f79535-47bb-0310-9956-ffa450edef68
2008-09-29 02:21:51 +00:00
Andreas Vajda
96809b1238
upgraded db lib to version 4.7.25
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699882 13f79535-47bb-0310-9956-ffa450edef68
2008-09-28 19:07:39 +00:00
Mark Harwood
ba6344a4d7
Fixed bug in FuzzyLikeThisQuery.java. Queries that contain a term with no fuzzy variants caused the query construction logic to exit loop early, producing no fuzzy variants for all subsequent terms in the query string.
...
Junit test added which recreates the problem conditions and added fix to FuzzyLikeThisQuery that solves the issue.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699512 13f79535-47bb-0310-9956-ffa450edef68
2008-09-26 21:44:55 +00:00
Michael McCandless
0e708a21fe
LUCENE-1401: fix accidentally lost 'create=true' in CreateIndexTask
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698975 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 14:11:41 +00:00
Michael McCandless
a40530e4a4
LUCENE-1401: remove new deprecated IndexWriter ctors; default autoCommit=false for new ctors
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698932 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 12:04:38 +00:00
Michael McCandless
b866828bab
add graph to package.html for contrib/instantiated showing queries per sec vs number of docs indexed
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@697126 13f79535-47bb-0310-9956-ffa450edef68
2008-09-19 15:15:36 +00:00
Otis Gospodnetic
0195fcd03d
LUCENE-1378 - Removed the remaining 199 @author references
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@695514 13f79535-47bb-0310-9956-ffa450edef68
2008-09-15 15:42:11 +00:00
Karl-Johan Wettin
c6e5618674
Added a bunch of <p> in the Javadocs to make the rendered html view look better.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694396 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:26:46 +00:00
Karl-Johan Wettin
71f2d8199b
LUCENE-1320
...
ShingleMatrixFilter JDK downgrade 1.5 -> 1.4
Grant Ingersoll via Karl Wettin
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694393 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:23:18 +00:00
Michael McCandless
d31fb4556c
fix wordnet's Syns2Index to not fiddle with mergeFactor & maxBuffereDocs (the latter was hitting an exception)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694222 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 12:11:03 +00:00
Michael McCandless
4218996230
LUCENE-1366: rename Field.Index.* options
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694004 13f79535-47bb-0310-9956-ffa450edef68
2008-09-10 21:38:52 +00:00
Grant Ingersoll
dadaebc290
LUCENE-1243: Added new benchmark tasks
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 15:56:41 +00:00
Mark Robert Miller
5147f0563f
LUCENE-1357: SpanScorer does not respect ConstantScoreRangeQuery setting
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693432 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 10:52:50 +00:00
Michael McCandless
d5a40278bc
LUCENE-1369: switch from Hashtable to HashMap and from Vector to List, when possible
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@692921 13f79535-47bb-0310-9956-ffa450edef68
2008-09-07 19:22:40 +00:00
Michael McCandless
120d01bb09
remove DOCTYPE declaration in testEncoding (it's causing intermittant false failures in nightly build when xerces has trouble loading the DTD from w3.org)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690914 13f79535-47bb-0310-9956-ffa450edef68
2008-09-01 09:07:16 +00:00
Karl-Johan Wettin
bf238a5743
Javadocs fix
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690779 13f79535-47bb-0310-9956-ffa450edef68
2008-08-31 20:46:47 +00:00
Karl-Johan Wettin
82c70c018e
LUCENE-1016 : TermVectorAccessor, transparent vector space access via stored vectors or by resolving the inverted index.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688745 13f79535-47bb-0310-9956-ffa450edef68
2008-08-25 15:02:20 +00:00
Karl-Johan Wettin
3034575f66
LUCENE-1142 : Updated Snowball package, org.tartarus distribution revision 500.
...
Introducing Hungarian, Turkish and Romanian support, updated older stemmers and optimized (reflectionless) SnowballFilter.
IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older) might not be compatible with these updated classes as some algorithms have changed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688420 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 22:02:47 +00:00
Karl-Johan Wettin
9fe7a35378
Contrib level CHANGES.txt. I forgot to add this some time ago.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 17:12:57 +00:00
Michael McCandless
003a853cc8
LUCENE-1333: don't use LuceneTestCase in contrib until we can fix the build dependency
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687539 13f79535-47bb-0310-9956-ffa450edef68
2008-08-21 02:45:37 +00:00
Grant Ingersoll
09467ad18e
LUCENE-1001: Make payloads accessible from SpanQuery's
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687379 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 16:06:36 +00:00
Michael McCandless
bb6b711718
LUCENE-1333: improvements to Token reuse API and full cutover to reuse API for all core and contrib analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687357 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 14:38:07 +00:00
Mark Robert Miller
7675606908
LUCENE-1355: highlighter can incorrectly produce negative idf when index has deletes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687052 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 13:16:41 +00:00
Michael McCandless
e31a9da835
LUCENE-1334: add Term(String fieldName) constructor that sets term text to empty string
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687014 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 10:40:39 +00:00
Doron Cohen
027307857e
LUCENE-1353: Javacc target to generate precedence query parser.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@685114 13f79535-47bb-0310-9956-ffa450edef68
2008-08-12 10:49:59 +00:00
Michael McCandless
12e4e67403
add missing synchronized to PerfRunData.getQueryMaker()
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@677555 13f79535-47bb-0310-9956-ffa450edef68
2008-07-17 10:50:24 +00:00
Mark Harwood
74c551fb88
Added option to allow UserQuery tag to define a different default fieldName.
...
Standard use case for this is where users are presented with a GUI form with multiple input boxes, each targetting a different field and allowing "lucene syntax". The XML query template behind such a form would have a <UserQuery> tag for each form field, each defined with the appropriate choice of default field name.
Added Junit test for changing default field name, updated DTD for XML query syntax and regenerated HTML documentation.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674975 13f79535-47bb-0310-9956-ffa450edef68
2008-07-08 21:50:06 +00:00
Karl-Johan Wettin
ddc7c290d0
LUCENE-1320
...
ShingleMatrixFilter, a multidimensional shingle token filter.
Bug fix, did not support empty input token streams.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674367 13f79535-47bb-0310-9956-ffa450edef68
2008-07-07 00:08:41 +00:00
Karl-Johan Wettin
bca43ea3ea
LUCENE-1320
...
ShingleMatrixFilter, a multidimensional shingle token filter.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673549 13f79535-47bb-0310-9956-ffa450edef68
2008-07-02 23:53:51 +00:00
Mark Robert Miller
943bf37e66
LUCENE-1321: Highlight fragment does not extend to maxDocCharsToAnalyze
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673220 13f79535-47bb-0310-9956-ffa450edef68
2008-07-01 21:44:33 +00:00
Karl-Johan Wettin
4f7ead8112
LUCENE-1318: InstantiatedIndexReader.norms(String, b[], int) didn't treat the array offset right.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672568 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 18:52:12 +00:00
Karl-Johan Wettin
b8fc54e72a
LUCENE-1312: Added full support for InstantiatedIndexReader#getFieldNames() and extended the test case to assert deleted documents behaves as they should (they did).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672556 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 17:23:35 +00:00
Erik Hatcher
3c80f97f7c
MemoryIndex implements Serializable
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@671579 13f79535-47bb-0310-9956-ffa450edef68
2008-06-25 15:18:29 +00:00
Grant Ingersoll
44dcbe184f
add a getter for the StringDistance
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669476 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 12:52:11 +00:00
Grant Ingersoll
d6dcae49dd
make public
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669455 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 11:45:08 +00:00
Otis Gospodnetic
8b11f3b077
- Renamed TRStringDistance to LevensteinDistance (related to LUCENE-1297)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669086 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:09:11 +00:00
Otis Gospodnetic
ce19aa28c0
LUCENE-1297 - Allow other string distance measures for the SpellChecker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669085 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:01:57 +00:00
Mark Harwood
4adc3f7ba6
Fixed bug parsing boolean attributes. Boolean.getBoolean(s) was being used by mistake which reads a system property with name of s - instead need to parse s as string value of "true" or "false".
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@667294 13f79535-47bb-0310-9956-ffa450edef68
2008-06-12 23:38:39 +00:00
Doron Cohen
4d9e280788
LUCENE-1090: remove relative paths from benchmark's build.xml.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68
2008-06-10 11:58:00 +00:00
Grant Ingersoll
8c97e9b87b
LUCENE-1299: Properly handle when IndexReader is not null and Field is in the spell checker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663649 13f79535-47bb-0310-9956-ffa450edef68
2008-06-05 16:12:08 +00:00
Grant Ingersoll
f89cda6dde
LUCENE-1298: Allow MLT to use custom similarity
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663054 13f79535-47bb-0310-9956-ffa450edef68
2008-06-04 10:41:41 +00:00