Commit Graph

921 Commits

Author SHA1 Message Date
Mark Harwood ba6344a4d7 Fixed bug in FuzzyLikeThisQuery.java. Queries that contain a term with no fuzzy variants caused the query construction logic to exit loop early, producing no fuzzy variants for all subsequent terms in the query string.
Junit test added which recreates the problem conditions and added fix to FuzzyLikeThisQuery that solves the issue.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699512 13f79535-47bb-0310-9956-ffa450edef68
2008-09-26 21:44:55 +00:00
Michael McCandless 0e708a21fe LUCENE-1401: fix accidentally lost 'create=true' in CreateIndexTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698975 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 14:11:41 +00:00
Michael McCandless a40530e4a4 LUCENE-1401: remove new deprecated IndexWriter ctors; default autoCommit=false for new ctors
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698932 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 12:04:38 +00:00
Michael McCandless b866828bab add graph to package.html for contrib/instantiated showing queries per sec vs number of docs indexed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@697126 13f79535-47bb-0310-9956-ffa450edef68
2008-09-19 15:15:36 +00:00
Otis Gospodnetic 0195fcd03d LUCENE-1378 - Removed the remaining 199 @author references
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@695514 13f79535-47bb-0310-9956-ffa450edef68
2008-09-15 15:42:11 +00:00
Karl-Johan Wettin c6e5618674 Added a bunch of <p> in the Javadocs to make the rendered html view look better.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694396 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:26:46 +00:00
Karl-Johan Wettin 71f2d8199b LUCENE-1320
ShingleMatrixFilter JDK downgrade 1.5 -> 1.4 
Grant Ingersoll via Karl Wettin

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694393 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:23:18 +00:00
Michael McCandless d31fb4556c fix wordnet's Syns2Index to not fiddle with mergeFactor & maxBuffereDocs (the latter was hitting an exception)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694222 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 12:11:03 +00:00
Michael McCandless 4218996230 LUCENE-1366: rename Field.Index.* options
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694004 13f79535-47bb-0310-9956-ffa450edef68
2008-09-10 21:38:52 +00:00
Grant Ingersoll dadaebc290 LUCENE-1243: Added new benchmark tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 15:56:41 +00:00
Mark Robert Miller 5147f0563f LUCENE-1357: SpanScorer does not respect ConstantScoreRangeQuery setting
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693432 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 10:52:50 +00:00
Michael McCandless d5a40278bc LUCENE-1369: switch from Hashtable to HashMap and from Vector to List, when possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@692921 13f79535-47bb-0310-9956-ffa450edef68
2008-09-07 19:22:40 +00:00
Michael McCandless 120d01bb09 remove DOCTYPE declaration in testEncoding (it's causing intermittant false failures in nightly build when xerces has trouble loading the DTD from w3.org)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690914 13f79535-47bb-0310-9956-ffa450edef68
2008-09-01 09:07:16 +00:00
Karl-Johan Wettin bf238a5743 Javadocs fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690779 13f79535-47bb-0310-9956-ffa450edef68
2008-08-31 20:46:47 +00:00
Karl-Johan Wettin 82c70c018e LUCENE-1016 : TermVectorAccessor, transparent vector space access via stored vectors or by resolving the inverted index.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688745 13f79535-47bb-0310-9956-ffa450edef68
2008-08-25 15:02:20 +00:00
Karl-Johan Wettin 3034575f66 LUCENE-1142 : Updated Snowball package, org.tartarus distribution revision 500.
Introducing Hungarian, Turkish and Romanian support, updated older stemmers and optimized (reflectionless) SnowballFilter.

IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older) might not be compatible with these updated classes as some algorithms have changed.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688420 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 22:02:47 +00:00
Karl-Johan Wettin 9fe7a35378 Contrib level CHANGES.txt. I forgot to add this some time ago.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 17:12:57 +00:00
Michael McCandless 003a853cc8 LUCENE-1333: don't use LuceneTestCase in contrib until we can fix the build dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687539 13f79535-47bb-0310-9956-ffa450edef68
2008-08-21 02:45:37 +00:00
Grant Ingersoll 09467ad18e LUCENE-1001: Make payloads accessible from SpanQuery's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687379 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 16:06:36 +00:00
Michael McCandless bb6b711718 LUCENE-1333: improvements to Token reuse API and full cutover to reuse API for all core and contrib analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687357 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 14:38:07 +00:00
Mark Robert Miller 7675606908 LUCENE-1355: highlighter can incorrectly produce negative idf when index has deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687052 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 13:16:41 +00:00
Michael McCandless e31a9da835 LUCENE-1334: add Term(String fieldName) constructor that sets term text to empty string
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687014 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 10:40:39 +00:00
Doron Cohen 027307857e LUCENE-1353: Javacc target to generate precedence query parser.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@685114 13f79535-47bb-0310-9956-ffa450edef68
2008-08-12 10:49:59 +00:00
Michael McCandless 12e4e67403 add missing synchronized to PerfRunData.getQueryMaker()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@677555 13f79535-47bb-0310-9956-ffa450edef68
2008-07-17 10:50:24 +00:00
Mark Harwood 74c551fb88 Added option to allow UserQuery tag to define a different default fieldName.
Standard use case for this is where users are presented with a GUI form with multiple input boxes, each targetting a different field and allowing "lucene syntax". The XML query template behind such a form would have a <UserQuery> tag for each form field, each defined with the appropriate choice of default field name.

Added Junit test for changing default field name, updated DTD for XML query syntax and regenerated HTML documentation.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674975 13f79535-47bb-0310-9956-ffa450edef68
2008-07-08 21:50:06 +00:00
Karl-Johan Wettin ddc7c290d0 LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

Bug fix, did not support empty input token streams.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674367 13f79535-47bb-0310-9956-ffa450edef68
2008-07-07 00:08:41 +00:00
Karl-Johan Wettin bca43ea3ea LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673549 13f79535-47bb-0310-9956-ffa450edef68
2008-07-02 23:53:51 +00:00
Mark Robert Miller 943bf37e66 LUCENE-1321: Highlight fragment does not extend to maxDocCharsToAnalyze
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673220 13f79535-47bb-0310-9956-ffa450edef68
2008-07-01 21:44:33 +00:00
Karl-Johan Wettin 4f7ead8112 LUCENE-1318: InstantiatedIndexReader.norms(String, b[], int) didn't treat the array offset right.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672568 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 18:52:12 +00:00
Karl-Johan Wettin b8fc54e72a LUCENE-1312: Added full support for InstantiatedIndexReader#getFieldNames() and extended the test case to assert deleted documents behaves as they should (they did).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672556 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 17:23:35 +00:00
Erik Hatcher 3c80f97f7c MemoryIndex implements Serializable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@671579 13f79535-47bb-0310-9956-ffa450edef68
2008-06-25 15:18:29 +00:00
Grant Ingersoll 44dcbe184f add a getter for the StringDistance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669476 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 12:52:11 +00:00
Grant Ingersoll d6dcae49dd make public
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669455 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 11:45:08 +00:00
Otis Gospodnetic 8b11f3b077 - Renamed TRStringDistance to LevensteinDistance (related to LUCENE-1297)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669086 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:09:11 +00:00
Otis Gospodnetic ce19aa28c0 LUCENE-1297 - Allow other string distance measures for the SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669085 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:01:57 +00:00
Mark Harwood 4adc3f7ba6 Fixed bug parsing boolean attributes. Boolean.getBoolean(s) was being used by mistake which reads a system property with name of s - instead need to parse s as string value of "true" or "false".
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@667294 13f79535-47bb-0310-9956-ffa450edef68
2008-06-12 23:38:39 +00:00
Doron Cohen 4d9e280788 LUCENE-1090: remove relative paths from benchmark's build.xml.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68
2008-06-10 11:58:00 +00:00
Grant Ingersoll 8c97e9b87b LUCENE-1299: Properly handle when IndexReader is not null and Field is in the spell checker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663649 13f79535-47bb-0310-9956-ffa450edef68
2008-06-05 16:12:08 +00:00
Grant Ingersoll f89cda6dde LUCENE-1298: Allow MLT to use custom similarity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663054 13f79535-47bb-0310-9956-ffa450edef68
2008-06-04 10:41:41 +00:00
Grant Ingersoll 1979d75605 LUCENE-1295: Made method public and added retrieveInterestingTerms variation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@662413 13f79535-47bb-0310-9956-ffa450edef68
2008-06-02 12:29:02 +00:00
Mark Robert Miller f32b5a5698 LUCENE-1285: WeightedSpanTermExtractor incorrectly treats the same terms occurring in different query types
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659965 13f79535-47bb-0310-9956-ffa450edef68
2008-05-25 11:38:55 +00:00
Michael Busch 3e7a15e317 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659742 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:45:12 +00:00
Michael Busch 1d6d186888 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659740 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:16:30 +00:00
Michael Busch 14f3663f56 Set eol-style to native for BooleanFilterTest.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659732 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 00:48:12 +00:00
Michael Busch 0ebfcc663e LUCENE-1187: ChainedFilter and BooleanFilter now work with new Filter API and DocIdSetIterator-based filters.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659635 13f79535-47bb-0310-9956-ffa450edef68
2008-05-23 19:25:05 +00:00
Otis Gospodnetic ce557ee4b3 LUCENE-852: Let the SpellChecker caller specify IndexWriter mergeFactor and RAM buffer size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659021 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:47:36 +00:00
Otis Gospodnetic b4f7c37bd1 LUCENE-1046: Removed dead code in SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659019 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:34:18 +00:00
Otis Gospodnetic f68c9544de LUCENE-1183: Optimized TRStringDistance class (in contrib/spell) that uses less memory than the previous version
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659016 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:24:55 +00:00
Otis Gospodnetic a379a67875 LUCENE-1152 Fix for calling indexDictionary after clearIndex call
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659013 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:21:17 +00:00
Otis Gospodnetic 90be0daa46 - Made PlainTextDictionary(Reader) public from protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657728 13f79535-47bb-0310-9956-ffa450edef68
2008-05-19 05:45:31 +00:00
Otis Gospodnetic f5df30327e - Fixed messed up indentation/tabs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657281 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:57:32 +00:00
Otis Gospodnetic 1d5ba345cc - Javadocs fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657280 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:56:46 +00:00
Otis Gospodnetic d5c708a161 - Renamed vars a bit, so test is easier to understand
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657279 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:55:48 +00:00
Grant Ingersoll 7a27cdcbc9 LUCENE-1166: Added token filter for decomposing compound words
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657027 13f79535-47bb-0310-9956-ffa450edef68
2008-05-16 12:22:50 +00:00
Otis Gospodnetic aa0074f5db LUCENE-1003: Don't let RussianAnalyzer drop numbers.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656111 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:37:45 +00:00
Otis Gospodnetic b182881092 - Made ant compile-core and build-contrib run again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656110 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:29:57 +00:00
Otis Gospodnetic 6a8bef8c74 - Made ant compile-core and build-contrib run again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656109 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:29:16 +00:00
Michael Busch 38d1caf56f LUCENE-1277: Remove println in highlighter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652625 13f79535-47bb-0310-9956-ffa450edef68
2008-05-01 18:59:06 +00:00
Grant Ingersoll afa040e9da LUCENE-1276: Fixed contrib highlighter build
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652471 13f79535-47bb-0310-9956-ffa450edef68
2008-05-01 10:34:50 +00:00
Mark Harwood 35c7eb36df Commit of LUCENE-794 patch - adding phrase/span query support to highlighter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652164 13f79535-47bb-0310-9956-ffa450edef68
2008-04-29 22:07:18 +00:00
Karl-Johan Wettin 6a4ae85775 minor javadocs change
First paragraph is used as description in javadocs index so I added the sentance from build/pom. And it gave me an excuse to test my karma.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@649312 13f79535-47bb-0310-9956-ffa450edef68
2008-04-17 22:19:41 +00:00
Michael McCandless 44daebab7c LUCENE-1150: put back public tokenImage/TOKEN_TYPES in StandardTokenizer and WikipediaTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@648183 13f79535-47bb-0310-9956-ffa450edef68
2008-04-15 08:48:41 +00:00
Chris M. Hostetter 21b22f8515 some javadoc building cleanup for contrib/instantiated -- notably moving docs to doc-files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@643170 13f79535-47bb-0310-9956-ffa450edef68
2008-03-31 22:00:57 +00:00
Grant Ingersoll cc955c9748 LUCENE-400: Added ShingleFilter (token based ngram)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@642612 13f79535-47bb-0310-9956-ffa450edef68
2008-03-29 21:11:33 +00:00
Mark Harwood c172010da0 Applied trejkaz's patch from https://issues.apache.org/jira/browse/LUCENE-1240 to optimise TermFilter.java and included new JUnit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@638631 13f79535-47bb-0310-9956-ffa450edef68
2008-03-18 23:01:00 +00:00
Grant Ingersoll 9ac963952f LUCENE-1236: Added some more javadocs. Also removed @author tags
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@637449 13f79535-47bb-0310-9956-ffa450edef68
2008-03-15 18:05:10 +00:00
Chris M. Hostetter 32f1054d88 LUCENE-1202 - tweaks to build system to make clover work better with contribs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@637344 13f79535-47bb-0310-9956-ffa450edef68
2008-03-15 03:16:43 +00:00
Grant Ingersoll f4cc6e2269 LUCENE-550: Added RAMDirectory alternative as a contrib. Similar to MemoryIndex, but can hold more than one document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@636745 13f79535-47bb-0310-9956-ffa450edef68
2008-03-13 12:34:30 +00:00
Michael McCandless b36d9f9be3 download bdb zip from Oracle's servers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@636673 13f79535-47bb-0310-9956-ffa450edef68
2008-03-13 09:06:31 +00:00
Doron Cohen 25f80c71c9 LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of
first round were used in all rounds.  (E.g. term vectors.)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635280 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 16:43:32 +00:00
Doron Cohen ce0309ed5f Javadoc clarrification for getRecall().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635195 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 08:30:28 +00:00
Doron Cohen 8b2f12851b make the name of multi-value property long enough for the values.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635194 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 08:28:53 +00:00
Mark Harwood 37a060c15c Additional thread safety around filter creation - old code could create duplicate CachingWrapperFilter if thread1 gets cache miss and thread 2 has a cache miss before thread1 populates cache with new CachingWrapperFilter.
Synchronization cost around whole method is OK here because Filter object construction should be a lightweight call. 
Note: CachingWrapperFilter currently has a similar bug in bits() method but adding "synchronized" around that whole method would not be a solution there because of the cost of evaluating filter.bits and the unnecessary blocking effect this would have on threads using different readers to the thread with the lock.

 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628921 13f79535-47bb-0310-9956-ffa450edef68
2008-02-18 23:39:12 +00:00
Mark Harwood caed55ef5e Added thread-safety around use of core's QueryParser.
Old XML parser constructors use a mode which will synchronize on use of the user-supplied QueryParser. 
New constructors offer alternative option of passing "defaultField" String which is used to create a new single-use QueryParser for each parse operation.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628568 13f79535-47bb-0310-9956-ffa450edef68
2008-02-17 22:32:58 +00:00
Michael McCandless fc4bc1f5af LUCENE-1084: fix more test cases to use user-defined MaxFieldLength ctor for IndexWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619637 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 21:07:35 +00:00
Grant Ingersoll 00941055fd benchmark autocommit
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619457 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 15:33:02 +00:00
Grant Ingersoll 0dc6c59ac1 LUCENE-494: Added QueryAutoStopWordAnalyzer in a new query subpackage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619420 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 14:13:38 +00:00
Michael McCandless 7ea8bd35f4 LUCENE-1084: fix the remaining deprecated calls to IndexWriter ctors
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618764 13f79535-47bb-0310-9956-ffa450edef68
2008-02-05 20:07:29 +00:00
Erik Hatcher 140a867933 Correct javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618275 13f79535-47bb-0310-9956-ffa450edef68
2008-02-04 13:31:02 +00:00
Michael McCandless afb58ca0d2 when reinit() is called, reset the start time (used only when printing # docs indexed)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617991 13f79535-47bb-0310-9956-ffa450edef68
2008-02-03 14:24:30 +00:00
Michael Busch 18b61286fa LUCENE-584: Changed Filter API to return a DocIdSet instead of a java.util.BitSet. This allows using more efficient data structures for Filters and makes them more flexible.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617859 13f79535-47bb-0310-9956-ffa450edef68
2008-02-02 19:04:03 +00:00
Grant Ingersoll 3db9002efb LUCENE-1156: see CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 22:47:52 +00:00
Grant Ingersoll 61f2aa3078 allow TrecDocMaker to be more easily extended by making values protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616882 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 20:03:39 +00:00
Grant Ingersoll 4fc8654bc1 Made some member variables protected so they can accessed by extending classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616806 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 16:29:35 +00:00
Grant Ingersoll b95e9a5154 LUCENE-1132: Updated documentation and deprecated bytes references in favor of character references
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616305 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 13:35:13 +00:00
Michael McCandless 38f8d4e837 LUCENE-1150: make StandardAnalyzer tokenizer constants public again (public access was accidentally removed with LUCENE-966)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616248 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 10:51:44 +00:00
Grant Ingersoll 7dd34ef888 LUCENE-1153: added lib/junit and build support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615329 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 20:27:55 +00:00
Michael McCandless ee835ccf21 LUCENE-1084: change all IW ctors to require up-front specification of max field length; deprecated existing ones
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615168 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 10:21:15 +00:00
Doron Cohen 8ace4103a7 LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 18:46:57 +00:00
Grant Ingersoll 7f09c9f00e Added javadocs about UNTOKENIZED_TOKEN_FLAG conundrum
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614905 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:42:50 +00:00
Grant Ingersoll 305c47f500 LUCENE-1133: Adds ability to keep certain strings as single tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614895 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:05:53 +00:00
Grant Ingersoll 1183763dbe LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:39:44 +00:00
Grant Ingersoll f75f490eb9 LUCENE-1127: added couple of convenience methods to TokenSources
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614884 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:36:46 +00:00
Michael McCandless dd0265469a LUCENE-1139: fix IndexWriter settings (compound, merge.factor, etc.) to work again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614022 13f79535-47bb-0310-9956-ffa450edef68
2008-01-21 21:42:43 +00:00
Michael McCandless 354a3175d5 LUCENE-1139: various additions/fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Grant Ingersoll ed9fdba601 LUCENE-1050 and LUCENE-1138 fixes for lock problem
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612868 13f79535-47bb-0310-9956-ffa450edef68
2008-01-17 17:00:43 +00:00
Grant Ingersoll 0a54ca1920 LUCENE-1135. Added Javadoc marking the WikipediaTokenizer as experimental
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612244 13f79535-47bb-0310-9956-ffa450edef68
2008-01-15 22:01:21 +00:00
Michael Busch e5da9db486 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610961 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:38:38 +00:00
Michael Busch 49dd305f69 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610960 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:37:27 +00:00
Michael McCandless ea951785eb LUCENE-1117: fix EnwikiDocMaker to not hang when the producer thread hits exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610738 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 10:12:18 +00:00
Michael McCandless f0d5002066 LUCENE-1117: fix intermittent thread safety issue w/ EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@609080 13f79535-47bb-0310-9956-ffa450edef68
2008-01-05 01:51:53 +00:00
Grant Ingersoll 79e09db401 LUCENE-1103: Internal links should increment as all tokens do, since the first token is valid too
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608989 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:36:23 +00:00
Grant Ingersoll b18f6ae959 LUCENE-1103: The link is now incremented 1, but then the next token in the link is not incremented. This way, the link is not associated with the previous term. Instead it associated with the next term in the link, which would be the display tokens. If there are no display tokens, then it will take it's proper place in the token chain.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608978 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:15:22 +00:00
Grant Ingersoll f715fc6031 LUCENE-1103
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608852 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 14:29:15 +00:00
Michael McCandless f12862426a fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608523 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 15:20:41 +00:00
Doron Cohen 9e65cd554f LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Michael McCandless 263244312d LUCENE-1115: some small fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608306 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 01:48:18 +00:00
Grant Ingersoll ed893f770c LUCENE-1114: Updated example
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608126 13f79535-47bb-0310-9956-ffa450edef68
2008-01-02 15:30:40 +00:00
Michael Busch 75473edb02 LUCENE-746: Fix error message in AnalyzingQueryParser.getPrefixQuery.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607832 13f79535-47bb-0310-9956-ffa450edef68
2008-01-01 12:49:44 +00:00
Grant Ingersoll 90a735441f LUCENE-1102: EnwikiDocMaker now adds a docid field
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Doron Cohen ece8361ab5 LUCENE-749: ChainedFilter behavior fixed when logic of first filter is ANDNOT.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607606 13f79535-47bb-0310-9956-ffa450edef68
2007-12-30 22:47:59 +00:00
Grant Ingersoll bd340a896d git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607291 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:26 +00:00
Grant Ingersoll 5c81934465 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607290 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:16 +00:00
Grant Ingersoll cb94c6aed4 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607288 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:07:33 +00:00
Grant Ingersoll 40d85a7781 Switch to using the EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607281 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 16:29:03 +00:00
Grant Ingersoll ca821526b0 removed bad chars at end of file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605966 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:33:16 +00:00
Grant Ingersoll 36b1206ad8 Restoring ExtractWikipedia, as it is still a handy class to have around. Splitting the documents is useful for debugging purposes when you know you want to look at a specific document instead of grepping through a really large file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605959 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:14:24 +00:00
Grant Ingersoll 55d0c3a2f8 LUCENE-1077: refactored to have a common PayloadHelper classes. Also added TokenOffsetPayloadTokenFilter, which encodes the Token offset into the payloads
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@604870 13f79535-47bb-0310-9956-ffa450edef68
2007-12-17 13:55:46 +00:00
Doron Cohen b7e167ac8d LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen 73f9e7ebc0 fix potential thread-safety issue in contrib/benchmark's TrecDocMaker.
(follow-up to http://svn.apache.org/viewvc?view=rev&revision=602475)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603846 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 07:26:58 +00:00
Michael McCandless 86ca6f86d7 fix intermittent thread-safety failure in contrib/benchmark unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602475 13f79535-47bb-0310-9956-ffa450edef68
2007-12-08 14:17:07 +00:00
Michael McCandless b0d2b1c90e LUCENE-1044: revert the doSync option to FSDirectory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602165 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 17:42:33 +00:00
Grant Ingersoll f9b2e971f2 LUCENE-1077 new sinks and payloads analysis packages
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602081 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 12:21:49 +00:00
Michael McCandless 6be2c0765c LUCENE-1044: also re-default doSync back to false in contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596874 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 23:17:44 +00:00
Michael Busch b04703fe8f LUCENE-1055: Remove gdata from trunk.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596501 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 00:46:27 +00:00
Michael Busch 1abb04580f Disable verbose standard output in MemoryIndexTest by default.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596000 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:19:17 +00:00
Mark Harwood 04ae927f38 Added toString implementation on BooleanFilter.java, provided by Jason Calabrese
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595996 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:08:06 +00:00
Michael Busch bb37d2bcff LUCENE-1051: Generate separate javadocs for core, demo and contrib classes, as well as an unified view.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595013 13f79535-47bb-0310-9956-ffa450edef68
2007-11-14 19:16:19 +00:00
Michael McCandless 439ba586fc LUCENE-1044: add doSync option to FSDirectory.getDirectory, defaulting to true, to sync() each file descriptor before close()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@593799 13f79535-47bb-0310-9956-ffa450edef68
2007-11-10 17:51:00 +00:00
Daniel Naber 2f5507bfc9 fix returning unbalanced quotes in describeParams()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@586918 13f79535-47bb-0310-9956-ffa450edef68
2007-10-21 17:26:16 +00:00
Grant Ingersoll a614f0d99a Added some more algorithms for testing things out, implemented basic TREC query driver based on the sample in the javadocs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@585677 13f79535-47bb-0310-9956-ffa450edef68
2007-10-17 20:36:20 +00:00
Grant Ingersoll b7253a06b7 LUCENE-1027: Added better formatting of doubles, added wikipedia-flush-by-RAM for comparison
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583792 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 12:10:31 +00:00
Grant Ingersoll 9c9ebe5cf4 LUCENE-1027: Added support for doubles to Config, also added copies of standard and micro-standard algorithms that flush by RAM
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583771 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 11:05:40 +00:00
Mark Harwood 21a07ee41e Provided DTDs for core and contrib XML query syntax. The "docs" directory contains detailed documentation generated by DTDdoc from the DTDs. The ant script used to generate these docs is also included but not hooked up to the main build process due to license issues with DTDdoc.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583307 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:45:27 +00:00
Mark Harwood 3872d3bfcc Updated hashcode/equals to test all fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583305 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:40:45 +00:00
Chris M. Hostetter 243861715b cleaning up a ton of javadoc warnings from gdata. most of these fixes related to either: clarifying packages for @link tags; changing @link or @see tags that pointed at classes/methods that didn't exist (by picking classes with very similar names that do exist); or removing incomplete stub javadocs (that added no information beyond the signature
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@582380 13f79535-47bb-0310-9956-ffa450edef68
2007-10-05 20:30:59 +00:00
Mark Harwood 62fa7b4b82 Added new DuplicateFilter functionality to filter documents sharing a field value (e.g. primary key/url)
Also includes Junit test and XML Query support

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@581426 13f79535-47bb-0310-9956-ffa450edef68
2007-10-02 22:56:46 +00:00
Grant Ingersoll dce47c6401 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579766 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:15:26 +00:00
Grant Ingersoll bcfad28d69 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579765 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:12:59 +00:00
Michael McCandless a28eb4d978 LUCENE-994: change defaults in IndexWriter to maximize 'out of the box' indexing speed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579360 13f79535-47bb-0310-9956-ffa450edef68
2007-09-25 20:02:07 +00:00
Michael McCandless 511406ecbe remove temporary print for GData unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577892 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 19:41:42 +00:00
Chris M. Hostetter 3f517bff75 don't just write date to stdout, include date string in failure message
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577873 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:52:42 +00:00
Michael McCandless fada31fa7f adding temporary print to figure out why this gdata-server test is failing on build machine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577861 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:38:23 +00:00
Doron Cohen 40f0adb507 LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen 9e51c30349 LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Michael Busch 9c2a036db3 - LUCENE-908: Improvements and simplifications for how the MANIFEST file and the META-INF dir are created.
- LUCENE-935: Various improvements for the maven artifacts. Now the artifacts also include the sources as .jar files. 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@568766 13f79535-47bb-0310-9956-ffa450edef68
2007-08-22 23:16:48 +00:00
Grant Ingersoll c67fd79a83 LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Grant Ingersoll d1f90c7825 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll 9192b16643 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll 477c4e0efe Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00
Michael McCandless d42de32984 LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Grant Ingersoll 82eb074afd LUCENE-974: Removed Author tags from all existing code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564236 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 15:21:19 +00:00
Michael McCandless d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless 2d954694dc LUCENE-966: sizable (~6X faster) speedups to StandardTokenizer by using JFlex instead of JavaCC
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564036 13f79535-47bb-0310-9956-ffa450edef68
2007-08-08 22:26:44 +00:00
Michael McCandless 0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen f3b9c9407a for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Doron Cohen 98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless 02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll e97d5830ce LUCENE-868: New Term Vector access mechanism. Allows for applications to define how they access term vector information instead of having to pack/unpack the TV info returned by the old way.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558592 13f79535-47bb-0310-9956-ffa450edef68
2007-07-23 03:17:25 +00:00
Michael McCandless 96ea45d193 LUCENE-952: force synchronized access to writer instance variable to fix infinite spin loop in TestGdataIndexer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@554189 13f79535-47bb-0310-9956-ffa450edef68
2007-07-07 12:28:04 +00:00
Michael McCandless cff5767e44 LUCENE-843: speed up IndexWriter performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553236 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 15:16:38 +00:00
Michael McCandless c03f45a62d LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll bc7c586468 LUCENE-848. Add Wikipedia benchmarking support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen 9ff9bf8142 fix javadoc unknown tag warning.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552111 13f79535-47bb-0310-9956-ffa450edef68
2007-06-30 07:04:27 +00:00
Doron Cohen ba90fb06c1 LUCENE-946: replace text from an online collection (used in
few test cases) with text that is surely 100% free.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Otis Gospodnetic 71f2c1da8b - LUCENE-906: Elision filter for French.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551744 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 00:36:09 +00:00
Doron Cohen d9b6aa9c0b LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen e6c659269a LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
logging for addDoc/deleteDoc tasks;


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Michael Busch d955a970b6 LUCENE-622: Add ant target and pom.xml files for building maven artifacts of the Lucene core and the contrib modules.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@547860 13f79535-47bb-0310-9956-ffa450edef68
2007-06-16 04:45:13 +00:00
Chris M. Hostetter 373af70a20 LUCENE-930: Various contrib building improvements to ensure contrib dependencies are met, and test compilation errors fail the build.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@546226 13f79535-47bb-0310-9956-ffa450edef68
2007-06-11 19:29:41 +00:00
Michael Busch df0a188415 LUCENE-931: adding missing license headers to various files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545696 13f79535-47bb-0310-9956-ffa450edef68
2007-06-09 06:09:46 +00:00
Mark Harwood 1f1528e3ca Exposed the MoreLikeThis "minDocFreq" property for use in MoreLikeThisQuery.java and in XML queries
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545333 13f79535-47bb-0310-9956-ffa450edef68
2007-06-07 22:13:37 +00:00
Michael Busch 731fb3a228 LUCENE-908:
- Improves content of MANIFEST file and makes it customizable for the contribs.
- Adds SNOWBALL-LICENSE.txt to META-INF of the snowball jar 
- lucli jar now contains LICENSE.txt and NOTICE.txt.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545010 13f79535-47bb-0310-9956-ffa450edef68
2007-06-07 00:48:32 +00:00
Michael Busch 4355588972 LUCENE-740: Added SNOWBALL-LICENSE.txt to the snowball package and a remark about the license to NOTICE.TXT.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@544703 13f79535-47bb-0310-9956-ffa450edef68
2007-06-06 01:33:11 +00:00
Chris M. Hostetter 1a3de38a0c LUCENE-885: fixing bug with compilation on java 1.4, introduce by earlier commit for this issue
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543257 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 20:58:55 +00:00
Daniel Naber 6e98fa61d4 LUCENE-763: LuceneDictionary skips first word in enumeration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543220 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 19:13:36 +00:00
Daniel Naber 42c3d11f07 small cleanup only: imports; no need for try...catch(...) { fail(); } in a test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543216 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 19:00:30 +00:00
Chris M. Hostetter cb38e0c06c LUCENE-885: include contribs in 'ant test' ... includes fixes for several contribs to make tests run cleanly regardless of what working directory they are run in
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@542769 13f79535-47bb-0310-9956-ffa450edef68
2007-05-30 07:18:29 +00:00
Michael Busch 8caf74c26e LUCENE-895: Disable tests "testSimple", "testWildcard" and "testPrecedence" in TestPrecedenceQueryParser.java to make this test case pass.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@542735 13f79535-47bb-0310-9956-ffa450edef68
2007-05-30 03:00:51 +00:00
Daniel Naber 8e9dd085ab LUCENE-886: cleanup: javadoc improvements; don't print exceptions to stderr but re-throw them; new test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@541885 13f79535-47bb-0310-9956-ffa450edef68
2007-05-26 11:06:38 +00:00
Mark Harwood 132c5fd899 Added equals/hashcode implementations to enable caching
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@541816 13f79535-47bb-0310-9956-ffa450edef68
2007-05-25 23:23:07 +00:00
Daniel Naber 3c4dc0484b LUCENE-883: consecutive calls to Spellchecker.indexDictionary() won't insert terms twice anymore. Fixes the test case failure.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@539732 13f79535-47bb-0310-9956-ffa450edef68
2007-05-19 11:27:54 +00:00
Daniel Naber 52acd10f63 LUCENE-882: Spellchecker doesn't need to store ngrams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@539727 13f79535-47bb-0310-9956-ffa450edef68
2007-05-19 11:04:38 +00:00
Grant Ingersoll b5e09521b5 javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@538783 13f79535-47bb-0310-9956-ffa450edef68
2007-05-17 01:13:40 +00:00
Mark Harwood 336d3e7c58 Added hashcode and equals implementations to enable caching
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@537123 13f79535-47bb-0310-9956-ffa450edef68
2007-05-11 07:14:18 +00:00
Doron Cohen a0437dab03 clean few javadoc warnings that showed up as javadocs are built for a few more contrib paths. (LUCENE-875 related).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@536668 13f79535-47bb-0310-9956-ffa450edef68
2007-05-09 21:49:01 +00:00
Doron Cohen 8ed11daae0 javadoc warnings and errors solved (except for gdata).
build.xml "javadocs" target will now fail if there are 
javadoc warnings/errors under jdk1.4.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535448 13f79535-47bb-0310-9956-ffa450edef68
2007-05-05 05:36:49 +00:00
Doron Cohen bcd57a9af0 implement isPayloadAvailable (just return false, payloads unimplemented here), b/c it desn't compile without it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535361 13f79535-47bb-0310-9956-ffa450edef68
2007-05-04 19:40:14 +00:00
Otis Gospodnetic 5b63c00106 - LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 22:11:09 +00:00
Mark Harwood 2333dd080b Resolved bug LUCENE-862 reported by Antony Bowesman
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529512 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:13:05 +00:00
Doron Cohen 33694c7d86 LUCENE-864: set svn:eol-style=native.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529511 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:11:04 +00:00
Mark Harwood 1877f9e3dc Fixed Junit failure due to recent change in QueryParser default behaviour. Needed to explicitly require old style of RangeQuery implementation rather than new default of RangeFilters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529417 13f79535-47bb-0310-9956-ffa450edef68
2007-04-16 22:04:07 +00:00
Doron Cohen 934a56e55f contrib/benchmark: better error handling and javadocs around "exhaustive" doc making.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68
2007-04-13 19:30:03 +00:00
Otis Gospodnetic 5650a7bf92 - find . -name \*java -type f | xargs perl -pi -e 's/(\r\n|\n|\r)/\n/g' \
&& find . -name \*java -type f | xargs svn propset svn:eol-style native



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524969 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 23:37:14 +00:00
Otis Gospodnetic 3a1652a95b - Added the missing ASL 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524958 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:47:10 +00:00
Otis Gospodnetic af77bc03c1 - Fixed javadoc typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524952 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:56 +00:00
Otis Gospodnetic e158736d91 - Import cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524950 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:05 +00:00
Otis Gospodnetic ec0493167c - Don't use the deprecated FSDirectory.getDirectory(indexDir, boolean)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524949 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:26:44 +00:00
Otis Gospodnetic 023ffe2c29 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524914 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 20:27:34 +00:00
Doron Cohen 031f50c4e7 LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*';
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-26 16:46:33 +00:00
Doron Cohen 346b484473 Documentation updates following LUCENE-837.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521830 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 17:56:09 +00:00
Grant Ingersoll 251c2b328a LUCENE-837:
Fixed issues related to Doron's comments on Mar. 22, 07.

PUt in tests for ***FieldSelectorTask.  Moved the byte field handling from ReutersDocMaker to BasicDocMaker.

Added in tests for new Task

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521717 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 13:13:46 +00:00
Grant Ingersoll cfff81d1dc LUCENE-837:
fix copyright

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521698 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 12:26:08 +00:00
Grant Ingersoll f4fffb3491 LUCENE-837:
Added optional bytes field to store on the Document.  Enabled ReutersDocMaker w/ the ability to store byte data in a field.  If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array.

Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 03:48:12 +00:00
Doron Cohen 34b560603a LUCENE-840: benchmarking code correctness tests were added.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68
2007-03-22 23:13:48 +00:00
Grant Ingersoll e09189a830 LUCENE-837 applied
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68
2007-03-21 13:52:34 +00:00
Doron Cohen 9a31a8828f correct names of dynamically loaded classes (analyzers, doc/query makers)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@519567 13f79535-47bb-0310-9956-ffa450edef68
2007-03-18 06:51:06 +00:00
Doron Cohen 977e729685 LUCENE-829: close done readers in contrib/benchmark.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518822 13f79535-47bb-0310-9956-ffa450edef68
2007-03-16 01:16:56 +00:00
Michael Busch b9edb2867f Make contrib/db and contrib/memory compile with changes from LUCENE-755
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518792 13f79535-47bb-0310-9956-ffa450edef68
2007-03-15 22:35:52 +00:00
Doron Cohen ab763bbdae contrib.benhmark.byTask javadocs corrections
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517566 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 06:54:03 +00:00
Otis Gospodnetic 1a48e218d6 - Committing forgotten classes for LUCENE-759
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517477 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 00:30:13 +00:00
Grant Ingersoll f4490ac789 LUCENE 791, INFRA-1173:
Migrated references from Jakarta Lucene Wiki to Lucene Java Wiki

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@516586 13f79535-47bb-0310-9956-ffa450edef68
2007-03-09 22:29:00 +00:00
Erik Hatcher b972b13174 convert interfaces to public visibility that already should have had it
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@515431 13f79535-47bb-0310-9956-ffa450edef68
2007-03-07 02:39:24 +00:00
Otis Gospodnetic d7e203c8b4 - LUCENE-786: Javadoc improvements from Karl Wettin and an extra searcher.close() from me
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513878 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 18:28:55 +00:00
Otis Gospodnetic 534be1599d - LUCENE-759: Two n-gram producting TokenFilters (using them for the spellchecker in SOLR-81)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513876 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 18:19:53 +00:00
Otis Gospodnetic 6636d88def - 2-char indentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513866 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 17:54:27 +00:00
Otis Gospodnetic 7b570fc8b2 - LUCENE-759: Made the tokenizer capable of creating n-grams of a varying sizes - from min to max characters per n-gram. Patch from Adam Hiatt.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513344 13f79535-47bb-0310-9956-ffa450edef68
2007-03-01 14:22:57 +00:00
Grant Ingersoll ee12ced8e7 LUCENE-801: build lucene core and demo first, change classpath to use the build classes instead of the jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@507260 13f79535-47bb-0310-9956-ffa450edef68
2007-02-13 22:17:24 +00:00
Grant Ingersoll 13eaccfd56 Lucene 790
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506441 13f79535-47bb-0310-9956-ffa450edef68
2007-02-12 13:32:20 +00:00
Grant Ingersoll bb66099414 Applied 788 and 790 from Doron Cohen. Ran both the micro-standard and the task runs and results look reasonable.
Thanks, Doron

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506093 13f79535-47bb-0310-9956-ffa450edef68
2007-02-11 18:59:22 +00:00
Wolfgang Hoschek 0695150949 javadoc cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@504229 13f79535-47bb-0310-9956-ffa450edef68
2007-02-06 18:09:19 +00:00
Mark Harwood 0149b5f82f Changes to improve server-side speed and thread safety by caching reusable, compiled stylesheets.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@503500 13f79535-47bb-0310-9956-ffa450edef68
2007-02-04 22:51:28 +00:00
Mark Harwood 387d95417b Added missing Apache license comments - thanks for the reminder, Otis.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@500417 13f79535-47bb-0310-9956-ffa450edef68
2007-01-26 23:51:38 +00:00
Mark Harwood c02aed3b5e Added QueryTemplateManager.java to aid construction of XML queries from form input by using XSL templates. A Junit test provides examples of use. This approach offers a convenient way of externalizing and changing how user input is turned into Lucene queries. Database applications often adopt similar practices by externalizing SQL in template files that can be easily changed/optimized by a DBA.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@500053 13f79535-47bb-0310-9956-ffa450edef68
2007-01-25 23:31:02 +00:00
Mark Harwood 9251a63e01 Fixed bug when BooleanFilter class is used in conjunction with cached filters - the anding of filters' bitsets was changing the original rather than taking a copy of the contents. Changed to use BitSet.clone before performing any destructive operation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@499592 13f79535-47bb-0310-9956-ffa450edef68
2007-01-24 22:19:05 +00:00
Otis Gospodnetic 8cafdd9b64 - Removed isEmpty() Java 6 method, so Andrzej can compile Luke
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@496628 13f79535-47bb-0310-9956-ffa450edef68
2007-01-16 09:07:01 +00:00
Otis Gospodnetic 74e68c9287 - Removed extra ;
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@496283 13f79535-47bb-0310-9956-ffa450edef68
2007-01-15 11:45:04 +00:00
Grant Ingersoll db238e787f Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron!
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68
2007-01-13 04:08:23 +00:00
Mark Harwood 18ece160e3 Fix to apply any custom boost setting when the Query is rewritten
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492830 13f79535-47bb-0310-9956-ffa450edef68
2007-01-05 00:18:32 +00:00
Mark Harwood 998908257b Added new "CachedFilter" feature to XML syntax enabling any queries or filters to be cached for better repeat performance. Added JUnit test and example XML file. Also fixed ClassCastException in DOMUtils which occured when getAttributeWithInheritance reached the root of a document without finding the required attribute.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492823 13f79535-47bb-0310-9956-ffa450edef68
2007-01-05 00:05:17 +00:00
Grant Ingersoll 8cffe45c2a replaced string concatenation with chained appends
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492383 13f79535-47bb-0310-9956-ffa450edef68
2007-01-04 02:17:58 +00:00
Daniel Naber 67399fdd8d ingore case in the names of commands (Search == search etc.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@490513 13f79535-47bb-0310-9956-ffa450edef68
2006-12-27 15:22:22 +00:00
Otis Gospodnetic 2cf113a022 - Javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489847 13f79535-47bb-0310-9956-ffa450edef68
2006-12-23 03:36:34 +00:00
Otis Gospodnetic 8b7f6e4ef6 - LUCENE-759: New n-gram-capable tokenizers and their unit tests.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489802 13f79535-47bb-0310-9956-ffa450edef68
2006-12-22 23:43:17 +00:00
Otis Gospodnetic 17315a9be7 - LUCENE-741: command-line utility for modifying or removing field norms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489213 13f79535-47bb-0310-9956-ffa450edef68
2006-12-20 22:32:13 +00:00
Michael McCandless 7c4b667f71 LUCENE-748: add javadoc around semantics of Exception during IndexWriter.close()
LUCENE-129: fix finalizers to always call super.finalize
LUCENE-301: add new constructors IndexWriter({String,File,Directory}, Analyzer) that create index if it's not already there, else append
LUCENE-701: found two cases of "open IndexWriter for create while reader is open, on Windows" that I didn't properly fix; added new test cases
LUCENE-702: corrected some small javadoc issues


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@488640 13f79535-47bb-0310-9956-ffa450edef68
2006-12-19 11:31:27 +00:00
Simon Willnauer 550b50fdd8 redefined compile-core and compile-test to exclude db4o dependend classes if db4o jar is not present
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@487315 13f79535-47bb-0310-9956-ffa450edef68
2006-12-14 19:26:27 +00:00
Simon Willnauer fb0c30d061 defined default task to use inherit ant tasks in sub folders
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@487313 13f79535-47bb-0310-9956-ffa450edef68
2006-12-14 19:25:43 +00:00
Simon Willnauer ad3ccde3a0 fixed some path errors due to the new structure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486811 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 19:36:49 +00:00
Simon Willnauer 4374e2e88a created internal "subproject" structure to reuse the common-build.xml ant tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486798 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 19:04:23 +00:00
Simon Willnauer 5aabf7a0b9 Fixed compile error in GDataIndexWriterTest as IndexWriter has not commitTimeout setter anymore.
Fixed misspelled path in gom-build.xml


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486631 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:55:43 +00:00
Simon Willnauer d775297874 modifications added to changes.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486629 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:46:13 +00:00
Simon Willnauer 4711af4ef3 Removed unneccessary UML metadata
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486628 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:41:32 +00:00
Simon Willnauer 3bd4a16711 Initial Revision GData Object Model
GOM is a GData object representation for render, parse and manipulate GData Feeds (RSS/ATOM)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486627 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:39:54 +00:00
Simon Willnauer 9cc3210dd0 deleted hivemind related test from core test folder.
This source has been copied/moved to ./src/hivemind/test

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486625 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:37:16 +00:00
Simon Willnauer e3e2ed3801 Applied new structure of the gdata-server project.
All source and test-sources are located under the ./src/ folder.
Created GData Object Model (GOM) folder ./src/gom, Core-Folder ./src/core and ./src/hivemind
Sources and Tests now splitted into their "sub" projects.

Fixed compile error in GDataIndexWriter as IndexWriter has not commitTimeout setter anymore.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486622 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:32:08 +00:00
Otis Gospodnetic 00250552f8 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@485884 13f79535-47bb-0310-9956-ffa450edef68
2006-12-11 21:42:32 +00:00
Otis Gospodnetic 65cd9b86bf - Reindented
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@483579 13f79535-47bb-0310-9956-ffa450edef68
2006-12-07 18:07:09 +00:00
Andreas Vajda 63d2e582ad added support for Berkeley DB 4.5.20
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480800 13f79535-47bb-0310-9956-ffa450edef68
2006-11-30 01:28:11 +00:00
Wolfgang Hoschek f620d56554 find/replace tab --> space
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480343 13f79535-47bb-0310-9956-ffa450edef68
2006-11-29 02:35:43 +00:00
Wolfgang Hoschek 06a9b2e22a better estimation of memory consumption
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480339 13f79535-47bb-0310-9956-ffa450edef68
2006-11-29 02:29:19 +00:00
Otis Gospodnetic 57ce49d49a - LUCENE-728: Removed duplicate/old MoreLikeThis and SimilarityQueries classes,
as their new home is under contrib/queries.
  This means contrib/similarity is not completely empty.  I am leaving it in SVN
  for now, as it seems like it could be a nice warm home for custom Similarity
  implementations.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480200 13f79535-47bb-0310-9956-ffa450edef68
2006-11-28 20:46:42 +00:00
Wolfgang Hoschek 8dc26ad30e javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479895 13f79535-47bb-0310-9956-ffa450edef68
2006-11-28 04:36:41 +00:00
Wolfgang Hoschek 64d4421321 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479755 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:36:29 +00:00
Wolfgang Hoschek 06b36cb004 For repeatable exceptions, store into cache only if child.tokenStream(fieldName, reader) does not throw an exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479754 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:34:45 +00:00
Wolfgang Hoschek deb74c1dfb javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479751 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:30:52 +00:00
Wolfgang Hoschek 8ab762aef2 Reverting back because of:
"Actually, now I'm considering reverting back to the version without a public clear() method. The rationale is that this would be less complex and more consistent with the AnalyzerUtil design (simple methods generating simple anonymous analyzer wrappers). If desired, you can still (re)use a single static "child" analyzer instance. It's cheap and easy to create a new caching analyzer on top of the static analyzer, and to do so before each document. The old one will simply be gc'd."


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479749 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:25:32 +00:00
Wolfgang Hoschek ad49369d3d added clear() method to TokenCachingAnalyzer, changed anonymous class to public class
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479699 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 17:37:26 +00:00
Wolfgang Hoschek f76157f560 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478407 13f79535-47bb-0310-9956-ffa450edef68
2006-11-23 00:27:36 +00:00
Wolfgang Hoschek b8387ec241 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478406 13f79535-47bb-0310-9956-ffa450edef68
2006-11-23 00:25:53 +00:00
Wolfgang Hoschek 8ccee0c870 hashing on a pair isn't necessary and overly complicates semantics. So taking Pair logic out.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478378 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:29:42 +00:00
Wolfgang Hoschek d33f53c109 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478377 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:22:56 +00:00
Wolfgang Hoschek ae6846798d cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478368 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:12:19 +00:00
Wolfgang Hoschek e7cde622d9 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478366 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:10:12 +00:00
Wolfgang Hoschek aeb9723407 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478361 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:57:59 +00:00
Wolfgang Hoschek cdb3440645 added getTokenCachingAnalyzer()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478360 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:55:01 +00:00
Daniel Naber a1966d93c7 import and whitespace cleanup only
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478355 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:45:23 +00:00
Daniel Naber dab153b256 LUCENE-722: DEFAULT spelled DEFALT in MoreLikeThis.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478354 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:41:33 +00:00
Wolfgang Hoschek 4002514600 reverting update to non-deprecated method as it was not backwards compatible with lucene-2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478303 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 19:48:24 +00:00
Wolfgang Hoschek e7ed9203c0 clarified javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478302 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 19:42:25 +00:00
Wolfgang Hoschek b797db9788 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478102 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 09:48:09 +00:00
Wolfgang Hoschek 758a772fcb cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478082 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 08:20:55 +00:00
Wolfgang Hoschek 286c390cb2 added option to measure indexing or query performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478072 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 08:01:31 +00:00
Wolfgang Hoschek d9cb737f5a cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478054 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 07:00:43 +00:00
Wolfgang Hoschek 248ff4e963 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478052 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 06:28:02 +00:00
Wolfgang Hoschek ad93748d49 example shouldn't use deprecated method
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478045 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 05:56:08 +00:00
Daniel Naber 5cf957527f LUCENE-722: DEFAULT spelled DEFALT in MoreLikeThis.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@477932 13f79535-47bb-0310-9956-ffa450edef68
2006-11-21 22:26:45 +00:00
Daniel Naber d98c449203 default to "jar" so the correct manifest is used (not the common one)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@477877 13f79535-47bb-0310-9956-ffa450edef68
2006-11-21 20:54:06 +00:00
Wolfgang Hoschek 2b1dd87818 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@476065 13f79535-47bb-0310-9956-ffa450edef68
2006-11-17 07:29:44 +00:00
Wolfgang Hoschek fb8fd8b8ce cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@476060 13f79535-47bb-0310-9956-ffa450edef68
2006-11-17 06:34:13 +00:00
Wolfgang Hoschek 2d5f3b0c87 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@476056 13f79535-47bb-0310-9956-ffa450edef68
2006-11-17 06:17:59 +00:00
Wolfgang Hoschek 34e917af03 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475844 13f79535-47bb-0310-9956-ffa450edef68
2006-11-16 18:16:27 +00:00
Wolfgang Hoschek 9a116a8b99 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475843 13f79535-47bb-0310-9956-ffa450edef68
2006-11-16 18:13:56 +00:00
Grant Ingersoll 423c1b4c9c Commit benchmark code. Thanks to Doron Cohen for updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475222 13f79535-47bb-0310-9956-ffa450edef68
2006-11-15 13:09:33 +00:00
Simon Willnauer 5627c6201c Webarchive packageing prepared for including IoC metadata descriptors
- including configured IoC descriptors in war distribution
	- improved dependency fetching (on demand)
	- fixed some spelling errors

Hessian Webservice support
	- created hessian servlet as a generic http interface for provided services
	- Interface def., Impl. and Tests
	- services can be exposed via HessianSkeletonProvider by defining a mapping key, Interface and an implementation instance.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@474244 13f79535-47bb-0310-9956-ffa450edef68
2006-11-13 10:51:04 +00:00
Simon Willnauer ebf6f78719 Class could not be resolved due to wrong spelling (after refactoring)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@474243 13f79535-47bb-0310-9956-ffa450edef68
2006-11-13 10:47:33 +00:00
Simon Willnauer b09de376e4 Changed Build management to create jar files for core and IoC Container.
Added Changes.txt file to document changes in build management, api, history and bug fixes

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@473722 13f79535-47bb-0310-9956-ffa450edef68
2006-11-11 15:03:13 +00:00
Yonik Seeley 7ca20ee19f new ASF licenses header
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@472959 13f79535-47bb-0310-9956-ffa450edef68
2006-11-09 16:21:50 +00:00
Simon Willnauer d2d7d3eca6 Third party jar will be fetched by the dependecy fetching ant tasks during build process
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@472609 13f79535-47bb-0310-9956-ffa450edef68
2006-11-08 19:44:50 +00:00
Simon Willnauer fe4025e0b6 Externalized some Ant-Tasks and Macros to fetch third patry libs via http from http://www.ibiblio.org/
-- Refactoring GData-Server -- Apache Hivemind integration

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@472602 13f79535-47bb-0310-9956-ffa450edef68
2006-11-08 19:25:49 +00:00
Simon Willnauer daedf1c3bb Fetching third patry libs via http from http://www.ibiblio.org/
-- Refactoring GData-Server -- Apache Hivemind integration

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@472601 13f79535-47bb-0310-9956-ffa450edef68
2006-11-08 19:24:55 +00:00
Simon Willnauer d7a514f722 Trivial changes fixed misspelled class name
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@471887 13f79535-47bb-0310-9956-ffa450edef68
2006-11-06 21:40:19 +00:00
Grant Ingersoll 861a964ce4 initial checkin of build and dependencies
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@471623 13f79535-47bb-0310-9956-ffa450edef68
2006-11-06 03:05:23 +00:00
Otis Gospodnetic a0339b4ea6 - Create the index if it doesn't exist; CamelCased spellindex var
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@468580 13f79535-47bb-0310-9956-ffa450edef68
2006-10-27 23:00:36 +00:00
Wolfgang Hoschek 574f3d3083 some simple queries for benchmarking
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@451715 13f79535-47bb-0310-9956-ffa450edef68
2006-10-01 05:22:33 +00:00
Wolfgang Hoschek 7719af7c94 javadoc typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@451465 13f79535-47bb-0310-9956-ffa450edef68
2006-09-29 21:46:59 +00:00
Wolfgang Hoschek 652d91f982 make files relative to top level directory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@450964 13f79535-47bb-0310-9956-ffa450edef68
2006-09-28 18:41:03 +00:00
Wolfgang Hoschek af08199090 fixed array length bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@450725 13f79535-47bb-0310-9956-ffa450edef68
2006-09-28 07:11:39 +00:00
Wolfgang Hoschek 5c89547898 added support for per field boost factor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@450724 13f79535-47bb-0310-9956-ffa450edef68
2006-09-28 07:10:30 +00:00
Mark Harwood 6c53bc3c49 Fixed bug where BooleanQueryBuilder.java and BooleanFilterBuilder.java were mistakenly adding all child (and grandchild) clauses to the top node. A call to Element.getElementsByTagName was the offending code - this descends full depth rather than getting only immediate children which is the desired behaviour. Thanks to Jingkei Ly for spotting this bug.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@441667 13f79535-47bb-0310-9956-ffa450edef68
2006-09-08 21:45:14 +00:00
Yonik Seeley 8c0d242750 GData html render preview: LUCENE-660
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@433051 13f79535-47bb-0310-9956-ffa450edef68
2006-08-20 21:20:08 +00:00
Mark Harwood d516bf50d8 Fix for http://issues.apache.org/jira/browse/LUCENE-645 with added Junit tests for this bug and related problem where last fragment can be huge if highlighting huge documents.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@432042 13f79535-47bb-0310-9956-ffa450edef68
2006-08-16 21:42:18 +00:00
Yonik Seeley d3629f25eb gdata fixes: LUCENE-653, LUCENE-654
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@432040 13f79535-47bb-0310-9956-ffa450edef68
2006-08-16 21:21:26 +00:00
Yonik Seeley aa32350b33 gdata 2006-08-10 checkin
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@430568 13f79535-47bb-0310-9956-ffa450edef68
2006-08-10 22:32:40 +00:00
Yonik Seeley 729ae8f039 gdata update
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@425563 13f79535-47bb-0310-9956-ffa450edef68
2006-07-26 00:20:00 +00:00
Yonik Seeley 5e61ac9bf1 gdata update
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@425538 13f79535-47bb-0310-9956-ffa450edef68
2006-07-25 22:23:01 +00:00
Otis Gospodnetic 895f763d21 - Renamed some variables and made minScore private
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@424239 13f79535-47bb-0310-9956-ffa450edef68
2006-07-21 09:10:52 +00:00
Otis Gospodnetic c0128f900b - Removed a useless tearDown()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@424232 13f79535-47bb-0310-9956-ffa450edef68
2006-07-21 08:39:32 +00:00
Otis Gospodnetic 8723cd5585 - Moved IndexSearcher creation to setSpellIndex(Dictionary) method,
so IndexSearcher is not opened/closed for every suggestion
- Fixed typo in method name: setAccuraty -> setAccuracy
- Cosmetics, comments and other cleanups



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@424230 13f79535-47bb-0310-9956-ffa450edef68
2006-07-21 08:39:02 +00:00
Otis Gospodnetic 9bc6ba0863 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@424229 13f79535-47bb-0310-9956-ffa450edef68
2006-07-21 08:36:29 +00:00
Otis Gospodnetic de09962750 - Reformatted
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@424183 13f79535-47bb-0310-9956-ffa450edef68
2006-07-21 05:14:59 +00:00
Mark Harwood 482ad148f9 Added Yonik's patch for overlapping token support See http://issues.apache.org/jira/browse/LUCENE-627?page=comments#action_12421332
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@422302 13f79535-47bb-0310-9956-ffa450edef68
2006-07-15 22:20:29 +00:00
Mark Harwood 5f7214b023 Added Yonik's patch to deal with overlapping tokens - see http://issues.apache.org/jira/browse/LUCENE-627?page=comments#action_12421332
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@422301 13f79535-47bb-0310-9956-ffa450edef68
2006-07-15 22:19:51 +00:00
Mark Harwood 69299c5663 Exposed support for stop words facility through LikeThisQueryBuilder and updated example LikeThisQuery.xml to illustrate usage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@421413 13f79535-47bb-0310-9956-ffa450edef68
2006-07-12 21:09:15 +00:00
Mark Harwood 72af4e813d Exposed underlying support for stop words in the MoreLikeThis class
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@421411 13f79535-47bb-0310-9956-ffa450edef68
2006-07-12 21:02:49 +00:00
Otis Gospodnetic ddae8f0c57 - LUCENE-593, inner Iterator fixed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@418368 13f79535-47bb-0310-9956-ffa450edef68
2006-06-30 20:14:48 +00:00
Chris M. Hostetter d13def5f21 patch_29_06_06.diff from LUCENE-620
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@418363 13f79535-47bb-0310-9956-ffa450edef68
2006-06-30 19:31:32 +00:00
Mark Harwood b2dd60bd4b Added new TermsQueryBuilder to simply build boolean queries from text without having concern over clashing reserved words/special characters/legal syntax that is demanded by normal query parser syntax. Added new xml attributes to BooleanQueryBuilder to control disableCoord and minimumNumberShouldMatch.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@417593 13f79535-47bb-0310-9956-ffa450edef68
2006-06-27 21:56:47 +00:00
Chris M. Hostetter 0e23b32894 LUCENE-618: commiting GData server patch 27.06.06.diff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@417565 13f79535-47bb-0310-9956-ffa450edef68
2006-06-27 19:31:20 +00:00
Yonik Seeley 4707eea96c gdata dev update from Simon 2006-06-06
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@417265 13f79535-47bb-0310-9956-ffa450edef68
2006-06-26 18:14:53 +00:00
Otis Gospodnetic a7151c0767 - LUCENE-593 partial fix (field name interned)
- Made vars and LuceneIterator ctor private
- Fixed comments


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@415097 13f79535-47bb-0310-9956-ffa450edef68
2006-06-18 05:47:26 +00:00
Otis Gospodnetic 7bbc7b1544 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@415093 13f79535-47bb-0310-9956-ffa450edef68
2006-06-18 05:32:54 +00:00
Otis Gospodnetic 924914d82b - Removed Jakarta reference
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@414743 13f79535-47bb-0310-9956-ffa450edef68
2006-06-16 05:29:53 +00:00
Mark Harwood e4ffe0679e Added optimization to ignore duplicate terms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413732 13f79535-47bb-0310-9956-ffa450edef68
2006-06-12 19:38:36 +00:00
Otis Gospodnetic f0bfc02d4d - perl -pi -e 's/\t/ /g'
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413584 13f79535-47bb-0310-9956-ffa450edef68
2006-06-12 05:46:16 +00:00
Grant Ingersoll d4cc19d03e Implementation of Issue 545. Introduction of new Fieldable interface (extracted from Field) which is now used where Field used to be used. Field now implements Fieldable.
Added new method to IndexReader and derived classes for working with the new FieldSelector interface.  The FieldSelector interface defines a mechanism for doing lazy loading, amongst other things.  Implemented Lazy loading of fields in the FieldsReader class.  Added test case to TestFieldsReader.java

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413201 13f79535-47bb-0310-9956-ffa450edef68
2006-06-10 01:23:22 +00:00
Daniel Naber 2b9effb894 deprecate the analysis.nl.WordlistLoader class because it's not robust (fails silently) and use analysis.WordlistLoader instead
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413180 13f79535-47bb-0310-9956-ffa450edef68
2006-06-09 22:15:47 +00:00
Doug Cutting 631585bf5a LUCENE-578. First version of gdata-server. Contributed by Simon Willnauer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@412574 13f79535-47bb-0310-9956-ffa450edef68
2006-06-07 21:56:25 +00:00
Chris M. Hostetter 2123b476df LUCENE-503: New ThaiAnalyzer and ThaiWordFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@411863 13f79535-47bb-0310-9956-ffa450edef68
2006-06-05 17:29:01 +00:00
Yonik Seeley 3afc9192b9 SpellChecker min score is incorrectly changed by suggestSimilar: LUCENE-575
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@410971 13f79535-47bb-0310-9956-ffa450edef68
2006-06-01 21:36:13 +00:00
Mark Harwood 1ef156da34 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409994 13f79535-47bb-0310-9956-ffa450edef68 2006-05-28 20:24:42 +00:00
Doug Cutting 39852b3b59 Add package documentation for a few contrib modules.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409694 13f79535-47bb-0310-9956-ffa450edef68
2006-05-26 16:29:02 +00:00
Chris M. Hostetter d2b63d328b LUCENE-577: initial checkin of SweetSpotSimilarity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409472 13f79535-47bb-0310-9956-ffa450edef68
2006-05-25 21:21:29 +00:00
Chris M. Hostetter d96e03e914 Grrr... unbelievably bad compilation failure typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409429 13f79535-47bb-0310-9956-ffa450edef68
2006-05-25 18:54:50 +00:00
Chris M. Hostetter dd3d979905 LUCENE-496 - Command line tool for modifying the field norms of an existing index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409428 13f79535-47bb-0310-9956-ffa450edef68
2006-05-25 18:49:04 +00:00
Daniel Naber cb15338e26 small javadoc cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@400190 13f79535-47bb-0310-9956-ffa450edef68
2006-05-05 21:59:07 +00:00
Daniel Naber 18f330f6a6 add missing license header
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@398112 13f79535-47bb-0310-9956-ffa450edef68
2006-04-29 09:54:16 +00:00
Andreas Vajda 9a451c8011 applied fix for issue 536 to bdb code as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@394218 13f79535-47bb-0310-9956-ffa450edef68
2006-04-14 22:38:22 +00:00
Andreas Vajda 597b011b27 resolved JIRA issue 536
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@394214 13f79535-47bb-0310-9956-ffa450edef68
2006-04-14 22:23:20 +00:00
Yonik Seeley d490983a24 (LUCENE-539) Fix for deprecations in contrib/surround
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@391763 13f79535-47bb-0310-9956-ffa450edef68
2006-04-05 19:55:30 +00:00
Mark Harwood 286f4f5f07 Simplified QueryTermExtractor.java to make use of Query.extractTerms method (especially now that all the SpanQuery classes implement this correctly).
Added tests in Junit test to demonstrate new support for other Queries (FilteredQuery) now that we use the standard extractTerms feature of Query objects.
Also deprecated highlighter getBestFragments method that hard-coded choice of fieldname and introduced new variation that takes an additional fieldName argument

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@389888 13f79535-47bb-0310-9956-ffa450edef68
2006-03-29 21:01:40 +00:00
Mark Harwood 0fa0e25de8 Added BooleanFilter with JUnit test, added BooleanFilter support to XMLQueryParser, Changed TermsFilterBuilder to only build a filter for a single choice of field - multiples can be combined using new BooleanFilter clauses. Added missing MatchAllDocsQueryBuilder source.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@389037 13f79535-47bb-0310-9956-ffa450edef68
2006-03-27 07:23:37 +00:00
Mark Harwood fa0516f6ee Added support for MatchAllDocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@388296 13f79535-47bb-0310-9956-ffa450edef68
2006-03-23 23:30:33 +00:00
Wolfgang Hoschek c954a1c51a removed methods deprecated in lucene-1.9
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387672 13f79535-47bb-0310-9956-ffa450edef68
2006-03-22 00:02:26 +00:00
Yonik Seeley f65db2faf2 deprecation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387581 13f79535-47bb-0310-9956-ffa450edef68
2006-03-21 16:56:11 +00:00
Yonik Seeley 2d7dbf8131 more deprecation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387575 13f79535-47bb-0310-9956-ffa450edef68
2006-03-21 16:26:10 +00:00
Yonik Seeley 3666a166a1 remove deprecations
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387550 13f79535-47bb-0310-9956-ffa450edef68
2006-03-21 15:36:32 +00:00
Mark Harwood 9078b805d3 Fixed bug which ignored user's choice of QueryParser. Thanks to Terry Steichen for spotting it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@385762 13f79535-47bb-0310-9956-ffa450edef68
2006-03-14 08:49:43 +00:00
Erik Hatcher 04ca37fb49 Modified lastmodified field to use DateTools instead of the deprecated DateField, and added an antlib.xml file to make Ant integration easier
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@384336 13f79535-47bb-0310-9956-ffa450edef68
2006-03-08 21:32:04 +00:00
Mark Harwood 7dab9545be Added more helper methods to DOMUtils and cleaned up error handling in builders - thanks Chris.
Moved FilteredQueryBuilder.java to "builders" package with all other builders

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382170 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 21:49:17 +00:00
Daniel Naber 85f9706b6b trivial import cleanup to avoid compiler warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382156 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 20:59:54 +00:00
Daniel Naber e5de36e653 clean up imports to avoid warnings; fix typo in usage message
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382154 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 20:54:59 +00:00
Daniel Naber 4189b329f8 stop using deprecated methods; make more fields private
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382151 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 20:47:32 +00:00
Erik Hatcher 297d1429eb Many javadoc additions, and adding ASL to each file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@381108 13f79535-47bb-0310-9956-ffa450edef68
2006-02-26 15:38:30 +00:00
Erik Hatcher 51d7a31725 add regexp LICENSE file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@381098 13f79535-47bb-0310-9956-ffa450edef68
2006-02-26 14:48:17 +00:00
Erik Hatcher 31a7b1987a modified build file to pick up the proper manifest file. more work is needed to have the manifest pick up the right Lucene JAR name though
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@381095 13f79535-47bb-0310-9956-ffa450edef68
2006-02-26 14:24:54 +00:00
Otis Gospodnetic b9fdf6d4e5 - Package description from Paul Elschot
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@380989 13f79535-47bb-0310-9956-ffa450edef68
2006-02-25 20:43:32 +00:00
Mark Harwood 87768c51c6 Added XML-query-parser module for new extensible query parser that handles queries expressed as XML. Is dependent on new "queries" contrib module.
Added "queries" contrib module for various new query/filter classes. This area is also intended to consolidate existing query classes so have moved a copy of MoreLikeThis into here. Probably need to remove "similarity" module as a result, if no one objects.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@380874 13f79535-47bb-0310-9956-ffa450edef68
2006-02-25 00:39:18 +00:00
Mark Harwood 77625ced9e Removed use of deprecated APIs ready for 2.0 Lucene release
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@379586 13f79535-47bb-0310-9956-ffa450edef68
2006-02-21 21:05:18 +00:00
Doug Cutting f9f3161f57 Minor javadoc improvements.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@379189 13f79535-47bb-0310-9956-ffa450edef68
2006-02-20 18:11:02 +00:00
Otis Gospodnetic 93657ca63a - Limit to an optional field; reindented (die tabs, die), ASF License 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@376393 13f79535-47bb-0310-9956-ffa450edef68
2006-02-09 19:17:14 +00:00
Daniel Naber 9db7eaa951 Don't suggest a word as similar for itself. Thanks to Michael Harhen for the fix.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@373185 13f79535-47bb-0310-9956-ffa450edef68
2006-01-28 17:22:04 +00:00
Wolfgang Hoschek 75ee463870 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@372696 13f79535-47bb-0310-9956-ffa450edef68
2006-01-27 01:32:41 +00:00
Otis Gospodnetic ce3d7a331e - ASL 2.0 instead of 1.1
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@366561 13f79535-47bb-0310-9956-ffa450edef68
2006-01-06 20:23:24 +00:00
Andreas Vajda c312ac9d52 - integrated Aaron Donovan's port to Berkeley DB Java Edition
- reorganized directory tree to accomodate multiple implementations


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@366041 13f79535-47bb-0310-9956-ffa450edef68
2006-01-05 01:04:17 +00:00
Erik Hatcher ca6b9ee2e5 add .equals/.hashCode
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@359537 13f79535-47bb-0310-9956-ffa450edef68
2005-12-28 14:25:34 +00:00
Erik Hatcher 396229f18d Remove (Span)RegexQuery from core. Add completely refactored version to contrib/regex allowing pluggable regex implementations. contrib/regex is still a work in progress, and documentation is forthcoming
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@359526 13f79535-47bb-0310-9956-ffa450edef68
2005-12-28 13:57:40 +00:00
Erik Hatcher aa07efc076 minor HTML encoding issue in javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@359523 13f79535-47bb-0310-9956-ffa450edef68
2005-12-28 13:53:09 +00:00
Daniel Naber 9ad46499cf print usage if called with wrong number of arguments; be more verbose
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@358655 13f79535-47bb-0310-9956-ffa450edef68
2005-12-22 23:39:49 +00:00
Otis Gospodnetic 1fe7d48e25 - Handy command line tool for merging Lucene indices
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@358156 13f79535-47bb-0310-9956-ffa450edef68
2005-12-21 00:05:18 +00:00
Andreas Vajda f962bd4dae made build.xml accept db.version property argument
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@355282 13f79535-47bb-0310-9956-ffa450edef68
2005-12-08 23:49:39 +00:00
Wolfgang Hoschek 63ce2859a7 javadoc cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@354001 13f79535-47bb-0310-9956-ffa450edef68
2005-12-05 07:11:02 +00:00
Wolfgang Hoschek 4ff370aaa3 javadoc cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@354000 13f79535-47bb-0310-9956-ffa450edef68
2005-12-05 07:10:24 +00:00
Erik Hatcher 7a3103fac0 Applied patched for LUCENE-324, correcting token offsets returned by ChineseTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@353930 13f79535-47bb-0310-9956-ffa450edef68
2005-12-04 23:07:42 +00:00
Wolfgang Hoschek ebe44ace90 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351896 13f79535-47bb-0310-9956-ffa450edef68 2005-12-03 05:44:16 +00:00
Wolfgang Hoschek a155416b4d tentative add: Various fulltext analysis utilities avoiding redundant code in several
* classes.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351895 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:42:59 +00:00
Wolfgang Hoschek 860733f32e indentation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351893 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:27:50 +00:00
Wolfgang Hoschek f42d7a1e9b indentation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351892 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:26:16 +00:00
Wolfgang Hoschek e28541354d some performance improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351891 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:24:31 +00:00
Wolfgang Hoschek efa4d10fa1 some performance improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351890 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:22:08 +00:00
Wolfgang Hoschek 317f3f77e9 just a SVN test - please ignore
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351887 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 04:32:53 +00:00
Wolfgang Hoschek ec49618824 just a SVN test - please ignore
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351886 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 04:31:52 +00:00
Mark Harwood 2da431d139 Added support for field-specific highlighting which respects the fieldnames found in queries. Pass a field name to the QueryScorer in order to only select those field's query terms for highlighting. Updated JUnit tests too.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351504 13f79535-47bb-0310-9956-ffa450edef68
2005-12-01 22:18:33 +00:00
Andreas Vajda fa24e67d6d - changed build to use version 4.3.29 of the C Berkeley DB Java API
- updated copyright notice year ranges to include 2005


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@350095 13f79535-47bb-0310-9956-ffa450edef68
2005-12-01 01:43:07 +00:00
Erik Hatcher a4c714d9d5 no longer needed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@348059 13f79535-47bb-0310-9956-ffa450edef68
2005-11-22 01:40:32 +00:00
Daniel Naber bfde3257dc moving the non-language specific analyzers to core, this is where most users will probably expect them
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@347991 13f79535-47bb-0310-9956-ffa450edef68
2005-11-21 21:35:24 +00:00
Daniel Naber 31c271c84b import cleanup to avoid Eclipse warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@344474 13f79535-47bb-0310-9956-ffa450edef68
2005-11-15 23:21:44 +00:00
Daniel Naber 4fd74d2554 Rename *Test files which are no unit tests to make "ant test" work. See LUCENE-465.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@344471 13f79535-47bb-0310-9956-ffa450edef68
2005-11-15 23:18:22 +00:00
Daniel Naber 7e079d2950 avoid compiler/Eclipse warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@344468 13f79535-47bb-0310-9956-ffa450edef68
2005-11-15 23:15:53 +00:00
Erik Hatcher 1687a79648 Add NullFragmenter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@332696 13f79535-47bb-0310-9956-ffa450edef68
2005-11-12 01:08:01 +00:00
Erik Hatcher 32fb624ebc LUCENE-437 - Add position increment pass through on SnowballFilter tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@290943 13f79535-47bb-0310-9956-ffa450edef68
2005-09-22 13:38:58 +00:00
Mark Harwood c00b260ecf Added fix to QueryScorer - if a query has multiple WeightedTerms with different weights for the same term the highest weight is used for scoring that term (previously selected last weight in list)
SimpleHTMLEncoder now encodes characters outside of ASCII range as character entities as per suggestion here: http://issues.apache.org/bugzilla/show_bug.cgi?id=36333


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@279088 13f79535-47bb-0310-9956-ffa450edef68
2005-09-06 20:19:50 +00:00
Mark Harwood f6b07dabe8 Changed TokenGroup.isDistinct after problems reported with JapaneseAnalyzer (no gaps between tokens)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@279078 13f79535-47bb-0310-9956-ffa450edef68
2005-09-06 19:38:12 +00:00
Daniel Naber dd5c74112f a query parser by Ronnie Kolehmainen that also sends PrefixQuerys etc. through the analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@231523 13f79535-47bb-0310-9956-ffa450edef68
2005-08-11 21:28:58 +00:00
Erik Hatcher 6e9c0b6f45 remove unused file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@209184 13f79535-47bb-0310-9956-ffa450edef68
2005-07-05 02:29:34 +00:00
Erik Hatcher 9d70229506 #34331 - Add Paul Elschot's Surround query language parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@209183 13f79535-47bb-0310-9956-ffa450edef68
2005-07-05 02:29:03 +00:00
Mark Harwood 0062898ada Updated version of MemoryIndex - reliant on new Term.createTerm() method in Trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@208688 13f79535-47bb-0310-9956-ffa450edef68
2005-06-30 21:40:05 +00:00
Mark Harwood 7894a0c0c0 Added (simple) SpanQuery support - matches any terms declared in Spans - proper impl should check for distances
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@208673 13f79535-47bb-0310-9956-ffa450edef68
2005-06-30 20:09:58 +00:00
Daniel Naber 6da2ef197d update to Apache Software License 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@189623 13f79535-47bb-0310-9956-ffa450edef68
2005-06-08 19:48:19 +00:00
Mark Harwood 07cee0b287 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@179637 13f79535-47bb-0310-9956-ffa450edef68 2005-06-02 20:27:06 +00:00
Daniel Naber a3f99b1f43 small javadoc improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178893 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 22:58:17 +00:00
Daniel Naber 27597a5c71 small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178892 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 22:40:36 +00:00
Daniel Naber fe52019614 javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178880 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 19:21:49 +00:00
Daniel Naber 06bb3230ff make this non-public, as it's not documented properly and has a confusing name
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178878 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 19:05:54 +00:00
Daniel Naber bd2345d856 small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178839 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:07:00 +00:00
Daniel Naber 952cfd54be small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178833 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:02:07 +00:00
Daniel Naber 816f370c0e small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178832 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:00:49 +00:00
Daniel Naber 9d2d4ead75 use entity for umlaut
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178239 13f79535-47bb-0310-9956-ffa450edef68
2005-05-24 18:44:20 +00:00
Daniel Naber 69b1f490df javadoc: fix typo and use HTML entity so generated HTML is correct
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169681 13f79535-47bb-0310-9956-ffa450edef68
2005-05-11 19:33:12 +00:00
David Spencer 1d68f8c88d Logic ignored stop words were in a early version of this code but it was taken out in the belief that there
was no point in explicitly looking for them as the scoring algorithm would effictively ignore them.

I did a test and indexed 700 pages on a corporate web site and then ran the MoreLikeThis code on them
and 1/2 of the docs had stop words identified as interesting.

So - I added code in to ignore stop words, but make it backward compatible so that by default this code
is not used.




git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169512 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 19:29:56 +00:00
David Spencer 81087e8bb6 Touchup javadoc.
Make retrieveInterestingTerms only return the top terms, not all terms.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169511 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 19:10:28 +00:00
David Spencer 175cf8a9fd [1] Added comments to retrieveTerms() to document the return value.
[2] Added convenience routine retrieveInterestingTerms() which makes it easier to get at the "interesting words" in a document.




git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169508 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 18:49:43 +00:00
David Spencer c696188668 don't print out summary unless it's present
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169366 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 21:37:50 +00:00
David Spencer 7f8bf69311 cleanup deprecated warnings so it compiles cleanly w/ the current lucene code, lucene-core-1.9-rc1-dev.jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169365 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 21:36:22 +00:00
David Spencer c680751f63 test checkin of README, just to verify my permissions
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169349 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 19:25:40 +00:00
Erik Hatcher 78dbe41805 prefix all JARs with lucene-
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168986 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 23:43:54 +00:00
Erik Hatcher e8c90fb050 rename WordNet to wordnet, required intermediate move due to OS case insensitivity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168480 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:32:00 +00:00
Erik Hatcher 5fd5169a6f temporary move to lowercase WordNet
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168479 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:31:11 +00:00
Erik Hatcher dd472377dd adjust code to fix compile/javadoc errors on JDK 1.5
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168478 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:26:08 +00:00
Erik Hatcher a12dac37b4 adjust project names for consistency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168476 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:24:18 +00:00
Mark Harwood 12a91b4395 Fixed bug where docs larger than maxDocBytesToAnalyze would cause last fragment to be sized as remainder of doc (which could be huge).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168452 13f79535-47bb-0310-9956-ffa450edef68
2005-05-05 22:40:45 +00:00
Erik Hatcher 8f70c09b9b Wolfgang is non-stop with the additions. Easy enough to paste in, so here it is with a Collection-based TokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168029 13f79535-47bb-0310-9956-ffa450edef68
2005-05-04 00:24:17 +00:00
Erik Hatcher f94ebdb41e applied norm caching path from Wolfgang
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@167958 13f79535-47bb-0310-9956-ffa450edef68
2005-05-03 19:01:58 +00:00
Erik Hatcher 2a37a3e820 Apply wolfgangs fix to the tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@167835 13f79535-47bb-0310-9956-ffa450edef68
2005-05-03 00:33:27 +00:00
Andreas Vajda 572633f8c4 - reworked store I/O to use new IndexInput and IndexOutput classes
- reworked store I/O to avoid upstream buffering giving better txn control
 - added DbStoreTest unit test adapted from StoreTest


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165674 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 20:06:00 +00:00
Erik Hatcher 8f9e2a15e7 Enhancement #34585 - high-performance in-memory index contributed by Wolfgang Hoschek
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165606 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 09:04:07 +00:00
Erik Hatcher c3847f26ea overhaul of build system to facilitate building and packaging of contrib sub-projects. some work still to be done, but core Lucene build still working fine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165566 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 00:11:11 +00:00
Erik Hatcher 21431112fe adjust license headers to be ASL 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165565 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 00:08:04 +00:00
Erik Hatcher df52ba1ec6 standardizing source layout
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165562 13f79535-47bb-0310-9956-ffa450edef68
2005-05-01 23:57:31 +00:00
Erik Hatcher f56d33e2d4 Add ASL header - sorry for the oversight on this.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165559 13f79535-47bb-0310-9956-ffa450edef68
2005-05-01 22:57:39 +00:00
Andreas Vajda 77130721ce - replaced db.jar with db-4.3.27.jar
- downloading db-4.3.27.jar from http://downloads.osafoundation.org/db


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165319 13f79535-47bb-0310-9956-ffa450edef68
2005-04-29 17:33:27 +00:00
Erik Hatcher d9042b00d8 move PrecedenceQueryParser to contrib/misc until the kinks are worked out
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164964 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 09:32:33 +00:00
Erik Hatcher 7b8f43ec7c move misc over to official contrib area
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164963 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 09:16:31 +00:00
Erik Hatcher 5c9ccb2442 Add Lucene's test classes to contrib test classpath, some tests rely on the utility methods in the core tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164937 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 01:52:17 +00:00
Erik Hatcher 790dfc1490 javadoc fixup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164742 13f79535-47bb-0310-9956-ffa450edef68
2005-04-26 04:41:54 +00:00
Erik Hatcher 26aab23901 add ignores
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164698 13f79535-47bb-0310-9956-ffa450edef68
2005-04-26 00:30:08 +00:00
Erik Hatcher d650384d4b add GreekAnalyzer, contributed by Panagiotis Astithas (past@ebs.gr)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164686 13f79535-47bb-0310-9956-ffa450edef68
2005-04-25 23:23:37 +00:00
Erik Hatcher 2fe0a80189 rename misspelled indexDictionnary method
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160988 13f79535-47bb-0310-9956-ffa450edef68
2005-04-12 00:11:33 +00:00
Erik Hatcher ec522fc1c8 Fixed deprecation issues, adjusted test cases to use assertEquals better, reformatted style
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160987 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:48:02 +00:00
Erik Hatcher 0c99b57cc1 Fixed issue with ctor parameter being ignored
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160984 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:43:57 +00:00
Erik Hatcher e88213a2d9 refactor build to use common contrib build system
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160983 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:42:26 +00:00
Daniel Naber c4f1ee70a9 use lowercase method names; remove javadoc that's inherited anyway
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160070 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 17:50:38 +00:00
Daniel Naber 04ea892fbe import cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160065 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 17:45:36 +00:00
Erik Hatcher 6f5f23444c enhanced test contributed by Sven. Encoding tweaks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160034 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 12:25:16 +00:00
Erik Hatcher 0ff227ff0a switch dotted u character to use unicode value reference
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160023 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 10:16:37 +00:00
Erik Hatcher 4e580e221e Issue deprecation warnings when building test cases. Fixed deprecation warnings on TestKeywordAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160012 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 09:10:59 +00:00
Erik Hatcher 3be3e8ab5d Add accent character normalizer filter contributed by Sven Duzont. Also created simple test case.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160011 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 09:10:05 +00:00
Daniel Naber 69380a1815 adapt to use of jline
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158852 13f79535-47bb-0310-9956-ffa450edef68
2005-03-23 23:49:08 +00:00
Daniel Naber 84db65bfde adapt to use of jline
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158851 13f79535-47bb-0310-9956-ffa450edef68
2005-03-23 23:42:23 +00:00
Daniel Naber 5a59714f4a use jline instead of java-readline. jline can be added to SVN thanks to its BSD license. plus some small cleanup.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158850 13f79535-47bb-0310-9956-ffa450edef68
2005-03-23 23:40:50 +00:00
Erik Hatcher b54f22aaab Fix max word length issue (though don't know why anyone would limit long words in a more-like-this query).
Also, modified to take into account all values of a field rather than just the first one.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158076 13f79535-47bb-0310-9956-ffa450edef68
2005-03-18 15:03:00 +00:00
Erik Hatcher 1cb674fc04 regenerated from latest Snowball CVS
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@157834 13f79535-47bb-0310-9956-ffa450edef68
2005-03-17 00:41:31 +00:00
Erik Hatcher 9621a0985c added title to documentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@156593 13f79535-47bb-0310-9956-ffa450edef68
2005-03-09 01:59:14 +00:00
Erik Hatcher 9824226394 Contribution of slick Swing models to enable on-the-fly searching of
tables and lists.  Created by Jonathan Simon.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@156591 13f79535-47bb-0310-9956-ffa450edef68
2005-03-09 01:52:13 +00:00
Mark Harwood fdf05bd088 Fixed missing fieldname in API
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154447 13f79535-47bb-0310-9956-ffa450edef68
2005-02-19 19:51:04 +00:00
Daniel Naber 05d0335dcd offer additional methods that take analyzer + text instead of tokenstream; fix some unused imports and variables
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154444 13f79535-47bb-0310-9956-ffa450edef68
2005-02-19 19:08:52 +00:00
Daniel Naber 335c1567d8 remove empty "@return" tags so javadoc stops complaining; small whitespace cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154083 13f79535-47bb-0310-9956-ffa450edef68
2005-02-16 20:37:57 +00:00
Daniel Naber 45864d1c9c clean up imports, remove unused variables and remove the declaration of an Exception that was never thrown
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154080 13f79535-47bb-0310-9956-ffa450edef68
2005-02-16 20:20:15 +00:00
Erik Hatcher 28e712b2ee update docs to account for TLP migration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153802 13f79535-47bb-0310-9956-ffa450edef68
2005-02-14 16:48:47 +00:00
Erik Hatcher 373e613341 remove unnecessary import
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153430 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 18:11:37 +00:00
Erik Hatcher 2ac412f6b7 move similarity and spellchecker to new contrib area
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153429 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 18:11:05 +00:00
Erik Hatcher f375d09898 add customizable buffer size
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153412 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 15:30:14 +00:00
Erik Hatcher cd0d0937e1 split keyword tokenizer out of KeywordAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153398 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 13:50:37 +00:00
Erik Hatcher 826fef7f6a KeywordAnalyzer contribution - adapted from _Lucene in Action_ code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@152921 13f79535-47bb-0310-9956-ffa450edef68
2005-02-08 19:13:05 +00:00
Mark Harwood 276ab079f5 Added Nicko Cadell's Encoder contribution
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151622 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 21:31:54 +00:00
Mark Harwood b1555b0bbf Test SVN Commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151615 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 18:12:57 +00:00
Erik Hatcher 0ee1728e6d move two more projects over to contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151590 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 15:35:12 +00:00
Erik Hatcher 646f0f0434 Switch ant project to conventional src/java directory structure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151589 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 14:51:59 +00:00
Erik Hatcher 767312d611 add convenient TODO file to keep track of sandbox -> contrib move
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151469 13f79535-47bb-0310-9956-ffa450edef68
2005-02-05 02:23:19 +00:00
Erik Hatcher 10904d02f6 fix most deprecation warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151468 13f79535-47bb-0310-9956-ffa450edef68
2005-02-05 02:21:39 +00:00
Erik Hatcher 0955eef89f move parts of the sandbox over to contrib area
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151459 13f79535-47bb-0310-9956-ffa450edef68
2005-02-05 01:25:43 +00:00