Commit Graph

374 Commits

Author SHA1 Message Date
Robert Muir ef56f5d551 LUCENE-3361: port url+email tokenizer to standardtokenizerinterface, fix combining marks bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154936 13f79535-47bb-0310-9956-ffa450edef68
2011-08-08 11:57:59 +00:00
Martijn van Groningen 0bc43983c4 SOLR-2665: Added post group faceting.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154676 13f79535-47bb-0310-9956-ffa450edef68
2011-08-07 09:02:33 +00:00
Robert Muir 2dda5bc35f LUCENE-3358: StandardTokenizer wrongly discarded combining marks attached to Han/Hiragana
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154005 13f79535-47bb-0310-9956-ffa450edef68
2011-08-04 20:49:47 +00:00
Adriano Crestani Campos cffd01664b -committing patch from LUCENE-3343 and LUCENE-3338 to trunk
-fixed javadoc reference to QueryNode in StandardSyntaxParser.jj

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1152892 13f79535-47bb-0310-9956-ffa450edef68
2011-08-01 18:57:05 +00:00
Robert Muir 31e3b272c5 LUCENE-3351: DirectSpellChecker throws NPE if field doesn't exist
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1152669 13f79535-47bb-0310-9956-ffa450edef68
2011-08-01 00:55:10 +00:00
Koji Sekiguchi 149d3de96d LUCENE-3129: fix javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1152417 13f79535-47bb-0310-9956-ffa450edef68
2011-07-30 02:46:21 +00:00
Steven Rowe 23d22e4d47 LUCENE-3337: avoid building jar files unless necessary in build
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1151720 13f79535-47bb-0310-9956-ffa450edef68
2011-07-28 04:02:09 +00:00
Martijn van Groningen 198ec93b4e LUCENE-3097: Fixed documentation error. FixedBitSet is now used instead of OpenBitSet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150794 13f79535-47bb-0310-9956-ffa450edef68
2011-07-25 17:22:46 +00:00
Uwe Schindler b6c1db99d3 LUCENE-1768: Javadocs (week-8)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150671 13f79535-47bb-0310-9956-ffa450edef68
2011-07-25 12:45:41 +00:00
Martijn van Groningen de074e9535 LUCENE-3097: Fixed documentation error. FixedBitSet is now used instead of OpenBitSet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150663 13f79535-47bb-0310-9956-ffa450edef68
2011-07-25 12:22:42 +00:00
Martijn van Groningen 708101f91e LUCENE-3097: Fix for build failure.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150613 13f79535-47bb-0310-9956-ffa450edef68
2011-07-25 10:01:56 +00:00
Uwe Schindler 014dee7cf5 revert accidental commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150488 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:22:07 +00:00
Uwe Schindler 9c73f9d03b LUCENE-3336: Speed up javadocs-all builds by minimizing compile costs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150486 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:19:42 +00:00
Martijn van Groningen 49075985fb LUCENE-3097: Added a new grouping collector that can be used to retrieve all most relevant documents per group.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150470 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 19:06:51 +00:00
Robert Muir 3626220146 use a different character for test, one that is still enclosing mark in 6.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150091 13f79535-47bb-0310-9956-ffa450edef68
2011-07-23 12:18:48 +00:00
Robert Muir 4af23c6d84 fix javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1147887 13f79535-47bb-0310-9956-ffa450edef68
2011-07-18 14:15:10 +00:00
Robert Muir 07bfe50eda LUCENE-3326: MoreLikeThis reuses a reader after it has already closed it
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1147881 13f79535-47bb-0310-9956-ffa450edef68
2011-07-18 13:56:49 +00:00
Shai Erera 561723eca6 LUCENE-3307: don't require an analyzer, if all fields are NOT_ANALYZED
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1147530 13f79535-47bb-0310-9956-ffa450edef68
2011-07-17 05:30:50 +00:00
Uwe Schindler 2e14617e19 LUCENE-1768: Fix DateFormat handling, don't lowercase numeric terms. Improve test.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1146984 13f79535-47bb-0310-9956-ffa450edef68
2011-07-15 07:08:37 +00:00
Christopher John Male a684f6d497 LUCENE-3285: Moved flexible queryparser to queryparser module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1146548 13f79535-47bb-0310-9956-ffa450edef68
2011-07-14 02:56:54 +00:00
Christopher John Male 6b700e1183 LUCENE-3271: Moved MoreLikeThis to queries module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1146542 13f79535-47bb-0310-9956-ffa450edef68
2011-07-14 02:07:11 +00:00
Robert Muir bbd85816c8 LUCENE-3306: disable positions for spellchecker ngram fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145957 13f79535-47bb-0310-9956-ffa450edef68
2011-07-13 11:22:31 +00:00
Robert Muir 1c646d24c9 LUCENE-2048: omit positions but keep term freq
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145594 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 13:31:22 +00:00
Christopher John Male b3671752f2 LUCENE-3285: Moved analyzing, complexphrase, etx and surround queryparsers to qp module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145430 13f79535-47bb-0310-9956-ffa450edef68
2011-07-12 02:22:24 +00:00
Michael McCandless fbf9f4ccad LUCENE-3289: add options to FST Builder to tradeoff RAM/CPU used during build vs how small the resulting FST is
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145292 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:53:13 +00:00
Michael McCandless a17df303e9 LUCENE-3280: add FixedBitSet and cutover most of Lucene
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145239 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 16:52:11 +00:00
Robert Muir 015ecfa0a0 LUCENE-3233: improve ram/perf of SynonymFilter, add wordnet parsing, nuke contrib/wordnet
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145158 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 12:58:52 +00:00
Koji Sekiguchi 6b1e514df9 fix javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145133 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 12:07:13 +00:00
Christopher John Male 5f30bedccc LUCENE-3283: Moved core QueryParsers to queryparser module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145016 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 03:37:00 +00:00
Shai Erera b23bafce93 enforce in-order merges
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144781 13f79535-47bb-0310-9956-ffa450edef68
2011-07-10 05:47:55 +00:00
Steven Rowe 9e020991ef Merged with trunk up to r1144714
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/solr2452@1144715 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 18:50:54 +00:00
Christopher John Male f16f395a30 LUCENE-3284: Decoupled remaining module/contrib tests from QueryParser
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144566 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 01:11:18 +00:00
Steven Rowe 88fe5d121f Merged with trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/solr2452@1144510 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 21:02:20 +00:00
Simon Willnauer 6c5621f16c fixed dead store variable
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144269 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 11:26:03 +00:00
Simon Willnauer feb9739bf7 merged with trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/LUCENE2793@1144189 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 08:09:32 +00:00
Steven Rowe 4505c08643 SOLR-2452: merged with trunk up r1144161; applied the svn movement script and the latest version of the post-svn-movement patch
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/solr2452@1144174 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 06:41:23 +00:00
Robert Muir ea67cd8b2c LUCENE-2392: decouple vector space scoring from Query/Weight/Scorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1144158 13f79535-47bb-0310-9956-ffa450edef68
2011-07-08 05:08:05 +00:00
Simon Willnauer 4be688e68d merged with trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/LUCENE2793@1143719 13f79535-47bb-0310-9956-ffa450edef68
2011-07-07 09:03:58 +00:00
Michael McCandless b55eeb510d LUCENE-3246: invert getDelDocs to getLiveDocs as pre-cursor for LUCENE-1536
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1143415 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 13:54:38 +00:00
Christopher John Male 36330b50b4 LUCENE-3274: Collapsed common module into lucene core util, removed module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1143234 13f79535-47bb-0310-9956-ffa450edef68
2011-07-06 00:28:34 +00:00
Robert Muir 6e25bef3ef LUCENE-3269: speed up top-k sampling tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1143122 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 16:13:15 +00:00
Yonik Seeley 06a3778905 SOLR-1932: add sumtotaltermfreq
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1143091 13f79535-47bb-0310-9956-ffa450edef68
2011-07-05 15:06:24 +00:00
Shai Erera d04561c703 LUCENE-3268: make TestScoredDocIDsUtils more robust to document deletions
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1142676 13f79535-47bb-0310-9956-ffa450edef68
2011-07-04 14:06:38 +00:00
Yonik Seeley 3f5c0eca5b SOLR-1932: add totaltermfreq to relevancy functions
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1142481 13f79535-47bb-0310-9956-ffa450edef68
2011-07-03 19:02:57 +00:00
Shai Erera b1afef02f9 remove references to old bug tracking system numbers from facet code
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1142388 13f79535-47bb-0310-9956-ffa450edef68
2011-07-03 07:49:58 +00:00
Shai Erera e92db313d4 LUCENE-3268: fixed TestScoredDocIDsUtils to use log-merge policy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1142386 13f79535-47bb-0310-9956-ffa450edef68
2011-07-03 07:44:01 +00:00
Christopher John Male 08b8b9b789 LUCENE-3267: Consolidated Lucene core's function impls into new module. Removed remainder
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141747 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 23:20:31 +00:00
Dawid Weiss 796fa6def3 JavaDoc warnings squashed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141689 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 19:52:31 +00:00
Dawid Weiss dab351a096 Changing the licensing from CC-SA (approved by Apache anyway, but we don't want any issues) to MPL (we've got an agreement from Marcin Milkowski; the license statement has been updated in Morfologik's repository too.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141673 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 19:16:32 +00:00
Dawid Weiss 29b09032d3 LUCENE-2341: integrating morfologik (Polish stemming/ morphosyntactic dictionary) as an analysis module.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141671 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 19:12:54 +00:00
Robert Muir cec86dbc06 LUCENE-3264: crank up faceting module tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141629 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 17:16:36 +00:00
Michael McCandless 9285e08bce LUCENE-3260: fix wrong result from MultiTermsEnum.next() after seekExact
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141593 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 16:05:42 +00:00
Uwe Schindler 988b53e499 Facets: Remove stupid array->List method and use Array.asList in example
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141518 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 12:53:13 +00:00
Robert Muir 295706f9f8 LUCENE-3239: remove use of slow Arrays.copyOf
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141510 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 12:42:17 +00:00
Simon Willnauer b843dfe839 LUCENE-3239: Drop Java 5 support on trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141501 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 12:14:36 +00:00
Doron Cohen e1d37f5e09 LUCENE-3142: benchmark/stats package is obsolete and unused - remove it
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141465 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 10:00:52 +00:00
Steven Rowe cfcf3a249d Put join in its place in the alphabetically sorted module lists.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141396 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 05:03:59 +00:00
Christopher John Male 2266c70ce4 LUCENE-3256: Consolidated CustomScoreQuery and BoostedQuery into Queries module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141366 13f79535-47bb-0310-9956-ffa450edef68
2011-06-30 02:31:22 +00:00
Robert Muir 4291f52c70 LUCENE-3079: faceting module (port to trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141246 13f79535-47bb-0310-9956-ffa450edef68
2011-06-29 21:02:07 +00:00
Michael McCandless 2b8ef42a5c LUCENE-3171: fixup CHANGES, add missing @Override
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141170 13f79535-47bb-0310-9956-ffa450edef68
2011-06-29 17:14:13 +00:00
Dawid Weiss f85c4e7c88 Reverting 1141022 (needs to wait for 1.6 support).
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141032 13f79535-47bb-0310-9956-ffa450edef68
2011-06-29 10:00:36 +00:00
Dawid Weiss d188d3df90 LUCENE-2341: integrating morfologik (Polish stemming/ morphosyntactic dictionary) as an analysis module.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1141022 13f79535-47bb-0310-9956-ffa450edef68
2011-06-29 09:24:14 +00:00
Christopher John Male a993ae9032 LUCENE-3249: Moved FunctionQuery impls to new Queries module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1140906 13f79535-47bb-0310-9956-ffa450edef68
2011-06-28 23:32:27 +00:00
Michael McCandless 4a3b510739 LUCENE-3171: add modules/join to enable joining parent + child documents when indexed as a doc block
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1140851 13f79535-47bb-0310-9956-ffa450edef68
2011-06-28 21:20:18 +00:00
Robert Muir dc328b0714 add placeholder to stop eclipse whining
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1140658 13f79535-47bb-0310-9956-ffa450edef68
2011-06-28 14:45:55 +00:00
Robert Muir 6f6b3cbc5a LUCENE-3250: remove contrib/misc,wordnet,suggest dependencies on modules/analysis
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1140394 13f79535-47bb-0310-9956-ffa450edef68
2011-06-28 01:44:15 +00:00
Christopher John Male b3f0ae9b7a LUCENE-3240: Moved FunctionQuery, ValueSource and DocValues to new queries module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1140379 13f79535-47bb-0310-9956-ffa450edef68
2011-06-27 23:45:18 +00:00
Robert Muir b5f676b992 add empty tests dir, to stop eclipse whining
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1139985 13f79535-47bb-0310-9956-ffa450edef68
2011-06-27 00:55:55 +00:00
Michael McCandless cf0e5ec467 LUCENE-3225: add TermsEnum.seekExact(BytesRef); renamne existing methods to seekCeil/seekExact
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1139788 13f79535-47bb-0310-9956-ffa450edef68
2011-06-26 13:33:58 +00:00
Christopher John Male 06653e0e62 LUCENE-3232: Fixed duplicate ant project name
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1139489 13f79535-47bb-0310-9956-ffa450edef68
2011-06-25 04:08:55 +00:00
Christopher John Male f37c1aaead LUCENE-3232: Added missing overview.html
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1139480 13f79535-47bb-0310-9956-ffa450edef68
2011-06-25 02:11:31 +00:00
Christopher John Male 9bf4c3fd8f LUCENE-3232: Moved MutableValues to common module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1139467 13f79535-47bb-0310-9956-ffa450edef68
2011-06-25 00:03:35 +00:00
Yonik Seeley a7a7108ec8 SOLR-2564: add pollLast to deal with performance regression due to Java5/Java6 difference between modules and Solr
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1138735 13f79535-47bb-0310-9956-ffa450edef68
2011-06-23 05:05:54 +00:00
Christopher John Male f9ed2c19cd LUCENE-3219: Moved SortField types to Enum
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1138276 13f79535-47bb-0310-9956-ffa450edef68
2011-06-22 01:48:45 +00:00
Michael McCandless c039e210b0 LUCENE-2548: don't intern field names
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1138058 13f79535-47bb-0310-9956-ffa450edef68
2011-06-21 15:42:54 +00:00
Christopher John Male 8db3c5af74 LUCENE-3223: Fixed if else bug preventing sorting by doc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1137882 13f79535-47bb-0310-9956-ffa450edef68
2011-06-21 06:56:02 +00:00
Uwe Schindler 1ca3506f6d LUCENE-3208: Missed change in grouping module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1136605 13f79535-47bb-0310-9956-ffa450edef68
2011-06-16 19:03:35 +00:00
Michael McCandless fe7593efee LUCENE-3191: simplify API; return TopFieldDocs from TopDocs.merge if sort != null; add some missing compareValue to FC impls
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1136467 13f79535-47bb-0310-9956-ffa450edef68
2011-06-16 14:52:56 +00:00
Michael McCandless a1e74fd6c9 LUCENE-3191: add TopDocs.merge, TopGroups.merge, SearchGroup.merge, to make sharding easier
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1136080 13f79535-47bb-0310-9956-ffa450edef68
2011-06-15 15:12:48 +00:00
Dawid Weiss 7614bedc9d LUCENE-3202: Add DataInput/DataOutput subclasses that delegate to an InputStream/OutputStream.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1135506 13f79535-47bb-0310-9956-ffa450edef68
2011-06-14 12:03:56 +00:00
Robert Muir eca56e0564 LUCENE-152: minor optimization to avoid some char[]/String creation
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1134328 13f79535-47bb-0310-9956-ffa450edef68
2011-06-10 14:00:32 +00:00
Simon Willnauer 2007a4b4e0 Remove @Version tags from JavaDoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1133805 13f79535-47bb-0310-9956-ffa450edef68
2011-06-09 11:43:35 +00:00
Michael McCandless 0f91b38fae LUCENE-3129: fix false TestGrouping failure
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1131401 13f79535-47bb-0310-9956-ffa450edef68
2011-06-04 13:58:37 +00:00
Michael McCandless 01c8469ab3 LUCENE-3129: BlockGroupingCollector wasn't tracking scores correctly; fang'd up TestGrouping to reveal the bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1131158 13f79535-47bb-0310-9956-ffa450edef68
2011-06-03 19:31:51 +00:00
Michael McCandless 27d8311ffc rename to test to match the class it's testing
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1131150 13f79535-47bb-0310-9956-ffa450edef68
2011-06-03 19:17:40 +00:00
Michael McCandless 526d6c937e LUCENE-3099: fix AllGroupsCollectorTest
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130859 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 22:42:27 +00:00
Michael McCandless d1548ca30a LUCENE-3099: allow subclasses to determine the group value
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130858 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 22:35:17 +00:00
Steven Rowe d2cc7f1330 LUCENE-3149: Switch ICU4J dependency to mavenized version
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130718 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 18:42:26 +00:00
Steven Rowe 8428aa9c0d LUCENE-3149: Updated ICU4J notice
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130676 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 17:34:25 +00:00
Michael McCandless b680eaf1af LUCENE-3129: move CHANGES entry into 3.x not 3.2
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130663 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 17:18:50 +00:00
Michael McCandless 375c1abdbf LUCENE-3129: add single pass grouping collector, BlockGroupingCollector
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130648 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 16:46:56 +00:00
Robert Muir fb4c480591 add missing LICENSE/NOTICE for modules
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130620 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 16:00:04 +00:00
Ryan McKinley 50fb06de1a LUCENE-3149 -- fix maven-dist
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130547 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 13:42:55 +00:00
Robert Muir b7277878e8 LUCENE-152: add KStem
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130527 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 12:58:22 +00:00
Robert Muir 5fff60467f LUCENE-3149: upgrade icu to 4.8
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1130439 13f79535-47bb-0310-9956-ffa450edef68
2011-06-02 08:58:34 +00:00
Robert Muir 063d18e280 LUCENE-3163: add link to jira versions information to CHANGES.txt files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129656 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 13:03:40 +00:00
Robert Muir 294bd36c8a LUCENE-3159: lucene benchmark has some unnecessary files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129456 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 01:41:46 +00:00
Robert Muir 7bf34183d0 LUCENE-3158: put licensing information in the LICENSE.txt file where it belongs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129450 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 01:22:03 +00:00
Robert Muir c8101d708c LUCENE-3160: lucene source build doesn't work correctly by itself
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129445 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 00:08:14 +00:00
Michael McCandless 150d8b3089 LUCENE-3140: move fst to .. on trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1128871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-29 13:12:32 +00:00
Shai Erera c5f5f67b25 LUCENE-3147: MockDirectoryWrapper should track open file handles of IndexOutput
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1128830 13f79535-47bb-0310-9956-ffa450edef68
2011-05-29 05:48:36 +00:00
Steven Rowe d4aaf0ce3d LUCENE-2995: specialize the dist-maven target for the new suggest module so that the generate-maven-artifacts target can succeed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1128121 13f79535-47bb-0310-9956-ffa450edef68
2011-05-26 23:32:56 +00:00
Doron Cohen f24c33f4fe LUCENE-3137: Benchmark's ExtractReuters created its temp dir wrongly if provided out-dir param ended by slash
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1127436 13f79535-47bb-0310-9956-ffa450edef68
2011-05-25 08:50:16 +00:00
Simon Willnauer 68a840c2b7 SOLR-2530: Remove Noggit CharArr from FieldType
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1127326 13f79535-47bb-0310-9956-ffa450edef68
2011-05-24 22:44:36 +00:00
Michael McCandless 0775014bb9 LUCENE-3098: don't double-lookup the doc's ord
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126761 13f79535-47bb-0310-9956-ffa450edef68
2011-05-23 21:31:57 +00:00
Robert Muir f5048293b5 LUCENE-2995: factor out a shared spellchecking module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126642 13f79535-47bb-0310-9956-ffa450edef68
2011-05-23 18:33:53 +00:00
Koji Sekiguchi 49fe358760 use prettify in grouping package javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126275 13f79535-47bb-0310-9956-ffa450edef68
2011-05-22 23:56:43 +00:00
Shai Erera fa5da66404 LUCENE-3102: add no-wrap ability to CachingCollector
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1124379 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 18:51:39 +00:00
Michael McCandless 92b00742ee fix javadoc warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1124298 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 15:17:51 +00:00
Shai Erera fa23344e40 LUCENE-3102: add factory method to CachingCollector
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104683 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 04:00:48 +00:00
Robert Muir f1a1844fb9 LUCENE-3113: fix analyzer bugs found by MockTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104519 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 20:16:40 +00:00
Michael McCandless 1c464e6dcc LUCENE-3098: add AllGroupsCollector
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104421 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 17:20:54 +00:00
Shai Erera f5fdea8dda LUCENE-3102: first cut - some refactoring, bug fixes, add test, move to core (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103872 13f79535-47bb-0310-9956-ffa450edef68
2011-05-16 20:29:10 +00:00
Shai Erera 680bb70e3a use assertArrayEquals instead of the deprecated version
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103601 13f79535-47bb-0310-9956-ffa450edef68
2011-05-16 03:17:55 +00:00
Steven Rowe acb3c8be3a LUCENE-1421: Specialize the dist-maven target for the new grouping module, so that generate-maven-artifacts can succeed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103227 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 21:24:50 +00:00
Michael McCandless c7db217230 LUCENE-1421: moves CHANGES entry up to contrib's 3.x CHANGES
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103155 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 16:44:04 +00:00
Michael McCandless a8993ca817 LUCENE-1421: fixup javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103150 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 16:17:58 +00:00
Robert Muir 4c85f799eb LUCENE-1421: make class public
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103102 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 13:26:26 +00:00
Robert Muir 2b3e066518 LUCENE-1421: add workaround to unbreak the build
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103048 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 11:25:56 +00:00
Michael McCandless 9083cedb92 LUCENE-1421: factor out shared grouping module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103024 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 10:48:29 +00:00
Robert Muir a871b29ed6 LUCENE-3086: add ElisionFilter to ItalianAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1102120 13f79535-47bb-0310-9956-ffa450edef68
2011-05-11 22:43:54 +00:00
Ryan McKinley 96878534a0 LUCENE-3071: Add ReversePathHierarchyTokenizer and enable skip on PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1099999 13f79535-47bb-0310-9956-ffa450edef68
2011-05-05 23:30:05 +00:00
Robert Muir 4455345c6e LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir a75e5282c7 collation tests: try to find less jre bugs and just test thread safety
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir 1f67321074 missing svn:eol-style
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1097216 13f79535-47bb-0310-9956-ffa450edef68
2011-04-27 19:40:18 +00:00
Robert Muir 44ba0859db LUCENE-2560: stress tests for icu integration
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096339 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 16:07:16 +00:00
Robert Muir 593d7a54ea LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir 7db98455e7 LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir c0c695053c LUCENE-2560: remove copy/paste unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir 68061ef921 LUCENE-2560: add basic stress tests for analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir c3f6331639 LUCENE-3016: add analyzer for Latvian
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir ecd795c585 LUCENE-3026: SmartChineseAnalyzer's WordTokenFilter threw NullPointerException on sentences longer than 32,767 characters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092328 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 15:15:31 +00:00
Robert Muir 52b54262dc LUCENE-3020: don't reflect mockanalyzer, it has no no-arg ctor anymore
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091159 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 18:15:50 +00:00
Robert Muir 7d07d206b5 LUCENE-3020: better payload testing with mockanalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Steven Rowe c613d642a0 LUCENE-3006: specialized definition of javadoc.classpath is not required for building ICU analysis module's javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088063 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 16:47:24 +00:00
Michael McCandless f10d92398b LUCENE-1076: new TieredMergePolicy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088051 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 15:47:12 +00:00
Steven Rowe 14eb02ffa4 LUCENE-3006: die javadoc warnings die (modules/ edition)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087830 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 17:43:24 +00:00
Robert Muir d940c24c03 fix benchmark collation test to match reality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087548 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 01:58:35 +00:00
Robert Muir 74a065a57f fix collation benchmark to use byte terms
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087529 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 00:47:16 +00:00
Steven Rowe 7402c50058 fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087475 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:58 +00:00
Steven Rowe 085d30ecf3 changes entries for recent commits
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087474 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:21 +00:00
Steven Rowe 1caaea77b2 ReadTokensTask now converts tokens to their indexed forms (char[]->byte[]), just as the indexer does. This allows measurement of the conversion process, which is important for analysis components that customize it, e.g. (ICU)CollationKeyFilter.
NB: as a result, benchmarks that incorporate this task will no longer be directly comparable between 3.X and 4.0

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087471 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:44:20 +00:00
Steven Rowe 9cefe60a4b Removed special case for looking up KeywordAnalyzer, which is *not* alone among analyzers occupying package o.a.l.analysis.core.
Instead, now attempting to instantiate no-package analyzers as core analyzers, then falling back to the previous default package ("org.apache.lucene.analysis.") if that fails.  Also, made the same changes in NewShingleAnalyzerTask.

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087468 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:34:46 +00:00
Steven Rowe 3bbfa450e4 Updated to the new method for obtaining a top-level deleted docs bitset. Also checking the bitset for null, when there are no deleted docs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087435 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 21:03:18 +00:00
Steven Rowe 56c2994f66 Added a special case for looking up KeywordAnalyzer, which alone among analyzers occupies package o.a.l.analysis.core.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087431 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 20:16:10 +00:00
Doron Cohen 8d0c1b62af LUCENE-2977: WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084929 13f79535-47bb-0310-9956-ffa450edef68
2011-03-24 12:22:13 +00:00
Doron Cohen c6f3dd5cc7 LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix - fix CHANGES entry.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084549 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:47:18 +00:00
Doron Cohen d123b8a224 LUCENE-2980: Benchmark's ContentSource made insensitive to letter case of file suffix.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084544 13f79535-47bb-0310-9956-ffa450edef68
2011-03-23 11:38:54 +00:00
Grant Ingersoll ed20a24d22 LUCENE-2952: restore src/tools and move validation there
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084274 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:57 +00:00
Doron Cohen 97909a908e fix mis-spelled assert comment (again)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084273 13f79535-47bb-0310-9956-ffa450edef68
2011-03-22 18:03:00 +00:00