Commit Graph

178 Commits

Author SHA1 Message Date
Robert Muir 294bd36c8a LUCENE-3159: lucene benchmark has some unnecessary files
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129456 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 01:41:46 +00:00
Robert Muir 7bf34183d0 LUCENE-3158: put licensing information in the LICENSE.txt file where it belongs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129450 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 01:22:03 +00:00
Robert Muir c8101d708c LUCENE-3160: lucene source build doesn't work correctly by itself
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1129445 13f79535-47bb-0310-9956-ffa450edef68
2011-05-31 00:08:14 +00:00
Michael McCandless 150d8b3089 LUCENE-3140: move fst to .. on trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1128871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-29 13:12:32 +00:00
Shai Erera c5f5f67b25 LUCENE-3147: MockDirectoryWrapper should track open file handles of IndexOutput
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1128830 13f79535-47bb-0310-9956-ffa450edef68
2011-05-29 05:48:36 +00:00
Steven Rowe d4aaf0ce3d LUCENE-2995: specialize the dist-maven target for the new suggest module so that the generate-maven-artifacts target can succeed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1128121 13f79535-47bb-0310-9956-ffa450edef68
2011-05-26 23:32:56 +00:00
Doron Cohen f24c33f4fe LUCENE-3137: Benchmark's ExtractReuters created its temp dir wrongly if provided out-dir param ended by slash
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1127436 13f79535-47bb-0310-9956-ffa450edef68
2011-05-25 08:50:16 +00:00
Simon Willnauer 68a840c2b7 SOLR-2530: Remove Noggit CharArr from FieldType
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1127326 13f79535-47bb-0310-9956-ffa450edef68
2011-05-24 22:44:36 +00:00
Michael McCandless 0775014bb9 LUCENE-3098: don't double-lookup the doc's ord
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126761 13f79535-47bb-0310-9956-ffa450edef68
2011-05-23 21:31:57 +00:00
Robert Muir f5048293b5 LUCENE-2995: factor out a shared spellchecking module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126642 13f79535-47bb-0310-9956-ffa450edef68
2011-05-23 18:33:53 +00:00
Koji Sekiguchi 49fe358760 use prettify in grouping package javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1126275 13f79535-47bb-0310-9956-ffa450edef68
2011-05-22 23:56:43 +00:00
Shai Erera fa5da66404 LUCENE-3102: add no-wrap ability to CachingCollector
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1124379 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 18:51:39 +00:00
Michael McCandless 92b00742ee fix javadoc warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1124298 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 15:17:51 +00:00
Shai Erera fa23344e40 LUCENE-3102: add factory method to CachingCollector
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104683 13f79535-47bb-0310-9956-ffa450edef68
2011-05-18 04:00:48 +00:00
Robert Muir f1a1844fb9 LUCENE-3113: fix analyzer bugs found by MockTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104519 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 20:16:40 +00:00
Michael McCandless 1c464e6dcc LUCENE-3098: add AllGroupsCollector
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1104421 13f79535-47bb-0310-9956-ffa450edef68
2011-05-17 17:20:54 +00:00
Shai Erera f5fdea8dda LUCENE-3102: first cut - some refactoring, bug fixes, add test, move to core (trunk)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103872 13f79535-47bb-0310-9956-ffa450edef68
2011-05-16 20:29:10 +00:00
Shai Erera 680bb70e3a use assertArrayEquals instead of the deprecated version
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103601 13f79535-47bb-0310-9956-ffa450edef68
2011-05-16 03:17:55 +00:00
Steven Rowe acb3c8be3a LUCENE-1421: Specialize the dist-maven target for the new grouping module, so that generate-maven-artifacts can succeed
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103227 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 21:24:50 +00:00
Michael McCandless c7db217230 LUCENE-1421: moves CHANGES entry up to contrib's 3.x CHANGES
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103155 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 16:44:04 +00:00
Michael McCandless a8993ca817 LUCENE-1421: fixup javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103150 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 16:17:58 +00:00
Robert Muir 4c85f799eb LUCENE-1421: make class public
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103102 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 13:26:26 +00:00
Robert Muir 2b3e066518 LUCENE-1421: add workaround to unbreak the build
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103048 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 11:25:56 +00:00
Michael McCandless 9083cedb92 LUCENE-1421: factor out shared grouping module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1103024 13f79535-47bb-0310-9956-ffa450edef68
2011-05-14 10:48:29 +00:00
Robert Muir a871b29ed6 LUCENE-3086: add ElisionFilter to ItalianAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1102120 13f79535-47bb-0310-9956-ffa450edef68
2011-05-11 22:43:54 +00:00
Ryan McKinley 96878534a0 LUCENE-3071: Add ReversePathHierarchyTokenizer and enable skip on PathHierarchyTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1099999 13f79535-47bb-0310-9956-ffa450edef68
2011-05-05 23:30:05 +00:00
Robert Muir 4455345c6e LUCENE-3063: factor CharTokenizer/CharacterUtils into analyzers module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098871 13f79535-47bb-0310-9956-ffa450edef68
2011-05-03 00:29:47 +00:00
Robert Muir a75e5282c7 collation tests: try to find less jre bugs and just test thread safety
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1098532 13f79535-47bb-0310-9956-ffa450edef68
2011-05-02 12:03:14 +00:00
Robert Muir 1f67321074 missing svn:eol-style
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1097216 13f79535-47bb-0310-9956-ffa450edef68
2011-04-27 19:40:18 +00:00
Robert Muir 44ba0859db LUCENE-2560: stress tests for icu integration
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096339 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 16:07:16 +00:00
Robert Muir 593d7a54ea LUCENE-3044: ThaiWordFilter uses AttributeSource.copyTo incorrectly
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096334 13f79535-47bb-0310-9956-ffa450edef68
2011-04-24 15:45:45 +00:00
Robert Muir 7db98455e7 LUCENE-3043: GermanStemmer threw IOOBE on zero-length tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096194 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:48:17 +00:00
Robert Muir c0c695053c LUCENE-2560: remove copy/paste unused import
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096183 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 17:16:51 +00:00
Robert Muir 68061ef921 LUCENE-2560: add basic stress tests for analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1096178 13f79535-47bb-0310-9956-ffa450edef68
2011-04-23 16:55:15 +00:00
Robert Muir c3f6331639 LUCENE-3016: add analyzer for Latvian
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092396 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 17:07:10 +00:00
Robert Muir ecd795c585 LUCENE-3026: SmartChineseAnalyzer's WordTokenFilter threw NullPointerException on sentences longer than 32,767 characters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1092328 13f79535-47bb-0310-9956-ffa450edef68
2011-04-14 15:15:31 +00:00
Robert Muir 52b54262dc LUCENE-3020: don't reflect mockanalyzer, it has no no-arg ctor anymore
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091159 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 18:15:50 +00:00
Robert Muir 7d07d206b5 LUCENE-3020: better payload testing with mockanalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1091132 13f79535-47bb-0310-9956-ffa450edef68
2011-04-11 17:20:31 +00:00
Steven Rowe c613d642a0 LUCENE-3006: specialized definition of javadoc.classpath is not required for building ICU analysis module's javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088063 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 16:47:24 +00:00
Michael McCandless f10d92398b LUCENE-1076: new TieredMergePolicy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1088051 13f79535-47bb-0310-9956-ffa450edef68
2011-04-02 15:47:12 +00:00
Steven Rowe 14eb02ffa4 LUCENE-3006: die javadoc warnings die (modules/ edition)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087830 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 17:43:24 +00:00
Robert Muir d940c24c03 fix benchmark collation test to match reality
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087548 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 01:58:35 +00:00
Robert Muir 74a065a57f fix collation benchmark to use byte terms
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087529 13f79535-47bb-0310-9956-ffa450edef68
2011-04-01 00:47:16 +00:00
Steven Rowe 7402c50058 fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087475 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:58 +00:00
Steven Rowe 085d30ecf3 changes entries for recent commits
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087474 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:53:21 +00:00
Steven Rowe 1caaea77b2 ReadTokensTask now converts tokens to their indexed forms (char[]->byte[]), just as the indexer does. This allows measurement of the conversion process, which is important for analysis components that customize it, e.g. (ICU)CollationKeyFilter.
NB: as a result, benchmarks that incorporate this task will no longer be directly comparable between 3.X and 4.0

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087471 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:44:20 +00:00
Steven Rowe 9cefe60a4b Removed special case for looking up KeywordAnalyzer, which is *not* alone among analyzers occupying package o.a.l.analysis.core.
Instead, now attempting to instantiate no-package analyzers as core analyzers, then falling back to the previous default package ("org.apache.lucene.analysis.") if that fails.  Also, made the same changes in NewShingleAnalyzerTask.

git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087468 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 22:34:46 +00:00
Steven Rowe 3bbfa450e4 Updated to the new method for obtaining a top-level deleted docs bitset. Also checking the bitset for null, when there are no deleted docs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087435 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 21:03:18 +00:00
Steven Rowe 56c2994f66 Added a special case for looking up KeywordAnalyzer, which alone among analyzers occupies package o.a.l.analysis.core.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1087431 13f79535-47bb-0310-9956-ffa450edef68
2011-03-31 20:16:10 +00:00
Doron Cohen 8d0c1b62af LUCENE-2977: WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1084929 13f79535-47bb-0310-9956-ffa450edef68
2011-03-24 12:22:13 +00:00