Commit Graph

184 Commits

Author SHA1 Message Date
Chris M. Hostetter 3ed5106920 SOLR-2819: Improved speed of parsing hex entities in HTMLStripCharFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1208032 13f79535-47bb-0310-9956-ffa450edef68
2011-11-29 19:15:54 +00:00
Robert Muir 7f766cf603 LUCENE-3590: nuke BytesRef.utf8ToChars
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206174 13f79535-47bb-0310-9956-ffa450edef68
2011-11-25 13:55:41 +00:00
Robert Muir 3b6da22aa7 LUCENE-3590: clearly mark bogus deep-copying apis in BytesRef
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1206143 13f79535-47bb-0310-9956-ffa450edef68
2011-11-25 12:50:13 +00:00
Robert Muir 873f199924 LUCENE-2621: move TermVectors,FieldInfos,SegmentInfos to codec
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1202842 13f79535-47bb-0310-9956-ffa450edef68
2011-11-16 19:09:35 +00:00
Robert Muir 598920d7bd LUCENE-3571: nuke IndexSearcher(Directory)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1202657 13f79535-47bb-0310-9956-ffa450edef68
2011-11-16 12:19:41 +00:00
Simon Willnauer ee293e7e7d fix javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1200111 13f79535-47bb-0310-9956-ffa450edef68
2011-11-10 03:32:33 +00:00
Simon Willnauer c0a7abbec0 LUCENE-2564: Cut over WordListLoader to CharArrayMap/Set and use CharSetDecoder to detect encoding problems early
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1200091 13f79535-47bb-0310-9956-ffa450edef68
2011-11-10 01:52:48 +00:00
Simon Willnauer dc6b4b6533 LUCENE-2564: Cut over WordListLoader to CharArrayMap/Set and use CharSetDecoder to detect encoding problems early
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1200080 13f79535-47bb-0310-9956-ffa450edef68
2011-11-10 01:21:25 +00:00
Robert Muir fa6500fa6c LUCENE-3490: restructure codec hierarchy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1197603 13f79535-47bb-0310-9956-ffa450edef68
2011-11-04 15:43:35 +00:00
Robert Muir d5601eb371 SOLR-2276: Support for cologne phonetic
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1195082 13f79535-47bb-0310-9956-ffa450edef68
2011-10-30 01:00:06 +00:00
Uwe Schindler a91efbedd1 LUCENE-3530: Remove deprecated methods in CompoundTokenFilters
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1188613 13f79535-47bb-0310-9956-ffa450edef68
2011-10-25 11:31:16 +00:00
Uwe Schindler ec186e7280 LUCENE-3508: Decompounders based on CompoundWordTokenFilterBase can now be used with custom attributes. All those attributes are preserved and set on all added decompounded tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1188597 13f79535-47bb-0310-9956-ffa450edef68
2011-10-25 10:44:36 +00:00
Robert Muir f21ac2f58c LUCENE-3301: add workaround for jre breakiterator bugs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1187900 13f79535-47bb-0310-9956-ffa450edef68
2011-10-23 14:55:25 +00:00
Robert Muir 7af9fbd16d LUCENE-3521: upgrade icu jar to 4.8.1.1 / remove lucenetestcase hack
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1183738 13f79535-47bb-0310-9956-ffa450edef68
2011-10-15 21:48:50 +00:00
Robert Muir 9ba4ce2ed5 javadocs fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1182505 13f79535-47bb-0310-9956-ffa450edef68
2011-10-12 18:20:41 +00:00
Jan Høydahl 22dcd39d9e SOLR-2792: Allow case insensitive Hunspell stemming
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1179459 13f79535-47bb-0310-9956-ffa450edef68
2011-10-05 22:08:55 +00:00
Michael McCandless ec2b654231 LUCENE-3477: add explicit breaks in jflex sources so we don't hit compiler warnings; fix a couple other warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1177723 13f79535-47bb-0310-9956-ffa450edef68
2011-09-30 16:23:24 +00:00
Christopher John Male 8d28270460 LUCENE-3470: Changed Field constructor signatures order to value, fieldtype
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1176773 13f79535-47bb-0310-9956-ffa450edef68
2011-09-28 08:07:16 +00:00
Christopher John Male 67c13bd2fe LUCENE-3455: Renamed Analyzer.reusableTokenStream to Analyzer.tokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1176728 13f79535-47bb-0310-9956-ffa450edef68
2011-09-28 05:26:54 +00:00
Christopher John Male 0bed3142bb LUCENE-3455: Test Analysis consumers now use reusableTokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1175670 13f79535-47bb-0310-9956-ffa450edef68
2011-09-26 04:58:48 +00:00
Christopher John Male 4ff0b2f82c LUCENE-3396: Collapsing Analyzer and ReusableAnalyzerBase together, mandating use of TokenStreamComponents
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1175297 13f79535-47bb-0310-9956-ffa450edef68
2011-09-25 05:10:25 +00:00
Christopher John Male 318911200d LUCENE-3434: Removed state changing setters in ShingleAnalyzerWrapper and PerFieldAnalyzerWrapper
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1170942 13f79535-47bb-0310-9956-ffa450edef68
2011-09-15 03:21:17 +00:00
Christopher John Male 94028fe11a LUCENE-3431: Removed deprecated addStopwords methods in QueryAutoStopWordAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1170424 13f79535-47bb-0310-9956-ffa450edef68
2011-09-14 03:33:50 +00:00
Christopher John Male 3597bc4bf4 LUCENE-3396: Converted simple Analyzers which got lost in merging
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1169654 13f79535-47bb-0310-9956-ffa450edef68
2011-09-12 09:00:42 +00:00
Christopher John Male 4c5606ee29 LUCENE-3396: Converted most Analyzers over to using ReusableAnalyzerBase
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1169607 13f79535-47bb-0310-9956-ffa450edef68
2011-09-12 05:50:26 +00:00
Robert Muir a027a35583 nocommit -> TODO
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1169474 13f79535-47bb-0310-9956-ffa450edef68
2011-09-11 16:39:59 +00:00
Christopher John Male e3172b9239 LUCENE-3414: Added Hunspell for Lucene
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1167467 13f79535-47bb-0310-9956-ffa450edef68
2011-09-10 06:00:39 +00:00
Robert Muir 128aaf8387 LUCENE-3410: move changes to 3.5 and nuke deprecated code in trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1166770 13f79535-47bb-0310-9956-ffa450edef68
2011-09-08 15:56:01 +00:00
Robert Muir b265d499f2 LUCENE-3417: DictionaryCompoundWordFilter did not properly add tokens from the end compound word
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1166728 13f79535-47bb-0310-9956-ffa450edef68
2011-09-08 14:59:15 +00:00
Christopher John Male 4b44bd7d83 LUCENE-3410: Deprecated multi-int constructors in WordDelimiterFilter. Now uses int bitfield
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1165995 13f79535-47bb-0310-9956-ffa450edef68
2011-09-07 04:43:10 +00:00
Michael McCandless 4dad0ba89f LUCENE-2308: cutover to FieldType
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1162347 13f79535-47bb-0310-9956-ffa450edef68
2011-08-27 13:27:01 +00:00
Christopher John Male 0f2d7ad556 LUCENE-3397: Cleaned up remaining test TSs and PatterAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1161986 13f79535-47bb-0310-9956-ffa450edef68
2011-08-26 04:16:19 +00:00
Christopher John Male 1057d24e7f LUCENE-3400: Removed DutchAnalyzer.setStemDictionary
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1161484 13f79535-47bb-0310-9956-ffa450edef68
2011-08-25 10:32:21 +00:00
Christopher John Male 0ef9c3c25f LUCENE-3376: Moved ReusableAnalyzerBase to core
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1160117 13f79535-47bb-0310-9956-ffa450edef68
2011-08-22 06:01:31 +00:00
Robert Muir a5d2d78cec LUCENE-3378: nuke another useless custom test-classpath
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158857 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 18:21:41 +00:00
Robert Muir efbdae6dd2 LUCENE-3378: remove unneeded special test-classpaths in build
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158821 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 16:45:37 +00:00
Robert Muir 99ac972281 LUCENE-3378: move collationtestbase to tests-framework
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158819 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 16:43:13 +00:00
Robert Muir 8a0578dfe2 LUCENE-3378: move VocabularyAssert to test-framework
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158730 13f79535-47bb-0310-9956-ffa450edef68
2011-08-17 14:19:15 +00:00
Robert Muir 7eab19aff7 LUCENE-3375: fix synonyms bug where keepOrig=false would discard unmatched inputs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1158342 13f79535-47bb-0310-9956-ffa450edef68
2011-08-16 16:01:05 +00:00
Robert Muir f7237cb165 LUCENE-3361: remove api deprecations in trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154943 13f79535-47bb-0310-9956-ffa450edef68
2011-08-08 12:17:33 +00:00
Robert Muir ef56f5d551 LUCENE-3361: port url+email tokenizer to standardtokenizerinterface, fix combining marks bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154936 13f79535-47bb-0310-9956-ffa450edef68
2011-08-08 11:57:59 +00:00
Robert Muir 2dda5bc35f LUCENE-3358: StandardTokenizer wrongly discarded combining marks attached to Han/Hiragana
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1154005 13f79535-47bb-0310-9956-ffa450edef68
2011-08-04 20:49:47 +00:00
Steven Rowe 23d22e4d47 LUCENE-3337: avoid building jar files unless necessary in build
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1151720 13f79535-47bb-0310-9956-ffa450edef68
2011-07-28 04:02:09 +00:00
Uwe Schindler 014dee7cf5 revert accidental commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150488 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:22:07 +00:00
Uwe Schindler 9c73f9d03b LUCENE-3336: Speed up javadocs-all builds by minimizing compile costs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150486 13f79535-47bb-0310-9956-ffa450edef68
2011-07-24 21:19:42 +00:00
Robert Muir 3626220146 use a different character for test, one that is still enclosing mark in 6.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1150091 13f79535-47bb-0310-9956-ffa450edef68
2011-07-23 12:18:48 +00:00
Michael McCandless fbf9f4ccad LUCENE-3289: add options to FST Builder to tradeoff RAM/CPU used during build vs how small the resulting FST is
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145292 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 18:53:13 +00:00
Robert Muir 015ecfa0a0 LUCENE-3233: improve ram/perf of SynonymFilter, add wordnet parsing, nuke contrib/wordnet
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145158 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 12:58:52 +00:00
Christopher John Male 5f30bedccc LUCENE-3283: Moved core QueryParsers to queryparser module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1145016 13f79535-47bb-0310-9956-ffa450edef68
2011-07-11 03:37:00 +00:00
Steven Rowe 9e020991ef Merged with trunk up to r1144714
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/branches/solr2452@1144715 13f79535-47bb-0310-9956-ffa450edef68
2011-07-09 18:50:54 +00:00