Shai Erera
|
640741783d
|
minor jdoc fixes for PerfTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235714 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-25 10:42:00 +00:00 |
Robert Muir
|
6edfe4f157
|
LUCENE-3717: add tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235199 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-24 10:40:46 +00:00 |
Robert Muir
|
35a73d5f55
|
LUCENE-3717: fix broken offsets in ngramtokenizers, and check return value of Reader.read
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1235187 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-24 09:50:21 +00:00 |
Robert Muir
|
7fafdd3576
|
LUCENE-3717: add checkRandomData to more analyzers and fix more offsets bugs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234850 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-23 15:19:58 +00:00 |
Steven Rowe
|
059410d424
|
LUCENE-3690: fix handling of unpaired numeric character entity UTF-16 surrogates to output U+FFFD REPLACEMENT CHARACTER; and add handling of properly paired numeric character entity UTF-16 surrogates, to output the corresponding pair of code units.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234687 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-23 07:36:38 +00:00 |
Robert Muir
|
c754c1c9c8
|
LUCENE-3717: add better offsets testing to BaseTokenStreamTestCase, fix offsets bugs in ThaiWordFilter and ICUTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234652 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-23 00:08:52 +00:00 |
Robert Muir
|
a7cfee6b07
|
SOLR-2891: fix CompoundWordTokenFilter to not create invalid offsets when the length of the text was changed by a previous filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234546 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-22 16:41:06 +00:00 |
Steven Rowe
|
f3a363708f
|
LUCENE-3690: Re-implemented HTMLStripCharFilter as a JFlex-generated scanner. Fixes LUCENE-2208, SOLR-882, and SOLR-42.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234452 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-22 05:20:46 +00:00 |
Shai Erera
|
17fe719bb5
|
LUCENE-3703: Calling DirectoryTaxonomyReader.refresh() could mess up reference counting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234451 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-22 05:14:45 +00:00 |
Uwe Schindler
|
af9b4d816f
|
LUCENE-3671: Add TypeTokenFilter that filters tokens based on their TypeAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1234396 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-21 19:02:44 +00:00 |
Martijn van Groningen
|
ac8befe5c3
|
LUCENE-3602: Fixed some typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1233918 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-20 14:15:42 +00:00 |
Robert Muir
|
dea4b911da
|
LUCENE-3646: throw exception for fieldcache on non-atomic reader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232444 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-17 15:34:43 +00:00 |
Robert Muir
|
e869b1fbf7
|
LUCENE-3700: give enough ram so that you can build naist-jdic with java 5
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232274 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-17 02:27:31 +00:00 |
Robert Muir
|
f562a8a0dc
|
LUCENE-3700: optionally support naist-jdic for kuromoji
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232268 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-17 02:20:24 +00:00 |
Robert Muir
|
48c01e5a2b
|
LUCENE-3699: share baseform with surface and flag if the reading can be computed from surface
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232265 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-17 02:12:27 +00:00 |
Robert Muir
|
c902f63125
|
unbreak clover/nightly builds until we do this right
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232254 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-17 01:37:28 +00:00 |
Martijn van Groningen
|
ac6679a7b6
|
- Added licence and notice files to join module.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232228 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-16 23:31:57 +00:00 |
Martijn van Groningen
|
6d4f6b0978
|
- Removed changes.txt from grouping module and moved the entries to the changes.txt in Lucene contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232227 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-16 23:30:16 +00:00 |
Martijn van Groningen
|
7cdb8028c1
|
LUCENE-3602: Added query time joining.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232223 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-16 23:16:35 +00:00 |
Robert Muir
|
12c9b8b4bf
|
LUCENE-3699: simplify dictionary access and reduce tokeninfodictionary 1.5MB
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232120 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-16 19:19:48 +00:00 |
Robert Muir
|
354a3be78f
|
LUCENE-3696: fix dictionary construction to work on java5
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1232012 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-16 14:50:09 +00:00 |
Michael McCandless
|
87bc4521c2
|
LUCENE-3695: move some confusing FST sugar out
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231795 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-15 23:25:38 +00:00 |
Michael McCandless
|
11f33ee521
|
LUCENE-3684: add offsets to postings APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231794 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-15 23:17:45 +00:00 |
Michael McCandless
|
9de01b56eb
|
LUCENE-3453: simplify DocValues/Field API
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231791 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-15 23:05:13 +00:00 |
Robert Muir
|
fbd34b4390
|
cleanups to 4.x CHANGES
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231552 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-14 18:24:48 +00:00 |
Yonik Seeley
|
b2a0040e98
|
tests: silliness
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231526 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-14 16:59:36 +00:00 |
Michael McCandless
|
5ca66287ea
|
woops
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231513 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-14 15:20:58 +00:00 |
Michael McCandless
|
d584f6361d
|
LUCENE-3685: add ToChildBlockJoinQuery, to join from parent to child
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231512 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-14 15:17:04 +00:00 |
Robert Muir
|
8b8c2b4dee
|
LUCENE-3690: since this filter handles all kinds of bad partial and wierd input, this should be fine to enable
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231272 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-13 19:46:30 +00:00 |
Robert Muir
|
2ff4bdb04f
|
enable assertions when executing various tools
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1231013 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-13 11:36:50 +00:00 |
Robert Muir
|
05a65507af
|
LUCENE-3305: optimization, don't retrieve the base form twice in this filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230769 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-12 20:36:58 +00:00 |
Robert Muir
|
4ebdc0872a
|
LUCENE-3305: sorry Mike (thanks for the help with the FST optimization)
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230756 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-12 20:24:40 +00:00 |
Robert Muir
|
cd372bdc83
|
LUCENE-3305: add Kuromoji Japanese morphological analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230748 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-12 20:10:48 +00:00 |
Shai Erera
|
d74db06ddf
|
LUCENE-3686: CategoryEnhancement must override Object.equals()
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1230431 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-12 07:43:42 +00:00 |
Simon Willnauer
|
3b8458f6de
|
use TEST_VERSION_CURRENT instead of 4_0 in test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229523 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-10 12:46:38 +00:00 |
Simon Willnauer
|
f19317d318
|
SOLR-3020: Add KeywordAttribute support to HunspellStemFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229519 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-10 12:33:29 +00:00 |
Michael McCandless
|
cdb2ee8a7b
|
LUCENE-3679: replace IR.getFieldNames with IR.getFieldInfos
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1229401 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-09 22:29:40 +00:00 |
Michael McCandless
|
defd51a11b
|
fix syn test bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1228704 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-07 19:28:07 +00:00 |
Michael McCandless
|
ed9f0fd5ef
|
LUCENE-3668: if there's only 1 output for a synonym rule then set start/endOffset to match the full span of the input tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1228650 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-07 16:26:15 +00:00 |
Robert Muir
|
bbfc831d52
|
LUCENE-3677: remove old byte[] norms api from IndexReader
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1227755 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-05 18:55:20 +00:00 |
Michael McCandless
|
ec3c3f81d9
|
LUCENE-3634: remove some old static main methods and other dead code
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1226871 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-03 17:08:19 +00:00 |
Shai Erera
|
d5932e1149
|
LUCENE-3649: port changes from 3x to trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1226236 13f79535-47bb-0310-9956-ffa450edef68
|
2012-01-01 11:41:30 +00:00 |
Robert Muir
|
f5d1e967db
|
LUCENE-3667: make tests reasonable if you dont have a supercomputer
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225920 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-30 21:14:15 +00:00 |
Steven Rowe
|
76d1662cb7
|
- Added license declaration
- Removed unused 'length' param to combine()
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225615 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-29 18:53:10 +00:00 |
Robert Muir
|
b2970db4bc
|
LUCENE-2906: filter to process output of Standard/ICUTokenizer and create overlapping bigrams for CJK
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225433 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-29 05:04:49 +00:00 |
Robert Muir
|
e2f81e84f2
|
SOLR-2982: workaround bug in sun javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225228 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-28 16:51:23 +00:00 |
Robert Muir
|
f3869ef3ce
|
SOLR-2982: add Beider-Morse phonetic filter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1225211 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-28 16:00:52 +00:00 |
Robert Muir
|
7c7c7bd077
|
LUCENE-3662: add note
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1224823 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-26 21:36:16 +00:00 |
Robert Muir
|
b096910a82
|
LUCENE-3662: extend LevenshteinAutomata to support transpositions as primitive a edit
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1224817 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-26 21:18:36 +00:00 |
Robert Muir
|
17438c9eac
|
don't allocate 128MB of ram in this assert
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1223047 13f79535-47bb-0310-9956-ffa450edef68
|
2011-12-25 04:24:06 +00:00 |