Simon Willnauer
61a68fe46f
Added the changes in LUCENE-1728 to the "Build" section of contrib/CHANGES.TXT
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806536 13f79535-47bb-0310-9956-ffa450edef68
2009-08-21 12:59:07 +00:00
Michael Busch
82c8a2ffc2
Fix if clause in MultiTermRewriteMethodProcessor
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806456 13f79535-47bb-0310-9956-ffa450edef68
2009-08-21 08:36:31 +00:00
Michael Busch
5a40767d67
LUCENE-1828: Fix MemoryIndex to call TokenStream.reset() and TokenStream.end().
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806444 13f79535-47bb-0310-9956-ffa450edef68
2009-08-21 08:02:40 +00:00
Mark Robert Miller
66c2217a55
improve Highlighter javadocs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805923 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 18:43:43 +00:00
Robert Muir
1d9a96c2fc
LUCENE-1813: Add option to ReverseStringFilter to mark reversed tokens
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805769 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 12:07:15 +00:00
Robert Muir
58cd4a04d7
LUCENE-1794: Ensure analyzer options are applied immediately when using reusable token streams
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805766 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 11:56:31 +00:00
Mark Robert Miller
39ada16b9c
fix some javadoc @returns
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805650 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 02:42:24 +00:00
Mark Robert Miller
124af950a6
Fix duplicate code and one version of getWeightedSpanTerms - should accept a TokenStream now rather than a CachingTokenStream.
...
Also fixes getWeightedSpanTermsWithScores to intern the fieldname.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805645 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 01:16:23 +00:00
Michael Busch
e488bb0d5f
LUCENE-1792: remove obsolete files.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805639 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 00:32:05 +00:00
Michael Busch
5864c5795c
LUCENE-1792: remove obsolete files.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805638 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 00:31:05 +00:00
Michael Busch
565a6d22b5
LUCENE-1792: Fix new query parser to set rewrite method for multi-term queries.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805633 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 00:09:41 +00:00
Robert Muir
3887cf9419
LUCENE-1692: Additional tests and javadocs for contrib/analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805400 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 12:55:26 +00:00
Mark Robert Miller
20505fd0d5
LUCENE-1522 fix + test
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804786 13f79535-47bb-0310-9956-ffa450edef68
2009-08-16 21:56:51 +00:00
Robert Muir
d2af6ef0bd
LUCENE-1794: Implement TokenStream reuse for contrib Analyzers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804680 13f79535-47bb-0310-9956-ffa450edef68
2009-08-16 12:37:05 +00:00
Mark Robert Miller
395c65ad08
change PositionCheckingMap to use an entry iterator in putAll rather than keyset - faster
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804516 13f79535-47bb-0310-9956-ffa450edef68
2009-08-15 19:09:34 +00:00
Michael McCandless
316ce42d99
LUCENE-1809: make highlihter vs vector highlighter comparison more fair
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804447 13f79535-47bb-0310-9956-ffa450edef68
2009-08-15 10:01:50 +00:00
Uwe Schindler
b16e0aa31b
LUCENE-1801: All Tokenizers/TokenStreams that are source of tokens call AttributeSource.clearAttributes() first. Made Token.clear() consistent to AttributeImpl (clear everything)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804392 13f79535-47bb-0310-9956-ffa450edef68
2009-08-14 22:01:42 +00:00
Michael McCandless
e166d54f87
LUCENE-1802: undeprecate current QueryParser
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804016 13f79535-47bb-0310-9956-ffa450edef68
2009-08-13 20:08:58 +00:00
Chris M. Hostetter
2558c06120
LUCENE-1749: Addition of FieldCacheSanityChecker utility, and hooks to use it in all existing Lucene Tests.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@803676 13f79535-47bb-0310-9956-ffa450edef68
2009-08-12 19:31:38 +00:00
Mark Robert Miller
7fff5a7ea7
LUCENE-1771: QueryWight back to Weight, but as abstract class rather than interface - explain now takes a Searcher and passes the sub reader that contains the doc if a top level reader is a multi reader.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@803339 13f79535-47bb-0310-9956-ffa450edef68
2009-08-12 01:22:30 +00:00
Michael McCandless
7ac73cb0a6
LUCENE-1792: properly default rewrite method to CONSTANT_SCORE_AUTO for wildcard & prefix queries
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@803053 13f79535-47bb-0310-9956-ffa450edef68
2009-08-11 10:14:31 +00:00
Robert Muir
43a5bd6c19
LUCENE-1628: Add Persian Analyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802955 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 23:29:27 +00:00
Michael McCandless
3aed0fd54b
LUCENE-1797: don't over-increment positions when creating MultiPhraseQuery
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802893 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 18:46:39 +00:00
Michael McCandless
08c69caca6
LUCENE-1795: disable leading wildcard by default in new QueryParser, matching 2.4
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802794 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 13:54:06 +00:00
Michael McCandless
35ea5c1350
LUCENE-1786: make the patternsFileContent static, so we only load it once, not 4 times, when running this test
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802767 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 12:47:52 +00:00
Michael McCandless
2a195d3622
move QueryParserMessages.properties under src/resources
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802722 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 10:24:35 +00:00
Uwe Schindler
911df49bcb
LUCENE-1607: Change some more String.intern() in contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802095 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 17:19:53 +00:00
Michael McCandless
ab0de2448a
fix smartcn to be JDK 1.4 only
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802085 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 16:48:09 +00:00
Michael McCandless
0b6769aa11
LUCENE-1782: rename original -> standard in the new QueryParser
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801934 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 10:24:28 +00:00
Mark Robert Miller
835efd0d6c
LUCENE-1788: cleanup Highlighter test class
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801856 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 01:30:45 +00:00
Michael McCandless
fc369cdf2f
add getDistanceFilter() since you need that in order to sort by the exact distances (previously distanceFilter attr was public)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801471 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 23:57:54 +00:00
Robert Muir
820620f3a7
LUCENE-1758: Update ArabicAnalyzer to light10 stemming, stopwords improvements, lowercase non-arabic text
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801348 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 18:22:22 +00:00
Yonik Seeley
7f1319fdf0
LUCENE-1607: String.intern() cache StringHelper.intern()
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801344 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 18:05:06 +00:00
Grant Ingersoll
ab276a5ab9
Javadoc updates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801219 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 13:17:20 +00:00
Grant Ingersoll
c0d86a4e30
Javadoc updates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801218 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 13:17:11 +00:00
Michael McCandless
1ab8d5ee6d
LUCENE-1779: improve javadocs & remove unused args from unreleased sorting APIs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801207 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 13:08:58 +00:00
Michael McCandless
e82cc0079d
remove extra print
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801166 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 12:13:02 +00:00
Michael McCandless
c471e8e83c
LUCENE-1773: fix highlighter test case; add contrib/memory as dependency to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801160 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 12:09:48 +00:00
Mark Robert Miller
34ab27401c
LUCENE-1770: Add EnwikiQueryMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801043 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 01:39:22 +00:00
Michael McCandless
d3fcafc53e
LUCENE-1773: add FastVectorHighlighter tasks to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800947 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 20:24:02 +00:00
Mark Robert Miller
5af69469e2
LUCENE-1748: LUCENE-1001 introduced PayloadSpans, but this was a back compat break and caused custom SpanQuery implementations to fail at runtime in a variety of ways. This issue attempts to remedy things by causing a compile time break on custom SpanQuery implementations and removing the PayloadSpans class, with its functionality now moved to Spans.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800925 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 19:10:02 +00:00
Michael McCandless
d79a369414
LUCENE-1504: switch to DocIdSet API and FilteredDocIdSet for contrib/spatial
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800892 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 18:01:57 +00:00
Mark Robert Miller
a4c2eae1d4
update javadoc for init(TokenStream) return
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800849 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 15:38:00 +00:00
Mark Robert Miller
f0e54e31e6
LUCENE-1406 belongs in contrib CHANGES
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800832 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 15:05:34 +00:00
Mark Robert Miller
b44ed588ac
LUCENE-1685 should be in API changes, not new features
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800821 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 14:33:58 +00:00
Mark Robert Miller
10b41d2dce
LUCENE-1685: The position aware SpanScorer has become the default scorer for Highlighting. The SpanScorer implementation has replaced QueryScorer and the old term highlighting QueryScorer has been renamed to QueryTermScorer. Multi-term queries are also now expanded by default. If you were previously rewritting the query for multi-term query highlighting, you should no longer do that (unless you switch to using QueryTermScorer). The SpanScorer API (now QueryScorer) has also been improved to more closely match the API of the previous QueryScorer implementation.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800796 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:56:11 +00:00
Mark Robert Miller
b155258203
LUCENE-1778: Add support for log.step setting per task type. Perviously, if you included a log.step line in the .alg file, it had been applied to all tasks. Now, you can include a log.step.AddDoc, or log.step.DeleteDoc (for example) to control logging for just these tasks. If you want to ommit logging for any other task, include log.step=-1. The syntax is "log.step." together with the Task's 'short' name (i.e., without the 'Task' part).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800768 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:05:03 +00:00
Mark Robert Miller
4054b4ebf3
move the web based xml demo from core changes to contrib changes - also fixes skipping # 34 in features
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800661 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 02:57:00 +00:00
Michael Busch
c91651e4f2
LUCENE-1775: Change contrib tee/sink filters to use new TokenStream API.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800606 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 22:45:27 +00:00
Michael Busch
457c29d31e
LUCENE-1775: Change remaining contrib TokenFilters (shingle, prefix-suffix) to use the new TokenStream API.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800195 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 04:33:10 +00:00
Mark Robert Miller
5aaf5b0167
LUCENE-1486: Move ComplexPhraseQueryParser to contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800193 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 04:06:22 +00:00
Michael Busch
343992fcbb
LUCENE-1567: New flexible QueryParser framework.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800191 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 03:38:44 +00:00
Michael McCandless
487278cf92
LUCENE-1522 turn on unit tests that had relied on LUCENE-1448 and LUCENE-1759
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800120 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 18:07:43 +00:00
Michael Busch
b91f993a0e
LUCENE-1460: Additional cleanup in two contrib junit tests.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799973 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 02:57:30 +00:00
Michael Busch
537aeb24e0
LUCENE-1759: Set final offset correctly in contrib TokenStreams.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799968 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 02:10:46 +00:00
Michael Busch
1743081b07
LUCENE-1460: Changed TokenStreams/TokenFilters in contrib to use the new TokenStream API.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799953 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 22:52:32 +00:00
Mark Robert Miller
3e869d9336
remove system.out and unnecessary next() in tokenstream
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799865 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 14:18:19 +00:00
Michael McCandless
175e8b546d
LUCENE-1763: require IndexWriter be passed up front to the MergePolicy
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799818 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 09:22:25 +00:00
Michael McCandless
bbcab117d9
LUCENE-1683: fixed JavaUtilRegexCapabilities (used by RegexQuery) to match entire string not just prefix
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799678 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 18:02:56 +00:00
Michael McCandless
0b0d13dffe
LUCENE-1745: allow passing matching flags to the underlying regexp engine
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799667 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 17:41:04 +00:00
Uwe Schindler
f8b2f0122c
Use the empty docidset provided by DocIdSet.EMPTY_DOCIDSET
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799582 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 11:32:37 +00:00
Mark Robert Miller
f73a4f4324
LUCENE-1695: Update the Highlighter to use the new TokenStream API. This issue breaks backwards compatibility with some public classes. If you have implemented custom Fregmenters or Scorers, you will need to adjust them to work with the new TokenStream API. Rather than getting passed a Token at a time, you will be given a TokenStream to init your impl with - store the Attributes you are interested in locally and access them on each call to the method that used to pass a new Token. Look at the included updated impls for examples.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799455 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 22:00:47 +00:00
Mark Robert Miller
7ecaa8c990
wikipedia-flush-by-RAM.alg should use content.source
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799379 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:35:10 +00:00
Mark Robert Miller
e505413fae
wikipedia.alg should use content.source
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799377 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:34:28 +00:00
Mark Robert Miller
afb517e832
LUCENE-1752: Missing highlights when terms were repeated in separate, nested, boolean or disjunction queries.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798976 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:47:05 +00:00
Michael McCandless
dbff1fc9b5
LUCENE-1754: just use EMPTY_DOCIDSET.iterator() instead of new EmptyDocIdSetIterator
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:12:36 +00:00
Michael McCandless
094c674c4d
LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Michael McCandless
26a2c427d1
LUCENE-1754: BooleanQuery detects up front if it won't match any docs and returns null from its scorer() instead of NonMatchingScorer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798086 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:50:02 +00:00
Michael McCandless
228888a882
LUCENE-1644: fix highlighter to rewrite MTQ whenever it's not already a SCORING_BOOLEAN_QUERY
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797728 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:31:17 +00:00
Michael McCandless
be66120dff
LUCENE-1644: enable different rewrite methods for MultiTermQuery
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797694 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 00:03:33 +00:00
Otis Gospodnetic
f758b4d259
- Typo
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797310 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 02:43:58 +00:00
Simon Willnauer
999f6157c7
LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Mark Robert Miller
3adc61c3ac
LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll
63402f49c7
Javadoc updates
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless
c79f54975e
LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Mark Robert Miller
add56f5e66
LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller
ea7e4ad344
LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Otis Gospodnetic
b393e4d0af
LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
...
- line, and those below, will be ignored--
M CHANGES.txt
M analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless
65494af827
LUCENE-1272: add MoreLikeThis.set/getBoost
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless
91aedd6685
LUCENE-1740: add 'analyzer' command to Lucli, to change analyzer from the default StandardAnalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793526 13f79535-47bb-0310-9956-ffa450edef68
2009-07-13 10:06:01 +00:00
Michael McCandless
9cbe5f4ff4
LUCENE-1522: adding new Fast Vector Highlighter contrib
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792542 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 13:06:51 +00:00
Michael McCandless
333e77a431
LUCENE-1704: allow specifying the Tidy configuration file when parsing HTML docs with contrib/ant
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791587 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:55:05 +00:00
Mark Robert Miller
28d65ceee7
remove java 1.5 dependency
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791576 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:18:19 +00:00
Mark Robert Miller
f780f77366
LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Uwe Schindler
705f099238
Convert and cleanup the test files to UTF-8. What is still broken is the incorrect usage of KOI8 and CP1251 encodings. Added svn:eol-style=native to all files again.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791483 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 13:50:17 +00:00
Mark Robert Miller
e04abc52e7
LUCENE-1599: Add clone support for SpanQuerys. SpanRegexQuery counts on this functionality and does not work correctly without it.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791280 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 17:16:16 +00:00
Mark Robert Miller
9789089343
reader should be closed after use
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791246 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 14:01:14 +00:00
Uwe Schindler
fed4bba63d
LUCENE-1713: Rename RangeQuery -> TermRangeQuery (part 1)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791175 13f79535-47bb-0310-9956-ffa450edef68
2009-07-04 20:14:12 +00:00
Simon Willnauer
410afb98bf
LUCENE-1719: Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter (Steven Row via Simon Willnauer)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790262 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 16:50:47 +00:00
Simon Willnauer
5265dc1bb2
LUCENE-1722: SmartChineseAnalyzer JavaDoc improvements - Replacing Chinese JavaDoc with English version. Robert Muir via Simon Willnauer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 10:32:23 +00:00
Michael McCandless
c7f865a4c7
LCUENE-1716: allow control over storage of norms (body norms), info stream and whether docs properties should be indexed as fields
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788777 13f79535-47bb-0310-9956-ffa450edef68
2009-06-26 17:26:54 +00:00
Uwe Schindler
42dcc00374
Build an index.html on the top-level Javadocs folder (e.g. hudson will use it as entry point)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788039 13f79535-47bb-0310-9956-ffa450edef68
2009-06-24 14:34:09 +00:00
Michael McCandless
87de0c9688
LUCENE-1466: added chainable CharFilter stage before Tokenizer to allow mapping of characters before tokenization
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787795 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 19:15:31 +00:00
Michael McCandless
f03d77b558
LUCENE-1630: switch from Weight (interface) to QueryWeight (abstract class); mate in/out-of docID order scoring between Collector & Scorer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787772 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 18:11:42 +00:00
Michael McCandless
5f72065d0f
LUCENE-1714: fix WriteLineDocTask to also replace \r, \n (in addition to \t) with space so those chars don't create mal-formed lines
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787750 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 16:46:17 +00:00
Michael McCandless
ec8088654d
bulk fix svn:eol-style to native for text files
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787424 13f79535-47bb-0310-9956-ffa450edef68
2009-06-22 22:18:56 +00:00
Erik Hatcher
65131ca7b9
LUCENE-1405: Added support for Ant resource collections in contrib/ant <index> task.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786610 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 18:24:19 +00:00
Michael McCandless
19234f12bd
LUCENE-1692: add new contrib analyzer tests
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786606 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 18:02:12 +00:00
Michael McCandless
2f2cd20828
LUCENE-1692: add tests for Thai & SmartChinese analyzers; fix wrong endOffset bug in ThaiWordFilter; use stop words by default with SmartChineseAnalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786560 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 15:52:36 +00:00
Uwe Schindler
0b5cbca110
LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 12:16:52 +00:00
Uwe Schindler
7b34ab8f30
LUCENE-1673: Move TrieRange to core (part 1: addition to core)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786470 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 12:09:52 +00:00
Mark Robert Miller
d7d455246f
LUCENE-1595: Separate DocMaker into DocMaker and ContentSource.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786233 13f79535-47bb-0310-9956-ffa450edef68
2009-06-18 19:58:59 +00:00
Michael McCandless
835c405be0
LUCENE-973: add test case for CJKAnalyzer; fix trailing empty string bug
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@785287 13f79535-47bb-0310-9956-ffa450edef68
2009-06-16 16:38:39 +00:00
Michael Busch
f2a5f395d8
Fix pom.xml.template of remote contrib to have the correct artifactId
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784674 13f79535-47bb-0310-9956-ffa450edef68
2009-06-15 07:33:57 +00:00
Michael McCandless
5b472b2d62
LUCENE-1539: add DeleteByPercent, FlushReader tasks, and ability to open reader on a labelled commit point
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784587 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 17:07:55 +00:00
Michael McCandless
26b9dea62b
LUCENE-1571: fix LatLongDistanceFilter to respect deleted docs
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784576 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 15:47:15 +00:00
Michael McCandless
6671c43458
LUCENE-979: remove a few more old benchmark things
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784568 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 14:52:02 +00:00
Michael McCandless
e3c4ff8ac3
revert accidental commit
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784562 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 14:06:52 +00:00
Michael McCandless
419b065957
LUCENE-1677: deprecate gcj specializations, and the system properties that let you specify which SegmentReader impl class to use
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784541 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 11:20:55 +00:00
Michael McCandless
392424ec11
LUCENE-1407: move RemoteSearchable out of core into contrib/remote
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784540 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 11:13:04 +00:00
Karl-Johan Wettin
9e5cb08461
LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784482 13f79535-47bb-0310-9956-ffa450edef68
2009-06-13 22:05:14 +00:00
Karl-Johan Wettin
196428ec39
LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784481 13f79535-47bb-0310-9956-ffa450edef68
2009-06-13 21:54:07 +00:00
Grant Ingersoll
1511ec5e31
LUCENE-1676: in-stream payload support
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784297 13f79535-47bb-0310-9956-ffa450edef68
2009-06-12 22:26:01 +00:00
Michael McCandless
159573f7eb
LUCENE-979: remove deprecated non-Task benchmark classes
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783794 13f79535-47bb-0310-9956-ffa450edef68
2009-06-11 14:26:32 +00:00
Mark Robert Miller
85f5a9c740
LUCENE-1455: close FileInputStream after parsing
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783595 13f79535-47bb-0310-9956-ffa450edef68
2009-06-11 01:13:51 +00:00
Patrick O'Leary
5509306e40
Fixed doc id's to bitset id for sorting with a multi segmented index FieldSourceComparator still uses bitset id's
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783517 13f79535-47bb-0310-9956-ffa450edef68
2009-06-10 20:59:26 +00:00
Uwe Schindler
406454d30b
LUCENE-1672: Deprecate all String/File ctors/opens in IndexReader/IndexWriter/IndexSearcher
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@782469 13f79535-47bb-0310-9956-ffa450edef68
2009-06-07 21:52:41 +00:00
Michael McCandless
a93e2c41b0
LUCENE-1614: switch next -> nextDoc, skipTo -> advance, doc -> docID in DISI
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@782410 13f79535-47bb-0310-9956-ffa450edef68
2009-06-07 16:58:41 +00:00
Michael McCandless
8a4a3eeed1
LUCENE-1387: fix numDocs -> maxDoc
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781136 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 19:37:59 +00:00
Otis Gospodnetic
475b5dc08f
LUCENE-898 - removed contrib/javascript (unpackaged, unmaintained, most likely not working due to QueryParser changes since 2005)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781057 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 15:18:51 +00:00
Otis Gospodnetic
ea155ae99b
- Removed empty dir
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781053 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 15:11:03 +00:00
Uwe Schindler
cb9090da45
Fix incorrect example: setValue() is not a method of TokenStream, need more specific type. Thanks Mike
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781029 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 13:12:22 +00:00
Michael McCandless
3c82bdadb3
fix SSS unit test
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@780404 13f79535-47bb-0310-9956-ffa450edef68
2009-05-31 10:37:09 +00:00
Otis Gospodnetic
9275233f71
- Removed an ancient deprecated method and fixed misspellings
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@780363 13f79535-47bb-0310-9956-ffa450edef68
2009-05-31 04:10:35 +00:00
Michael McCandless
f953c8883e
use 2 digits precision when reporting rec/s
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@779569 13f79535-47bb-0310-9956-ffa450edef68
2009-05-28 11:31:44 +00:00
Uwe Schindler
eba450d10b
LUCENE-1591: Update commons-compress to the released version 1.0
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@777458 13f79535-47bb-0310-9956-ffa450edef68
2009-05-22 10:38:21 +00:00
Grant Ingersoll
27fc5cd348
LUCENE-1550: Added new ngram spell checking distance
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@776704 13f79535-47bb-0310-9956-ffa450edef68
2009-05-20 14:07:08 +00:00
Michael McCandless
2dd7d33e86
LUCENE-1643: use reusable RawCollationKey for better performance
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@776252 13f79535-47bb-0310-9956-ffa450edef68
2009-05-19 09:50:24 +00:00
Michael McCandless
af550281cb
LUCENE-1629: remove unnecessary source files
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@775468 13f79535-47bb-0310-9956-ffa450edef68
2009-05-16 14:08:38 +00:00
Michael McCandless
f81f6796a2
LUCENE-1629: correct ASF source headers
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@775444 13f79535-47bb-0310-9956-ffa450edef68
2009-05-16 09:55:34 +00:00
Michael McCandless
be0a47b7e3
LUCENE-1629: move CHANGES entry to contrib; add TestArabicAnalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@774727 13f79535-47bb-0310-9956-ffa450edef68
2009-05-14 10:50:52 +00:00
Michael McCandless
e01aad89fe
LUCENE-1629: adding new contrib analyzer SmartChineseAnalyzer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@774718 13f79535-47bb-0310-9956-ffa450edef68
2009-05-14 10:09:22 +00:00
Michael McCandless
d4f9809520
LUCENE-1593: optimize core Collectors for in-docID-order scorer case
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@773194 13f79535-47bb-0310-9956-ffa450edef68
2009-05-09 10:36:41 +00:00
Mark Robert Miller
6bcd89a916
fix for LUCENE-1425
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770993 13f79535-47bb-0310-9956-ffa450edef68
2009-05-02 22:46:26 +00:00
Uwe Schindler
673b90e651
Correct JavaDocs in TrieUtils. This also adds a missing test for equals() and hashcode() of the TrieRangeQueries.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770602 13f79535-47bb-0310-9956-ffa450edef68
2009-05-01 09:55:01 +00:00
Michael McCandless
9ba59706b0
try to prevent false failure of testTimedSearchTask
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770151 13f79535-47bb-0310-9956-ffa450edef68
2009-04-30 10:32:56 +00:00
Patrick O'Leary
46619cfa23
Fixed bug caused by multiSegmentIndexReader
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@769639 13f79535-47bb-0310-9956-ffa450edef68
2009-04-29 03:53:21 +00:00
Michael McCandless
72d1873533
LUCENE-1604: allow IndexReader.norms to return null if the field has no norms; this is off by default but will be hardwired to on in 3.0
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@769524 13f79535-47bb-0310-9956-ffa450edef68
2009-04-28 20:38:47 +00:00
Patrick O'Leary
9242c2b6e1
Fix class cast exception in SerialChainFilter
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@769470 13f79535-47bb-0310-9956-ffa450edef68
2009-04-28 17:14:09 +00:00
Uwe Schindler
5807d11b70
LUCENE-1602: Fix incomplete toString(), hashcode() and equals() because boost was not involved there (copy'n'paste error from the Filter, which had no boost)
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@767982 13f79535-47bb-0310-9956-ffa450edef68
2009-04-23 17:07:36 +00:00
Michael McCandless
e0c2792a57
Java 1.4 build of Xerces JAR
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@766300 13f79535-47bb-0310-9956-ffa450edef68
2009-04-18 11:16:27 +00:00
Uwe Schindler
92c6484161
LUCENE-1602: Rewrite TrieRange to use MultiTermQuery
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765618 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 12:58:28 +00:00
Michael McCandless
022d104c38
LUCENE-1591: add commons-compress JAR
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765580 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 11:28:46 +00:00
Michael McCandless
8c4fff6e21
LUCENE-1591: add bzip2 compression/decompress to contrib/benchmark
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765543 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 09:46:30 +00:00
Uwe Schindler
87e10db563
Add a missing check for inverse ranges in splitRange() and corresponding tests for inverse ranges and 0-length-ranges.
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764829 13f79535-47bb-0310-9956-ffa450edef68
2009-04-14 15:28:06 +00:00
Uwe Schindler
3ef76ae033
Small documentation updates, test fixed according to attributes documentation
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764654 13f79535-47bb-0310-9956-ffa450edef68
2009-04-14 00:36:02 +00:00
Michael McCandless
ad79d914e4
LUCENE-1575: switch to new Collector API
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764551 13f79535-47bb-0310-9956-ffa450edef68
2009-04-13 18:33:56 +00:00
Mark Robert Miller
413cabef1f
Contrib queries package Query implementations do not override equals()
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763857 13f79535-47bb-0310-9956-ffa450edef68
2009-04-10 02:21:04 +00:00
Mark Robert Miller
a1b3fd7240
Add ConstantScore highlighting support to SpanScorer
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763856 13f79535-47bb-0310-9956-ffa450edef68
2009-04-10 02:09:46 +00:00
Uwe Schindler
ca6451958b
Add extra check to TokenStream tests, only get attribute not create it
...
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763793 13f79535-47bb-0310-9956-ffa450edef68
2009-04-09 20:32:35 +00:00