Commit Graph

921 Commits

Author SHA1 Message Date
Mark Robert Miller a4c2eae1d4 update javadoc for init(TokenStream) return
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800849 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 15:38:00 +00:00
Mark Robert Miller f0e54e31e6 LUCENE-1406 belongs in contrib CHANGES
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800832 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 15:05:34 +00:00
Mark Robert Miller b44ed588ac LUCENE-1685 should be in API changes, not new features
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800821 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 14:33:58 +00:00
Mark Robert Miller 10b41d2dce LUCENE-1685: The position aware SpanScorer has become the default scorer for Highlighting. The SpanScorer implementation has replaced QueryScorer and the old term highlighting QueryScorer has been renamed to QueryTermScorer. Multi-term queries are also now expanded by default. If you were previously rewritting the query for multi-term query highlighting, you should no longer do that (unless you switch to using QueryTermScorer). The SpanScorer API (now QueryScorer) has also been improved to more closely match the API of the previous QueryScorer implementation.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800796 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:56:11 +00:00
Mark Robert Miller b155258203 LUCENE-1778: Add support for log.step setting per task type. Perviously, if you included a log.step line in the .alg file, it had been applied to all tasks. Now, you can include a log.step.AddDoc, or log.step.DeleteDoc (for example) to control logging for just these tasks. If you want to ommit logging for any other task, include log.step=-1. The syntax is "log.step." together with the Task's 'short' name (i.e., without the 'Task' part).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800768 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:05:03 +00:00
Mark Robert Miller 4054b4ebf3 move the web based xml demo from core changes to contrib changes - also fixes skipping # 34 in features
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800661 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 02:57:00 +00:00
Michael Busch c91651e4f2 LUCENE-1775: Change contrib tee/sink filters to use new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800606 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 22:45:27 +00:00
Michael Busch 457c29d31e LUCENE-1775: Change remaining contrib TokenFilters (shingle, prefix-suffix) to use the new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800195 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 04:33:10 +00:00
Mark Robert Miller 5aaf5b0167 LUCENE-1486: Move ComplexPhraseQueryParser to contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800193 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 04:06:22 +00:00
Michael Busch 343992fcbb LUCENE-1567: New flexible QueryParser framework.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800191 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 03:38:44 +00:00
Michael McCandless 487278cf92 LUCENE-1522 turn on unit tests that had relied on LUCENE-1448 and LUCENE-1759
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800120 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 18:07:43 +00:00
Michael Busch b91f993a0e LUCENE-1460: Additional cleanup in two contrib junit tests.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799973 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 02:57:30 +00:00
Michael Busch 537aeb24e0 LUCENE-1759: Set final offset correctly in contrib TokenStreams.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799968 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 02:10:46 +00:00
Michael Busch 1743081b07 LUCENE-1460: Changed TokenStreams/TokenFilters in contrib to use the new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799953 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 22:52:32 +00:00
Mark Robert Miller 3e869d9336 remove system.out and unnecessary next() in tokenstream
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799865 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 14:18:19 +00:00
Michael McCandless 175e8b546d LUCENE-1763: require IndexWriter be passed up front to the MergePolicy
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799818 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 09:22:25 +00:00
Michael McCandless bbcab117d9 LUCENE-1683: fixed JavaUtilRegexCapabilities (used by RegexQuery) to match entire string not just prefix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799678 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 18:02:56 +00:00
Michael McCandless 0b0d13dffe LUCENE-1745: allow passing matching flags to the underlying regexp engine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799667 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 17:41:04 +00:00
Uwe Schindler f8b2f0122c Use the empty docidset provided by DocIdSet.EMPTY_DOCIDSET
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799582 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 11:32:37 +00:00
Mark Robert Miller f73a4f4324 LUCENE-1695: Update the Highlighter to use the new TokenStream API. This issue breaks backwards compatibility with some public classes. If you have implemented custom Fregmenters or Scorers, you will need to adjust them to work with the new TokenStream API. Rather than getting passed a Token at a time, you will be given a TokenStream to init your impl with - store the Attributes you are interested in locally and access them on each call to the method that used to pass a new Token. Look at the included updated impls for examples.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799455 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 22:00:47 +00:00
Mark Robert Miller 7ecaa8c990 wikipedia-flush-by-RAM.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799379 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:35:10 +00:00
Mark Robert Miller e505413fae wikipedia.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799377 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:34:28 +00:00
Mark Robert Miller afb517e832 LUCENE-1752: Missing highlights when terms were repeated in separate, nested, boolean or disjunction queries.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798976 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:47:05 +00:00
Michael McCandless dbff1fc9b5 LUCENE-1754: just use EMPTY_DOCIDSET.iterator() instead of new EmptyDocIdSetIterator
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:12:36 +00:00
Michael McCandless 094c674c4d LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Michael McCandless 26a2c427d1 LUCENE-1754: BooleanQuery detects up front if it won't match any docs and returns null from its scorer() instead of NonMatchingScorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798086 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:50:02 +00:00
Michael McCandless 228888a882 LUCENE-1644: fix highlighter to rewrite MTQ whenever it's not already a SCORING_BOOLEAN_QUERY
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797728 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:31:17 +00:00
Michael McCandless be66120dff LUCENE-1644: enable different rewrite methods for MultiTermQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797694 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 00:03:33 +00:00
Otis Gospodnetic f758b4d259 - Typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797310 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 02:43:58 +00:00
Simon Willnauer 999f6157c7 LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Mark Robert Miller 3adc61c3ac LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll 63402f49c7 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless c79f54975e LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Mark Robert Miller add56f5e66 LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller ea7e4ad344 LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Otis Gospodnetic b393e4d0af LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
- line, and those below, will be ignored--

M    CHANGES.txt
M    analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M    analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless 65494af827 LUCENE-1272: add MoreLikeThis.set/getBoost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless 91aedd6685 LUCENE-1740: add 'analyzer' command to Lucli, to change analyzer from the default StandardAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793526 13f79535-47bb-0310-9956-ffa450edef68
2009-07-13 10:06:01 +00:00
Michael McCandless 9cbe5f4ff4 LUCENE-1522: adding new Fast Vector Highlighter contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792542 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 13:06:51 +00:00
Michael McCandless 333e77a431 LUCENE-1704: allow specifying the Tidy configuration file when parsing HTML docs with contrib/ant
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791587 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:55:05 +00:00
Mark Robert Miller 28d65ceee7 remove java 1.5 dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791576 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:18:19 +00:00
Mark Robert Miller f780f77366 LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Uwe Schindler 705f099238 Convert and cleanup the test files to UTF-8. What is still broken is the incorrect usage of KOI8 and CP1251 encodings. Added svn:eol-style=native to all files again.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791483 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 13:50:17 +00:00
Mark Robert Miller e04abc52e7 LUCENE-1599: Add clone support for SpanQuerys. SpanRegexQuery counts on this functionality and does not work correctly without it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791280 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 17:16:16 +00:00
Mark Robert Miller 9789089343 reader should be closed after use
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791246 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 14:01:14 +00:00
Uwe Schindler fed4bba63d LUCENE-1713: Rename RangeQuery -> TermRangeQuery (part 1)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791175 13f79535-47bb-0310-9956-ffa450edef68
2009-07-04 20:14:12 +00:00
Simon Willnauer 410afb98bf LUCENE-1719: Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter (Steven Row via Simon Willnauer)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790262 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 16:50:47 +00:00
Simon Willnauer 5265dc1bb2 LUCENE-1722: SmartChineseAnalyzer JavaDoc improvements - Replacing Chinese JavaDoc with English version. Robert Muir via Simon Willnauer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 10:32:23 +00:00
Michael McCandless c7f865a4c7 LCUENE-1716: allow control over storage of norms (body norms), info stream and whether docs properties should be indexed as fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788777 13f79535-47bb-0310-9956-ffa450edef68
2009-06-26 17:26:54 +00:00
Uwe Schindler 42dcc00374 Build an index.html on the top-level Javadocs folder (e.g. hudson will use it as entry point)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788039 13f79535-47bb-0310-9956-ffa450edef68
2009-06-24 14:34:09 +00:00
Michael McCandless 87de0c9688 LUCENE-1466: added chainable CharFilter stage before Tokenizer to allow mapping of characters before tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787795 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 19:15:31 +00:00
Michael McCandless f03d77b558 LUCENE-1630: switch from Weight (interface) to QueryWeight (abstract class); mate in/out-of docID order scoring between Collector & Scorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787772 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 18:11:42 +00:00
Michael McCandless 5f72065d0f LUCENE-1714: fix WriteLineDocTask to also replace \r, \n (in addition to \t) with space so those chars don't create mal-formed lines
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787750 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 16:46:17 +00:00
Michael McCandless ec8088654d bulk fix svn:eol-style to native for text files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787424 13f79535-47bb-0310-9956-ffa450edef68
2009-06-22 22:18:56 +00:00
Erik Hatcher 65131ca7b9 LUCENE-1405: Added support for Ant resource collections in contrib/ant <index> task.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786610 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 18:24:19 +00:00
Michael McCandless 19234f12bd LUCENE-1692: add new contrib analyzer tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786606 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 18:02:12 +00:00
Michael McCandless 2f2cd20828 LUCENE-1692: add tests for Thai & SmartChinese analyzers; fix wrong endOffset bug in ThaiWordFilter; use stop words by default with SmartChineseAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786560 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 15:52:36 +00:00
Uwe Schindler 0b5cbca110 LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 12:16:52 +00:00
Uwe Schindler 7b34ab8f30 LUCENE-1673: Move TrieRange to core (part 1: addition to core)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786470 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 12:09:52 +00:00
Mark Robert Miller d7d455246f LUCENE-1595: Separate DocMaker into DocMaker and ContentSource.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786233 13f79535-47bb-0310-9956-ffa450edef68
2009-06-18 19:58:59 +00:00
Michael McCandless 835c405be0 LUCENE-973: add test case for CJKAnalyzer; fix trailing empty string bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@785287 13f79535-47bb-0310-9956-ffa450edef68
2009-06-16 16:38:39 +00:00
Michael Busch f2a5f395d8 Fix pom.xml.template of remote contrib to have the correct artifactId
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784674 13f79535-47bb-0310-9956-ffa450edef68
2009-06-15 07:33:57 +00:00
Michael McCandless 5b472b2d62 LUCENE-1539: add DeleteByPercent, FlushReader tasks, and ability to open reader on a labelled commit point
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784587 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 17:07:55 +00:00
Michael McCandless 26b9dea62b LUCENE-1571: fix LatLongDistanceFilter to respect deleted docs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784576 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 15:47:15 +00:00
Michael McCandless 6671c43458 LUCENE-979: remove a few more old benchmark things
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784568 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 14:52:02 +00:00
Michael McCandless e3c4ff8ac3 revert accidental commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784562 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 14:06:52 +00:00
Michael McCandless 419b065957 LUCENE-1677: deprecate gcj specializations, and the system properties that let you specify which SegmentReader impl class to use
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784541 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 11:20:55 +00:00
Michael McCandless 392424ec11 LUCENE-1407: move RemoteSearchable out of core into contrib/remote
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784540 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 11:13:04 +00:00
Karl-Johan Wettin 9e5cb08461 LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784482 13f79535-47bb-0310-9956-ffa450edef68
2009-06-13 22:05:14 +00:00
Karl-Johan Wettin 196428ec39 LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784481 13f79535-47bb-0310-9956-ffa450edef68
2009-06-13 21:54:07 +00:00
Grant Ingersoll 1511ec5e31 LUCENE-1676: in-stream payload support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784297 13f79535-47bb-0310-9956-ffa450edef68
2009-06-12 22:26:01 +00:00
Michael McCandless 159573f7eb LUCENE-979: remove deprecated non-Task benchmark classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783794 13f79535-47bb-0310-9956-ffa450edef68
2009-06-11 14:26:32 +00:00
Mark Robert Miller 85f5a9c740 LUCENE-1455: close FileInputStream after parsing
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783595 13f79535-47bb-0310-9956-ffa450edef68
2009-06-11 01:13:51 +00:00
Patrick O'Leary 5509306e40 Fixed doc id's to bitset id for sorting with a multi segmented index FieldSourceComparator still uses bitset id's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783517 13f79535-47bb-0310-9956-ffa450edef68
2009-06-10 20:59:26 +00:00
Uwe Schindler 406454d30b LUCENE-1672: Deprecate all String/File ctors/opens in IndexReader/IndexWriter/IndexSearcher
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@782469 13f79535-47bb-0310-9956-ffa450edef68
2009-06-07 21:52:41 +00:00
Michael McCandless a93e2c41b0 LUCENE-1614: switch next -> nextDoc, skipTo -> advance, doc -> docID in DISI
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@782410 13f79535-47bb-0310-9956-ffa450edef68
2009-06-07 16:58:41 +00:00
Michael McCandless 8a4a3eeed1 LUCENE-1387: fix numDocs -> maxDoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781136 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 19:37:59 +00:00
Otis Gospodnetic 475b5dc08f LUCENE-898 - removed contrib/javascript (unpackaged, unmaintained, most likely not working due to QueryParser changes since 2005)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781057 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 15:18:51 +00:00
Otis Gospodnetic ea155ae99b - Removed empty dir
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781053 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 15:11:03 +00:00
Uwe Schindler cb9090da45 Fix incorrect example: setValue() is not a method of TokenStream, need more specific type. Thanks Mike
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781029 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 13:12:22 +00:00
Michael McCandless 3c82bdadb3 fix SSS unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@780404 13f79535-47bb-0310-9956-ffa450edef68
2009-05-31 10:37:09 +00:00
Otis Gospodnetic 9275233f71 - Removed an ancient deprecated method and fixed misspellings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@780363 13f79535-47bb-0310-9956-ffa450edef68
2009-05-31 04:10:35 +00:00
Michael McCandless f953c8883e use 2 digits precision when reporting rec/s
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@779569 13f79535-47bb-0310-9956-ffa450edef68
2009-05-28 11:31:44 +00:00
Uwe Schindler eba450d10b LUCENE-1591: Update commons-compress to the released version 1.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@777458 13f79535-47bb-0310-9956-ffa450edef68
2009-05-22 10:38:21 +00:00
Grant Ingersoll 27fc5cd348 LUCENE-1550: Added new ngram spell checking distance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@776704 13f79535-47bb-0310-9956-ffa450edef68
2009-05-20 14:07:08 +00:00
Michael McCandless 2dd7d33e86 LUCENE-1643: use reusable RawCollationKey for better performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@776252 13f79535-47bb-0310-9956-ffa450edef68
2009-05-19 09:50:24 +00:00
Michael McCandless af550281cb LUCENE-1629: remove unnecessary source files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@775468 13f79535-47bb-0310-9956-ffa450edef68
2009-05-16 14:08:38 +00:00
Michael McCandless f81f6796a2 LUCENE-1629: correct ASF source headers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@775444 13f79535-47bb-0310-9956-ffa450edef68
2009-05-16 09:55:34 +00:00
Michael McCandless be0a47b7e3 LUCENE-1629: move CHANGES entry to contrib; add TestArabicAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@774727 13f79535-47bb-0310-9956-ffa450edef68
2009-05-14 10:50:52 +00:00
Michael McCandless e01aad89fe LUCENE-1629: adding new contrib analyzer SmartChineseAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@774718 13f79535-47bb-0310-9956-ffa450edef68
2009-05-14 10:09:22 +00:00
Michael McCandless d4f9809520 LUCENE-1593: optimize core Collectors for in-docID-order scorer case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@773194 13f79535-47bb-0310-9956-ffa450edef68
2009-05-09 10:36:41 +00:00
Mark Robert Miller 6bcd89a916 fix for LUCENE-1425
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770993 13f79535-47bb-0310-9956-ffa450edef68
2009-05-02 22:46:26 +00:00
Uwe Schindler 673b90e651 Correct JavaDocs in TrieUtils. This also adds a missing test for equals() and hashcode() of the TrieRangeQueries.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770602 13f79535-47bb-0310-9956-ffa450edef68
2009-05-01 09:55:01 +00:00
Michael McCandless 9ba59706b0 try to prevent false failure of testTimedSearchTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770151 13f79535-47bb-0310-9956-ffa450edef68
2009-04-30 10:32:56 +00:00
Patrick O'Leary 46619cfa23 Fixed bug caused by multiSegmentIndexReader
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@769639 13f79535-47bb-0310-9956-ffa450edef68
2009-04-29 03:53:21 +00:00
Michael McCandless 72d1873533 LUCENE-1604: allow IndexReader.norms to return null if the field has no norms; this is off by default but will be hardwired to on in 3.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@769524 13f79535-47bb-0310-9956-ffa450edef68
2009-04-28 20:38:47 +00:00
Patrick O'Leary 9242c2b6e1 Fix class cast exception in SerialChainFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@769470 13f79535-47bb-0310-9956-ffa450edef68
2009-04-28 17:14:09 +00:00
Uwe Schindler 5807d11b70 LUCENE-1602: Fix incomplete toString(), hashcode() and equals() because boost was not involved there (copy'n'paste error from the Filter, which had no boost)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@767982 13f79535-47bb-0310-9956-ffa450edef68
2009-04-23 17:07:36 +00:00
Michael McCandless e0c2792a57 Java 1.4 build of Xerces JAR
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@766300 13f79535-47bb-0310-9956-ffa450edef68
2009-04-18 11:16:27 +00:00
Uwe Schindler 92c6484161 LUCENE-1602: Rewrite TrieRange to use MultiTermQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765618 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 12:58:28 +00:00
Michael McCandless 022d104c38 LUCENE-1591: add commons-compress JAR
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765580 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 11:28:46 +00:00
Michael McCandless 8c4fff6e21 LUCENE-1591: add bzip2 compression/decompress to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@765543 13f79535-47bb-0310-9956-ffa450edef68
2009-04-16 09:46:30 +00:00
Uwe Schindler 87e10db563 Add a missing check for inverse ranges in splitRange() and corresponding tests for inverse ranges and 0-length-ranges.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764829 13f79535-47bb-0310-9956-ffa450edef68
2009-04-14 15:28:06 +00:00
Uwe Schindler 3ef76ae033 Small documentation updates, test fixed according to attributes documentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764654 13f79535-47bb-0310-9956-ffa450edef68
2009-04-14 00:36:02 +00:00
Michael McCandless ad79d914e4 LUCENE-1575: switch to new Collector API
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@764551 13f79535-47bb-0310-9956-ffa450edef68
2009-04-13 18:33:56 +00:00
Mark Robert Miller 413cabef1f Contrib queries package Query implementations do not override equals()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763857 13f79535-47bb-0310-9956-ffa450edef68
2009-04-10 02:21:04 +00:00
Mark Robert Miller a1b3fd7240 Add ConstantScore highlighting support to SpanScorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763856 13f79535-47bb-0310-9956-ffa450edef68
2009-04-10 02:09:46 +00:00
Uwe Schindler ca6451958b Add extra check to TokenStream tests, only get attribute not create it
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763793 13f79535-47bb-0310-9956-ffa450edef68
2009-04-09 20:32:35 +00:00
Uwe Schindler b5c4f8e67d Add field name to term statistics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763778 13f79535-47bb-0310-9956-ffa450edef68
2009-04-09 19:31:42 +00:00
Michael McCandless 3c91517add LUCENE-1516: add near real-time search to IndexWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763737 13f79535-47bb-0310-9956-ffa450edef68
2009-04-09 17:17:46 +00:00
Michael McCandless 6b4831fd1d LUCENE-1591: workaround bug in xerces so we can process Wikipedia's XML
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763416 13f79535-47bb-0310-9956-ffa450edef68
2009-04-08 21:50:33 +00:00
Uwe Schindler c7ac1fdec3 LUCENE-1582: Fix missing public accessor in ShiftAttribute
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@763004 13f79535-47bb-0310-9956-ffa450edef68
2009-04-07 22:25:46 +00:00
Patrick O'Leary 42783c57e5 Commit LUCENE-1588 DistanceFieldComparatorSource
deprecate BoundaryBoxFilter

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@762801 13f79535-47bb-0310-9956-ffa450edef68
2009-04-07 14:54:50 +00:00
Uwe Schindler 13ae26b7fe LUCENE-1582: Make TrieRange completely independent from Document/Field with TokenStream of prefix encoded values
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@762710 13f79535-47bb-0310-9956-ffa450edef68
2009-04-07 11:47:35 +00:00
Michael McCandless bddac65336 share single IndexSearcher instance in ReadTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@762307 13f79535-47bb-0310-9956-ffa450edef68
2009-04-06 12:20:42 +00:00
Michael McCandless c73712d1bb LUCENE-1576: fix BrazilianAnalyzer to downcase before filtering stop words
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@759307 13f79535-47bb-0310-9956-ffa450edef68
2009-03-27 19:04:25 +00:00
Michael McCandless 0f17904f1e remove slow download host; only download zip file once for all tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@759061 13f79535-47bb-0310-9956-ffa450edef68
2009-03-27 08:44:33 +00:00
Michael McCandless 198aa70b35 LUCENE-1500: fix compilation errors in contrib/benchmark by allowing doHighlight method to throw InvalidTokenOfsetsException
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@758575 13f79535-47bb-0310-9956-ffa450edef68
2009-03-26 10:03:34 +00:00
Mark Harwood 73a02ec6fe Fix for Lucene-1500 - new exception added to Highlighter API to handle TokenStreams with Tokens that exceed given text length
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@758460 13f79535-47bb-0310-9956-ffa450edef68
2009-03-25 23:05:14 +00:00
Mark Robert Miller 126f4b18d3 LUCENE-1563:Add example test case for surround query language
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@757762 13f79535-47bb-0310-9956-ffa450edef68
2009-03-24 12:28:13 +00:00
Patrick O'Leary 4f7ef62e1f Fixed NPE for minimal bounding box search
LUCENE-1568

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@756849 13f79535-47bb-0310-9956-ffa450edef68
2009-03-21 00:46:27 +00:00
Michael McCandless c443dcc662 LUCENE-1561: rename omitTf -> omitTermFreqAndPositions in Field*
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@756663 13f79535-47bb-0310-9956-ffa450edef68
2009-03-20 17:26:01 +00:00
Michael McCandless 6bf0e6e09b LUCENE-1435: add contrib/collation (CollationKeyFilter), to convert tokens into indexable CollationKeys
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755914 13f79535-47bb-0310-9956-ffa450edef68
2009-03-19 10:51:55 +00:00
Michael McCandless e44e6b0603 LUCENE-1490: forgot CHANGES.txt update
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755746 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 21:42:17 +00:00
Michael McCandless 96863198a5 LUCENE-1490: fix latin1 conversion of HALFWIDTH_AND_FULLWIDTH_FORMS characters to only apply to the correct subset
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755666 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 17:28:53 +00:00
Michael McCandless 68f4d8b1a8 revert accidental changes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755652 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 17:06:46 +00:00
Michael McCandless cd5028b79a improve IndexWriter.rollback javadoc src/java/org/apache/lucene/index/IndexWriter.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@755535 13f79535-47bb-0310-9956-ffa450edef68
2009-03-18 10:22:51 +00:00
Mark Robert Miller e256f679e3 typo in javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@749625 13f79535-47bb-0310-9956-ffa450edef68
2009-03-03 15:00:02 +00:00
Michael McCandless 6248e14515 LUCENE-1548: fix distance normalization in LevenshteinDistance to not produce negative distances
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@748534 13f79535-47bb-0310-9956-ffa450edef68
2009-02-27 14:07:12 +00:00
Yonik Seeley 6c176eb016 LUCENE-1398: Add ReverseStringFilter to contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@747915 13f79535-47bb-0310-9956-ffa450edef68
2009-02-25 20:44:05 +00:00
Chris M. Hostetter b97ee3a1a5 some javadoc cleanup, and deprecation note for older version of norm modification tool
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@747627 13f79535-47bb-0310-9956-ffa450edef68
2009-02-25 00:29:17 +00:00
Uwe Schindler 174e8237b9 Javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@746792 13f79535-47bb-0310-9956-ffa450edef68
2009-02-22 18:29:42 +00:00
Uwe Schindler c314ad000f LUCENE-1470: API change in RangeBuilder
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@746790 13f79535-47bb-0310-9956-ffa450edef68
2009-02-22 18:17:13 +00:00
Uwe Schindler 03db7c81c1 LUCENE-1470: Remove the recursion from splitRange and implement with loop
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@745533 13f79535-47bb-0310-9956-ffa450edef68
2009-02-18 15:08:02 +00:00
Uwe Schindler 8b79fa51a5 LUCENE-1470: New implementation using encoding of TrieUtils and TrieRangeFilter that now also supports 32bit and 64bit fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@744207 13f79535-47bb-0310-9956-ffa450edef68
2009-02-13 18:27:01 +00:00
Karl-Johan Wettin 6e692d38ec LUCENE-1531
Added support for BoostingTermQuery to XML query parser.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@742411 13f79535-47bb-0310-9956-ffa450edef68
2009-02-09 11:49:33 +00:00
Michael McCandless dbe0898d63 LUCENE-1535: use random seed for randomness, but print the seed on failure for reproducibility
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@741311 13f79535-47bb-0310-9956-ffa450edef68
2009-02-05 21:53:40 +00:00
Uwe Schindler 80b471b0cc one Math.max too much, sorry
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@740444 13f79535-47bb-0310-9956-ffa450edef68
2009-02-03 21:07:11 +00:00
Uwe Schindler b2c04094ef Fix jUnit test problem with exclusive range if upper==lower (thanks to Michael McCandless):
[junit] junit.framework.AssertionFailedError: Returned count of  
range query must be equal to exclusive range length expected:<0> but  
was:<-1>
at org.apache.lucene.search.trie.TestTrieRangeQuery.testRangeSplit(TestTrieRangeQuery.java:203)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@740442 13f79535-47bb-0310-9956-ffa450edef68
2009-02-03 21:05:29 +00:00
Uwe Schindler e7edb24bc0 fix javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@739441 13f79535-47bb-0310-9956-ffa450edef68
2009-01-30 22:44:20 +00:00
Uwe Schindler cb55b1671b Add some extra check for validity of c'tor parameters in TrieRangeFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@739439 13f79535-47bb-0310-9956-ffa450edef68
2009-01-30 22:38:04 +00:00
Michael McCandless 20753f2e31 LUCENE-1507: add DocIdSet.EMPTY_DOCIDSET
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738895 13f79535-47bb-0310-9956-ffa450edef68
2009-01-29 15:39:11 +00:00
Michael McCandless 96e0aaf49e LUCENE-1483: switch to newly added MultiReaderHitCollector for all core collectors, that is aware of segment transitions during searching, to improve performance of searching and warming
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738219 13f79535-47bb-0310-9956-ffa450edef68
2009-01-27 20:15:21 +00:00
Uwe Schindler f1ab1aee64 Implement a shortcut, when range has min>max. In this case a static empty SortedVIntList is returned.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738182 13f79535-47bb-0310-9956-ffa450edef68
2009-01-27 18:12:34 +00:00
Uwe Schindler 153acfd064 LUCENE-1530: Support inclusive/exclusive for TrieRangeQuery/-Filter, remove default trie variant setters/getters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@738109 13f79535-47bb-0310-9956-ffa450edef68
2009-01-27 15:15:24 +00:00
Michael McCandless c6f6f01643 LUCENE-1316: don't call synchronized IndexReader.isDeleted when scoring MatchAllDocsQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@737513 13f79535-47bb-0310-9956-ffa450edef68
2009-01-25 14:38:43 +00:00
Michael McCandless 4764105ff4 LUCENE-1527: open reader non-readOnly for doing deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@737175 13f79535-47bb-0310-9956-ffa450edef68
2009-01-23 20:33:51 +00:00
Uwe Schindler 7ef048e01c Optimize the test index as preparation for LUCENE-1483, because the statistics on visited terms in TrieRangeQuery/-Filter only work correctly for optimized indexes (as the count is reset on each query execution. So the new multisegment-search must have a one-segment index to only execute the query one time)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@737079 13f79535-47bb-0310-9956-ffa450edef68
2009-01-23 16:44:17 +00:00
Michael McCandless d545a46df5 LUCENE-1525: add missing maven dependencies for contrib/{benchmark,highlighter}; fix svn eolstyle to native
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@736340 13f79535-47bb-0310-9956-ffa450edef68
2009-01-21 16:59:39 +00:00
Uwe Schindler 88e19ab531 Trie encoded fields do not need norms (as they are only used for filtering with TrieRangeFilter or for sorting)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@734414 13f79535-47bb-0310-9956-ffa450edef68
2009-01-14 15:09:25 +00:00
Michael McCandless 03d6b2aa24 LUCENE-1479: if date is missing, don't skip the doc; just don't add 'docdate' field to the doc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733697 13f79535-47bb-0310-9956-ffa450edef68
2009-01-12 11:37:23 +00:00
Uwe Schindler f0300a643e LUCENE-1517: Change superclass of TrieRangeQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733519 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 21:15:24 +00:00
Uwe Schindler 4905d4404c Small optimization because an Object is not needed here
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733449 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 11:04:56 +00:00
Uwe Schindler c2b5e91ddd As suggested by Otis: Convert TABs to spaces, remove @author javadoc tags
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733443 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 10:25:14 +00:00
Karl-Johan Wettin d7376608b2 LUCENE-1514
ShingleMatrixFilter#next(Token) easily throws a StackOverflowException due to recursive invocation. (Karl Wettin)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733064 13f79535-47bb-0310-9956-ffa450edef68
2009-01-09 15:34:52 +00:00
Michael McCandless 8180700de8 LUCENE-1497: minor code cleanups to SimpleHTMLFormatter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732739 13f79535-47bb-0310-9956-ffa450edef68
2009-01-08 16:11:19 +00:00
Karl-Johan Wettin f991524da8 LUCENE-1510
InstantiatedIndexReader#norms methods throws NullPointerException on empty index.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732661 13f79535-47bb-0310-9956-ffa450edef68
2009-01-08 09:28:42 +00:00
Ryan McKinley 3f72bc813f remove unused imports and @author tag
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732410 13f79535-47bb-0310-9956-ffa450edef68
2009-01-07 17:47:26 +00:00
Ryan McKinley 977c9acc71 LUCENE-1512 -- adding GeoHash implementaion
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732400 13f79535-47bb-0310-9956-ffa450edef68
2009-01-07 17:10:58 +00:00
Ryan McKinley 43609e891d LUCENE-1504 -- SerialChainFilter should use DocSet API
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732031 13f79535-47bb-0310-9956-ffa450edef68
2009-01-06 18:13:06 +00:00
Ryan McKinley 4978391e93 LUCENE-1508 -- using the tierPrefix constructors by default
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730662 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 03:21:23 +00:00
Ryan McKinley 0958d83354 LUCENE-1508 -- make the field prefix configurable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730658 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 02:52:30 +00:00
Ryan McKinley 1f585f0357 LUCENE-1387 -- expose getLat/getLng from DistanceQueryBuilder
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730653 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 00:51:26 +00:00
Michael Busch c9e7642bae Fix pom.xml.template of spatial search contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730552 13f79535-47bb-0310-9956-ffa450edef68
2009-01-01 15:02:48 +00:00
Otis Gospodnetic 72725a0b58 - Small documentation mods.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730207 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 18:20:43 +00:00
Ryan McKinley 0afd451f24 LUCENE-1503 -- refactor spatial Query/Filter classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730205 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 18:00:56 +00:00
Ryan McKinley 581c82ccbe LUCENE-1387 -- adding locallucene as new spatial contrib package.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730067 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 07:37:17 +00:00
Michael McCandless eb64661216 LUCENE-1495: fix TaskSequence.toString to show time-limited configuration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728751 13f79535-47bb-0310-9956-ffa450edef68
2008-12-22 19:37:58 +00:00
Michael McCandless 51f9b759fa LUCENE-1495: fix time-based test to reduce change of false failure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728425 13f79535-47bb-0310-9956-ffa450edef68
2008-12-21 11:07:28 +00:00
Michael McCandless 7abe0311bc LUCENE-1444: fix broken site links in javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728262 13f79535-47bb-0310-9956-ffa450edef68
2008-12-20 09:43:06 +00:00
Michael McCandless bdc42e5342 LUCENE-1495: allow TaskSequence to run for certain time
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728252 13f79535-47bb-0310-9956-ffa450edef68
2008-12-20 08:45:09 +00:00
Michael McCandless 74e097f8eb LUCENE-1493: allow setting top number of hits to collect with search.num.hits
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727063 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 15:09:46 +00:00
Michael McCandless 09f661c48c LUCENE-1492: add optional readOnly param to OpenReader task
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727029 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 11:44:01 +00:00
Michael McCandless 1f33d7c66a fix off-by-one bug in getting field sort type
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@726213 13f79535-47bb-0310-9956-ffa450edef68
2008-12-13 14:43:32 +00:00
Karl-Johan Wettin 812abd20dd LUCENE-1462
InstantiatedIndexWriter did not reset pre analyzed TokenStreams the same way IndexWriter does. 
Parts of InstantiatedIndex was not Serializable.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@726030 13f79535-47bb-0310-9956-ffa450edef68
2008-12-12 15:12:31 +00:00
Karl-Johan Wettin 219a20a945 LUCENE-1462
InstantiatedIndexWriter did not reset pre analyzed TokenStreams the same way IndexWriter does. 
Parts of InstantiatedIndex was not Serializable.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@725837 13f79535-47bb-0310-9956-ffa450edef68
2008-12-11 22:08:45 +00:00
Grant Ingersoll 2225462178 LUCENE-1380: Add PositionFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@725691 13f79535-47bb-0310-9956-ffa450edef68
2008-12-11 14:17:44 +00:00
Michael McCandless efc985cbf4 LUCENE-1478: allow SortField to use a custom numeric FieldCache parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@724484 13f79535-47bb-0310-9956-ffa450edef68
2008-12-08 21:07:45 +00:00
Grant Ingersoll 702ea32da7 make constructors public
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@724059 13f79535-47bb-0310-9956-ffa450edef68
2008-12-07 00:39:35 +00:00
Michael McCandless 4a0a5664dd LUCENE-1468: switch Directory.list() to Directory.listAll(), which does no filtering of returned array
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723789 13f79535-47bb-0310-9956-ffa450edef68
2008-12-05 17:03:13 +00:00
Michael McCandless 5e3cd6711a LUCENE-1470: describe TrieRangeFilter/Query in README
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723701 13f79535-47bb-0310-9956-ffa450edef68
2008-12-05 10:16:50 +00:00
Michael McCandless 240897988d LUCENE-1470: switch TestTrieRangeQuery to LuceneTestCase as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723313 13f79535-47bb-0310-9956-ffa450edef68
2008-12-04 13:17:23 +00:00
Michael McCandless 05bff1833f LUCENE-1470: add proper dependency to contrib/build.xml so contrib tests can subclass LuceneTestCase; fixed small bug TestTrieUtils test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723287 13f79535-47bb-0310-9956-ffa450edef68
2008-12-04 11:06:59 +00:00
Michael McCandless 481f8080ab LUCENE-1470: add TrieRangeQuery, a much more efficient implementation of RangeQuery at the expense of added space consumed in the index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723031 13f79535-47bb-0310-9956-ffa450edef68
2008-12-03 19:38:31 +00:00
Michael McCandless 278dc0fc04 LUCENE-1296: add protected method CachingWrapperFilter.docIdSetToCache
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@722174 13f79535-47bb-0310-9956-ffa450edef68
2008-12-01 18:50:45 +00:00
Mark Robert Miller c2f7d554a3 LUCENE-1389: SimpleSpanFragmenter can create very short fragments
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@713569 13f79535-47bb-0310-9956-ffa450edef68
2008-11-13 00:01:43 +00:00
Yonik Seeley 8e8e8ddec4 set svn:eol-style to native on java files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712922 13f79535-47bb-0310-9956-ffa450edef68
2008-11-11 02:35:46 +00:00
Michael McCandless abe6ea0828 LUCENE-1424: enable MultiTermQuery to do constant scoring; refactor Range, Wildcard, Prefix to expose this
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712890 13f79535-47bb-0310-9956-ffa450edef68
2008-11-11 00:40:00 +00:00
Mark Harwood de7ab5b898 Added new web application demo for contrib's XmlQueryParser.
This change involves:
* Adding Tomcat's Servlet jar into the lib directory and appropriate entry in NOTICE.txt following the lead from Solr's packaging
* Adding new "demo" directory to XmlQueryParser src directory
* Changing XMLQueryParser's build file to create a demo War file
* Changing the main build to include the demo War file (and any other future contrib/*/war files) in the binary distributions

The packaged source distribution has NOT been changed currently to add a lib directory with the servlet.jar so building from a cut-down src distro as opposed to the full subversion /trunk directory will not currently build the war file (the xml query parser build file detects the absence of servlet.jar). Not sure if this is a problem currently.

TODO: 
Now that the servlet jar is available in Subversion I would recommend that the other existing WAR file,"luceneweb.war", is changed to move much of the java code which is currently embedded in JSP files into servlet .java files. This would ensure that the build system will check that the code in this application compiles cleanly with the latest Lucene APIs - otherwise any issue will only become apparent when a user tries to run a JSP.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712318 13f79535-47bb-0310-9956-ffa450edef68
2008-11-08 00:20:32 +00:00
Michael McCandless 24d76157a0 LUCENE-1420: let Similarity.computeNorm compute the norm; add option to discount overlap tokens when computing lengthNorm
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@710117 13f79535-47bb-0310-9956-ffa450edef68
2008-11-03 18:03:58 +00:00
Michael McCandless f2c988ae2b don't include contrib/memory class files in highlighter's jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@707834 13f79535-47bb-0310-9956-ffa450edef68
2008-10-25 10:19:30 +00:00
Grant Ingersoll 8dfe073760 LUCENE-1406. Added Arabic stemming and normalization. Also added new method to WordListLoader to allow for comments in word lists.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706342 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 17:19:29 +00:00
Michael McCandless da78e20bd0 LUCENE-1416: don't fail contrib/ant's unit test when file path contains non-URL safe characters like space
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706160 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 09:12:46 +00:00
Karl-Johan Wettin 456b10fdf9 LUCENE-1423
InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68
2008-10-18 16:29:53 +00:00
Daniel Naber 20ddb46fec minor javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@703717 13f79535-47bb-0310-9956-ffa450edef68
2008-10-11 19:45:16 +00:00
Michael McCandless 3f27b17a89 fix non-1.4-compatible throws clause
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@701827 13f79535-47bb-0310-9956-ffa450edef68
2008-10-05 16:40:59 +00:00
Mark Harwood 92d6a0ae7b Fix for a potential null-pointer-introducing bug which came about as part of the DocIdSet changes. TermsFilter no longer implemented bits(IndexReader) and the Filter base class' version of this was changed to return null.
When dropping 2.4 Lucene in as a direct replacement for 2.3.2 my client code was getting NullPointer errors - returning null was never part of the Filter.bits contract and so this could be a problem for others using this class. 

Fix is for TermsFilter to implement bits(IndexReader) - this can safely be removed in later versions because it is a deprecated method going forward.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700815 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 15:37:05 +00:00
Michael McCandless 98e1129a14 break off contrib/CHANGES.txt's 2.4.0 release section
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700743 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 11:22:58 +00:00
Andreas Vajda 88283a7027 - upgraded to bdb-je 3.3.69
- re-enabled testBytes() and testArrays() (per Aaron, Michael's instructions)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699961 13f79535-47bb-0310-9956-ffa450edef68
2008-09-29 02:21:51 +00:00
Andreas Vajda 96809b1238 upgraded db lib to version 4.7.25
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699882 13f79535-47bb-0310-9956-ffa450edef68
2008-09-28 19:07:39 +00:00
Mark Harwood ba6344a4d7 Fixed bug in FuzzyLikeThisQuery.java. Queries that contain a term with no fuzzy variants caused the query construction logic to exit loop early, producing no fuzzy variants for all subsequent terms in the query string.
Junit test added which recreates the problem conditions and added fix to FuzzyLikeThisQuery that solves the issue.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699512 13f79535-47bb-0310-9956-ffa450edef68
2008-09-26 21:44:55 +00:00
Michael McCandless 0e708a21fe LUCENE-1401: fix accidentally lost 'create=true' in CreateIndexTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698975 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 14:11:41 +00:00
Michael McCandless a40530e4a4 LUCENE-1401: remove new deprecated IndexWriter ctors; default autoCommit=false for new ctors
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698932 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 12:04:38 +00:00
Michael McCandless b866828bab add graph to package.html for contrib/instantiated showing queries per sec vs number of docs indexed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@697126 13f79535-47bb-0310-9956-ffa450edef68
2008-09-19 15:15:36 +00:00
Otis Gospodnetic 0195fcd03d LUCENE-1378 - Removed the remaining 199 @author references
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@695514 13f79535-47bb-0310-9956-ffa450edef68
2008-09-15 15:42:11 +00:00
Karl-Johan Wettin c6e5618674 Added a bunch of <p> in the Javadocs to make the rendered html view look better.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694396 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:26:46 +00:00
Karl-Johan Wettin 71f2d8199b LUCENE-1320
ShingleMatrixFilter JDK downgrade 1.5 -> 1.4 
Grant Ingersoll via Karl Wettin

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694393 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:23:18 +00:00
Michael McCandless d31fb4556c fix wordnet's Syns2Index to not fiddle with mergeFactor & maxBuffereDocs (the latter was hitting an exception)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694222 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 12:11:03 +00:00
Michael McCandless 4218996230 LUCENE-1366: rename Field.Index.* options
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694004 13f79535-47bb-0310-9956-ffa450edef68
2008-09-10 21:38:52 +00:00
Grant Ingersoll dadaebc290 LUCENE-1243: Added new benchmark tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 15:56:41 +00:00
Mark Robert Miller 5147f0563f LUCENE-1357: SpanScorer does not respect ConstantScoreRangeQuery setting
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693432 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 10:52:50 +00:00
Michael McCandless d5a40278bc LUCENE-1369: switch from Hashtable to HashMap and from Vector to List, when possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@692921 13f79535-47bb-0310-9956-ffa450edef68
2008-09-07 19:22:40 +00:00
Michael McCandless 120d01bb09 remove DOCTYPE declaration in testEncoding (it's causing intermittant false failures in nightly build when xerces has trouble loading the DTD from w3.org)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690914 13f79535-47bb-0310-9956-ffa450edef68
2008-09-01 09:07:16 +00:00
Karl-Johan Wettin bf238a5743 Javadocs fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690779 13f79535-47bb-0310-9956-ffa450edef68
2008-08-31 20:46:47 +00:00
Karl-Johan Wettin 82c70c018e LUCENE-1016 : TermVectorAccessor, transparent vector space access via stored vectors or by resolving the inverted index.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688745 13f79535-47bb-0310-9956-ffa450edef68
2008-08-25 15:02:20 +00:00
Karl-Johan Wettin 3034575f66 LUCENE-1142 : Updated Snowball package, org.tartarus distribution revision 500.
Introducing Hungarian, Turkish and Romanian support, updated older stemmers and optimized (reflectionless) SnowballFilter.

IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older) might not be compatible with these updated classes as some algorithms have changed.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688420 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 22:02:47 +00:00
Karl-Johan Wettin 9fe7a35378 Contrib level CHANGES.txt. I forgot to add this some time ago.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 17:12:57 +00:00
Michael McCandless 003a853cc8 LUCENE-1333: don't use LuceneTestCase in contrib until we can fix the build dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687539 13f79535-47bb-0310-9956-ffa450edef68
2008-08-21 02:45:37 +00:00
Grant Ingersoll 09467ad18e LUCENE-1001: Make payloads accessible from SpanQuery's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687379 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 16:06:36 +00:00
Michael McCandless bb6b711718 LUCENE-1333: improvements to Token reuse API and full cutover to reuse API for all core and contrib analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687357 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 14:38:07 +00:00
Mark Robert Miller 7675606908 LUCENE-1355: highlighter can incorrectly produce negative idf when index has deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687052 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 13:16:41 +00:00
Michael McCandless e31a9da835 LUCENE-1334: add Term(String fieldName) constructor that sets term text to empty string
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687014 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 10:40:39 +00:00
Doron Cohen 027307857e LUCENE-1353: Javacc target to generate precedence query parser.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@685114 13f79535-47bb-0310-9956-ffa450edef68
2008-08-12 10:49:59 +00:00
Michael McCandless 12e4e67403 add missing synchronized to PerfRunData.getQueryMaker()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@677555 13f79535-47bb-0310-9956-ffa450edef68
2008-07-17 10:50:24 +00:00
Mark Harwood 74c551fb88 Added option to allow UserQuery tag to define a different default fieldName.
Standard use case for this is where users are presented with a GUI form with multiple input boxes, each targetting a different field and allowing "lucene syntax". The XML query template behind such a form would have a <UserQuery> tag for each form field, each defined with the appropriate choice of default field name.

Added Junit test for changing default field name, updated DTD for XML query syntax and regenerated HTML documentation.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674975 13f79535-47bb-0310-9956-ffa450edef68
2008-07-08 21:50:06 +00:00
Karl-Johan Wettin ddc7c290d0 LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

Bug fix, did not support empty input token streams.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674367 13f79535-47bb-0310-9956-ffa450edef68
2008-07-07 00:08:41 +00:00
Karl-Johan Wettin bca43ea3ea LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673549 13f79535-47bb-0310-9956-ffa450edef68
2008-07-02 23:53:51 +00:00
Mark Robert Miller 943bf37e66 LUCENE-1321: Highlight fragment does not extend to maxDocCharsToAnalyze
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673220 13f79535-47bb-0310-9956-ffa450edef68
2008-07-01 21:44:33 +00:00
Karl-Johan Wettin 4f7ead8112 LUCENE-1318: InstantiatedIndexReader.norms(String, b[], int) didn't treat the array offset right.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672568 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 18:52:12 +00:00
Karl-Johan Wettin b8fc54e72a LUCENE-1312: Added full support for InstantiatedIndexReader#getFieldNames() and extended the test case to assert deleted documents behaves as they should (they did).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672556 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 17:23:35 +00:00
Erik Hatcher 3c80f97f7c MemoryIndex implements Serializable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@671579 13f79535-47bb-0310-9956-ffa450edef68
2008-06-25 15:18:29 +00:00
Grant Ingersoll 44dcbe184f add a getter for the StringDistance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669476 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 12:52:11 +00:00
Grant Ingersoll d6dcae49dd make public
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669455 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 11:45:08 +00:00
Otis Gospodnetic 8b11f3b077 - Renamed TRStringDistance to LevensteinDistance (related to LUCENE-1297)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669086 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:09:11 +00:00
Otis Gospodnetic ce19aa28c0 LUCENE-1297 - Allow other string distance measures for the SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669085 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:01:57 +00:00
Mark Harwood 4adc3f7ba6 Fixed bug parsing boolean attributes. Boolean.getBoolean(s) was being used by mistake which reads a system property with name of s - instead need to parse s as string value of "true" or "false".
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@667294 13f79535-47bb-0310-9956-ffa450edef68
2008-06-12 23:38:39 +00:00
Doron Cohen 4d9e280788 LUCENE-1090: remove relative paths from benchmark's build.xml.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68
2008-06-10 11:58:00 +00:00
Grant Ingersoll 8c97e9b87b LUCENE-1299: Properly handle when IndexReader is not null and Field is in the spell checker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663649 13f79535-47bb-0310-9956-ffa450edef68
2008-06-05 16:12:08 +00:00
Grant Ingersoll f89cda6dde LUCENE-1298: Allow MLT to use custom similarity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663054 13f79535-47bb-0310-9956-ffa450edef68
2008-06-04 10:41:41 +00:00
Grant Ingersoll 1979d75605 LUCENE-1295: Made method public and added retrieveInterestingTerms variation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@662413 13f79535-47bb-0310-9956-ffa450edef68
2008-06-02 12:29:02 +00:00
Mark Robert Miller f32b5a5698 LUCENE-1285: WeightedSpanTermExtractor incorrectly treats the same terms occurring in different query types
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659965 13f79535-47bb-0310-9956-ffa450edef68
2008-05-25 11:38:55 +00:00
Michael Busch 3e7a15e317 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659742 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:45:12 +00:00
Michael Busch 1d6d186888 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659740 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:16:30 +00:00
Michael Busch 14f3663f56 Set eol-style to native for BooleanFilterTest.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659732 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 00:48:12 +00:00
Michael Busch 0ebfcc663e LUCENE-1187: ChainedFilter and BooleanFilter now work with new Filter API and DocIdSetIterator-based filters.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659635 13f79535-47bb-0310-9956-ffa450edef68
2008-05-23 19:25:05 +00:00
Otis Gospodnetic ce557ee4b3 LUCENE-852: Let the SpellChecker caller specify IndexWriter mergeFactor and RAM buffer size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659021 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:47:36 +00:00
Otis Gospodnetic b4f7c37bd1 LUCENE-1046: Removed dead code in SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659019 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:34:18 +00:00
Otis Gospodnetic f68c9544de LUCENE-1183: Optimized TRStringDistance class (in contrib/spell) that uses less memory than the previous version
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659016 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:24:55 +00:00
Otis Gospodnetic a379a67875 LUCENE-1152 Fix for calling indexDictionary after clearIndex call
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659013 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:21:17 +00:00
Otis Gospodnetic 90be0daa46 - Made PlainTextDictionary(Reader) public from protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657728 13f79535-47bb-0310-9956-ffa450edef68
2008-05-19 05:45:31 +00:00
Otis Gospodnetic f5df30327e - Fixed messed up indentation/tabs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657281 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:57:32 +00:00
Otis Gospodnetic 1d5ba345cc - Javadocs fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657280 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:56:46 +00:00
Otis Gospodnetic d5c708a161 - Renamed vars a bit, so test is easier to understand
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657279 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:55:48 +00:00
Grant Ingersoll 7a27cdcbc9 LUCENE-1166: Added token filter for decomposing compound words
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657027 13f79535-47bb-0310-9956-ffa450edef68
2008-05-16 12:22:50 +00:00
Otis Gospodnetic aa0074f5db LUCENE-1003: Don't let RussianAnalyzer drop numbers.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656111 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:37:45 +00:00
Otis Gospodnetic b182881092 - Made ant compile-core and build-contrib run again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656110 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:29:57 +00:00
Otis Gospodnetic 6a8bef8c74 - Made ant compile-core and build-contrib run again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656109 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:29:16 +00:00
Michael Busch 38d1caf56f LUCENE-1277: Remove println in highlighter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652625 13f79535-47bb-0310-9956-ffa450edef68
2008-05-01 18:59:06 +00:00
Grant Ingersoll afa040e9da LUCENE-1276: Fixed contrib highlighter build
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652471 13f79535-47bb-0310-9956-ffa450edef68
2008-05-01 10:34:50 +00:00
Mark Harwood 35c7eb36df Commit of LUCENE-794 patch - adding phrase/span query support to highlighter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652164 13f79535-47bb-0310-9956-ffa450edef68
2008-04-29 22:07:18 +00:00
Karl-Johan Wettin 6a4ae85775 minor javadocs change
First paragraph is used as description in javadocs index so I added the sentance from build/pom. And it gave me an excuse to test my karma.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@649312 13f79535-47bb-0310-9956-ffa450edef68
2008-04-17 22:19:41 +00:00
Michael McCandless 44daebab7c LUCENE-1150: put back public tokenImage/TOKEN_TYPES in StandardTokenizer and WikipediaTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@648183 13f79535-47bb-0310-9956-ffa450edef68
2008-04-15 08:48:41 +00:00
Chris M. Hostetter 21b22f8515 some javadoc building cleanup for contrib/instantiated -- notably moving docs to doc-files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@643170 13f79535-47bb-0310-9956-ffa450edef68
2008-03-31 22:00:57 +00:00
Grant Ingersoll cc955c9748 LUCENE-400: Added ShingleFilter (token based ngram)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@642612 13f79535-47bb-0310-9956-ffa450edef68
2008-03-29 21:11:33 +00:00
Mark Harwood c172010da0 Applied trejkaz's patch from https://issues.apache.org/jira/browse/LUCENE-1240 to optimise TermFilter.java and included new JUnit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@638631 13f79535-47bb-0310-9956-ffa450edef68
2008-03-18 23:01:00 +00:00
Grant Ingersoll 9ac963952f LUCENE-1236: Added some more javadocs. Also removed @author tags
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@637449 13f79535-47bb-0310-9956-ffa450edef68
2008-03-15 18:05:10 +00:00
Chris M. Hostetter 32f1054d88 LUCENE-1202 - tweaks to build system to make clover work better with contribs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@637344 13f79535-47bb-0310-9956-ffa450edef68
2008-03-15 03:16:43 +00:00
Grant Ingersoll f4cc6e2269 LUCENE-550: Added RAMDirectory alternative as a contrib. Similar to MemoryIndex, but can hold more than one document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@636745 13f79535-47bb-0310-9956-ffa450edef68
2008-03-13 12:34:30 +00:00
Michael McCandless b36d9f9be3 download bdb zip from Oracle's servers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@636673 13f79535-47bb-0310-9956-ffa450edef68
2008-03-13 09:06:31 +00:00
Doron Cohen 25f80c71c9 LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of
first round were used in all rounds.  (E.g. term vectors.)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635280 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 16:43:32 +00:00
Doron Cohen ce0309ed5f Javadoc clarrification for getRecall().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635195 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 08:30:28 +00:00
Doron Cohen 8b2f12851b make the name of multi-value property long enough for the values.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635194 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 08:28:53 +00:00
Mark Harwood 37a060c15c Additional thread safety around filter creation - old code could create duplicate CachingWrapperFilter if thread1 gets cache miss and thread 2 has a cache miss before thread1 populates cache with new CachingWrapperFilter.
Synchronization cost around whole method is OK here because Filter object construction should be a lightweight call. 
Note: CachingWrapperFilter currently has a similar bug in bits() method but adding "synchronized" around that whole method would not be a solution there because of the cost of evaluating filter.bits and the unnecessary blocking effect this would have on threads using different readers to the thread with the lock.

 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628921 13f79535-47bb-0310-9956-ffa450edef68
2008-02-18 23:39:12 +00:00
Mark Harwood caed55ef5e Added thread-safety around use of core's QueryParser.
Old XML parser constructors use a mode which will synchronize on use of the user-supplied QueryParser. 
New constructors offer alternative option of passing "defaultField" String which is used to create a new single-use QueryParser for each parse operation.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628568 13f79535-47bb-0310-9956-ffa450edef68
2008-02-17 22:32:58 +00:00
Michael McCandless fc4bc1f5af LUCENE-1084: fix more test cases to use user-defined MaxFieldLength ctor for IndexWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619637 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 21:07:35 +00:00
Grant Ingersoll 00941055fd benchmark autocommit
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619457 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 15:33:02 +00:00
Grant Ingersoll 0dc6c59ac1 LUCENE-494: Added QueryAutoStopWordAnalyzer in a new query subpackage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619420 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 14:13:38 +00:00
Michael McCandless 7ea8bd35f4 LUCENE-1084: fix the remaining deprecated calls to IndexWriter ctors
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618764 13f79535-47bb-0310-9956-ffa450edef68
2008-02-05 20:07:29 +00:00
Erik Hatcher 140a867933 Correct javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618275 13f79535-47bb-0310-9956-ffa450edef68
2008-02-04 13:31:02 +00:00
Michael McCandless afb58ca0d2 when reinit() is called, reset the start time (used only when printing # docs indexed)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617991 13f79535-47bb-0310-9956-ffa450edef68
2008-02-03 14:24:30 +00:00
Michael Busch 18b61286fa LUCENE-584: Changed Filter API to return a DocIdSet instead of a java.util.BitSet. This allows using more efficient data structures for Filters and makes them more flexible.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617859 13f79535-47bb-0310-9956-ffa450edef68
2008-02-02 19:04:03 +00:00
Grant Ingersoll 3db9002efb LUCENE-1156: see CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 22:47:52 +00:00
Grant Ingersoll 61f2aa3078 allow TrecDocMaker to be more easily extended by making values protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616882 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 20:03:39 +00:00
Grant Ingersoll 4fc8654bc1 Made some member variables protected so they can accessed by extending classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616806 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 16:29:35 +00:00
Grant Ingersoll b95e9a5154 LUCENE-1132: Updated documentation and deprecated bytes references in favor of character references
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616305 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 13:35:13 +00:00
Michael McCandless 38f8d4e837 LUCENE-1150: make StandardAnalyzer tokenizer constants public again (public access was accidentally removed with LUCENE-966)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616248 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 10:51:44 +00:00
Grant Ingersoll 7dd34ef888 LUCENE-1153: added lib/junit and build support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615329 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 20:27:55 +00:00
Michael McCandless ee835ccf21 LUCENE-1084: change all IW ctors to require up-front specification of max field length; deprecated existing ones
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615168 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 10:21:15 +00:00
Doron Cohen 8ace4103a7 LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 18:46:57 +00:00
Grant Ingersoll 7f09c9f00e Added javadocs about UNTOKENIZED_TOKEN_FLAG conundrum
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614905 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:42:50 +00:00
Grant Ingersoll 305c47f500 LUCENE-1133: Adds ability to keep certain strings as single tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614895 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:05:53 +00:00
Grant Ingersoll 1183763dbe LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:39:44 +00:00
Grant Ingersoll f75f490eb9 LUCENE-1127: added couple of convenience methods to TokenSources
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614884 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:36:46 +00:00
Michael McCandless dd0265469a LUCENE-1139: fix IndexWriter settings (compound, merge.factor, etc.) to work again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614022 13f79535-47bb-0310-9956-ffa450edef68
2008-01-21 21:42:43 +00:00
Michael McCandless 354a3175d5 LUCENE-1139: various additions/fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Grant Ingersoll ed9fdba601 LUCENE-1050 and LUCENE-1138 fixes for lock problem
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612868 13f79535-47bb-0310-9956-ffa450edef68
2008-01-17 17:00:43 +00:00
Grant Ingersoll 0a54ca1920 LUCENE-1135. Added Javadoc marking the WikipediaTokenizer as experimental
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612244 13f79535-47bb-0310-9956-ffa450edef68
2008-01-15 22:01:21 +00:00
Michael Busch e5da9db486 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610961 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:38:38 +00:00
Michael Busch 49dd305f69 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610960 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:37:27 +00:00
Michael McCandless ea951785eb LUCENE-1117: fix EnwikiDocMaker to not hang when the producer thread hits exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610738 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 10:12:18 +00:00