Commit Graph

828 Commits

Author SHA1 Message Date
Michael Busch 1f15246336 LUCENE-1851: Change 'javacc' and 'clean-javacc' targets to build and clean contrib/surround files.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807835 13f79535-47bb-0310-9956-ffa450edef68
2009-08-25 22:14:07 +00:00
Mark Robert Miller 1e4c3cdf86 {@link AttributeFactory} should be {@link AttributeSource.AttributeFactory}
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807809 13f79535-47bb-0310-9956-ffa450edef68
2009-08-25 21:26:35 +00:00
Simon Willnauer a6eecd50c6 LUCENE-1845: Skip compile, test and jar if the dbd.jar download fails in a nightly build.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807772 13f79535-47bb-0310-9956-ffa450edef68
2009-08-25 19:22:01 +00:00
Michael Busch e8caefdb7d LUCENE-1852: Fix localization test failures.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807499 13f79535-47bb-0310-9956-ffa450edef68
2009-08-25 06:48:36 +00:00
Michael Busch 4b6e01654c LUCENE-1836: Fix localization bug in the new query parser and add new LocalizedTestCase as base class for localization junit tests.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807494 13f79535-47bb-0310-9956-ffa450edef68
2009-08-25 06:19:14 +00:00
Uwe Schindler 367b35f0cb LUCENE-1843: Update contrib tests to conform to onlyUseNewAPI; refactored assertAnalyzesTo and others into the new BaseTokenStreamTestCase class; Rewrote TestMappingCharFilter to use the new assert functions, too; performance improvements of Token.copyTo(); new impl of SingleTokenTokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807190 13f79535-47bb-0310-9956-ffa450edef68
2009-08-24 12:44:13 +00:00
Uwe Schindler 5dd1810b0c LUCENE-1846: Fix more Locale problems
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@807117 13f79535-47bb-0310-9956-ffa450edef68
2009-08-24 08:31:34 +00:00
Uwe Schindler 4745c8db05 LUCENE-1825: Another one :(
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806990 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 16:35:50 +00:00
Uwe Schindler c2f95d474b LUCENE-1825: Additional incorrect getAttribute usage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806986 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 16:17:08 +00:00
Robert Muir 6847c0e2bd LUCENE-1826: the new tokenizer constructors should not allow deprecated charsets
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806961 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 12:39:28 +00:00
Michael Busch 64ed5f39a5 LUCENE-1826: Add constructors that take AttributeSource and AttributeFactory to all Tokenizer implementations.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806942 13f79535-47bb-0310-9956-ffa450edef68
2009-08-23 08:34:22 +00:00
Robert Muir 8e4e7e17d0 LUCENE-1834: Remove unused code in SmartChineseAnalyzer hmm pkg
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806891 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 21:03:08 +00:00
Robert Muir 1ebbe2abd1 LUCENE-1793: Deprecate custom encoding support in Greek and Russian analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806886 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 20:36:06 +00:00
Uwe Schindler 23d976abbd LUCENE-1825: Incorrect usage of AttributeSource.addAttribute/getAttribute leads to failures when onlyUseNewAPI=true
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806844 13f79535-47bb-0310-9956-ffa450edef68
2009-08-22 12:04:35 +00:00
Michael Busch f409117a6f LUCENE-1829: Fix contrib query parser to properly create javacc files.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806755 13f79535-47bb-0310-9956-ffa450edef68
2009-08-21 23:19:26 +00:00
Simon Willnauer 61a68fe46f Added the changes in LUCENE-1728 to the "Build" section of contrib/CHANGES.TXT
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806536 13f79535-47bb-0310-9956-ffa450edef68
2009-08-21 12:59:07 +00:00
Michael Busch 82c8a2ffc2 Fix if clause in MultiTermRewriteMethodProcessor
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806456 13f79535-47bb-0310-9956-ffa450edef68
2009-08-21 08:36:31 +00:00
Michael Busch 5a40767d67 LUCENE-1828: Fix MemoryIndex to call TokenStream.reset() and TokenStream.end().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@806444 13f79535-47bb-0310-9956-ffa450edef68
2009-08-21 08:02:40 +00:00
Mark Robert Miller 66c2217a55 improve Highlighter javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805923 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 18:43:43 +00:00
Robert Muir 1d9a96c2fc LUCENE-1813: Add option to ReverseStringFilter to mark reversed tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805769 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 12:07:15 +00:00
Robert Muir 58cd4a04d7 LUCENE-1794: Ensure analyzer options are applied immediately when using reusable token streams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805766 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 11:56:31 +00:00
Mark Robert Miller 39ada16b9c fix some javadoc @returns
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805650 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 02:42:24 +00:00
Mark Robert Miller 124af950a6 Fix duplicate code and one version of getWeightedSpanTerms - should accept a TokenStream now rather than a CachingTokenStream.
Also fixes getWeightedSpanTermsWithScores to intern the fieldname.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805645 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 01:16:23 +00:00
Michael Busch e488bb0d5f LUCENE-1792: remove obsolete files.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805639 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 00:32:05 +00:00
Michael Busch 5864c5795c LUCENE-1792: remove obsolete files.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805638 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 00:31:05 +00:00
Michael Busch 565a6d22b5 LUCENE-1792: Fix new query parser to set rewrite method for multi-term queries.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805633 13f79535-47bb-0310-9956-ffa450edef68
2009-08-19 00:09:41 +00:00
Robert Muir 3887cf9419 LUCENE-1692: Additional tests and javadocs for contrib/analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@805400 13f79535-47bb-0310-9956-ffa450edef68
2009-08-18 12:55:26 +00:00
Mark Robert Miller 20505fd0d5 LUCENE-1522 fix + test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804786 13f79535-47bb-0310-9956-ffa450edef68
2009-08-16 21:56:51 +00:00
Robert Muir d2af6ef0bd LUCENE-1794: Implement TokenStream reuse for contrib Analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804680 13f79535-47bb-0310-9956-ffa450edef68
2009-08-16 12:37:05 +00:00
Mark Robert Miller 395c65ad08 change PositionCheckingMap to use an entry iterator in putAll rather than keyset - faster
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804516 13f79535-47bb-0310-9956-ffa450edef68
2009-08-15 19:09:34 +00:00
Michael McCandless 316ce42d99 LUCENE-1809: make highlihter vs vector highlighter comparison more fair
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804447 13f79535-47bb-0310-9956-ffa450edef68
2009-08-15 10:01:50 +00:00
Uwe Schindler b16e0aa31b LUCENE-1801: All Tokenizers/TokenStreams that are source of tokens call AttributeSource.clearAttributes() first. Made Token.clear() consistent to AttributeImpl (clear everything)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804392 13f79535-47bb-0310-9956-ffa450edef68
2009-08-14 22:01:42 +00:00
Michael McCandless e166d54f87 LUCENE-1802: undeprecate current QueryParser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@804016 13f79535-47bb-0310-9956-ffa450edef68
2009-08-13 20:08:58 +00:00
Chris M. Hostetter 2558c06120 LUCENE-1749: Addition of FieldCacheSanityChecker utility, and hooks to use it in all existing Lucene Tests.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@803676 13f79535-47bb-0310-9956-ffa450edef68
2009-08-12 19:31:38 +00:00
Mark Robert Miller 7fff5a7ea7 LUCENE-1771: QueryWight back to Weight, but as abstract class rather than interface - explain now takes a Searcher and passes the sub reader that contains the doc if a top level reader is a multi reader.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@803339 13f79535-47bb-0310-9956-ffa450edef68
2009-08-12 01:22:30 +00:00
Michael McCandless 7ac73cb0a6 LUCENE-1792: properly default rewrite method to CONSTANT_SCORE_AUTO for wildcard & prefix queries
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@803053 13f79535-47bb-0310-9956-ffa450edef68
2009-08-11 10:14:31 +00:00
Robert Muir 43a5bd6c19 LUCENE-1628: Add Persian Analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802955 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 23:29:27 +00:00
Michael McCandless 3aed0fd54b LUCENE-1797: don't over-increment positions when creating MultiPhraseQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802893 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 18:46:39 +00:00
Michael McCandless 08c69caca6 LUCENE-1795: disable leading wildcard by default in new QueryParser, matching 2.4
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802794 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 13:54:06 +00:00
Michael McCandless 35ea5c1350 LUCENE-1786: make the patternsFileContent static, so we only load it once, not 4 times, when running this test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802767 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 12:47:52 +00:00
Michael McCandless 2a195d3622 move QueryParserMessages.properties under src/resources
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802722 13f79535-47bb-0310-9956-ffa450edef68
2009-08-10 10:24:35 +00:00
Uwe Schindler 911df49bcb LUCENE-1607: Change some more String.intern() in contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802095 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 17:19:53 +00:00
Michael McCandless ab0de2448a fix smartcn to be JDK 1.4 only
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@802085 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 16:48:09 +00:00
Michael McCandless 0b6769aa11 LUCENE-1782: rename original -> standard in the new QueryParser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801934 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 10:24:28 +00:00
Mark Robert Miller 835efd0d6c LUCENE-1788: cleanup Highlighter test class
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801856 13f79535-47bb-0310-9956-ffa450edef68
2009-08-07 01:30:45 +00:00
Michael McCandless fc369cdf2f add getDistanceFilter() since you need that in order to sort by the exact distances (previously distanceFilter attr was public)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801471 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 23:57:54 +00:00
Robert Muir 820620f3a7 LUCENE-1758: Update ArabicAnalyzer to light10 stemming, stopwords improvements, lowercase non-arabic text
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801348 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 18:22:22 +00:00
Yonik Seeley 7f1319fdf0 LUCENE-1607: String.intern() cache StringHelper.intern()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801344 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 18:05:06 +00:00
Grant Ingersoll ab276a5ab9 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801219 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 13:17:20 +00:00
Grant Ingersoll c0d86a4e30 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801218 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 13:17:11 +00:00
Michael McCandless 1ab8d5ee6d LUCENE-1779: improve javadocs & remove unused args from unreleased sorting APIs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801207 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 13:08:58 +00:00
Michael McCandless e82cc0079d remove extra print
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801166 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 12:13:02 +00:00
Michael McCandless c471e8e83c LUCENE-1773: fix highlighter test case; add contrib/memory as dependency to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801160 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 12:09:48 +00:00
Mark Robert Miller 34ab27401c LUCENE-1770: Add EnwikiQueryMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@801043 13f79535-47bb-0310-9956-ffa450edef68
2009-08-05 01:39:22 +00:00
Michael McCandless d3fcafc53e LUCENE-1773: add FastVectorHighlighter tasks to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800947 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 20:24:02 +00:00
Mark Robert Miller 5af69469e2 LUCENE-1748: LUCENE-1001 introduced PayloadSpans, but this was a back compat break and caused custom SpanQuery implementations to fail at runtime in a variety of ways. This issue attempts to remedy things by causing a compile time break on custom SpanQuery implementations and removing the PayloadSpans class, with its functionality now moved to Spans.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800925 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 19:10:02 +00:00
Michael McCandless d79a369414 LUCENE-1504: switch to DocIdSet API and FilteredDocIdSet for contrib/spatial
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800892 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 18:01:57 +00:00
Mark Robert Miller a4c2eae1d4 update javadoc for init(TokenStream) return
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800849 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 15:38:00 +00:00
Mark Robert Miller f0e54e31e6 LUCENE-1406 belongs in contrib CHANGES
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800832 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 15:05:34 +00:00
Mark Robert Miller b44ed588ac LUCENE-1685 should be in API changes, not new features
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800821 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 14:33:58 +00:00
Mark Robert Miller 10b41d2dce LUCENE-1685: The position aware SpanScorer has become the default scorer for Highlighting. The SpanScorer implementation has replaced QueryScorer and the old term highlighting QueryScorer has been renamed to QueryTermScorer. Multi-term queries are also now expanded by default. If you were previously rewritting the query for multi-term query highlighting, you should no longer do that (unless you switch to using QueryTermScorer). The SpanScorer API (now QueryScorer) has also been improved to more closely match the API of the previous QueryScorer implementation.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800796 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:56:11 +00:00
Mark Robert Miller b155258203 LUCENE-1778: Add support for log.step setting per task type. Perviously, if you included a log.step line in the .alg file, it had been applied to all tasks. Now, you can include a log.step.AddDoc, or log.step.DeleteDoc (for example) to control logging for just these tasks. If you want to ommit logging for any other task, include log.step=-1. The syntax is "log.step." together with the Task's 'short' name (i.e., without the 'Task' part).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800768 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 13:05:03 +00:00
Mark Robert Miller 4054b4ebf3 move the web based xml demo from core changes to contrib changes - also fixes skipping # 34 in features
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800661 13f79535-47bb-0310-9956-ffa450edef68
2009-08-04 02:57:00 +00:00
Michael Busch c91651e4f2 LUCENE-1775: Change contrib tee/sink filters to use new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800606 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 22:45:27 +00:00
Michael Busch 457c29d31e LUCENE-1775: Change remaining contrib TokenFilters (shingle, prefix-suffix) to use the new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800195 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 04:33:10 +00:00
Mark Robert Miller 5aaf5b0167 LUCENE-1486: Move ComplexPhraseQueryParser to contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800193 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 04:06:22 +00:00
Michael Busch 343992fcbb LUCENE-1567: New flexible QueryParser framework.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800191 13f79535-47bb-0310-9956-ffa450edef68
2009-08-03 03:38:44 +00:00
Michael McCandless 487278cf92 LUCENE-1522 turn on unit tests that had relied on LUCENE-1448 and LUCENE-1759
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@800120 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 18:07:43 +00:00
Michael Busch b91f993a0e LUCENE-1460: Additional cleanup in two contrib junit tests.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799973 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 02:57:30 +00:00
Michael Busch 537aeb24e0 LUCENE-1759: Set final offset correctly in contrib TokenStreams.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799968 13f79535-47bb-0310-9956-ffa450edef68
2009-08-02 02:10:46 +00:00
Michael Busch 1743081b07 LUCENE-1460: Changed TokenStreams/TokenFilters in contrib to use the new TokenStream API.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799953 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 22:52:32 +00:00
Mark Robert Miller 3e869d9336 remove system.out and unnecessary next() in tokenstream
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799865 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 14:18:19 +00:00
Michael McCandless 175e8b546d LUCENE-1763: require IndexWriter be passed up front to the MergePolicy
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799818 13f79535-47bb-0310-9956-ffa450edef68
2009-08-01 09:22:25 +00:00
Michael McCandless bbcab117d9 LUCENE-1683: fixed JavaUtilRegexCapabilities (used by RegexQuery) to match entire string not just prefix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799678 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 18:02:56 +00:00
Michael McCandless 0b0d13dffe LUCENE-1745: allow passing matching flags to the underlying regexp engine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799667 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 17:41:04 +00:00
Uwe Schindler f8b2f0122c Use the empty docidset provided by DocIdSet.EMPTY_DOCIDSET
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799582 13f79535-47bb-0310-9956-ffa450edef68
2009-07-31 11:32:37 +00:00
Mark Robert Miller f73a4f4324 LUCENE-1695: Update the Highlighter to use the new TokenStream API. This issue breaks backwards compatibility with some public classes. If you have implemented custom Fregmenters or Scorers, you will need to adjust them to work with the new TokenStream API. Rather than getting passed a Token at a time, you will be given a TokenStream to init your impl with - store the Attributes you are interested in locally and access them on each call to the method that used to pass a new Token. Look at the included updated impls for examples.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799455 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 22:00:47 +00:00
Mark Robert Miller 7ecaa8c990 wikipedia-flush-by-RAM.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799379 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:35:10 +00:00
Mark Robert Miller e505413fae wikipedia.alg should use content.source
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@799377 13f79535-47bb-0310-9956-ffa450edef68
2009-07-30 17:34:28 +00:00
Mark Robert Miller afb517e832 LUCENE-1752: Missing highlights when terms were repeated in separate, nested, boolean or disjunction queries.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798976 13f79535-47bb-0310-9956-ffa450edef68
2009-07-29 16:47:05 +00:00
Michael McCandless dbff1fc9b5 LUCENE-1754: just use EMPTY_DOCIDSET.iterator() instead of new EmptyDocIdSetIterator
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 11:12:36 +00:00
Michael McCandless 094c674c4d LUCENE-1595: don't use SortField.AUTO; deprecate LineDocMaker & EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798096 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 10:15:03 +00:00
Michael McCandless 26a2c427d1 LUCENE-1754: BooleanQuery detects up front if it won't match any docs and returns null from its scorer() instead of NonMatchingScorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@798086 13f79535-47bb-0310-9956-ffa450edef68
2009-07-27 09:50:02 +00:00
Michael McCandless 228888a882 LUCENE-1644: fix highlighter to rewrite MTQ whenever it's not already a SCORING_BOOLEAN_QUERY
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797728 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 09:31:17 +00:00
Michael McCandless be66120dff LUCENE-1644: enable different rewrite methods for MultiTermQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797694 13f79535-47bb-0310-9956-ffa450edef68
2009-07-25 00:03:33 +00:00
Otis Gospodnetic f758b4d259 - Typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797310 13f79535-47bb-0310-9956-ffa450edef68
2009-07-24 02:43:58 +00:00
Simon Willnauer 999f6157c7 LUCENE-1728: Splitted contrib/analyzers into common and smartcn. Smartcn depends on a large dictionary that causes the analyzers jar to grow up to 3MB compressed size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@797150 13f79535-47bb-0310-9956-ffa450edef68
2009-07-23 17:11:22 +00:00
Mark Robert Miller 3adc61c3ac LUCENE-1755: Fix WriteLineDocTask to output a document if it contains either a title or body (or both).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795792 13f79535-47bb-0310-9956-ffa450edef68
2009-07-20 12:19:06 +00:00
Grant Ingersoll 63402f49c7 Javadoc updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@795553 13f79535-47bb-0310-9956-ffa450edef68
2009-07-19 15:06:57 +00:00
Michael McCandless c79f54975e LUCENE-1505: switch local lucene to use trie's NumericUtils for mapping doubles to strings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794721 13f79535-47bb-0310-9956-ffa450edef68
2009-07-16 15:38:06 +00:00
Mark Robert Miller add56f5e66 LUCENE-1725: Fix the example Sort algorithm - auto is now deprecated and no longer works with Benchmark. Benchmark will now throw an exception if you specify sort fields without a type. The example sort algorithm is now typed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794109 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 22:52:58 +00:00
Mark Robert Miller ea7e4ad344 LUCENE-1688: Deprecate static final String stop word array in and StopAnalzyer and replace it with an immutable implementation of CharArraySet.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794078 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 21:39:22 +00:00
Otis Gospodnetic b393e4d0af LUCENE-1491 - EdgeNGramTokenFilter no longer stops on tokens shorter than minimum n-gram size.
- line, and those below, will be ignored--

M    CHANGES.txt
M    analyzers/src/test/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilterTest.java
M    analyzers/src/test/org/apache/lucene/analysis/ngram/NGramTokenFilterTest.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/EdgeNGramTokenFilter.java
M    analyzers/src/java/org/apache/lucene/analysis/ngram/NGramTokenFilter.java


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@794034 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 19:44:52 +00:00
Michael McCandless 65494af827 LUCENE-1272: add MoreLikeThis.set/getBoost
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793973 13f79535-47bb-0310-9956-ffa450edef68
2009-07-14 16:56:16 +00:00
Michael McCandless 91aedd6685 LUCENE-1740: add 'analyzer' command to Lucli, to change analyzer from the default StandardAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@793526 13f79535-47bb-0310-9956-ffa450edef68
2009-07-13 10:06:01 +00:00
Michael McCandless 9cbe5f4ff4 LUCENE-1522: adding new Fast Vector Highlighter contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@792542 13f79535-47bb-0310-9956-ffa450edef68
2009-07-09 13:06:51 +00:00
Michael McCandless 333e77a431 LUCENE-1704: allow specifying the Tidy configuration file when parsing HTML docs with contrib/ant
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791587 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:55:05 +00:00
Mark Robert Miller 28d65ceee7 remove java 1.5 dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791576 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 19:18:19 +00:00
Mark Robert Miller f780f77366 LUCENE-1730: Fix TrecContentSource to use ISO-8859-1 when reading the TREC files, unless a different encoding is specified. Additionally, ContentSource now supports a content.source.encoding parameter in the configuration file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791528 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 15:56:39 +00:00
Uwe Schindler 705f099238 Convert and cleanup the test files to UTF-8. What is still broken is the incorrect usage of KOI8 and CP1251 encodings. Added svn:eol-style=native to all files again.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791483 13f79535-47bb-0310-9956-ffa450edef68
2009-07-06 13:50:17 +00:00
Mark Robert Miller e04abc52e7 LUCENE-1599: Add clone support for SpanQuerys. SpanRegexQuery counts on this functionality and does not work correctly without it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791280 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 17:16:16 +00:00
Mark Robert Miller 9789089343 reader should be closed after use
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791246 13f79535-47bb-0310-9956-ffa450edef68
2009-07-05 14:01:14 +00:00
Uwe Schindler fed4bba63d LUCENE-1713: Rename RangeQuery -> TermRangeQuery (part 1)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@791175 13f79535-47bb-0310-9956-ffa450edef68
2009-07-04 20:14:12 +00:00
Simon Willnauer 410afb98bf LUCENE-1719: Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter (Steven Row via Simon Willnauer)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790262 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 16:50:47 +00:00
Simon Willnauer 5265dc1bb2 LUCENE-1722: SmartChineseAnalyzer JavaDoc improvements - Replacing Chinese JavaDoc with English version. Robert Muir via Simon Willnauer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@790102 13f79535-47bb-0310-9956-ffa450edef68
2009-07-01 10:32:23 +00:00
Michael McCandless c7f865a4c7 LCUENE-1716: allow control over storage of norms (body norms), info stream and whether docs properties should be indexed as fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788777 13f79535-47bb-0310-9956-ffa450edef68
2009-06-26 17:26:54 +00:00
Uwe Schindler 42dcc00374 Build an index.html on the top-level Javadocs folder (e.g. hudson will use it as entry point)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@788039 13f79535-47bb-0310-9956-ffa450edef68
2009-06-24 14:34:09 +00:00
Michael McCandless 87de0c9688 LUCENE-1466: added chainable CharFilter stage before Tokenizer to allow mapping of characters before tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787795 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 19:15:31 +00:00
Michael McCandless f03d77b558 LUCENE-1630: switch from Weight (interface) to QueryWeight (abstract class); mate in/out-of docID order scoring between Collector & Scorer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787772 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 18:11:42 +00:00
Michael McCandless 5f72065d0f LUCENE-1714: fix WriteLineDocTask to also replace \r, \n (in addition to \t) with space so those chars don't create mal-formed lines
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787750 13f79535-47bb-0310-9956-ffa450edef68
2009-06-23 16:46:17 +00:00
Michael McCandless ec8088654d bulk fix svn:eol-style to native for text files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@787424 13f79535-47bb-0310-9956-ffa450edef68
2009-06-22 22:18:56 +00:00
Erik Hatcher 65131ca7b9 LUCENE-1405: Added support for Ant resource collections in contrib/ant <index> task.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786610 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 18:24:19 +00:00
Michael McCandless 19234f12bd LUCENE-1692: add new contrib analyzer tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786606 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 18:02:12 +00:00
Michael McCandless 2f2cd20828 LUCENE-1692: add tests for Thai & SmartChinese analyzers; fix wrong endOffset bug in ThaiWordFilter; use stop words by default with SmartChineseAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786560 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 15:52:36 +00:00
Uwe Schindler 0b5cbca110 LUCENE-1673: Move TrieRange to core (part 2: removing from contrib/queries)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786474 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 12:16:52 +00:00
Uwe Schindler 7b34ab8f30 LUCENE-1673: Move TrieRange to core (part 1: addition to core)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786470 13f79535-47bb-0310-9956-ffa450edef68
2009-06-19 12:09:52 +00:00
Mark Robert Miller d7d455246f LUCENE-1595: Separate DocMaker into DocMaker and ContentSource.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@786233 13f79535-47bb-0310-9956-ffa450edef68
2009-06-18 19:58:59 +00:00
Michael McCandless 835c405be0 LUCENE-973: add test case for CJKAnalyzer; fix trailing empty string bug
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@785287 13f79535-47bb-0310-9956-ffa450edef68
2009-06-16 16:38:39 +00:00
Michael Busch f2a5f395d8 Fix pom.xml.template of remote contrib to have the correct artifactId
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784674 13f79535-47bb-0310-9956-ffa450edef68
2009-06-15 07:33:57 +00:00
Michael McCandless 5b472b2d62 LUCENE-1539: add DeleteByPercent, FlushReader tasks, and ability to open reader on a labelled commit point
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784587 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 17:07:55 +00:00
Michael McCandless 26b9dea62b LUCENE-1571: fix LatLongDistanceFilter to respect deleted docs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784576 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 15:47:15 +00:00
Michael McCandless 6671c43458 LUCENE-979: remove a few more old benchmark things
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784568 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 14:52:02 +00:00
Michael McCandless e3c4ff8ac3 revert accidental commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784562 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 14:06:52 +00:00
Michael McCandless 419b065957 LUCENE-1677: deprecate gcj specializations, and the system properties that let you specify which SegmentReader impl class to use
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784541 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 11:20:55 +00:00
Michael McCandless 392424ec11 LUCENE-1407: move RemoteSearchable out of core into contrib/remote
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784540 13f79535-47bb-0310-9956-ffa450edef68
2009-06-14 11:13:04 +00:00
Karl-Johan Wettin 9e5cb08461 LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784482 13f79535-47bb-0310-9956-ffa450edef68
2009-06-13 22:05:14 +00:00
Karl-Johan Wettin 196428ec39 LUCENE-1578: Support for loading unoptimized readers to the constructor of InstantiatedIndex. (Karl Wettin)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784481 13f79535-47bb-0310-9956-ffa450edef68
2009-06-13 21:54:07 +00:00
Grant Ingersoll 1511ec5e31 LUCENE-1676: in-stream payload support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@784297 13f79535-47bb-0310-9956-ffa450edef68
2009-06-12 22:26:01 +00:00
Michael McCandless 159573f7eb LUCENE-979: remove deprecated non-Task benchmark classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783794 13f79535-47bb-0310-9956-ffa450edef68
2009-06-11 14:26:32 +00:00
Mark Robert Miller 85f5a9c740 LUCENE-1455: close FileInputStream after parsing
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783595 13f79535-47bb-0310-9956-ffa450edef68
2009-06-11 01:13:51 +00:00
Patrick O'Leary 5509306e40 Fixed doc id's to bitset id for sorting with a multi segmented index FieldSourceComparator still uses bitset id's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@783517 13f79535-47bb-0310-9956-ffa450edef68
2009-06-10 20:59:26 +00:00
Uwe Schindler 406454d30b LUCENE-1672: Deprecate all String/File ctors/opens in IndexReader/IndexWriter/IndexSearcher
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@782469 13f79535-47bb-0310-9956-ffa450edef68
2009-06-07 21:52:41 +00:00
Michael McCandless a93e2c41b0 LUCENE-1614: switch next -> nextDoc, skipTo -> advance, doc -> docID in DISI
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@782410 13f79535-47bb-0310-9956-ffa450edef68
2009-06-07 16:58:41 +00:00
Michael McCandless 8a4a3eeed1 LUCENE-1387: fix numDocs -> maxDoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781136 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 19:37:59 +00:00
Otis Gospodnetic 475b5dc08f LUCENE-898 - removed contrib/javascript (unpackaged, unmaintained, most likely not working due to QueryParser changes since 2005)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781057 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 15:18:51 +00:00
Otis Gospodnetic ea155ae99b - Removed empty dir
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781053 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 15:11:03 +00:00
Uwe Schindler cb9090da45 Fix incorrect example: setValue() is not a method of TokenStream, need more specific type. Thanks Mike
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@781029 13f79535-47bb-0310-9956-ffa450edef68
2009-06-02 13:12:22 +00:00
Michael McCandless 3c82bdadb3 fix SSS unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@780404 13f79535-47bb-0310-9956-ffa450edef68
2009-05-31 10:37:09 +00:00
Otis Gospodnetic 9275233f71 - Removed an ancient deprecated method and fixed misspellings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@780363 13f79535-47bb-0310-9956-ffa450edef68
2009-05-31 04:10:35 +00:00
Michael McCandless f953c8883e use 2 digits precision when reporting rec/s
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@779569 13f79535-47bb-0310-9956-ffa450edef68
2009-05-28 11:31:44 +00:00
Uwe Schindler eba450d10b LUCENE-1591: Update commons-compress to the released version 1.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@777458 13f79535-47bb-0310-9956-ffa450edef68
2009-05-22 10:38:21 +00:00
Grant Ingersoll 27fc5cd348 LUCENE-1550: Added new ngram spell checking distance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@776704 13f79535-47bb-0310-9956-ffa450edef68
2009-05-20 14:07:08 +00:00
Michael McCandless 2dd7d33e86 LUCENE-1643: use reusable RawCollationKey for better performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@776252 13f79535-47bb-0310-9956-ffa450edef68
2009-05-19 09:50:24 +00:00
Michael McCandless af550281cb LUCENE-1629: remove unnecessary source files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@775468 13f79535-47bb-0310-9956-ffa450edef68
2009-05-16 14:08:38 +00:00
Michael McCandless f81f6796a2 LUCENE-1629: correct ASF source headers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@775444 13f79535-47bb-0310-9956-ffa450edef68
2009-05-16 09:55:34 +00:00
Michael McCandless be0a47b7e3 LUCENE-1629: move CHANGES entry to contrib; add TestArabicAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@774727 13f79535-47bb-0310-9956-ffa450edef68
2009-05-14 10:50:52 +00:00
Michael McCandless e01aad89fe LUCENE-1629: adding new contrib analyzer SmartChineseAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@774718 13f79535-47bb-0310-9956-ffa450edef68
2009-05-14 10:09:22 +00:00
Michael McCandless d4f9809520 LUCENE-1593: optimize core Collectors for in-docID-order scorer case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@773194 13f79535-47bb-0310-9956-ffa450edef68
2009-05-09 10:36:41 +00:00
Mark Robert Miller 6bcd89a916 fix for LUCENE-1425
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770993 13f79535-47bb-0310-9956-ffa450edef68
2009-05-02 22:46:26 +00:00
Uwe Schindler 673b90e651 Correct JavaDocs in TrieUtils. This also adds a missing test for equals() and hashcode() of the TrieRangeQueries.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@770602 13f79535-47bb-0310-9956-ffa450edef68
2009-05-01 09:55:01 +00:00