Commit Graph

348 Commits

Author SHA1 Message Date
Mark Harwood e4ffe0679e Added optimization to ignore duplicate terms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413732 13f79535-47bb-0310-9956-ffa450edef68
2006-06-12 19:38:36 +00:00
Otis Gospodnetic f0bfc02d4d - perl -pi -e 's/\t/ /g'
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413584 13f79535-47bb-0310-9956-ffa450edef68
2006-06-12 05:46:16 +00:00
Grant Ingersoll d4cc19d03e Implementation of Issue 545. Introduction of new Fieldable interface (extracted from Field) which is now used where Field used to be used. Field now implements Fieldable.
Added new method to IndexReader and derived classes for working with the new FieldSelector interface.  The FieldSelector interface defines a mechanism for doing lazy loading, amongst other things.  Implemented Lazy loading of fields in the FieldsReader class.  Added test case to TestFieldsReader.java

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413201 13f79535-47bb-0310-9956-ffa450edef68
2006-06-10 01:23:22 +00:00
Daniel Naber 2b9effb894 deprecate the analysis.nl.WordlistLoader class because it's not robust (fails silently) and use analysis.WordlistLoader instead
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@413180 13f79535-47bb-0310-9956-ffa450edef68
2006-06-09 22:15:47 +00:00
Doug Cutting 631585bf5a LUCENE-578. First version of gdata-server. Contributed by Simon Willnauer.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@412574 13f79535-47bb-0310-9956-ffa450edef68
2006-06-07 21:56:25 +00:00
Chris M. Hostetter 2123b476df LUCENE-503: New ThaiAnalyzer and ThaiWordFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@411863 13f79535-47bb-0310-9956-ffa450edef68
2006-06-05 17:29:01 +00:00
Yonik Seeley 3afc9192b9 SpellChecker min score is incorrectly changed by suggestSimilar: LUCENE-575
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@410971 13f79535-47bb-0310-9956-ffa450edef68
2006-06-01 21:36:13 +00:00
Mark Harwood 1ef156da34 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409994 13f79535-47bb-0310-9956-ffa450edef68 2006-05-28 20:24:42 +00:00
Doug Cutting 39852b3b59 Add package documentation for a few contrib modules.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409694 13f79535-47bb-0310-9956-ffa450edef68
2006-05-26 16:29:02 +00:00
Chris M. Hostetter d2b63d328b LUCENE-577: initial checkin of SweetSpotSimilarity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409472 13f79535-47bb-0310-9956-ffa450edef68
2006-05-25 21:21:29 +00:00
Chris M. Hostetter d96e03e914 Grrr... unbelievably bad compilation failure typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409429 13f79535-47bb-0310-9956-ffa450edef68
2006-05-25 18:54:50 +00:00
Chris M. Hostetter dd3d979905 LUCENE-496 - Command line tool for modifying the field norms of an existing index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@409428 13f79535-47bb-0310-9956-ffa450edef68
2006-05-25 18:49:04 +00:00
Daniel Naber cb15338e26 small javadoc cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@400190 13f79535-47bb-0310-9956-ffa450edef68
2006-05-05 21:59:07 +00:00
Daniel Naber 18f330f6a6 add missing license header
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@398112 13f79535-47bb-0310-9956-ffa450edef68
2006-04-29 09:54:16 +00:00
Andreas Vajda 9a451c8011 applied fix for issue 536 to bdb code as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@394218 13f79535-47bb-0310-9956-ffa450edef68
2006-04-14 22:38:22 +00:00
Andreas Vajda 597b011b27 resolved JIRA issue 536
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@394214 13f79535-47bb-0310-9956-ffa450edef68
2006-04-14 22:23:20 +00:00
Yonik Seeley d490983a24 (LUCENE-539) Fix for deprecations in contrib/surround
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@391763 13f79535-47bb-0310-9956-ffa450edef68
2006-04-05 19:55:30 +00:00
Mark Harwood 286f4f5f07 Simplified QueryTermExtractor.java to make use of Query.extractTerms method (especially now that all the SpanQuery classes implement this correctly).
Added tests in Junit test to demonstrate new support for other Queries (FilteredQuery) now that we use the standard extractTerms feature of Query objects.
Also deprecated highlighter getBestFragments method that hard-coded choice of fieldname and introduced new variation that takes an additional fieldName argument

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@389888 13f79535-47bb-0310-9956-ffa450edef68
2006-03-29 21:01:40 +00:00
Mark Harwood 0fa0e25de8 Added BooleanFilter with JUnit test, added BooleanFilter support to XMLQueryParser, Changed TermsFilterBuilder to only build a filter for a single choice of field - multiples can be combined using new BooleanFilter clauses. Added missing MatchAllDocsQueryBuilder source.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@389037 13f79535-47bb-0310-9956-ffa450edef68
2006-03-27 07:23:37 +00:00
Mark Harwood fa0516f6ee Added support for MatchAllDocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@388296 13f79535-47bb-0310-9956-ffa450edef68
2006-03-23 23:30:33 +00:00
Wolfgang Hoschek c954a1c51a removed methods deprecated in lucene-1.9
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387672 13f79535-47bb-0310-9956-ffa450edef68
2006-03-22 00:02:26 +00:00
Yonik Seeley f65db2faf2 deprecation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387581 13f79535-47bb-0310-9956-ffa450edef68
2006-03-21 16:56:11 +00:00
Yonik Seeley 2d7dbf8131 more deprecation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387575 13f79535-47bb-0310-9956-ffa450edef68
2006-03-21 16:26:10 +00:00
Yonik Seeley 3666a166a1 remove deprecations
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@387550 13f79535-47bb-0310-9956-ffa450edef68
2006-03-21 15:36:32 +00:00
Mark Harwood 9078b805d3 Fixed bug which ignored user's choice of QueryParser. Thanks to Terry Steichen for spotting it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@385762 13f79535-47bb-0310-9956-ffa450edef68
2006-03-14 08:49:43 +00:00
Erik Hatcher 04ca37fb49 Modified lastmodified field to use DateTools instead of the deprecated DateField, and added an antlib.xml file to make Ant integration easier
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@384336 13f79535-47bb-0310-9956-ffa450edef68
2006-03-08 21:32:04 +00:00
Mark Harwood 7dab9545be Added more helper methods to DOMUtils and cleaned up error handling in builders - thanks Chris.
Moved FilteredQueryBuilder.java to "builders" package with all other builders

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382170 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 21:49:17 +00:00
Daniel Naber 85f9706b6b trivial import cleanup to avoid compiler warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382156 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 20:59:54 +00:00
Daniel Naber e5de36e653 clean up imports to avoid warnings; fix typo in usage message
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382154 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 20:54:59 +00:00
Daniel Naber 4189b329f8 stop using deprecated methods; make more fields private
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@382151 13f79535-47bb-0310-9956-ffa450edef68
2006-03-01 20:47:32 +00:00
Erik Hatcher 297d1429eb Many javadoc additions, and adding ASL to each file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@381108 13f79535-47bb-0310-9956-ffa450edef68
2006-02-26 15:38:30 +00:00
Erik Hatcher 51d7a31725 add regexp LICENSE file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@381098 13f79535-47bb-0310-9956-ffa450edef68
2006-02-26 14:48:17 +00:00
Erik Hatcher 31a7b1987a modified build file to pick up the proper manifest file. more work is needed to have the manifest pick up the right Lucene JAR name though
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@381095 13f79535-47bb-0310-9956-ffa450edef68
2006-02-26 14:24:54 +00:00
Otis Gospodnetic b9fdf6d4e5 - Package description from Paul Elschot
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@380989 13f79535-47bb-0310-9956-ffa450edef68
2006-02-25 20:43:32 +00:00
Mark Harwood 87768c51c6 Added XML-query-parser module for new extensible query parser that handles queries expressed as XML. Is dependent on new "queries" contrib module.
Added "queries" contrib module for various new query/filter classes. This area is also intended to consolidate existing query classes so have moved a copy of MoreLikeThis into here. Probably need to remove "similarity" module as a result, if no one objects.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@380874 13f79535-47bb-0310-9956-ffa450edef68
2006-02-25 00:39:18 +00:00
Mark Harwood 77625ced9e Removed use of deprecated APIs ready for 2.0 Lucene release
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@379586 13f79535-47bb-0310-9956-ffa450edef68
2006-02-21 21:05:18 +00:00
Doug Cutting f9f3161f57 Minor javadoc improvements.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@379189 13f79535-47bb-0310-9956-ffa450edef68
2006-02-20 18:11:02 +00:00
Otis Gospodnetic 93657ca63a - Limit to an optional field; reindented (die tabs, die), ASF License 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@376393 13f79535-47bb-0310-9956-ffa450edef68
2006-02-09 19:17:14 +00:00
Daniel Naber 9db7eaa951 Don't suggest a word as similar for itself. Thanks to Michael Harhen for the fix.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@373185 13f79535-47bb-0310-9956-ffa450edef68
2006-01-28 17:22:04 +00:00
Wolfgang Hoschek 75ee463870 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@372696 13f79535-47bb-0310-9956-ffa450edef68
2006-01-27 01:32:41 +00:00
Otis Gospodnetic ce3d7a331e - ASL 2.0 instead of 1.1
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@366561 13f79535-47bb-0310-9956-ffa450edef68
2006-01-06 20:23:24 +00:00
Andreas Vajda c312ac9d52 - integrated Aaron Donovan's port to Berkeley DB Java Edition
- reorganized directory tree to accomodate multiple implementations


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@366041 13f79535-47bb-0310-9956-ffa450edef68
2006-01-05 01:04:17 +00:00
Erik Hatcher ca6b9ee2e5 add .equals/.hashCode
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@359537 13f79535-47bb-0310-9956-ffa450edef68
2005-12-28 14:25:34 +00:00
Erik Hatcher 396229f18d Remove (Span)RegexQuery from core. Add completely refactored version to contrib/regex allowing pluggable regex implementations. contrib/regex is still a work in progress, and documentation is forthcoming
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@359526 13f79535-47bb-0310-9956-ffa450edef68
2005-12-28 13:57:40 +00:00
Erik Hatcher aa07efc076 minor HTML encoding issue in javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@359523 13f79535-47bb-0310-9956-ffa450edef68
2005-12-28 13:53:09 +00:00
Daniel Naber 9ad46499cf print usage if called with wrong number of arguments; be more verbose
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@358655 13f79535-47bb-0310-9956-ffa450edef68
2005-12-22 23:39:49 +00:00
Otis Gospodnetic 1fe7d48e25 - Handy command line tool for merging Lucene indices
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@358156 13f79535-47bb-0310-9956-ffa450edef68
2005-12-21 00:05:18 +00:00
Andreas Vajda f962bd4dae made build.xml accept db.version property argument
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@355282 13f79535-47bb-0310-9956-ffa450edef68
2005-12-08 23:49:39 +00:00
Wolfgang Hoschek 63ce2859a7 javadoc cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@354001 13f79535-47bb-0310-9956-ffa450edef68
2005-12-05 07:11:02 +00:00
Wolfgang Hoschek 4ff370aaa3 javadoc cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@354000 13f79535-47bb-0310-9956-ffa450edef68
2005-12-05 07:10:24 +00:00
Erik Hatcher 7a3103fac0 Applied patched for LUCENE-324, correcting token offsets returned by ChineseTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@353930 13f79535-47bb-0310-9956-ffa450edef68
2005-12-04 23:07:42 +00:00
Wolfgang Hoschek ebe44ace90 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351896 13f79535-47bb-0310-9956-ffa450edef68 2005-12-03 05:44:16 +00:00
Wolfgang Hoschek a155416b4d tentative add: Various fulltext analysis utilities avoiding redundant code in several
* classes.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351895 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:42:59 +00:00
Wolfgang Hoschek 860733f32e indentation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351893 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:27:50 +00:00
Wolfgang Hoschek f42d7a1e9b indentation fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351892 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:26:16 +00:00
Wolfgang Hoschek e28541354d some performance improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351891 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:24:31 +00:00
Wolfgang Hoschek efa4d10fa1 some performance improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351890 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 05:22:08 +00:00
Wolfgang Hoschek 317f3f77e9 just a SVN test - please ignore
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351887 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 04:32:53 +00:00
Wolfgang Hoschek ec49618824 just a SVN test - please ignore
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351886 13f79535-47bb-0310-9956-ffa450edef68
2005-12-03 04:31:52 +00:00
Mark Harwood 2da431d139 Added support for field-specific highlighting which respects the fieldnames found in queries. Pass a field name to the QueryScorer in order to only select those field's query terms for highlighting. Updated JUnit tests too.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@351504 13f79535-47bb-0310-9956-ffa450edef68
2005-12-01 22:18:33 +00:00
Andreas Vajda fa24e67d6d - changed build to use version 4.3.29 of the C Berkeley DB Java API
- updated copyright notice year ranges to include 2005


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@350095 13f79535-47bb-0310-9956-ffa450edef68
2005-12-01 01:43:07 +00:00
Erik Hatcher a4c714d9d5 no longer needed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@348059 13f79535-47bb-0310-9956-ffa450edef68
2005-11-22 01:40:32 +00:00
Daniel Naber bfde3257dc moving the non-language specific analyzers to core, this is where most users will probably expect them
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@347991 13f79535-47bb-0310-9956-ffa450edef68
2005-11-21 21:35:24 +00:00
Daniel Naber 31c271c84b import cleanup to avoid Eclipse warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@344474 13f79535-47bb-0310-9956-ffa450edef68
2005-11-15 23:21:44 +00:00
Daniel Naber 4fd74d2554 Rename *Test files which are no unit tests to make "ant test" work. See LUCENE-465.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@344471 13f79535-47bb-0310-9956-ffa450edef68
2005-11-15 23:18:22 +00:00
Daniel Naber 7e079d2950 avoid compiler/Eclipse warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@344468 13f79535-47bb-0310-9956-ffa450edef68
2005-11-15 23:15:53 +00:00
Erik Hatcher 1687a79648 Add NullFragmenter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@332696 13f79535-47bb-0310-9956-ffa450edef68
2005-11-12 01:08:01 +00:00
Erik Hatcher 32fb624ebc LUCENE-437 - Add position increment pass through on SnowballFilter tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@290943 13f79535-47bb-0310-9956-ffa450edef68
2005-09-22 13:38:58 +00:00
Mark Harwood c00b260ecf Added fix to QueryScorer - if a query has multiple WeightedTerms with different weights for the same term the highest weight is used for scoring that term (previously selected last weight in list)
SimpleHTMLEncoder now encodes characters outside of ASCII range as character entities as per suggestion here: http://issues.apache.org/bugzilla/show_bug.cgi?id=36333


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@279088 13f79535-47bb-0310-9956-ffa450edef68
2005-09-06 20:19:50 +00:00
Mark Harwood f6b07dabe8 Changed TokenGroup.isDistinct after problems reported with JapaneseAnalyzer (no gaps between tokens)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@279078 13f79535-47bb-0310-9956-ffa450edef68
2005-09-06 19:38:12 +00:00
Daniel Naber dd5c74112f a query parser by Ronnie Kolehmainen that also sends PrefixQuerys etc. through the analyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@231523 13f79535-47bb-0310-9956-ffa450edef68
2005-08-11 21:28:58 +00:00
Erik Hatcher 6e9c0b6f45 remove unused file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@209184 13f79535-47bb-0310-9956-ffa450edef68
2005-07-05 02:29:34 +00:00
Erik Hatcher 9d70229506 #34331 - Add Paul Elschot's Surround query language parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@209183 13f79535-47bb-0310-9956-ffa450edef68
2005-07-05 02:29:03 +00:00
Mark Harwood 0062898ada Updated version of MemoryIndex - reliant on new Term.createTerm() method in Trunk
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@208688 13f79535-47bb-0310-9956-ffa450edef68
2005-06-30 21:40:05 +00:00
Mark Harwood 7894a0c0c0 Added (simple) SpanQuery support - matches any terms declared in Spans - proper impl should check for distances
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@208673 13f79535-47bb-0310-9956-ffa450edef68
2005-06-30 20:09:58 +00:00
Daniel Naber 6da2ef197d update to Apache Software License 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@189623 13f79535-47bb-0310-9956-ffa450edef68
2005-06-08 19:48:19 +00:00
Mark Harwood 07cee0b287 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@179637 13f79535-47bb-0310-9956-ffa450edef68 2005-06-02 20:27:06 +00:00
Daniel Naber a3f99b1f43 small javadoc improvements
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178893 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 22:58:17 +00:00
Daniel Naber 27597a5c71 small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178892 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 22:40:36 +00:00
Daniel Naber fe52019614 javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178880 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 19:21:49 +00:00
Daniel Naber 06bb3230ff make this non-public, as it's not documented properly and has a confusing name
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178878 13f79535-47bb-0310-9956-ffa450edef68
2005-05-28 19:05:54 +00:00
Daniel Naber bd2345d856 small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178839 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:07:00 +00:00
Daniel Naber 952cfd54be small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178833 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:02:07 +00:00
Daniel Naber 816f370c0e small javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178832 13f79535-47bb-0310-9956-ffa450edef68
2005-05-27 23:00:49 +00:00
Daniel Naber 9d2d4ead75 use entity for umlaut
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@178239 13f79535-47bb-0310-9956-ffa450edef68
2005-05-24 18:44:20 +00:00
Daniel Naber 69b1f490df javadoc: fix typo and use HTML entity so generated HTML is correct
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169681 13f79535-47bb-0310-9956-ffa450edef68
2005-05-11 19:33:12 +00:00
David Spencer 1d68f8c88d Logic ignored stop words were in a early version of this code but it was taken out in the belief that there
was no point in explicitly looking for them as the scoring algorithm would effictively ignore them.

I did a test and indexed 700 pages on a corporate web site and then ran the MoreLikeThis code on them
and 1/2 of the docs had stop words identified as interesting.

So - I added code in to ignore stop words, but make it backward compatible so that by default this code
is not used.




git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169512 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 19:29:56 +00:00
David Spencer 81087e8bb6 Touchup javadoc.
Make retrieveInterestingTerms only return the top terms, not all terms.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169511 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 19:10:28 +00:00
David Spencer 175cf8a9fd [1] Added comments to retrieveTerms() to document the return value.
[2] Added convenience routine retrieveInterestingTerms() which makes it easier to get at the "interesting words" in a document.




git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169508 13f79535-47bb-0310-9956-ffa450edef68
2005-05-10 18:49:43 +00:00
David Spencer c696188668 don't print out summary unless it's present
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169366 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 21:37:50 +00:00
David Spencer 7f8bf69311 cleanup deprecated warnings so it compiles cleanly w/ the current lucene code, lucene-core-1.9-rc1-dev.jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169365 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 21:36:22 +00:00
David Spencer c680751f63 test checkin of README, just to verify my permissions
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@169349 13f79535-47bb-0310-9956-ffa450edef68
2005-05-09 19:25:40 +00:00
Erik Hatcher 78dbe41805 prefix all JARs with lucene-
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168986 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 23:43:54 +00:00
Erik Hatcher e8c90fb050 rename WordNet to wordnet, required intermediate move due to OS case insensitivity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168480 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:32:00 +00:00
Erik Hatcher 5fd5169a6f temporary move to lowercase WordNet
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168479 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:31:11 +00:00
Erik Hatcher dd472377dd adjust code to fix compile/javadoc errors on JDK 1.5
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168478 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:26:08 +00:00
Erik Hatcher a12dac37b4 adjust project names for consistency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168476 13f79535-47bb-0310-9956-ffa450edef68
2005-05-06 00:24:18 +00:00
Mark Harwood 12a91b4395 Fixed bug where docs larger than maxDocBytesToAnalyze would cause last fragment to be sized as remainder of doc (which could be huge).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168452 13f79535-47bb-0310-9956-ffa450edef68
2005-05-05 22:40:45 +00:00
Erik Hatcher 8f70c09b9b Wolfgang is non-stop with the additions. Easy enough to paste in, so here it is with a Collection-based TokenStream
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@168029 13f79535-47bb-0310-9956-ffa450edef68
2005-05-04 00:24:17 +00:00
Erik Hatcher f94ebdb41e applied norm caching path from Wolfgang
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@167958 13f79535-47bb-0310-9956-ffa450edef68
2005-05-03 19:01:58 +00:00
Erik Hatcher 2a37a3e820 Apply wolfgangs fix to the tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@167835 13f79535-47bb-0310-9956-ffa450edef68
2005-05-03 00:33:27 +00:00
Andreas Vajda 572633f8c4 - reworked store I/O to use new IndexInput and IndexOutput classes
- reworked store I/O to avoid upstream buffering giving better txn control
 - added DbStoreTest unit test adapted from StoreTest


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165674 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 20:06:00 +00:00
Erik Hatcher 8f9e2a15e7 Enhancement #34585 - high-performance in-memory index contributed by Wolfgang Hoschek
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165606 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 09:04:07 +00:00
Erik Hatcher c3847f26ea overhaul of build system to facilitate building and packaging of contrib sub-projects. some work still to be done, but core Lucene build still working fine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165566 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 00:11:11 +00:00
Erik Hatcher 21431112fe adjust license headers to be ASL 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165565 13f79535-47bb-0310-9956-ffa450edef68
2005-05-02 00:08:04 +00:00
Erik Hatcher df52ba1ec6 standardizing source layout
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165562 13f79535-47bb-0310-9956-ffa450edef68
2005-05-01 23:57:31 +00:00
Erik Hatcher f56d33e2d4 Add ASL header - sorry for the oversight on this.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165559 13f79535-47bb-0310-9956-ffa450edef68
2005-05-01 22:57:39 +00:00
Andreas Vajda 77130721ce - replaced db.jar with db-4.3.27.jar
- downloading db-4.3.27.jar from http://downloads.osafoundation.org/db


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165319 13f79535-47bb-0310-9956-ffa450edef68
2005-04-29 17:33:27 +00:00
Erik Hatcher d9042b00d8 move PrecedenceQueryParser to contrib/misc until the kinks are worked out
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164964 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 09:32:33 +00:00
Erik Hatcher 7b8f43ec7c move misc over to official contrib area
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164963 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 09:16:31 +00:00
Erik Hatcher 5c9ccb2442 Add Lucene's test classes to contrib test classpath, some tests rely on the utility methods in the core tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164937 13f79535-47bb-0310-9956-ffa450edef68
2005-04-27 01:52:17 +00:00
Erik Hatcher 790dfc1490 javadoc fixup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164742 13f79535-47bb-0310-9956-ffa450edef68
2005-04-26 04:41:54 +00:00
Erik Hatcher 26aab23901 add ignores
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164698 13f79535-47bb-0310-9956-ffa450edef68
2005-04-26 00:30:08 +00:00
Erik Hatcher d650384d4b add GreekAnalyzer, contributed by Panagiotis Astithas (past@ebs.gr)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164686 13f79535-47bb-0310-9956-ffa450edef68
2005-04-25 23:23:37 +00:00
Erik Hatcher 2fe0a80189 rename misspelled indexDictionnary method
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160988 13f79535-47bb-0310-9956-ffa450edef68
2005-04-12 00:11:33 +00:00
Erik Hatcher ec522fc1c8 Fixed deprecation issues, adjusted test cases to use assertEquals better, reformatted style
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160987 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:48:02 +00:00
Erik Hatcher 0c99b57cc1 Fixed issue with ctor parameter being ignored
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160984 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:43:57 +00:00
Erik Hatcher e88213a2d9 refactor build to use common contrib build system
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160983 13f79535-47bb-0310-9956-ffa450edef68
2005-04-11 23:42:26 +00:00
Daniel Naber c4f1ee70a9 use lowercase method names; remove javadoc that's inherited anyway
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160070 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 17:50:38 +00:00
Daniel Naber 04ea892fbe import cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160065 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 17:45:36 +00:00
Erik Hatcher 6f5f23444c enhanced test contributed by Sven. Encoding tweaks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160034 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 12:25:16 +00:00
Erik Hatcher 0ff227ff0a switch dotted u character to use unicode value reference
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160023 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 10:16:37 +00:00
Erik Hatcher 4e580e221e Issue deprecation warnings when building test cases. Fixed deprecation warnings on TestKeywordAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160012 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 09:10:59 +00:00
Erik Hatcher 3be3e8ab5d Add accent character normalizer filter contributed by Sven Duzont. Also created simple test case.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@160011 13f79535-47bb-0310-9956-ffa450edef68
2005-04-04 09:10:05 +00:00
Daniel Naber 69380a1815 adapt to use of jline
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158852 13f79535-47bb-0310-9956-ffa450edef68
2005-03-23 23:49:08 +00:00
Daniel Naber 84db65bfde adapt to use of jline
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158851 13f79535-47bb-0310-9956-ffa450edef68
2005-03-23 23:42:23 +00:00
Daniel Naber 5a59714f4a use jline instead of java-readline. jline can be added to SVN thanks to its BSD license. plus some small cleanup.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158850 13f79535-47bb-0310-9956-ffa450edef68
2005-03-23 23:40:50 +00:00
Erik Hatcher b54f22aaab Fix max word length issue (though don't know why anyone would limit long words in a more-like-this query).
Also, modified to take into account all values of a field rather than just the first one.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@158076 13f79535-47bb-0310-9956-ffa450edef68
2005-03-18 15:03:00 +00:00
Erik Hatcher 1cb674fc04 regenerated from latest Snowball CVS
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@157834 13f79535-47bb-0310-9956-ffa450edef68
2005-03-17 00:41:31 +00:00
Erik Hatcher 9621a0985c added title to documentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@156593 13f79535-47bb-0310-9956-ffa450edef68
2005-03-09 01:59:14 +00:00
Erik Hatcher 9824226394 Contribution of slick Swing models to enable on-the-fly searching of
tables and lists.  Created by Jonathan Simon.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@156591 13f79535-47bb-0310-9956-ffa450edef68
2005-03-09 01:52:13 +00:00
Mark Harwood fdf05bd088 Fixed missing fieldname in API
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154447 13f79535-47bb-0310-9956-ffa450edef68
2005-02-19 19:51:04 +00:00
Daniel Naber 05d0335dcd offer additional methods that take analyzer + text instead of tokenstream; fix some unused imports and variables
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154444 13f79535-47bb-0310-9956-ffa450edef68
2005-02-19 19:08:52 +00:00
Daniel Naber 335c1567d8 remove empty "@return" tags so javadoc stops complaining; small whitespace cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154083 13f79535-47bb-0310-9956-ffa450edef68
2005-02-16 20:37:57 +00:00
Daniel Naber 45864d1c9c clean up imports, remove unused variables and remove the declaration of an Exception that was never thrown
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@154080 13f79535-47bb-0310-9956-ffa450edef68
2005-02-16 20:20:15 +00:00
Erik Hatcher 28e712b2ee update docs to account for TLP migration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153802 13f79535-47bb-0310-9956-ffa450edef68
2005-02-14 16:48:47 +00:00
Erik Hatcher 373e613341 remove unnecessary import
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153430 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 18:11:37 +00:00
Erik Hatcher 2ac412f6b7 move similarity and spellchecker to new contrib area
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153429 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 18:11:05 +00:00
Erik Hatcher f375d09898 add customizable buffer size
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153412 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 15:30:14 +00:00
Erik Hatcher cd0d0937e1 split keyword tokenizer out of KeywordAnalyzer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153398 13f79535-47bb-0310-9956-ffa450edef68
2005-02-11 13:50:37 +00:00
Erik Hatcher 826fef7f6a KeywordAnalyzer contribution - adapted from _Lucene in Action_ code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@152921 13f79535-47bb-0310-9956-ffa450edef68
2005-02-08 19:13:05 +00:00
Mark Harwood 276ab079f5 Added Nicko Cadell's Encoder contribution
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151622 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 21:31:54 +00:00
Mark Harwood b1555b0bbf Test SVN Commit
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151615 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 18:12:57 +00:00
Erik Hatcher 0ee1728e6d move two more projects over to contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151590 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 15:35:12 +00:00
Erik Hatcher 646f0f0434 Switch ant project to conventional src/java directory structure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151589 13f79535-47bb-0310-9956-ffa450edef68
2005-02-06 14:51:59 +00:00
Erik Hatcher 767312d611 add convenient TODO file to keep track of sandbox -> contrib move
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151469 13f79535-47bb-0310-9956-ffa450edef68
2005-02-05 02:23:19 +00:00
Erik Hatcher 10904d02f6 fix most deprecation warnings
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151468 13f79535-47bb-0310-9956-ffa450edef68
2005-02-05 02:21:39 +00:00
Erik Hatcher 0955eef89f move parts of the sandbox over to contrib area
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@151459 13f79535-47bb-0310-9956-ffa450edef68
2005-02-05 01:25:43 +00:00