Commit Graph

476 Commits

Author SHA1 Message Date
Grant Ingersoll ed9fdba601 LUCENE-1050 and LUCENE-1138 fixes for lock problem
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612868 13f79535-47bb-0310-9956-ffa450edef68
2008-01-17 17:00:43 +00:00
Grant Ingersoll 0a54ca1920 LUCENE-1135. Added Javadoc marking the WikipediaTokenizer as experimental
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612244 13f79535-47bb-0310-9956-ffa450edef68
2008-01-15 22:01:21 +00:00
Michael Busch e5da9db486 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610961 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:38:38 +00:00
Michael Busch 49dd305f69 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610960 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:37:27 +00:00
Michael McCandless ea951785eb LUCENE-1117: fix EnwikiDocMaker to not hang when the producer thread hits exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610738 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 10:12:18 +00:00
Michael McCandless f0d5002066 LUCENE-1117: fix intermittent thread safety issue w/ EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@609080 13f79535-47bb-0310-9956-ffa450edef68
2008-01-05 01:51:53 +00:00
Grant Ingersoll 79e09db401 LUCENE-1103: Internal links should increment as all tokens do, since the first token is valid too
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608989 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:36:23 +00:00
Grant Ingersoll b18f6ae959 LUCENE-1103: The link is now incremented 1, but then the next token in the link is not incremented. This way, the link is not associated with the previous term. Instead it associated with the next term in the link, which would be the display tokens. If there are no display tokens, then it will take it's proper place in the token chain.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608978 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:15:22 +00:00
Grant Ingersoll f715fc6031 LUCENE-1103
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608852 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 14:29:15 +00:00
Michael McCandless f12862426a fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608523 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 15:20:41 +00:00
Doron Cohen 9e65cd554f LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Michael McCandless 263244312d LUCENE-1115: some small fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608306 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 01:48:18 +00:00
Grant Ingersoll ed893f770c LUCENE-1114: Updated example
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608126 13f79535-47bb-0310-9956-ffa450edef68
2008-01-02 15:30:40 +00:00
Michael Busch 75473edb02 LUCENE-746: Fix error message in AnalyzingQueryParser.getPrefixQuery.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607832 13f79535-47bb-0310-9956-ffa450edef68
2008-01-01 12:49:44 +00:00
Grant Ingersoll 90a735441f LUCENE-1102: EnwikiDocMaker now adds a docid field
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Doron Cohen ece8361ab5 LUCENE-749: ChainedFilter behavior fixed when logic of first filter is ANDNOT.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607606 13f79535-47bb-0310-9956-ffa450edef68
2007-12-30 22:47:59 +00:00
Grant Ingersoll bd340a896d git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607291 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:26 +00:00
Grant Ingersoll 5c81934465 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607290 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:16 +00:00
Grant Ingersoll cb94c6aed4 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607288 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:07:33 +00:00
Grant Ingersoll 40d85a7781 Switch to using the EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607281 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 16:29:03 +00:00
Grant Ingersoll ca821526b0 removed bad chars at end of file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605966 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:33:16 +00:00
Grant Ingersoll 36b1206ad8 Restoring ExtractWikipedia, as it is still a handy class to have around. Splitting the documents is useful for debugging purposes when you know you want to look at a specific document instead of grepping through a really large file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605959 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:14:24 +00:00
Grant Ingersoll 55d0c3a2f8 LUCENE-1077: refactored to have a common PayloadHelper classes. Also added TokenOffsetPayloadTokenFilter, which encodes the Token offset into the payloads
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@604870 13f79535-47bb-0310-9956-ffa450edef68
2007-12-17 13:55:46 +00:00
Doron Cohen b7e167ac8d LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen 73f9e7ebc0 fix potential thread-safety issue in contrib/benchmark's TrecDocMaker.
(follow-up to http://svn.apache.org/viewvc?view=rev&revision=602475)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603846 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 07:26:58 +00:00
Michael McCandless 86ca6f86d7 fix intermittent thread-safety failure in contrib/benchmark unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602475 13f79535-47bb-0310-9956-ffa450edef68
2007-12-08 14:17:07 +00:00
Michael McCandless b0d2b1c90e LUCENE-1044: revert the doSync option to FSDirectory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602165 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 17:42:33 +00:00
Grant Ingersoll f9b2e971f2 LUCENE-1077 new sinks and payloads analysis packages
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602081 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 12:21:49 +00:00
Michael McCandless 6be2c0765c LUCENE-1044: also re-default doSync back to false in contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596874 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 23:17:44 +00:00
Michael Busch b04703fe8f LUCENE-1055: Remove gdata from trunk.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596501 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 00:46:27 +00:00
Michael Busch 1abb04580f Disable verbose standard output in MemoryIndexTest by default.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596000 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:19:17 +00:00
Mark Harwood 04ae927f38 Added toString implementation on BooleanFilter.java, provided by Jason Calabrese
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595996 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:08:06 +00:00
Michael Busch bb37d2bcff LUCENE-1051: Generate separate javadocs for core, demo and contrib classes, as well as an unified view.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595013 13f79535-47bb-0310-9956-ffa450edef68
2007-11-14 19:16:19 +00:00
Michael McCandless 439ba586fc LUCENE-1044: add doSync option to FSDirectory.getDirectory, defaulting to true, to sync() each file descriptor before close()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@593799 13f79535-47bb-0310-9956-ffa450edef68
2007-11-10 17:51:00 +00:00
Daniel Naber 2f5507bfc9 fix returning unbalanced quotes in describeParams()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@586918 13f79535-47bb-0310-9956-ffa450edef68
2007-10-21 17:26:16 +00:00
Grant Ingersoll a614f0d99a Added some more algorithms for testing things out, implemented basic TREC query driver based on the sample in the javadocs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@585677 13f79535-47bb-0310-9956-ffa450edef68
2007-10-17 20:36:20 +00:00
Grant Ingersoll b7253a06b7 LUCENE-1027: Added better formatting of doubles, added wikipedia-flush-by-RAM for comparison
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583792 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 12:10:31 +00:00
Grant Ingersoll 9c9ebe5cf4 LUCENE-1027: Added support for doubles to Config, also added copies of standard and micro-standard algorithms that flush by RAM
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583771 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 11:05:40 +00:00
Mark Harwood 21a07ee41e Provided DTDs for core and contrib XML query syntax. The "docs" directory contains detailed documentation generated by DTDdoc from the DTDs. The ant script used to generate these docs is also included but not hooked up to the main build process due to license issues with DTDdoc.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583307 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:45:27 +00:00
Mark Harwood 3872d3bfcc Updated hashcode/equals to test all fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583305 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:40:45 +00:00
Chris M. Hostetter 243861715b cleaning up a ton of javadoc warnings from gdata. most of these fixes related to either: clarifying packages for @link tags; changing @link or @see tags that pointed at classes/methods that didn't exist (by picking classes with very similar names that do exist); or removing incomplete stub javadocs (that added no information beyond the signature
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@582380 13f79535-47bb-0310-9956-ffa450edef68
2007-10-05 20:30:59 +00:00
Mark Harwood 62fa7b4b82 Added new DuplicateFilter functionality to filter documents sharing a field value (e.g. primary key/url)
Also includes Junit test and XML Query support

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@581426 13f79535-47bb-0310-9956-ffa450edef68
2007-10-02 22:56:46 +00:00
Grant Ingersoll dce47c6401 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579766 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:15:26 +00:00
Grant Ingersoll bcfad28d69 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579765 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:12:59 +00:00
Michael McCandless a28eb4d978 LUCENE-994: change defaults in IndexWriter to maximize 'out of the box' indexing speed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579360 13f79535-47bb-0310-9956-ffa450edef68
2007-09-25 20:02:07 +00:00
Michael McCandless 511406ecbe remove temporary print for GData unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577892 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 19:41:42 +00:00
Chris M. Hostetter 3f517bff75 don't just write date to stdout, include date string in failure message
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577873 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:52:42 +00:00
Michael McCandless fada31fa7f adding temporary print to figure out why this gdata-server test is failing on build machine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577861 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:38:23 +00:00
Doron Cohen 40f0adb507 LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen 9e51c30349 LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Michael Busch 9c2a036db3 - LUCENE-908: Improvements and simplifications for how the MANIFEST file and the META-INF dir are created.
- LUCENE-935: Various improvements for the maven artifacts. Now the artifacts also include the sources as .jar files. 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@568766 13f79535-47bb-0310-9956-ffa450edef68
2007-08-22 23:16:48 +00:00
Grant Ingersoll c67fd79a83 LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Grant Ingersoll d1f90c7825 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll 9192b16643 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll 477c4e0efe Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00
Michael McCandless d42de32984 LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Grant Ingersoll 82eb074afd LUCENE-974: Removed Author tags from all existing code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564236 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 15:21:19 +00:00
Michael McCandless d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless 2d954694dc LUCENE-966: sizable (~6X faster) speedups to StandardTokenizer by using JFlex instead of JavaCC
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564036 13f79535-47bb-0310-9956-ffa450edef68
2007-08-08 22:26:44 +00:00
Michael McCandless 0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen f3b9c9407a for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Doron Cohen 98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless 02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll e97d5830ce LUCENE-868: New Term Vector access mechanism. Allows for applications to define how they access term vector information instead of having to pack/unpack the TV info returned by the old way.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558592 13f79535-47bb-0310-9956-ffa450edef68
2007-07-23 03:17:25 +00:00
Michael McCandless 96ea45d193 LUCENE-952: force synchronized access to writer instance variable to fix infinite spin loop in TestGdataIndexer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@554189 13f79535-47bb-0310-9956-ffa450edef68
2007-07-07 12:28:04 +00:00
Michael McCandless cff5767e44 LUCENE-843: speed up IndexWriter performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553236 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 15:16:38 +00:00
Michael McCandless c03f45a62d LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll bc7c586468 LUCENE-848. Add Wikipedia benchmarking support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen 9ff9bf8142 fix javadoc unknown tag warning.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552111 13f79535-47bb-0310-9956-ffa450edef68
2007-06-30 07:04:27 +00:00
Doron Cohen ba90fb06c1 LUCENE-946: replace text from an online collection (used in
few test cases) with text that is surely 100% free.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Otis Gospodnetic 71f2c1da8b - LUCENE-906: Elision filter for French.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551744 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 00:36:09 +00:00
Doron Cohen d9b6aa9c0b LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen e6c659269a LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
logging for addDoc/deleteDoc tasks;


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Michael Busch d955a970b6 LUCENE-622: Add ant target and pom.xml files for building maven artifacts of the Lucene core and the contrib modules.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@547860 13f79535-47bb-0310-9956-ffa450edef68
2007-06-16 04:45:13 +00:00
Chris M. Hostetter 373af70a20 LUCENE-930: Various contrib building improvements to ensure contrib dependencies are met, and test compilation errors fail the build.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@546226 13f79535-47bb-0310-9956-ffa450edef68
2007-06-11 19:29:41 +00:00
Michael Busch df0a188415 LUCENE-931: adding missing license headers to various files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545696 13f79535-47bb-0310-9956-ffa450edef68
2007-06-09 06:09:46 +00:00
Mark Harwood 1f1528e3ca Exposed the MoreLikeThis "minDocFreq" property for use in MoreLikeThisQuery.java and in XML queries
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545333 13f79535-47bb-0310-9956-ffa450edef68
2007-06-07 22:13:37 +00:00
Michael Busch 731fb3a228 LUCENE-908:
- Improves content of MANIFEST file and makes it customizable for the contribs.
- Adds SNOWBALL-LICENSE.txt to META-INF of the snowball jar 
- lucli jar now contains LICENSE.txt and NOTICE.txt.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545010 13f79535-47bb-0310-9956-ffa450edef68
2007-06-07 00:48:32 +00:00
Michael Busch 4355588972 LUCENE-740: Added SNOWBALL-LICENSE.txt to the snowball package and a remark about the license to NOTICE.TXT.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@544703 13f79535-47bb-0310-9956-ffa450edef68
2007-06-06 01:33:11 +00:00
Chris M. Hostetter 1a3de38a0c LUCENE-885: fixing bug with compilation on java 1.4, introduce by earlier commit for this issue
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543257 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 20:58:55 +00:00
Daniel Naber 6e98fa61d4 LUCENE-763: LuceneDictionary skips first word in enumeration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543220 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 19:13:36 +00:00
Daniel Naber 42c3d11f07 small cleanup only: imports; no need for try...catch(...) { fail(); } in a test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543216 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 19:00:30 +00:00
Chris M. Hostetter cb38e0c06c LUCENE-885: include contribs in 'ant test' ... includes fixes for several contribs to make tests run cleanly regardless of what working directory they are run in
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@542769 13f79535-47bb-0310-9956-ffa450edef68
2007-05-30 07:18:29 +00:00
Michael Busch 8caf74c26e LUCENE-895: Disable tests "testSimple", "testWildcard" and "testPrecedence" in TestPrecedenceQueryParser.java to make this test case pass.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@542735 13f79535-47bb-0310-9956-ffa450edef68
2007-05-30 03:00:51 +00:00
Daniel Naber 8e9dd085ab LUCENE-886: cleanup: javadoc improvements; don't print exceptions to stderr but re-throw them; new test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@541885 13f79535-47bb-0310-9956-ffa450edef68
2007-05-26 11:06:38 +00:00
Mark Harwood 132c5fd899 Added equals/hashcode implementations to enable caching
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@541816 13f79535-47bb-0310-9956-ffa450edef68
2007-05-25 23:23:07 +00:00
Daniel Naber 3c4dc0484b LUCENE-883: consecutive calls to Spellchecker.indexDictionary() won't insert terms twice anymore. Fixes the test case failure.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@539732 13f79535-47bb-0310-9956-ffa450edef68
2007-05-19 11:27:54 +00:00
Daniel Naber 52acd10f63 LUCENE-882: Spellchecker doesn't need to store ngrams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@539727 13f79535-47bb-0310-9956-ffa450edef68
2007-05-19 11:04:38 +00:00
Grant Ingersoll b5e09521b5 javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@538783 13f79535-47bb-0310-9956-ffa450edef68
2007-05-17 01:13:40 +00:00
Mark Harwood 336d3e7c58 Added hashcode and equals implementations to enable caching
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@537123 13f79535-47bb-0310-9956-ffa450edef68
2007-05-11 07:14:18 +00:00
Doron Cohen a0437dab03 clean few javadoc warnings that showed up as javadocs are built for a few more contrib paths. (LUCENE-875 related).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@536668 13f79535-47bb-0310-9956-ffa450edef68
2007-05-09 21:49:01 +00:00
Doron Cohen 8ed11daae0 javadoc warnings and errors solved (except for gdata).
build.xml "javadocs" target will now fail if there are 
javadoc warnings/errors under jdk1.4.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535448 13f79535-47bb-0310-9956-ffa450edef68
2007-05-05 05:36:49 +00:00
Doron Cohen bcd57a9af0 implement isPayloadAvailable (just return false, payloads unimplemented here), b/c it desn't compile without it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535361 13f79535-47bb-0310-9956-ffa450edef68
2007-05-04 19:40:14 +00:00
Otis Gospodnetic 5b63c00106 - LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 22:11:09 +00:00
Mark Harwood 2333dd080b Resolved bug LUCENE-862 reported by Antony Bowesman
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529512 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:13:05 +00:00
Doron Cohen 33694c7d86 LUCENE-864: set svn:eol-style=native.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529511 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:11:04 +00:00
Mark Harwood 1877f9e3dc Fixed Junit failure due to recent change in QueryParser default behaviour. Needed to explicitly require old style of RangeQuery implementation rather than new default of RangeFilters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529417 13f79535-47bb-0310-9956-ffa450edef68
2007-04-16 22:04:07 +00:00
Doron Cohen 934a56e55f contrib/benchmark: better error handling and javadocs around "exhaustive" doc making.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68
2007-04-13 19:30:03 +00:00
Otis Gospodnetic 5650a7bf92 - find . -name \*java -type f | xargs perl -pi -e 's/(\r\n|\n|\r)/\n/g' \
&& find . -name \*java -type f | xargs svn propset svn:eol-style native



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524969 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 23:37:14 +00:00
Otis Gospodnetic 3a1652a95b - Added the missing ASL 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524958 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:47:10 +00:00
Otis Gospodnetic af77bc03c1 - Fixed javadoc typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524952 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:56 +00:00
Otis Gospodnetic e158736d91 - Import cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524950 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:05 +00:00
Otis Gospodnetic ec0493167c - Don't use the deprecated FSDirectory.getDirectory(indexDir, boolean)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524949 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:26:44 +00:00
Otis Gospodnetic 023ffe2c29 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524914 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 20:27:34 +00:00
Doron Cohen 031f50c4e7 LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*';
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-26 16:46:33 +00:00
Doron Cohen 346b484473 Documentation updates following LUCENE-837.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521830 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 17:56:09 +00:00
Grant Ingersoll 251c2b328a LUCENE-837:
Fixed issues related to Doron's comments on Mar. 22, 07.

PUt in tests for ***FieldSelectorTask.  Moved the byte field handling from ReutersDocMaker to BasicDocMaker.

Added in tests for new Task

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521717 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 13:13:46 +00:00
Grant Ingersoll cfff81d1dc LUCENE-837:
fix copyright

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521698 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 12:26:08 +00:00
Grant Ingersoll f4fffb3491 LUCENE-837:
Added optional bytes field to store on the Document.  Enabled ReutersDocMaker w/ the ability to store byte data in a field.  If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array.

Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 03:48:12 +00:00
Doron Cohen 34b560603a LUCENE-840: benchmarking code correctness tests were added.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68
2007-03-22 23:13:48 +00:00
Grant Ingersoll e09189a830 LUCENE-837 applied
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68
2007-03-21 13:52:34 +00:00
Doron Cohen 9a31a8828f correct names of dynamically loaded classes (analyzers, doc/query makers)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@519567 13f79535-47bb-0310-9956-ffa450edef68
2007-03-18 06:51:06 +00:00
Doron Cohen 977e729685 LUCENE-829: close done readers in contrib/benchmark.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518822 13f79535-47bb-0310-9956-ffa450edef68
2007-03-16 01:16:56 +00:00
Michael Busch b9edb2867f Make contrib/db and contrib/memory compile with changes from LUCENE-755
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518792 13f79535-47bb-0310-9956-ffa450edef68
2007-03-15 22:35:52 +00:00
Doron Cohen ab763bbdae contrib.benhmark.byTask javadocs corrections
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517566 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 06:54:03 +00:00
Otis Gospodnetic 1a48e218d6 - Committing forgotten classes for LUCENE-759
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517477 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 00:30:13 +00:00
Grant Ingersoll f4490ac789 LUCENE 791, INFRA-1173:
Migrated references from Jakarta Lucene Wiki to Lucene Java Wiki

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@516586 13f79535-47bb-0310-9956-ffa450edef68
2007-03-09 22:29:00 +00:00
Erik Hatcher b972b13174 convert interfaces to public visibility that already should have had it
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@515431 13f79535-47bb-0310-9956-ffa450edef68
2007-03-07 02:39:24 +00:00
Otis Gospodnetic d7e203c8b4 - LUCENE-786: Javadoc improvements from Karl Wettin and an extra searcher.close() from me
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513878 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 18:28:55 +00:00
Otis Gospodnetic 534be1599d - LUCENE-759: Two n-gram producting TokenFilters (using them for the spellchecker in SOLR-81)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513876 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 18:19:53 +00:00
Otis Gospodnetic 6636d88def - 2-char indentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513866 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 17:54:27 +00:00
Otis Gospodnetic 7b570fc8b2 - LUCENE-759: Made the tokenizer capable of creating n-grams of a varying sizes - from min to max characters per n-gram. Patch from Adam Hiatt.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513344 13f79535-47bb-0310-9956-ffa450edef68
2007-03-01 14:22:57 +00:00
Grant Ingersoll ee12ced8e7 LUCENE-801: build lucene core and demo first, change classpath to use the build classes instead of the jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@507260 13f79535-47bb-0310-9956-ffa450edef68
2007-02-13 22:17:24 +00:00
Grant Ingersoll 13eaccfd56 Lucene 790
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506441 13f79535-47bb-0310-9956-ffa450edef68
2007-02-12 13:32:20 +00:00
Grant Ingersoll bb66099414 Applied 788 and 790 from Doron Cohen. Ran both the micro-standard and the task runs and results look reasonable.
Thanks, Doron

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506093 13f79535-47bb-0310-9956-ffa450edef68
2007-02-11 18:59:22 +00:00
Wolfgang Hoschek 0695150949 javadoc cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@504229 13f79535-47bb-0310-9956-ffa450edef68
2007-02-06 18:09:19 +00:00
Mark Harwood 0149b5f82f Changes to improve server-side speed and thread safety by caching reusable, compiled stylesheets.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@503500 13f79535-47bb-0310-9956-ffa450edef68
2007-02-04 22:51:28 +00:00
Mark Harwood 387d95417b Added missing Apache license comments - thanks for the reminder, Otis.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@500417 13f79535-47bb-0310-9956-ffa450edef68
2007-01-26 23:51:38 +00:00
Mark Harwood c02aed3b5e Added QueryTemplateManager.java to aid construction of XML queries from form input by using XSL templates. A Junit test provides examples of use. This approach offers a convenient way of externalizing and changing how user input is turned into Lucene queries. Database applications often adopt similar practices by externalizing SQL in template files that can be easily changed/optimized by a DBA.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@500053 13f79535-47bb-0310-9956-ffa450edef68
2007-01-25 23:31:02 +00:00
Mark Harwood 9251a63e01 Fixed bug when BooleanFilter class is used in conjunction with cached filters - the anding of filters' bitsets was changing the original rather than taking a copy of the contents. Changed to use BitSet.clone before performing any destructive operation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@499592 13f79535-47bb-0310-9956-ffa450edef68
2007-01-24 22:19:05 +00:00
Otis Gospodnetic 8cafdd9b64 - Removed isEmpty() Java 6 method, so Andrzej can compile Luke
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@496628 13f79535-47bb-0310-9956-ffa450edef68
2007-01-16 09:07:01 +00:00
Otis Gospodnetic 74e68c9287 - Removed extra ;
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@496283 13f79535-47bb-0310-9956-ffa450edef68
2007-01-15 11:45:04 +00:00
Grant Ingersoll db238e787f Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron!
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68
2007-01-13 04:08:23 +00:00
Mark Harwood 18ece160e3 Fix to apply any custom boost setting when the Query is rewritten
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492830 13f79535-47bb-0310-9956-ffa450edef68
2007-01-05 00:18:32 +00:00
Mark Harwood 998908257b Added new "CachedFilter" feature to XML syntax enabling any queries or filters to be cached for better repeat performance. Added JUnit test and example XML file. Also fixed ClassCastException in DOMUtils which occured when getAttributeWithInheritance reached the root of a document without finding the required attribute.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492823 13f79535-47bb-0310-9956-ffa450edef68
2007-01-05 00:05:17 +00:00
Grant Ingersoll 8cffe45c2a replaced string concatenation with chained appends
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492383 13f79535-47bb-0310-9956-ffa450edef68
2007-01-04 02:17:58 +00:00
Daniel Naber 67399fdd8d ingore case in the names of commands (Search == search etc.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@490513 13f79535-47bb-0310-9956-ffa450edef68
2006-12-27 15:22:22 +00:00
Otis Gospodnetic 2cf113a022 - Javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489847 13f79535-47bb-0310-9956-ffa450edef68
2006-12-23 03:36:34 +00:00
Otis Gospodnetic 8b7f6e4ef6 - LUCENE-759: New n-gram-capable tokenizers and their unit tests.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489802 13f79535-47bb-0310-9956-ffa450edef68
2006-12-22 23:43:17 +00:00
Otis Gospodnetic 17315a9be7 - LUCENE-741: command-line utility for modifying or removing field norms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489213 13f79535-47bb-0310-9956-ffa450edef68
2006-12-20 22:32:13 +00:00
Michael McCandless 7c4b667f71 LUCENE-748: add javadoc around semantics of Exception during IndexWriter.close()
LUCENE-129: fix finalizers to always call super.finalize
LUCENE-301: add new constructors IndexWriter({String,File,Directory}, Analyzer) that create index if it's not already there, else append
LUCENE-701: found two cases of "open IndexWriter for create while reader is open, on Windows" that I didn't properly fix; added new test cases
LUCENE-702: corrected some small javadoc issues


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@488640 13f79535-47bb-0310-9956-ffa450edef68
2006-12-19 11:31:27 +00:00
Simon Willnauer 550b50fdd8 redefined compile-core and compile-test to exclude db4o dependend classes if db4o jar is not present
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@487315 13f79535-47bb-0310-9956-ffa450edef68
2006-12-14 19:26:27 +00:00
Simon Willnauer fb0c30d061 defined default task to use inherit ant tasks in sub folders
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@487313 13f79535-47bb-0310-9956-ffa450edef68
2006-12-14 19:25:43 +00:00
Simon Willnauer ad3ccde3a0 fixed some path errors due to the new structure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486811 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 19:36:49 +00:00
Simon Willnauer 4374e2e88a created internal "subproject" structure to reuse the common-build.xml ant tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486798 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 19:04:23 +00:00
Simon Willnauer 5aabf7a0b9 Fixed compile error in GDataIndexWriterTest as IndexWriter has not commitTimeout setter anymore.
Fixed misspelled path in gom-build.xml


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486631 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:55:43 +00:00
Simon Willnauer d775297874 modifications added to changes.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486629 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:46:13 +00:00
Simon Willnauer 4711af4ef3 Removed unneccessary UML metadata
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486628 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:41:32 +00:00
Simon Willnauer 3bd4a16711 Initial Revision GData Object Model
GOM is a GData object representation for render, parse and manipulate GData Feeds (RSS/ATOM)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486627 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:39:54 +00:00
Simon Willnauer 9cc3210dd0 deleted hivemind related test from core test folder.
This source has been copied/moved to ./src/hivemind/test

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486625 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:37:16 +00:00