Commit Graph

435 Commits

Author SHA1 Message Date
Mark Harwood 62fa7b4b82 Added new DuplicateFilter functionality to filter documents sharing a field value (e.g. primary key/url)
Also includes Junit test and XML Query support

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@581426 13f79535-47bb-0310-9956-ffa450edef68
2007-10-02 22:56:46 +00:00
Grant Ingersoll dce47c6401 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579766 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:15:26 +00:00
Grant Ingersoll bcfad28d69 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579765 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:12:59 +00:00
Michael McCandless a28eb4d978 LUCENE-994: change defaults in IndexWriter to maximize 'out of the box' indexing speed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579360 13f79535-47bb-0310-9956-ffa450edef68
2007-09-25 20:02:07 +00:00
Michael McCandless 511406ecbe remove temporary print for GData unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577892 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 19:41:42 +00:00
Chris M. Hostetter 3f517bff75 don't just write date to stdout, include date string in failure message
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577873 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:52:42 +00:00
Michael McCandless fada31fa7f adding temporary print to figure out why this gdata-server test is failing on build machine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577861 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:38:23 +00:00
Doron Cohen 40f0adb507 LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen 9e51c30349 LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Michael Busch 9c2a036db3 - LUCENE-908: Improvements and simplifications for how the MANIFEST file and the META-INF dir are created.
- LUCENE-935: Various improvements for the maven artifacts. Now the artifacts also include the sources as .jar files. 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@568766 13f79535-47bb-0310-9956-ffa450edef68
2007-08-22 23:16:48 +00:00
Grant Ingersoll c67fd79a83 LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Grant Ingersoll d1f90c7825 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll 9192b16643 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll 477c4e0efe Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00
Michael McCandless d42de32984 LUCENE-969: deprecate Token.termText() & optimize core tokenizers by re-using tokens & TokenStreams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564715 13f79535-47bb-0310-9956-ffa450edef68
2007-08-10 18:34:33 +00:00
Grant Ingersoll 82eb074afd LUCENE-974: Removed Author tags from all existing code
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564236 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 15:21:19 +00:00
Michael McCandless d1422ebd6b LUCENE-971: extract wikipedia documents as a doc maker directly from XML file without using intermediate one-file-per-document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564151 13f79535-47bb-0310-9956-ffa450edef68
2007-08-09 08:57:26 +00:00
Michael McCandless 2d954694dc LUCENE-966: sizable (~6X faster) speedups to StandardTokenizer by using JFlex instead of JavaCC
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@564036 13f79535-47bb-0310-9956-ffa450edef68
2007-08-08 22:26:44 +00:00
Michael McCandless 0fd867732e LUCENE-967: add ReadTokensTask to allow for benchmarking just tokenization
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@561908 13f79535-47bb-0310-9956-ffa450edef68
2007-08-01 18:54:43 +00:00
Doron Cohen f3b9c9407a for LUCENE-836 sort reuters files by name (otherwise TestQualityRun can fail on some OSs).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560428 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 23:56:48 +00:00
Doron Cohen 98fa2d898d LUCENE-836: Add support for search quality benchmarking.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@560372 13f79535-47bb-0310-9956-ffa450edef68
2007-07-27 20:24:52 +00:00
Michael McCandless 02dd452026 LUCENE-947: add creation of & indexing from 'one document per line' text files to minimize IO overhead of creating documents when running tests
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@559366 13f79535-47bb-0310-9956-ffa450edef68
2007-07-25 08:54:58 +00:00
Grant Ingersoll e97d5830ce LUCENE-868: New Term Vector access mechanism. Allows for applications to define how they access term vector information instead of having to pack/unpack the TV info returned by the old way.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@558592 13f79535-47bb-0310-9956-ffa450edef68
2007-07-23 03:17:25 +00:00
Michael McCandless 96ea45d193 LUCENE-952: force synchronized access to writer instance variable to fix infinite spin loop in TestGdataIndexer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@554189 13f79535-47bb-0310-9956-ffa450edef68
2007-07-07 12:28:04 +00:00
Michael McCandless cff5767e44 LUCENE-843: speed up IndexWriter performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@553236 13f79535-47bb-0310-9956-ffa450edef68
2007-07-04 15:16:38 +00:00
Michael McCandless c03f45a62d LUCENE-848: make sure we use the right version (2.9.0) of XercesJ
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552378 13f79535-47bb-0310-9956-ffa450edef68
2007-07-02 01:06:07 +00:00
Grant Ingersoll bc7c586468 LUCENE-848. Add Wikipedia benchmarking support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552229 13f79535-47bb-0310-9956-ffa450edef68
2007-07-01 02:19:10 +00:00
Doron Cohen 9ff9bf8142 fix javadoc unknown tag warning.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552111 13f79535-47bb-0310-9956-ffa450edef68
2007-06-30 07:04:27 +00:00
Doron Cohen ba90fb06c1 LUCENE-946: replace text from an online collection (used in
few test cases) with text that is surely 100% free.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@552009 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 19:43:19 +00:00
Otis Gospodnetic 71f2c1da8b - LUCENE-906: Elision filter for French.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551744 13f79535-47bb-0310-9956-ffa450edef68
2007-06-29 00:36:09 +00:00
Doron Cohen d9b6aa9c0b LUCENE-945: tests failed to find data dirs. Added sys-prop benchmark.work.dir and cfg-prop work.dir.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@551077 13f79535-47bb-0310-9956-ffa450edef68
2007-06-27 06:49:38 +00:00
Doron Cohen e6c659269a LUCENE-940: Multi-threaded issues fixed: SimpleDateFormat;
logging for addDoc/deleteDoc tasks;


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@550905 13f79535-47bb-0310-9956-ffa450edef68
2007-06-26 18:27:21 +00:00
Michael Busch d955a970b6 LUCENE-622: Add ant target and pom.xml files for building maven artifacts of the Lucene core and the contrib modules.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@547860 13f79535-47bb-0310-9956-ffa450edef68
2007-06-16 04:45:13 +00:00
Chris M. Hostetter 373af70a20 LUCENE-930: Various contrib building improvements to ensure contrib dependencies are met, and test compilation errors fail the build.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@546226 13f79535-47bb-0310-9956-ffa450edef68
2007-06-11 19:29:41 +00:00
Michael Busch df0a188415 LUCENE-931: adding missing license headers to various files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545696 13f79535-47bb-0310-9956-ffa450edef68
2007-06-09 06:09:46 +00:00
Mark Harwood 1f1528e3ca Exposed the MoreLikeThis "minDocFreq" property for use in MoreLikeThisQuery.java and in XML queries
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545333 13f79535-47bb-0310-9956-ffa450edef68
2007-06-07 22:13:37 +00:00
Michael Busch 731fb3a228 LUCENE-908:
- Improves content of MANIFEST file and makes it customizable for the contribs.
- Adds SNOWBALL-LICENSE.txt to META-INF of the snowball jar 
- lucli jar now contains LICENSE.txt and NOTICE.txt.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@545010 13f79535-47bb-0310-9956-ffa450edef68
2007-06-07 00:48:32 +00:00
Michael Busch 4355588972 LUCENE-740: Added SNOWBALL-LICENSE.txt to the snowball package and a remark about the license to NOTICE.TXT.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@544703 13f79535-47bb-0310-9956-ffa450edef68
2007-06-06 01:33:11 +00:00
Chris M. Hostetter 1a3de38a0c LUCENE-885: fixing bug with compilation on java 1.4, introduce by earlier commit for this issue
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543257 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 20:58:55 +00:00
Daniel Naber 6e98fa61d4 LUCENE-763: LuceneDictionary skips first word in enumeration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543220 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 19:13:36 +00:00
Daniel Naber 42c3d11f07 small cleanup only: imports; no need for try...catch(...) { fail(); } in a test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@543216 13f79535-47bb-0310-9956-ffa450edef68
2007-05-31 19:00:30 +00:00
Chris M. Hostetter cb38e0c06c LUCENE-885: include contribs in 'ant test' ... includes fixes for several contribs to make tests run cleanly regardless of what working directory they are run in
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@542769 13f79535-47bb-0310-9956-ffa450edef68
2007-05-30 07:18:29 +00:00
Michael Busch 8caf74c26e LUCENE-895: Disable tests "testSimple", "testWildcard" and "testPrecedence" in TestPrecedenceQueryParser.java to make this test case pass.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@542735 13f79535-47bb-0310-9956-ffa450edef68
2007-05-30 03:00:51 +00:00
Daniel Naber 8e9dd085ab LUCENE-886: cleanup: javadoc improvements; don't print exceptions to stderr but re-throw them; new test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@541885 13f79535-47bb-0310-9956-ffa450edef68
2007-05-26 11:06:38 +00:00
Mark Harwood 132c5fd899 Added equals/hashcode implementations to enable caching
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@541816 13f79535-47bb-0310-9956-ffa450edef68
2007-05-25 23:23:07 +00:00
Daniel Naber 3c4dc0484b LUCENE-883: consecutive calls to Spellchecker.indexDictionary() won't insert terms twice anymore. Fixes the test case failure.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@539732 13f79535-47bb-0310-9956-ffa450edef68
2007-05-19 11:27:54 +00:00
Daniel Naber 52acd10f63 LUCENE-882: Spellchecker doesn't need to store ngrams
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@539727 13f79535-47bb-0310-9956-ffa450edef68
2007-05-19 11:04:38 +00:00
Grant Ingersoll b5e09521b5 javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@538783 13f79535-47bb-0310-9956-ffa450edef68
2007-05-17 01:13:40 +00:00
Mark Harwood 336d3e7c58 Added hashcode and equals implementations to enable caching
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@537123 13f79535-47bb-0310-9956-ffa450edef68
2007-05-11 07:14:18 +00:00
Doron Cohen a0437dab03 clean few javadoc warnings that showed up as javadocs are built for a few more contrib paths. (LUCENE-875 related).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@536668 13f79535-47bb-0310-9956-ffa450edef68
2007-05-09 21:49:01 +00:00
Doron Cohen 8ed11daae0 javadoc warnings and errors solved (except for gdata).
build.xml "javadocs" target will now fail if there are 
javadoc warnings/errors under jdk1.4.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535448 13f79535-47bb-0310-9956-ffa450edef68
2007-05-05 05:36:49 +00:00
Doron Cohen bcd57a9af0 implement isPayloadAvailable (just return false, payloads unimplemented here), b/c it desn't compile without it.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@535361 13f79535-47bb-0310-9956-ffa450edef68
2007-05-04 19:40:14 +00:00
Otis Gospodnetic 5b63c00106 - LUCENE-863: Deprecated StandardBenchmaker in favour of byTask benchmark tasks.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529790 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 22:11:09 +00:00
Mark Harwood 2333dd080b Resolved bug LUCENE-862 reported by Antony Bowesman
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529512 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:13:05 +00:00
Doron Cohen 33694c7d86 LUCENE-864: set svn:eol-style=native.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529511 13f79535-47bb-0310-9956-ffa450edef68
2007-04-17 07:11:04 +00:00
Mark Harwood 1877f9e3dc Fixed Junit failure due to recent change in QueryParser default behaviour. Needed to explicitly require old style of RangeQuery implementation rather than new default of RangeFilters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@529417 13f79535-47bb-0310-9956-ffa450edef68
2007-04-16 22:04:07 +00:00
Doron Cohen 934a56e55f contrib/benchmark: better error handling and javadocs around "exhaustive" doc making.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@528617 13f79535-47bb-0310-9956-ffa450edef68
2007-04-13 19:30:03 +00:00
Otis Gospodnetic 5650a7bf92 - find . -name \*java -type f | xargs perl -pi -e 's/(\r\n|\n|\r)/\n/g' \
&& find . -name \*java -type f | xargs svn propset svn:eol-style native



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524969 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 23:37:14 +00:00
Otis Gospodnetic 3a1652a95b - Added the missing ASL 2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524958 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:47:10 +00:00
Otis Gospodnetic af77bc03c1 - Fixed javadoc typos.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524952 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:56 +00:00
Otis Gospodnetic e158736d91 - Import cleanup
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524950 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:27:05 +00:00
Otis Gospodnetic ec0493167c - Don't use the deprecated FSDirectory.getDirectory(indexDir, boolean)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524949 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 22:26:44 +00:00
Otis Gospodnetic 023ffe2c29 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@524914 13f79535-47bb-0310-9956-ffa450edef68
2007-04-02 20:27:34 +00:00
Doron Cohen 031f50c4e7 LUCENE-849: configurable HTML Parser; external classes; exhaustive doc maker - '*';
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@522569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-26 16:46:33 +00:00
Doron Cohen 346b484473 Documentation updates following LUCENE-837.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521830 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 17:56:09 +00:00
Grant Ingersoll 251c2b328a LUCENE-837:
Fixed issues related to Doron's comments on Mar. 22, 07.

PUt in tests for ***FieldSelectorTask.  Moved the byte field handling from ReutersDocMaker to BasicDocMaker.

Added in tests for new Task

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521717 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 13:13:46 +00:00
Grant Ingersoll cfff81d1dc LUCENE-837:
fix copyright

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521698 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 12:26:08 +00:00
Grant Ingersoll f4fffb3491 LUCENE-837:
Added optional bytes field to store on the Document.  Enabled ReutersDocMaker w/ the ability to store byte data in a field.  If the param is set (see the javadocs) it will store the contents of the body as a UTF-8 byte array.

Then, the SearchTravRetLoadFieldSelectorTask (whew) can take in parameters specifying what fields to load (others are ignored by default)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521569 13f79535-47bb-0310-9956-ffa450edef68
2007-03-23 03:48:12 +00:00
Doron Cohen 34b560603a LUCENE-840: benchmarking code correctness tests were added.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@521526 13f79535-47bb-0310-9956-ffa450edef68
2007-03-22 23:13:48 +00:00
Grant Ingersoll e09189a830 LUCENE-837 applied
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@520890 13f79535-47bb-0310-9956-ffa450edef68
2007-03-21 13:52:34 +00:00
Doron Cohen 9a31a8828f correct names of dynamically loaded classes (analyzers, doc/query makers)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@519567 13f79535-47bb-0310-9956-ffa450edef68
2007-03-18 06:51:06 +00:00
Doron Cohen 977e729685 LUCENE-829: close done readers in contrib/benchmark.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518822 13f79535-47bb-0310-9956-ffa450edef68
2007-03-16 01:16:56 +00:00
Michael Busch b9edb2867f Make contrib/db and contrib/memory compile with changes from LUCENE-755
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@518792 13f79535-47bb-0310-9956-ffa450edef68
2007-03-15 22:35:52 +00:00
Doron Cohen ab763bbdae contrib.benhmark.byTask javadocs corrections
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517566 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 06:54:03 +00:00
Otis Gospodnetic 1a48e218d6 - Committing forgotten classes for LUCENE-759
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@517477 13f79535-47bb-0310-9956-ffa450edef68
2007-03-13 00:30:13 +00:00
Grant Ingersoll f4490ac789 LUCENE 791, INFRA-1173:
Migrated references from Jakarta Lucene Wiki to Lucene Java Wiki

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@516586 13f79535-47bb-0310-9956-ffa450edef68
2007-03-09 22:29:00 +00:00
Erik Hatcher b972b13174 convert interfaces to public visibility that already should have had it
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@515431 13f79535-47bb-0310-9956-ffa450edef68
2007-03-07 02:39:24 +00:00
Otis Gospodnetic d7e203c8b4 - LUCENE-786: Javadoc improvements from Karl Wettin and an extra searcher.close() from me
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513878 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 18:28:55 +00:00
Otis Gospodnetic 534be1599d - LUCENE-759: Two n-gram producting TokenFilters (using them for the spellchecker in SOLR-81)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513876 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 18:19:53 +00:00
Otis Gospodnetic 6636d88def - 2-char indentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513866 13f79535-47bb-0310-9956-ffa450edef68
2007-03-02 17:54:27 +00:00
Otis Gospodnetic 7b570fc8b2 - LUCENE-759: Made the tokenizer capable of creating n-grams of a varying sizes - from min to max characters per n-gram. Patch from Adam Hiatt.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@513344 13f79535-47bb-0310-9956-ffa450edef68
2007-03-01 14:22:57 +00:00
Grant Ingersoll ee12ced8e7 LUCENE-801: build lucene core and demo first, change classpath to use the build classes instead of the jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@507260 13f79535-47bb-0310-9956-ffa450edef68
2007-02-13 22:17:24 +00:00
Grant Ingersoll 13eaccfd56 Lucene 790
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506441 13f79535-47bb-0310-9956-ffa450edef68
2007-02-12 13:32:20 +00:00
Grant Ingersoll bb66099414 Applied 788 and 790 from Doron Cohen. Ran both the micro-standard and the task runs and results look reasonable.
Thanks, Doron

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@506093 13f79535-47bb-0310-9956-ffa450edef68
2007-02-11 18:59:22 +00:00
Wolfgang Hoschek 0695150949 javadoc cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@504229 13f79535-47bb-0310-9956-ffa450edef68
2007-02-06 18:09:19 +00:00
Mark Harwood 0149b5f82f Changes to improve server-side speed and thread safety by caching reusable, compiled stylesheets.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@503500 13f79535-47bb-0310-9956-ffa450edef68
2007-02-04 22:51:28 +00:00
Mark Harwood 387d95417b Added missing Apache license comments - thanks for the reminder, Otis.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@500417 13f79535-47bb-0310-9956-ffa450edef68
2007-01-26 23:51:38 +00:00
Mark Harwood c02aed3b5e Added QueryTemplateManager.java to aid construction of XML queries from form input by using XSL templates. A Junit test provides examples of use. This approach offers a convenient way of externalizing and changing how user input is turned into Lucene queries. Database applications often adopt similar practices by externalizing SQL in template files that can be easily changed/optimized by a DBA.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@500053 13f79535-47bb-0310-9956-ffa450edef68
2007-01-25 23:31:02 +00:00
Mark Harwood 9251a63e01 Fixed bug when BooleanFilter class is used in conjunction with cached filters - the anding of filters' bitsets was changing the original rather than taking a copy of the contents. Changed to use BitSet.clone before performing any destructive operation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@499592 13f79535-47bb-0310-9956-ffa450edef68
2007-01-24 22:19:05 +00:00
Otis Gospodnetic 8cafdd9b64 - Removed isEmpty() Java 6 method, so Andrzej can compile Luke
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@496628 13f79535-47bb-0310-9956-ffa450edef68
2007-01-16 09:07:01 +00:00
Otis Gospodnetic 74e68c9287 - Removed extra ;
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@496283 13f79535-47bb-0310-9956-ffa450edef68
2007-01-15 11:45:04 +00:00
Grant Ingersoll db238e787f Lucene 675: Initial commit of Doron Cohen's byTask benchmarking contribution. Thanks Doron!
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@495834 13f79535-47bb-0310-9956-ffa450edef68
2007-01-13 04:08:23 +00:00
Mark Harwood 18ece160e3 Fix to apply any custom boost setting when the Query is rewritten
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492830 13f79535-47bb-0310-9956-ffa450edef68
2007-01-05 00:18:32 +00:00
Mark Harwood 998908257b Added new "CachedFilter" feature to XML syntax enabling any queries or filters to be cached for better repeat performance. Added JUnit test and example XML file. Also fixed ClassCastException in DOMUtils which occured when getAttributeWithInheritance reached the root of a document without finding the required attribute.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492823 13f79535-47bb-0310-9956-ffa450edef68
2007-01-05 00:05:17 +00:00
Grant Ingersoll 8cffe45c2a replaced string concatenation with chained appends
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@492383 13f79535-47bb-0310-9956-ffa450edef68
2007-01-04 02:17:58 +00:00
Daniel Naber 67399fdd8d ingore case in the names of commands (Search == search etc.)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@490513 13f79535-47bb-0310-9956-ffa450edef68
2006-12-27 15:22:22 +00:00
Otis Gospodnetic 2cf113a022 - Javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489847 13f79535-47bb-0310-9956-ffa450edef68
2006-12-23 03:36:34 +00:00
Otis Gospodnetic 8b7f6e4ef6 - LUCENE-759: New n-gram-capable tokenizers and their unit tests.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489802 13f79535-47bb-0310-9956-ffa450edef68
2006-12-22 23:43:17 +00:00
Otis Gospodnetic 17315a9be7 - LUCENE-741: command-line utility for modifying or removing field norms
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@489213 13f79535-47bb-0310-9956-ffa450edef68
2006-12-20 22:32:13 +00:00
Michael McCandless 7c4b667f71 LUCENE-748: add javadoc around semantics of Exception during IndexWriter.close()
LUCENE-129: fix finalizers to always call super.finalize
LUCENE-301: add new constructors IndexWriter({String,File,Directory}, Analyzer) that create index if it's not already there, else append
LUCENE-701: found two cases of "open IndexWriter for create while reader is open, on Windows" that I didn't properly fix; added new test cases
LUCENE-702: corrected some small javadoc issues


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@488640 13f79535-47bb-0310-9956-ffa450edef68
2006-12-19 11:31:27 +00:00
Simon Willnauer 550b50fdd8 redefined compile-core and compile-test to exclude db4o dependend classes if db4o jar is not present
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@487315 13f79535-47bb-0310-9956-ffa450edef68
2006-12-14 19:26:27 +00:00
Simon Willnauer fb0c30d061 defined default task to use inherit ant tasks in sub folders
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@487313 13f79535-47bb-0310-9956-ffa450edef68
2006-12-14 19:25:43 +00:00
Simon Willnauer ad3ccde3a0 fixed some path errors due to the new structure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486811 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 19:36:49 +00:00
Simon Willnauer 4374e2e88a created internal "subproject" structure to reuse the common-build.xml ant tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486798 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 19:04:23 +00:00
Simon Willnauer 5aabf7a0b9 Fixed compile error in GDataIndexWriterTest as IndexWriter has not commitTimeout setter anymore.
Fixed misspelled path in gom-build.xml


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486631 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:55:43 +00:00
Simon Willnauer d775297874 modifications added to changes.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486629 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:46:13 +00:00
Simon Willnauer 4711af4ef3 Removed unneccessary UML metadata
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486628 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:41:32 +00:00
Simon Willnauer 3bd4a16711 Initial Revision GData Object Model
GOM is a GData object representation for render, parse and manipulate GData Feeds (RSS/ATOM)

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486627 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:39:54 +00:00
Simon Willnauer 9cc3210dd0 deleted hivemind related test from core test folder.
This source has been copied/moved to ./src/hivemind/test

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486625 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:37:16 +00:00
Simon Willnauer e3e2ed3801 Applied new structure of the gdata-server project.
All source and test-sources are located under the ./src/ folder.
Created GData Object Model (GOM) folder ./src/gom, Core-Folder ./src/core and ./src/hivemind
Sources and Tests now splitted into their "sub" projects.

Fixed compile error in GDataIndexWriter as IndexWriter has not commitTimeout setter anymore.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@486622 13f79535-47bb-0310-9956-ffa450edef68
2006-12-13 12:32:08 +00:00
Otis Gospodnetic 00250552f8 - Cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@485884 13f79535-47bb-0310-9956-ffa450edef68
2006-12-11 21:42:32 +00:00
Otis Gospodnetic 65cd9b86bf - Reindented
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@483579 13f79535-47bb-0310-9956-ffa450edef68
2006-12-07 18:07:09 +00:00
Andreas Vajda 63d2e582ad added support for Berkeley DB 4.5.20
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480800 13f79535-47bb-0310-9956-ffa450edef68
2006-11-30 01:28:11 +00:00
Wolfgang Hoschek f620d56554 find/replace tab --> space
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480343 13f79535-47bb-0310-9956-ffa450edef68
2006-11-29 02:35:43 +00:00
Wolfgang Hoschek 06a9b2e22a better estimation of memory consumption
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480339 13f79535-47bb-0310-9956-ffa450edef68
2006-11-29 02:29:19 +00:00
Otis Gospodnetic 57ce49d49a - LUCENE-728: Removed duplicate/old MoreLikeThis and SimilarityQueries classes,
as their new home is under contrib/queries.
  This means contrib/similarity is not completely empty.  I am leaving it in SVN
  for now, as it seems like it could be a nice warm home for custom Similarity
  implementations.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@480200 13f79535-47bb-0310-9956-ffa450edef68
2006-11-28 20:46:42 +00:00
Wolfgang Hoschek 8dc26ad30e javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479895 13f79535-47bb-0310-9956-ffa450edef68
2006-11-28 04:36:41 +00:00
Wolfgang Hoschek 64d4421321 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479755 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:36:29 +00:00
Wolfgang Hoschek 06b36cb004 For repeatable exceptions, store into cache only if child.tokenStream(fieldName, reader) does not throw an exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479754 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:34:45 +00:00
Wolfgang Hoschek deb74c1dfb javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479751 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:30:52 +00:00
Wolfgang Hoschek 8ab762aef2 Reverting back because of:
"Actually, now I'm considering reverting back to the version without a public clear() method. The rationale is that this would be less complex and more consistent with the AnalyzerUtil design (simple methods generating simple anonymous analyzer wrappers). If desired, you can still (re)use a single static "child" analyzer instance. It's cheap and easy to create a new caching analyzer on top of the static analyzer, and to do so before each document. The old one will simply be gc'd."


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479749 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 20:25:32 +00:00
Wolfgang Hoschek ad49369d3d added clear() method to TokenCachingAnalyzer, changed anonymous class to public class
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@479699 13f79535-47bb-0310-9956-ffa450edef68
2006-11-27 17:37:26 +00:00
Wolfgang Hoschek f76157f560 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478407 13f79535-47bb-0310-9956-ffa450edef68
2006-11-23 00:27:36 +00:00
Wolfgang Hoschek b8387ec241 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478406 13f79535-47bb-0310-9956-ffa450edef68
2006-11-23 00:25:53 +00:00
Wolfgang Hoschek 8ccee0c870 hashing on a pair isn't necessary and overly complicates semantics. So taking Pair logic out.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478378 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:29:42 +00:00
Wolfgang Hoschek d33f53c109 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478377 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:22:56 +00:00
Wolfgang Hoschek ae6846798d cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478368 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:12:19 +00:00
Wolfgang Hoschek e7cde622d9 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478366 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 23:10:12 +00:00
Wolfgang Hoschek aeb9723407 javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478361 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:57:59 +00:00
Wolfgang Hoschek cdb3440645 added getTokenCachingAnalyzer()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478360 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:55:01 +00:00
Daniel Naber a1966d93c7 import and whitespace cleanup only
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478355 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:45:23 +00:00
Daniel Naber dab153b256 LUCENE-722: DEFAULT spelled DEFALT in MoreLikeThis.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478354 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 22:41:33 +00:00
Wolfgang Hoschek 4002514600 reverting update to non-deprecated method as it was not backwards compatible with lucene-2.0
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478303 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 19:48:24 +00:00
Wolfgang Hoschek e7ed9203c0 clarified javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478302 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 19:42:25 +00:00
Wolfgang Hoschek b797db9788 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478102 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 09:48:09 +00:00
Wolfgang Hoschek 758a772fcb cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478082 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 08:20:55 +00:00
Wolfgang Hoschek 286c390cb2 added option to measure indexing or query performance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478072 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 08:01:31 +00:00
Wolfgang Hoschek d9cb737f5a cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478054 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 07:00:43 +00:00
Wolfgang Hoschek 248ff4e963 cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478052 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 06:28:02 +00:00
Wolfgang Hoschek ad93748d49 example shouldn't use deprecated method
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@478045 13f79535-47bb-0310-9956-ffa450edef68
2006-11-22 05:56:08 +00:00
Daniel Naber 5cf957527f LUCENE-722: DEFAULT spelled DEFALT in MoreLikeThis.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@477932 13f79535-47bb-0310-9956-ffa450edef68
2006-11-21 22:26:45 +00:00
Daniel Naber d98c449203 default to "jar" so the correct manifest is used (not the common one)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@477877 13f79535-47bb-0310-9956-ffa450edef68
2006-11-21 20:54:06 +00:00
Wolfgang Hoschek 2b1dd87818 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@476065 13f79535-47bb-0310-9956-ffa450edef68
2006-11-17 07:29:44 +00:00
Wolfgang Hoschek fb8fd8b8ce cosmetics
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@476060 13f79535-47bb-0310-9956-ffa450edef68
2006-11-17 06:34:13 +00:00
Wolfgang Hoschek 2d5f3b0c87 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@476056 13f79535-47bb-0310-9956-ffa450edef68
2006-11-17 06:17:59 +00:00
Wolfgang Hoschek 34e917af03 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475844 13f79535-47bb-0310-9956-ffa450edef68
2006-11-16 18:16:27 +00:00
Wolfgang Hoschek 9a116a8b99 better memory estimate
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475843 13f79535-47bb-0310-9956-ffa450edef68
2006-11-16 18:13:56 +00:00
Grant Ingersoll 423c1b4c9c Commit benchmark code. Thanks to Doron Cohen for updates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@475222 13f79535-47bb-0310-9956-ffa450edef68
2006-11-15 13:09:33 +00:00
Simon Willnauer 5627c6201c Webarchive packageing prepared for including IoC metadata descriptors
- including configured IoC descriptors in war distribution
	- improved dependency fetching (on demand)
	- fixed some spelling errors

Hessian Webservice support
	- created hessian servlet as a generic http interface for provided services
	- Interface def., Impl. and Tests
	- services can be exposed via HessianSkeletonProvider by defining a mapping key, Interface and an implementation instance.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@474244 13f79535-47bb-0310-9956-ffa450edef68
2006-11-13 10:51:04 +00:00
Simon Willnauer ebf6f78719 Class could not be resolved due to wrong spelling (after refactoring)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@474243 13f79535-47bb-0310-9956-ffa450edef68
2006-11-13 10:47:33 +00:00