Commit Graph

671 Commits

Author SHA1 Message Date
Michael McCandless 03d6b2aa24 LUCENE-1479: if date is missing, don't skip the doc; just don't add 'docdate' field to the doc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733697 13f79535-47bb-0310-9956-ffa450edef68
2009-01-12 11:37:23 +00:00
Uwe Schindler f0300a643e LUCENE-1517: Change superclass of TrieRangeQuery
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733519 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 21:15:24 +00:00
Uwe Schindler 4905d4404c Small optimization because an Object is not needed here
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733449 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 11:04:56 +00:00
Uwe Schindler c2b5e91ddd As suggested by Otis: Convert TABs to spaces, remove @author javadoc tags
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733443 13f79535-47bb-0310-9956-ffa450edef68
2009-01-11 10:25:14 +00:00
Karl-Johan Wettin d7376608b2 LUCENE-1514
ShingleMatrixFilter#next(Token) easily throws a StackOverflowException due to recursive invocation. (Karl Wettin)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@733064 13f79535-47bb-0310-9956-ffa450edef68
2009-01-09 15:34:52 +00:00
Michael McCandless 8180700de8 LUCENE-1497: minor code cleanups to SimpleHTMLFormatter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732739 13f79535-47bb-0310-9956-ffa450edef68
2009-01-08 16:11:19 +00:00
Karl-Johan Wettin f991524da8 LUCENE-1510
InstantiatedIndexReader#norms methods throws NullPointerException on empty index.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732661 13f79535-47bb-0310-9956-ffa450edef68
2009-01-08 09:28:42 +00:00
Ryan McKinley 3f72bc813f remove unused imports and @author tag
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732410 13f79535-47bb-0310-9956-ffa450edef68
2009-01-07 17:47:26 +00:00
Ryan McKinley 977c9acc71 LUCENE-1512 -- adding GeoHash implementaion
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732400 13f79535-47bb-0310-9956-ffa450edef68
2009-01-07 17:10:58 +00:00
Ryan McKinley 43609e891d LUCENE-1504 -- SerialChainFilter should use DocSet API
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@732031 13f79535-47bb-0310-9956-ffa450edef68
2009-01-06 18:13:06 +00:00
Ryan McKinley 4978391e93 LUCENE-1508 -- using the tierPrefix constructors by default
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730662 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 03:21:23 +00:00
Ryan McKinley 0958d83354 LUCENE-1508 -- make the field prefix configurable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730658 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 02:52:30 +00:00
Ryan McKinley 1f585f0357 LUCENE-1387 -- expose getLat/getLng from DistanceQueryBuilder
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730653 13f79535-47bb-0310-9956-ffa450edef68
2009-01-02 00:51:26 +00:00
Michael Busch c9e7642bae Fix pom.xml.template of spatial search contrib
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730552 13f79535-47bb-0310-9956-ffa450edef68
2009-01-01 15:02:48 +00:00
Otis Gospodnetic 72725a0b58 - Small documentation mods.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730207 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 18:20:43 +00:00
Ryan McKinley 0afd451f24 LUCENE-1503 -- refactor spatial Query/Filter classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730205 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 18:00:56 +00:00
Ryan McKinley 581c82ccbe LUCENE-1387 -- adding locallucene as new spatial contrib package.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@730067 13f79535-47bb-0310-9956-ffa450edef68
2008-12-30 07:37:17 +00:00
Michael McCandless eb64661216 LUCENE-1495: fix TaskSequence.toString to show time-limited configuration
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728751 13f79535-47bb-0310-9956-ffa450edef68
2008-12-22 19:37:58 +00:00
Michael McCandless 51f9b759fa LUCENE-1495: fix time-based test to reduce change of false failure
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728425 13f79535-47bb-0310-9956-ffa450edef68
2008-12-21 11:07:28 +00:00
Michael McCandless 7abe0311bc LUCENE-1444: fix broken site links in javadocs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728262 13f79535-47bb-0310-9956-ffa450edef68
2008-12-20 09:43:06 +00:00
Michael McCandless bdc42e5342 LUCENE-1495: allow TaskSequence to run for certain time
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@728252 13f79535-47bb-0310-9956-ffa450edef68
2008-12-20 08:45:09 +00:00
Michael McCandless 74e097f8eb LUCENE-1493: allow setting top number of hits to collect with search.num.hits
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727063 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 15:09:46 +00:00
Michael McCandless 09f661c48c LUCENE-1492: add optional readOnly param to OpenReader task
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@727029 13f79535-47bb-0310-9956-ffa450edef68
2008-12-16 11:44:01 +00:00
Michael McCandless 1f33d7c66a fix off-by-one bug in getting field sort type
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@726213 13f79535-47bb-0310-9956-ffa450edef68
2008-12-13 14:43:32 +00:00
Karl-Johan Wettin 812abd20dd LUCENE-1462
InstantiatedIndexWriter did not reset pre analyzed TokenStreams the same way IndexWriter does. 
Parts of InstantiatedIndex was not Serializable.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@726030 13f79535-47bb-0310-9956-ffa450edef68
2008-12-12 15:12:31 +00:00
Karl-Johan Wettin 219a20a945 LUCENE-1462
InstantiatedIndexWriter did not reset pre analyzed TokenStreams the same way IndexWriter does. 
Parts of InstantiatedIndex was not Serializable.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@725837 13f79535-47bb-0310-9956-ffa450edef68
2008-12-11 22:08:45 +00:00
Grant Ingersoll 2225462178 LUCENE-1380: Add PositionFilter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@725691 13f79535-47bb-0310-9956-ffa450edef68
2008-12-11 14:17:44 +00:00
Michael McCandless efc985cbf4 LUCENE-1478: allow SortField to use a custom numeric FieldCache parser
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@724484 13f79535-47bb-0310-9956-ffa450edef68
2008-12-08 21:07:45 +00:00
Grant Ingersoll 702ea32da7 make constructors public
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@724059 13f79535-47bb-0310-9956-ffa450edef68
2008-12-07 00:39:35 +00:00
Michael McCandless 4a0a5664dd LUCENE-1468: switch Directory.list() to Directory.listAll(), which does no filtering of returned array
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723789 13f79535-47bb-0310-9956-ffa450edef68
2008-12-05 17:03:13 +00:00
Michael McCandless 5e3cd6711a LUCENE-1470: describe TrieRangeFilter/Query in README
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723701 13f79535-47bb-0310-9956-ffa450edef68
2008-12-05 10:16:50 +00:00
Michael McCandless 240897988d LUCENE-1470: switch TestTrieRangeQuery to LuceneTestCase as well
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723313 13f79535-47bb-0310-9956-ffa450edef68
2008-12-04 13:17:23 +00:00
Michael McCandless 05bff1833f LUCENE-1470: add proper dependency to contrib/build.xml so contrib tests can subclass LuceneTestCase; fixed small bug TestTrieUtils test case
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723287 13f79535-47bb-0310-9956-ffa450edef68
2008-12-04 11:06:59 +00:00
Michael McCandless 481f8080ab LUCENE-1470: add TrieRangeQuery, a much more efficient implementation of RangeQuery at the expense of added space consumed in the index
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@723031 13f79535-47bb-0310-9956-ffa450edef68
2008-12-03 19:38:31 +00:00
Michael McCandless 278dc0fc04 LUCENE-1296: add protected method CachingWrapperFilter.docIdSetToCache
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@722174 13f79535-47bb-0310-9956-ffa450edef68
2008-12-01 18:50:45 +00:00
Mark Robert Miller c2f7d554a3 LUCENE-1389: SimpleSpanFragmenter can create very short fragments
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@713569 13f79535-47bb-0310-9956-ffa450edef68
2008-11-13 00:01:43 +00:00
Yonik Seeley 8e8e8ddec4 set svn:eol-style to native on java files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712922 13f79535-47bb-0310-9956-ffa450edef68
2008-11-11 02:35:46 +00:00
Michael McCandless abe6ea0828 LUCENE-1424: enable MultiTermQuery to do constant scoring; refactor Range, Wildcard, Prefix to expose this
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712890 13f79535-47bb-0310-9956-ffa450edef68
2008-11-11 00:40:00 +00:00
Mark Harwood de7ab5b898 Added new web application demo for contrib's XmlQueryParser.
This change involves:
* Adding Tomcat's Servlet jar into the lib directory and appropriate entry in NOTICE.txt following the lead from Solr's packaging
* Adding new "demo" directory to XmlQueryParser src directory
* Changing XMLQueryParser's build file to create a demo War file
* Changing the main build to include the demo War file (and any other future contrib/*/war files) in the binary distributions

The packaged source distribution has NOT been changed currently to add a lib directory with the servlet.jar so building from a cut-down src distro as opposed to the full subversion /trunk directory will not currently build the war file (the xml query parser build file detects the absence of servlet.jar). Not sure if this is a problem currently.

TODO: 
Now that the servlet jar is available in Subversion I would recommend that the other existing WAR file,"luceneweb.war", is changed to move much of the java code which is currently embedded in JSP files into servlet .java files. This would ensure that the build system will check that the code in this application compiles cleanly with the latest Lucene APIs - otherwise any issue will only become apparent when a user tries to run a JSP.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@712318 13f79535-47bb-0310-9956-ffa450edef68
2008-11-08 00:20:32 +00:00
Michael McCandless 24d76157a0 LUCENE-1420: let Similarity.computeNorm compute the norm; add option to discount overlap tokens when computing lengthNorm
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@710117 13f79535-47bb-0310-9956-ffa450edef68
2008-11-03 18:03:58 +00:00
Michael McCandless f2c988ae2b don't include contrib/memory class files in highlighter's jar
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@707834 13f79535-47bb-0310-9956-ffa450edef68
2008-10-25 10:19:30 +00:00
Grant Ingersoll 8dfe073760 LUCENE-1406. Added Arabic stemming and normalization. Also added new method to WordListLoader to allow for comments in word lists.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706342 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 17:19:29 +00:00
Michael McCandless da78e20bd0 LUCENE-1416: don't fail contrib/ant's unit test when file path contains non-URL safe characters like space
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@706160 13f79535-47bb-0310-9956-ffa450edef68
2008-10-20 09:12:46 +00:00
Karl-Johan Wettin 456b10fdf9 LUCENE-1423
InstantiatedTermEnum#skipTo(Term) throws ArrayIndexOutOfBoundsException on an empty index.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@705893 13f79535-47bb-0310-9956-ffa450edef68
2008-10-18 16:29:53 +00:00
Daniel Naber 20ddb46fec minor javadoc fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@703717 13f79535-47bb-0310-9956-ffa450edef68
2008-10-11 19:45:16 +00:00
Michael McCandless 3f27b17a89 fix non-1.4-compatible throws clause
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@701827 13f79535-47bb-0310-9956-ffa450edef68
2008-10-05 16:40:59 +00:00
Mark Harwood 92d6a0ae7b Fix for a potential null-pointer-introducing bug which came about as part of the DocIdSet changes. TermsFilter no longer implemented bits(IndexReader) and the Filter base class' version of this was changed to return null.
When dropping 2.4 Lucene in as a direct replacement for 2.3.2 my client code was getting NullPointer errors - returning null was never part of the Filter.bits contract and so this could be a problem for others using this class. 

Fix is for TermsFilter to implement bits(IndexReader) - this can safely be removed in later versions because it is a deprecated method going forward.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700815 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 15:37:05 +00:00
Michael McCandless 98e1129a14 break off contrib/CHANGES.txt's 2.4.0 release section
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@700743 13f79535-47bb-0310-9956-ffa450edef68
2008-10-01 11:22:58 +00:00
Andreas Vajda 88283a7027 - upgraded to bdb-je 3.3.69
- re-enabled testBytes() and testArrays() (per Aaron, Michael's instructions)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699961 13f79535-47bb-0310-9956-ffa450edef68
2008-09-29 02:21:51 +00:00
Andreas Vajda 96809b1238 upgraded db lib to version 4.7.25
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699882 13f79535-47bb-0310-9956-ffa450edef68
2008-09-28 19:07:39 +00:00
Mark Harwood ba6344a4d7 Fixed bug in FuzzyLikeThisQuery.java. Queries that contain a term with no fuzzy variants caused the query construction logic to exit loop early, producing no fuzzy variants for all subsequent terms in the query string.
Junit test added which recreates the problem conditions and added fix to FuzzyLikeThisQuery that solves the issue.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@699512 13f79535-47bb-0310-9956-ffa450edef68
2008-09-26 21:44:55 +00:00
Michael McCandless 0e708a21fe LUCENE-1401: fix accidentally lost 'create=true' in CreateIndexTask
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698975 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 14:11:41 +00:00
Michael McCandless a40530e4a4 LUCENE-1401: remove new deprecated IndexWriter ctors; default autoCommit=false for new ctors
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@698932 13f79535-47bb-0310-9956-ffa450edef68
2008-09-25 12:04:38 +00:00
Michael McCandless b866828bab add graph to package.html for contrib/instantiated showing queries per sec vs number of docs indexed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@697126 13f79535-47bb-0310-9956-ffa450edef68
2008-09-19 15:15:36 +00:00
Otis Gospodnetic 0195fcd03d LUCENE-1378 - Removed the remaining 199 @author references
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@695514 13f79535-47bb-0310-9956-ffa450edef68
2008-09-15 15:42:11 +00:00
Karl-Johan Wettin c6e5618674 Added a bunch of <p> in the Javadocs to make the rendered html view look better.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694396 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:26:46 +00:00
Karl-Johan Wettin 71f2d8199b LUCENE-1320
ShingleMatrixFilter JDK downgrade 1.5 -> 1.4 
Grant Ingersoll via Karl Wettin

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694393 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 18:23:18 +00:00
Michael McCandless d31fb4556c fix wordnet's Syns2Index to not fiddle with mergeFactor & maxBuffereDocs (the latter was hitting an exception)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694222 13f79535-47bb-0310-9956-ffa450edef68
2008-09-11 12:11:03 +00:00
Michael McCandless 4218996230 LUCENE-1366: rename Field.Index.* options
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@694004 13f79535-47bb-0310-9956-ffa450edef68
2008-09-10 21:38:52 +00:00
Grant Ingersoll dadaebc290 LUCENE-1243: Added new benchmark tasks
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693495 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 15:56:41 +00:00
Mark Robert Miller 5147f0563f LUCENE-1357: SpanScorer does not respect ConstantScoreRangeQuery setting
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@693432 13f79535-47bb-0310-9956-ffa450edef68
2008-09-09 10:52:50 +00:00
Michael McCandless d5a40278bc LUCENE-1369: switch from Hashtable to HashMap and from Vector to List, when possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@692921 13f79535-47bb-0310-9956-ffa450edef68
2008-09-07 19:22:40 +00:00
Michael McCandless 120d01bb09 remove DOCTYPE declaration in testEncoding (it's causing intermittant false failures in nightly build when xerces has trouble loading the DTD from w3.org)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690914 13f79535-47bb-0310-9956-ffa450edef68
2008-09-01 09:07:16 +00:00
Karl-Johan Wettin bf238a5743 Javadocs fix
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@690779 13f79535-47bb-0310-9956-ffa450edef68
2008-08-31 20:46:47 +00:00
Karl-Johan Wettin 82c70c018e LUCENE-1016 : TermVectorAccessor, transparent vector space access via stored vectors or by resolving the inverted index.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688745 13f79535-47bb-0310-9956-ffa450edef68
2008-08-25 15:02:20 +00:00
Karl-Johan Wettin 3034575f66 LUCENE-1142 : Updated Snowball package, org.tartarus distribution revision 500.
Introducing Hungarian, Turkish and Romanian support, updated older stemmers and optimized (reflectionless) SnowballFilter.

IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY: an index created using the 2.3.2 (or older) might not be compatible with these updated classes as some algorithms have changed.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688420 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 22:02:47 +00:00
Karl-Johan Wettin 9fe7a35378 Contrib level CHANGES.txt. I forgot to add this some time ago.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@688370 13f79535-47bb-0310-9956-ffa450edef68
2008-08-23 17:12:57 +00:00
Michael McCandless 003a853cc8 LUCENE-1333: don't use LuceneTestCase in contrib until we can fix the build dependency
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687539 13f79535-47bb-0310-9956-ffa450edef68
2008-08-21 02:45:37 +00:00
Grant Ingersoll 09467ad18e LUCENE-1001: Make payloads accessible from SpanQuery's
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687379 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 16:06:36 +00:00
Michael McCandless bb6b711718 LUCENE-1333: improvements to Token reuse API and full cutover to reuse API for all core and contrib analyzers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687357 13f79535-47bb-0310-9956-ffa450edef68
2008-08-20 14:38:07 +00:00
Mark Robert Miller 7675606908 LUCENE-1355: highlighter can incorrectly produce negative idf when index has deletes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687052 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 13:16:41 +00:00
Michael McCandless e31a9da835 LUCENE-1334: add Term(String fieldName) constructor that sets term text to empty string
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@687014 13f79535-47bb-0310-9956-ffa450edef68
2008-08-19 10:40:39 +00:00
Doron Cohen 027307857e LUCENE-1353: Javacc target to generate precedence query parser.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@685114 13f79535-47bb-0310-9956-ffa450edef68
2008-08-12 10:49:59 +00:00
Michael McCandless 12e4e67403 add missing synchronized to PerfRunData.getQueryMaker()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@677555 13f79535-47bb-0310-9956-ffa450edef68
2008-07-17 10:50:24 +00:00
Mark Harwood 74c551fb88 Added option to allow UserQuery tag to define a different default fieldName.
Standard use case for this is where users are presented with a GUI form with multiple input boxes, each targetting a different field and allowing "lucene syntax". The XML query template behind such a form would have a <UserQuery> tag for each form field, each defined with the appropriate choice of default field name.

Added Junit test for changing default field name, updated DTD for XML query syntax and regenerated HTML documentation.



git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674975 13f79535-47bb-0310-9956-ffa450edef68
2008-07-08 21:50:06 +00:00
Karl-Johan Wettin ddc7c290d0 LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

Bug fix, did not support empty input token streams.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@674367 13f79535-47bb-0310-9956-ffa450edef68
2008-07-07 00:08:41 +00:00
Karl-Johan Wettin bca43ea3ea LUCENE-1320
ShingleMatrixFilter, a multidimensional shingle token filter.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673549 13f79535-47bb-0310-9956-ffa450edef68
2008-07-02 23:53:51 +00:00
Mark Robert Miller 943bf37e66 LUCENE-1321: Highlight fragment does not extend to maxDocCharsToAnalyze
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@673220 13f79535-47bb-0310-9956-ffa450edef68
2008-07-01 21:44:33 +00:00
Karl-Johan Wettin 4f7ead8112 LUCENE-1318: InstantiatedIndexReader.norms(String, b[], int) didn't treat the array offset right.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672568 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 18:52:12 +00:00
Karl-Johan Wettin b8fc54e72a LUCENE-1312: Added full support for InstantiatedIndexReader#getFieldNames() and extended the test case to assert deleted documents behaves as they should (they did).
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@672556 13f79535-47bb-0310-9956-ffa450edef68
2008-06-28 17:23:35 +00:00
Erik Hatcher 3c80f97f7c MemoryIndex implements Serializable
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@671579 13f79535-47bb-0310-9956-ffa450edef68
2008-06-25 15:18:29 +00:00
Grant Ingersoll 44dcbe184f add a getter for the StringDistance
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669476 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 12:52:11 +00:00
Grant Ingersoll d6dcae49dd make public
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669455 13f79535-47bb-0310-9956-ffa450edef68
2008-06-19 11:45:08 +00:00
Otis Gospodnetic 8b11f3b077 - Renamed TRStringDistance to LevensteinDistance (related to LUCENE-1297)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669086 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:09:11 +00:00
Otis Gospodnetic ce19aa28c0 LUCENE-1297 - Allow other string distance measures for the SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@669085 13f79535-47bb-0310-9956-ffa450edef68
2008-06-18 05:01:57 +00:00
Mark Harwood 4adc3f7ba6 Fixed bug parsing boolean attributes. Boolean.getBoolean(s) was being used by mistake which reads a system property with name of s - instead need to parse s as string value of "true" or "false".
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@667294 13f79535-47bb-0310-9956-ffa450edef68
2008-06-12 23:38:39 +00:00
Doron Cohen 4d9e280788 LUCENE-1090: remove relative paths from benchmark's build.xml.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@666079 13f79535-47bb-0310-9956-ffa450edef68
2008-06-10 11:58:00 +00:00
Grant Ingersoll 8c97e9b87b LUCENE-1299: Properly handle when IndexReader is not null and Field is in the spell checker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663649 13f79535-47bb-0310-9956-ffa450edef68
2008-06-05 16:12:08 +00:00
Grant Ingersoll f89cda6dde LUCENE-1298: Allow MLT to use custom similarity
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@663054 13f79535-47bb-0310-9956-ffa450edef68
2008-06-04 10:41:41 +00:00
Grant Ingersoll 1979d75605 LUCENE-1295: Made method public and added retrieveInterestingTerms variation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@662413 13f79535-47bb-0310-9956-ffa450edef68
2008-06-02 12:29:02 +00:00
Mark Robert Miller f32b5a5698 LUCENE-1285: WeightedSpanTermExtractor incorrectly treats the same terms occurring in different query types
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659965 13f79535-47bb-0310-9956-ffa450edef68
2008-05-25 11:38:55 +00:00
Michael Busch 3e7a15e317 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659742 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:45:12 +00:00
Michael Busch 1d6d186888 LUCENE-1187: Fix ChainedFilterTest.java and BooleanFilterTest.java to not reference class from test/org/apache/lucene/search/, which is not in the contrib classpath.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659740 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 01:16:30 +00:00
Michael Busch 14f3663f56 Set eol-style to native for BooleanFilterTest.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659732 13f79535-47bb-0310-9956-ffa450edef68
2008-05-24 00:48:12 +00:00
Michael Busch 0ebfcc663e LUCENE-1187: ChainedFilter and BooleanFilter now work with new Filter API and DocIdSetIterator-based filters.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659635 13f79535-47bb-0310-9956-ffa450edef68
2008-05-23 19:25:05 +00:00
Otis Gospodnetic ce557ee4b3 LUCENE-852: Let the SpellChecker caller specify IndexWriter mergeFactor and RAM buffer size.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659021 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:47:36 +00:00
Otis Gospodnetic b4f7c37bd1 LUCENE-1046: Removed dead code in SpellChecker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659019 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:34:18 +00:00
Otis Gospodnetic f68c9544de LUCENE-1183: Optimized TRStringDistance class (in contrib/spell) that uses less memory than the previous version
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659016 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:24:55 +00:00
Otis Gospodnetic a379a67875 LUCENE-1152 Fix for calling indexDictionary after clearIndex call
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@659013 13f79535-47bb-0310-9956-ffa450edef68
2008-05-22 06:21:17 +00:00
Otis Gospodnetic 90be0daa46 - Made PlainTextDictionary(Reader) public from protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657728 13f79535-47bb-0310-9956-ffa450edef68
2008-05-19 05:45:31 +00:00
Otis Gospodnetic f5df30327e - Fixed messed up indentation/tabs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657281 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:57:32 +00:00
Otis Gospodnetic 1d5ba345cc - Javadocs fixes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657280 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:56:46 +00:00
Otis Gospodnetic d5c708a161 - Renamed vars a bit, so test is easier to understand
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657279 13f79535-47bb-0310-9956-ffa450edef68
2008-05-17 01:55:48 +00:00
Grant Ingersoll 7a27cdcbc9 LUCENE-1166: Added token filter for decomposing compound words
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@657027 13f79535-47bb-0310-9956-ffa450edef68
2008-05-16 12:22:50 +00:00
Otis Gospodnetic aa0074f5db LUCENE-1003: Don't let RussianAnalyzer drop numbers.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656111 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:37:45 +00:00
Otis Gospodnetic b182881092 - Made ant compile-core and build-contrib run again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656110 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:29:57 +00:00
Otis Gospodnetic 6a8bef8c74 - Made ant compile-core and build-contrib run again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@656109 13f79535-47bb-0310-9956-ffa450edef68
2008-05-14 05:29:16 +00:00
Michael Busch 38d1caf56f LUCENE-1277: Remove println in highlighter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652625 13f79535-47bb-0310-9956-ffa450edef68
2008-05-01 18:59:06 +00:00
Grant Ingersoll afa040e9da LUCENE-1276: Fixed contrib highlighter build
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652471 13f79535-47bb-0310-9956-ffa450edef68
2008-05-01 10:34:50 +00:00
Mark Harwood 35c7eb36df Commit of LUCENE-794 patch - adding phrase/span query support to highlighter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@652164 13f79535-47bb-0310-9956-ffa450edef68
2008-04-29 22:07:18 +00:00
Karl-Johan Wettin 6a4ae85775 minor javadocs change
First paragraph is used as description in javadocs index so I added the sentance from build/pom. And it gave me an excuse to test my karma.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@649312 13f79535-47bb-0310-9956-ffa450edef68
2008-04-17 22:19:41 +00:00
Michael McCandless 44daebab7c LUCENE-1150: put back public tokenImage/TOKEN_TYPES in StandardTokenizer and WikipediaTokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@648183 13f79535-47bb-0310-9956-ffa450edef68
2008-04-15 08:48:41 +00:00
Chris M. Hostetter 21b22f8515 some javadoc building cleanup for contrib/instantiated -- notably moving docs to doc-files
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@643170 13f79535-47bb-0310-9956-ffa450edef68
2008-03-31 22:00:57 +00:00
Grant Ingersoll cc955c9748 LUCENE-400: Added ShingleFilter (token based ngram)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@642612 13f79535-47bb-0310-9956-ffa450edef68
2008-03-29 21:11:33 +00:00
Mark Harwood c172010da0 Applied trejkaz's patch from https://issues.apache.org/jira/browse/LUCENE-1240 to optimise TermFilter.java and included new JUnit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@638631 13f79535-47bb-0310-9956-ffa450edef68
2008-03-18 23:01:00 +00:00
Grant Ingersoll 9ac963952f LUCENE-1236: Added some more javadocs. Also removed @author tags
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@637449 13f79535-47bb-0310-9956-ffa450edef68
2008-03-15 18:05:10 +00:00
Chris M. Hostetter 32f1054d88 LUCENE-1202 - tweaks to build system to make clover work better with contribs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@637344 13f79535-47bb-0310-9956-ffa450edef68
2008-03-15 03:16:43 +00:00
Grant Ingersoll f4cc6e2269 LUCENE-550: Added RAMDirectory alternative as a contrib. Similar to MemoryIndex, but can hold more than one document
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@636745 13f79535-47bb-0310-9956-ffa450edef68
2008-03-13 12:34:30 +00:00
Michael McCandless b36d9f9be3 download bdb zip from Oracle's servers
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@636673 13f79535-47bb-0310-9956-ffa450edef68
2008-03-13 09:06:31 +00:00
Doron Cohen 25f80c71c9 LUCENE-1209: Fixed DocMaker settings by round. Prior to this fix, DocMaker settings of
first round were used in all rounds.  (E.g. term vectors.)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635280 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 16:43:32 +00:00
Doron Cohen ce0309ed5f Javadoc clarrification for getRecall().
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635195 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 08:30:28 +00:00
Doron Cohen 8b2f12851b make the name of multi-value property long enough for the values.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@635194 13f79535-47bb-0310-9956-ffa450edef68
2008-03-09 08:28:53 +00:00
Mark Harwood 37a060c15c Additional thread safety around filter creation - old code could create duplicate CachingWrapperFilter if thread1 gets cache miss and thread 2 has a cache miss before thread1 populates cache with new CachingWrapperFilter.
Synchronization cost around whole method is OK here because Filter object construction should be a lightweight call. 
Note: CachingWrapperFilter currently has a similar bug in bits() method but adding "synchronized" around that whole method would not be a solution there because of the cost of evaluating filter.bits and the unnecessary blocking effect this would have on threads using different readers to the thread with the lock.

 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628921 13f79535-47bb-0310-9956-ffa450edef68
2008-02-18 23:39:12 +00:00
Mark Harwood caed55ef5e Added thread-safety around use of core's QueryParser.
Old XML parser constructors use a mode which will synchronize on use of the user-supplied QueryParser. 
New constructors offer alternative option of passing "defaultField" String which is used to create a new single-use QueryParser for each parse operation.

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@628568 13f79535-47bb-0310-9956-ffa450edef68
2008-02-17 22:32:58 +00:00
Michael McCandless fc4bc1f5af LUCENE-1084: fix more test cases to use user-defined MaxFieldLength ctor for IndexWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619637 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 21:07:35 +00:00
Grant Ingersoll 00941055fd benchmark autocommit
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619457 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 15:33:02 +00:00
Grant Ingersoll 0dc6c59ac1 LUCENE-494: Added QueryAutoStopWordAnalyzer in a new query subpackage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@619420 13f79535-47bb-0310-9956-ffa450edef68
2008-02-07 14:13:38 +00:00
Michael McCandless 7ea8bd35f4 LUCENE-1084: fix the remaining deprecated calls to IndexWriter ctors
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618764 13f79535-47bb-0310-9956-ffa450edef68
2008-02-05 20:07:29 +00:00
Erik Hatcher 140a867933 Correct javadoc
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@618275 13f79535-47bb-0310-9956-ffa450edef68
2008-02-04 13:31:02 +00:00
Michael McCandless afb58ca0d2 when reinit() is called, reset the start time (used only when printing # docs indexed)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617991 13f79535-47bb-0310-9956-ffa450edef68
2008-02-03 14:24:30 +00:00
Michael Busch 18b61286fa LUCENE-584: Changed Filter API to return a DocIdSet instead of a java.util.BitSet. This allows using more efficient data structures for Filters and makes them more flexible.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@617859 13f79535-47bb-0310-9956-ffa450edef68
2008-02-02 19:04:03 +00:00
Grant Ingersoll 3db9002efb LUCENE-1156: see CHANGES.txt
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616934 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 22:47:52 +00:00
Grant Ingersoll 61f2aa3078 allow TrecDocMaker to be more easily extended by making values protected
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616882 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 20:03:39 +00:00
Grant Ingersoll 4fc8654bc1 Made some member variables protected so they can accessed by extending classes
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616806 13f79535-47bb-0310-9956-ffa450edef68
2008-01-30 16:29:35 +00:00
Grant Ingersoll b95e9a5154 LUCENE-1132: Updated documentation and deprecated bytes references in favor of character references
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616305 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 13:35:13 +00:00
Michael McCandless 38f8d4e837 LUCENE-1150: make StandardAnalyzer tokenizer constants public again (public access was accidentally removed with LUCENE-966)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@616248 13f79535-47bb-0310-9956-ffa450edef68
2008-01-29 10:51:44 +00:00
Grant Ingersoll 7dd34ef888 LUCENE-1153: added lib/junit and build support
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615329 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 20:27:55 +00:00
Michael McCandless ee835ccf21 LUCENE-1084: change all IW ctors to require up-front specification of max field length; deprecated existing ones
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@615168 13f79535-47bb-0310-9956-ffa450edef68
2008-01-25 10:21:15 +00:00
Doron Cohen 8ace4103a7 LUCENE-1136: add ability to not count sub-task doLogic increment to contri/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614956 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 18:46:57 +00:00
Grant Ingersoll 7f09c9f00e Added javadocs about UNTOKENIZED_TOKEN_FLAG conundrum
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614905 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:42:50 +00:00
Grant Ingersoll 305c47f500 LUCENE-1133: Adds ability to keep certain strings as single tokens
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614895 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 15:05:53 +00:00
Grant Ingersoll 1183763dbe LUCENE-1128 and 1129: Add highlighting support to benchmarking, plus fix minor traversalSize bug in ReadTask, also added a few new algorithms to try out
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614885 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:39:44 +00:00
Grant Ingersoll f75f490eb9 LUCENE-1127: added couple of convenience methods to TokenSources
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614884 13f79535-47bb-0310-9956-ffa450edef68
2008-01-24 14:36:46 +00:00
Michael McCandless dd0265469a LUCENE-1139: fix IndexWriter settings (compound, merge.factor, etc.) to work again
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@614022 13f79535-47bb-0310-9956-ffa450edef68
2008-01-21 21:42:43 +00:00
Michael McCandless 354a3175d5 LUCENE-1139: various additions/fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@613536 13f79535-47bb-0310-9956-ffa450edef68
2008-01-20 11:31:38 +00:00
Grant Ingersoll ed9fdba601 LUCENE-1050 and LUCENE-1138 fixes for lock problem
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612868 13f79535-47bb-0310-9956-ffa450edef68
2008-01-17 17:00:43 +00:00
Grant Ingersoll 0a54ca1920 LUCENE-1135. Added Javadoc marking the WikipediaTokenizer as experimental
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@612244 13f79535-47bb-0310-9956-ffa450edef68
2008-01-15 22:01:21 +00:00
Michael Busch e5da9db486 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610961 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:38:38 +00:00
Michael Busch 49dd305f69 Rename README files to uppercase letters
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610960 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 22:37:27 +00:00
Michael McCandless ea951785eb LUCENE-1117: fix EnwikiDocMaker to not hang when the producer thread hits exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@610738 13f79535-47bb-0310-9956-ffa450edef68
2008-01-10 10:12:18 +00:00
Michael McCandless f0d5002066 LUCENE-1117: fix intermittent thread safety issue w/ EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@609080 13f79535-47bb-0310-9956-ffa450edef68
2008-01-05 01:51:53 +00:00
Grant Ingersoll 79e09db401 LUCENE-1103: Internal links should increment as all tokens do, since the first token is valid too
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608989 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:36:23 +00:00
Grant Ingersoll b18f6ae959 LUCENE-1103: The link is now incremented 1, but then the next token in the link is not incremented. This way, the link is not associated with the previous term. Instead it associated with the next term in the link, which would be the display tokens. If there are no display tokens, then it will take it's proper place in the token chain.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608978 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 20:15:22 +00:00
Grant Ingersoll f715fc6031 LUCENE-1103
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608852 13f79535-47bb-0310-9956-ffa450edef68
2008-01-04 14:29:15 +00:00
Michael McCandless f12862426a fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608523 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 15:20:41 +00:00
Doron Cohen 9e65cd554f LUCENE-1116: contrib/benchmark quality package improvements (MRR, Trec1MQ)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608370 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 07:44:40 +00:00
Michael McCandless 263244312d LUCENE-1115: some small fixes to contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608306 13f79535-47bb-0310-9956-ffa450edef68
2008-01-03 01:48:18 +00:00
Grant Ingersoll ed893f770c LUCENE-1114: Updated example
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@608126 13f79535-47bb-0310-9956-ffa450edef68
2008-01-02 15:30:40 +00:00
Michael Busch 75473edb02 LUCENE-746: Fix error message in AnalyzingQueryParser.getPrefixQuery.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607832 13f79535-47bb-0310-9956-ffa450edef68
2008-01-01 12:49:44 +00:00
Grant Ingersoll 90a735441f LUCENE-1102: EnwikiDocMaker now adds a docid field
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607732 13f79535-47bb-0310-9956-ffa450edef68
2007-12-31 13:07:14 +00:00
Doron Cohen ece8361ab5 LUCENE-749: ChainedFilter behavior fixed when logic of first filter is ANDNOT.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607606 13f79535-47bb-0310-9956-ffa450edef68
2007-12-30 22:47:59 +00:00
Grant Ingersoll bd340a896d git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607291 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:26 +00:00
Grant Ingersoll 5c81934465 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607290 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:08:16 +00:00
Grant Ingersoll cb94c6aed4 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607288 13f79535-47bb-0310-9956-ffa450edef68 2007-12-28 17:07:33 +00:00
Grant Ingersoll 40d85a7781 Switch to using the EnwikiDocMaker
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@607281 13f79535-47bb-0310-9956-ffa450edef68
2007-12-28 16:29:03 +00:00
Grant Ingersoll ca821526b0 removed bad chars at end of file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605966 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:33:16 +00:00
Grant Ingersoll 36b1206ad8 Restoring ExtractWikipedia, as it is still a handy class to have around. Splitting the documents is useful for debugging purposes when you know you want to look at a specific document instead of grepping through a really large file.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@605959 13f79535-47bb-0310-9956-ffa450edef68
2007-12-20 15:14:24 +00:00
Grant Ingersoll 55d0c3a2f8 LUCENE-1077: refactored to have a common PayloadHelper classes. Also added TokenOffsetPayloadTokenFilter, which encodes the Token offset into the payloads
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@604870 13f79535-47bb-0310-9956-ffa450edef68
2007-12-17 13:55:46 +00:00
Doron Cohen b7e167ac8d LUCENE-1086: DocMakers setup for the "docs.dir" property fails when passing an absolute path.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603856 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 08:58:52 +00:00
Doron Cohen 73f9e7ebc0 fix potential thread-safety issue in contrib/benchmark's TrecDocMaker.
(follow-up to http://svn.apache.org/viewvc?view=rev&revision=602475)


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@603846 13f79535-47bb-0310-9956-ffa450edef68
2007-12-13 07:26:58 +00:00
Michael McCandless 86ca6f86d7 fix intermittent thread-safety failure in contrib/benchmark unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602475 13f79535-47bb-0310-9956-ffa450edef68
2007-12-08 14:17:07 +00:00
Michael McCandless b0d2b1c90e LUCENE-1044: revert the doSync option to FSDirectory
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602165 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 17:42:33 +00:00
Grant Ingersoll f9b2e971f2 LUCENE-1077 new sinks and payloads analysis packages
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@602081 13f79535-47bb-0310-9956-ffa450edef68
2007-12-07 12:21:49 +00:00
Michael McCandless 6be2c0765c LUCENE-1044: also re-default doSync back to false in contrib/benchmark
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596874 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 23:17:44 +00:00
Michael Busch b04703fe8f LUCENE-1055: Remove gdata from trunk.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596501 13f79535-47bb-0310-9956-ffa450edef68
2007-11-20 00:46:27 +00:00
Michael Busch 1abb04580f Disable verbose standard output in MemoryIndexTest by default.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@596000 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:19:17 +00:00
Mark Harwood 04ae927f38 Added toString implementation on BooleanFilter.java, provided by Jason Calabrese
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595996 13f79535-47bb-0310-9956-ffa450edef68
2007-11-17 20:08:06 +00:00
Michael Busch bb37d2bcff LUCENE-1051: Generate separate javadocs for core, demo and contrib classes, as well as an unified view.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@595013 13f79535-47bb-0310-9956-ffa450edef68
2007-11-14 19:16:19 +00:00
Michael McCandless 439ba586fc LUCENE-1044: add doSync option to FSDirectory.getDirectory, defaulting to true, to sync() each file descriptor before close()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@593799 13f79535-47bb-0310-9956-ffa450edef68
2007-11-10 17:51:00 +00:00
Daniel Naber 2f5507bfc9 fix returning unbalanced quotes in describeParams()
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@586918 13f79535-47bb-0310-9956-ffa450edef68
2007-10-21 17:26:16 +00:00
Grant Ingersoll a614f0d99a Added some more algorithms for testing things out, implemented basic TREC query driver based on the sample in the javadocs.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@585677 13f79535-47bb-0310-9956-ffa450edef68
2007-10-17 20:36:20 +00:00
Grant Ingersoll b7253a06b7 LUCENE-1027: Added better formatting of doubles, added wikipedia-flush-by-RAM for comparison
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583792 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 12:10:31 +00:00
Grant Ingersoll 9c9ebe5cf4 LUCENE-1027: Added support for doubles to Config, also added copies of standard and micro-standard algorithms that flush by RAM
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583771 13f79535-47bb-0310-9956-ffa450edef68
2007-10-11 11:05:40 +00:00
Mark Harwood 21a07ee41e Provided DTDs for core and contrib XML query syntax. The "docs" directory contains detailed documentation generated by DTDdoc from the DTDs. The ant script used to generate these docs is also included but not hooked up to the main build process due to license issues with DTDdoc.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583307 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:45:27 +00:00
Mark Harwood 3872d3bfcc Updated hashcode/equals to test all fields
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@583305 13f79535-47bb-0310-9956-ffa450edef68
2007-10-09 21:40:45 +00:00
Chris M. Hostetter 243861715b cleaning up a ton of javadoc warnings from gdata. most of these fixes related to either: clarifying packages for @link tags; changing @link or @see tags that pointed at classes/methods that didn't exist (by picking classes with very similar names that do exist); or removing incomplete stub javadocs (that added no information beyond the signature
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@582380 13f79535-47bb-0310-9956-ffa450edef68
2007-10-05 20:30:59 +00:00
Mark Harwood 62fa7b4b82 Added new DuplicateFilter functionality to filter documents sharing a field value (e.g. primary key/url)
Also includes Junit test and XML Query support

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@581426 13f79535-47bb-0310-9956-ffa450edef68
2007-10-02 22:56:46 +00:00
Grant Ingersoll dce47c6401 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579766 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:15:26 +00:00
Grant Ingersoll bcfad28d69 LUCENE-1005, apply GMT timeZone to the data formatter so it outputs properly formatted dates
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579765 13f79535-47bb-0310-9956-ffa450edef68
2007-09-26 19:12:59 +00:00
Michael McCandless a28eb4d978 LUCENE-994: change defaults in IndexWriter to maximize 'out of the box' indexing speed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@579360 13f79535-47bb-0310-9956-ffa450edef68
2007-09-25 20:02:07 +00:00
Michael McCandless 511406ecbe remove temporary print for GData unit test
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577892 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 19:41:42 +00:00
Chris M. Hostetter 3f517bff75 don't just write date to stdout, include date string in failure message
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577873 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:52:42 +00:00
Michael McCandless fada31fa7f adding temporary print to figure out why this gdata-server test is failing on build machine
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@577861 13f79535-47bb-0310-9956-ffa450edef68
2007-09-20 18:38:23 +00:00
Doron Cohen 40f0adb507 LUCENE-941: (leftover - add info in benchmark/CHANGES.txt entry)
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576790 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:13:15 +00:00
Doron Cohen 9e51c30349 LUCENE-941: benchmark: infinite loop for alg: {[AddDoc(4000)]: 4} : *
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@576786 13f79535-47bb-0310-9956-ffa450edef68
2007-09-18 09:05:06 +00:00
Michael Busch 9c2a036db3 - LUCENE-908: Improvements and simplifications for how the MANIFEST file and the META-INF dir are created.
- LUCENE-935: Various improvements for the maven artifacts. Now the artifacts also include the sources as .jar files. 

git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@568766 13f79535-47bb-0310-9956-ffa450edef68
2007-08-22 23:16:48 +00:00
Grant Ingersoll c67fd79a83 LUCENE-981 and LUCENE-980: Added new AnalyzerTask and fixed issue with long strings in Format.java
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@567262 13f79535-47bb-0310-9956-ffa450edef68
2007-08-18 12:24:21 +00:00
Grant Ingersoll d1f90c7825 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566454 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:49:32 +00:00
Grant Ingersoll 9192b16643 Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566435 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:23:06 +00:00
Grant Ingersoll 477c4e0efe Deprecated all the old benchmarking stuff
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@566433 13f79535-47bb-0310-9956-ffa450edef68
2007-08-16 00:22:46 +00:00