Commit Graph

51 Commits

Author SHA1 Message Date
Tomoko Uchida 40038bcc92 LUCENE-10024: remove non-existing path from history file 2021-07-17 14:30:27 +09:00
Michael Wechner 489ba3e4f9 LUCENE-10024: Catch NoSuchFileException when opening index directory 2021-07-17 13:43:12 +09:00
Dawid Weiss 3bedc0871e
LUCENE-9977: rat task corrections (proper up-to-date checks, cleanup and rewrite of the task itself). (#178) 2021-06-11 09:26:34 +02:00
Tomoko Uchida 2160d7239d Revert "LUCENE-9448: clean up unused start scripts for luke."
This reverts commit 16104090fb.
2021-05-27 19:22:29 +09:00
Tomoko Uchida 16104090fb LUCENE-9448: clean up unused start scripts for luke. 2021-05-26 23:32:52 +09:00
Michael Wechner a9522c7179
LUCENE-9954: README for Luke (#135) 2021-05-13 00:53:53 +09:00
Ignacio Vera b84e0c272b
LUCENE-9047: Directory API is now little endian 2021-05-03 07:49:56 +02:00
Mayya Sharipova d03662c48b
LUCENE-9334 Consistency of field data structures
Require consistency between data-structures on a per-field basis

A field must be indexed with the same index options and data-structures across
all documents. Thus, for example, it is not allowed to have one document
where a certain field is indexed with doc values and points, and another document 
where the same field is indexed only with points. 
But it is allowed for a document not to have a certain field at all.

As a consequence of this, doc values updates are
only applicable for fields that are indexed with doc values only.
2021-04-14 15:00:41 -04:00
Robert Muir 2971f311a2
LUCENE-9911: enable ecjLint unusedExceptionParameter (#70)
Fails the linter if an exception is swallowed (e.g. variable completely
unused).

If this is intentional for some reason, the exception can simply by
annotated with @SuppressWarnings("unused").
2021-04-08 08:19:01 -04:00
zacharymorn 79fcd99f4c
LUCENE-9883: Turn on ecj missingEnumCaseDespiteDefault setting (#56) 2021-03-31 15:50:52 +09:00
zacharymorn 3ed87c867a
LUCENE-9864: Enforce @Override annotation everywhere (#40)
Requiring the annotation is helpful because if an abstract method is removed, the concrete methods will then show up as compile errors: preventing dead code from being accidentally left behind.

Co-authored-by: Robert Muir <rmuir@apache.org>
2021-03-25 17:50:38 -04:00
Robert Muir 945b1cb872
LUCENE-9856: fail precommit on unused local variables, take two (#37)
Enable ecj unused local variable, private instance and method detection. Allow SuppressWarnings("unused") to disable unused checks (e.g. for generated code or very special tests). Fix gradlew regenerate for python 3.9 SuppressWarnings("unused") for generated javacc and jflex code. Enable a few other easy ecj checks such as Deprecated annotation, hashcode/equals, equals across different types.

Co-authored-by: Mike McCandless <mikemccand@apache.org>
2021-03-23 13:59:00 -04:00
Robert Muir e6c4956cf6
Revert "LUCENE-9856: fail precommit on unused local variables (#34)"
This reverts commit 20dba278bb.
2021-03-23 12:46:36 -04:00
Robert Muir 20dba278bb
LUCENE-9856: fail precommit on unused local variables (#34)
Enable ecj unused local variable, private instance and method detection. Allow SuppressWarnings("unused") to disable unused checks (e.g. for generated code or very special tests). Fix gradlew regenerate for python 3.9 SuppressWarnings("unused") for generated javacc and jflex code. Enable a few other easy ecj checks such as Deprecated annotation, hashcode/equals, equals across different types.

Co-authored-by: Mike McCandless <mikemccand@apache.org>
2021-03-23 11:09:24 -04:00
Robert Muir f3a284ad83
LUCENE-9796: Fix SortedDocValues to no longer extend BinaryDocValues
SortedDocValues do not have a per-document binary value, they have a
per-document numeric `ordValue()`. The ordinal can then be dereferenced
to its binary form with `lookupOrd()`, but it was a performance trap to
implement a `binaryValue()` on the SortedDocValues api that does this
behind-the-scenes on every document.

You can replace calls of `binaryValue()` with `lookupOrd(ordValue())`
as a "quick fix", but it is better to use the ordinal alone
(integer-based datastructures) for per-document access, and only call
lookupOrd() a few times at the end (e.g. for the hits you want to display).
Otherwise, if you really don't want per-document ordinals, but instead a
per-document `byte[]`, use a BinaryDocValues field.

This change only addresses the API (slow `binaryValue()` trap), but
doesn't yet fix any slow algorithms that were discovered in the process,
so it doesn't yield any performance improvements.
2021-03-14 23:07:48 -04:00
Tomoko Uchida 471f38c031 LUCENE-9834: goodbye old friend - the classic luke logo 2021-03-12 23:17:32 +09:00
Tomoko Uchida 7478b3fc17 LUCENE-9834: Adjast logo/colors in the Luke About dialog 2021-03-12 11:00:10 +09:00
Robert Muir 2892ef4ca0
LUCENE-9802: switch to new logo (#9)
Replace logo used in generated documentation
Replace logo used by luke (about box)
Add logo to README.md
2021-03-10 15:28:52 -05:00
Marcus 4bc5d51494
LUCENE-8626: Lucene standardize test naming part 3 and final (#2220) 2021-01-22 12:38:52 -05:00
Patrick Marty 227256d951
LUCENE-9646: Set BM25Similarity discountOverlaps via the constructor 2021-01-19 09:49:57 +01:00
Dawid Weiss 2cbf261032 LUCENE-9570: code reformatting [final]. 2021-01-05 13:44:05 +01:00
Mike Drob 08e38d3452
LUCENE-9488 Create Release Artifacts with Gradle (#1905)
* Build Lucene binary distribution using Gradle
* Generate SHA-512 checksums for all release artifacts
* Update documentation artifacts included in binaries
* Delete some additional Ant relics

Co-authored-by: Dawid Weiss <dawid.weiss@carrotsearch.com>
Co-authored-by: Uwe Schindler <uschindler@apache.org>
2020-10-08 14:25:51 -05:00
Dawid Weiss 65a62b04c5 Remove unused imports. 2020-09-29 10:24:17 +02:00
Tomoko Uchida 5e617ccc33
LUCENE-9317: Clean up split package in analyzers-common (#1836) 2020-09-28 16:49:28 +09:00
Adrien Grand 27aa5c5f59
LUCENE-9486: Use preset dictionaries with LZ4 for BEST_SPEED. (#1793) 2020-09-03 12:17:04 +02:00
Erick Erickson 69fa5a00fb LUCENE-9433: Remove Ant support from trunk 2020-08-28 09:31:16 -04:00
Simon Willnauer 4787042f3e LUCENE-9447: suppress DeflateWithPresetCompressingStoredFieldsData since it doesn't add any attributes 2020-08-28 10:52:01 +02:00
Erick Erickson c9c75810c2 Revert "LUCENE-9433: Remove Ant support from trunk"
This reverts commit 37cd17dc
2020-08-21 16:57:58 -04:00
Erick Erickson 37cd17dcf5 LUCENE-9433: Remove Ant support from trunk 2020-08-21 15:19:52 -04:00
Tomoko Uchida bbd21aa422 LUCENE-9448: Move README.txt to README.md; We no lonnger have txt format README on the master. 2020-08-21 10:20:37 +09:00
Dawid Weiss 3579056249
Standalone distribution assembly and 'run' task for Luke (#1742)
Co-authored-by: Tomoko Uchida <tomoko.uchida.1111@gmail.com>
2020-08-12 16:28:48 +02:00
Erick Erickson ed025741d7 LUCENE-9389: Enhance logging messages in Lucene's Luke module 2020-06-25 22:44:10 -04:00
Uwe Schindler 06df50e759
LUCENE-9321: Port markdown task to Gradle (#1477) 2020-05-17 14:46:26 +02:00
Erick Erickson 9ae05e9b4f LUCENE-7788: fail precommit on unparameterised log messages and examine for wasted work/objects 2020-04-30 19:50:31 -04:00
Simon Willnauer 113043b1ed
LUCENE-9324: Add an ID to SegmentCommitInfo (#1434)
We already have IDs in SegmentInfo, as well as on SegmentInfos which are useful to uniquely identify segments and entire commits. Having IDs on SegmentCommitInfo is be useful too in
order to compare commits for equality and make snapshots incremental on generational files.
This change adds a unique ID to SegmentCommitInfo starting from Lucene 8.6. Older segments won't have an ID until the segment receives an update or a delete even if they have been opened and / or committed by Lucene 8.6 or above.
2020-04-18 14:24:57 +02:00
Nicholas Knize 206a70e7b7 LUCENE-9149: Increase data dimension limit in BKD 2020-02-07 16:08:14 -06:00
Adrien Grand fdf5ade727 LUCENE-9147: Fix codec excludes. 2020-02-06 10:34:03 +01:00
Robert Muir 975df9ddd3
LUCENE-9182: add apache license headers to all .gradle files and enforce in rat task 2020-01-27 12:05:34 -05:00
Dawid Weiss d4a9842375 Initial gradle build layer. 2019-12-02 15:34:57 +01:00
Tomoko Uchida 42ff080c84 LUCENE-8998: Fix OverviewImplTest.testIsOptimized reproducible failure 2019-10-05 17:15:10 +09:00
Amish Shah 369df12c2c LUCENE:8945: Allow to change the output file delimiter on Luke "export terms" feature
Signed-off-by: Tomoko Uchida <tomoko@apache.org>
2019-09-18 19:53:48 +09:00
Leonardo Menezes ff7b0c9de5 LUCENE-8764: Add "export all terms" feature to Luke
Co-authored-by: Tomoko Uchida <tomoko@apache.org>
2019-08-03 18:20:26 +09:00
Tomoko Uchida 299bfe55b3 LUCENE-8874: Show SPI names instead of class names in Luke Analysis tab. 2019-06-30 13:27:24 +09:00
Tomoko Uchida 8e81f47ca6 LUCENE-8793: Luke enhanced UI for CustomAnalyzer: show detailed analysis steps.
Co-authored-by: Jun Ohtani
Co-authored-by: Tomoko Uchida
2019-06-22 16:22:26 +09:00
Tomoko Uchida 62ed046e52 LUCENE-2562: Preserve map key order for the user interface. 2019-04-22 00:36:41 +09:00
Tomoko Uchida 511efc89fe LUCENE-2562: Use Map#copyOf(), List#copyOf() and Set#copyOf() instead of Collections#unmodifiableMap(), unmodifiableList() and unmodifiableSet() 2019-04-21 18:18:23 +09:00
Tomoko Uchida 7ba8bdca8f LUCENE-2562: Suppress SimpleText codec for CommitsImplTest. 2019-04-19 11:07:01 +09:00
Adrien Grand 793635eb0e LUCENE-2562: Don't randomly enable term vectors on fields. 2019-04-18 09:27:09 +02:00
Uwe Schindler faaee86efb LUCENE-8738: Move to Java 11 as minimum Java version (merged branch: jira/LUCENE-8738)
Co-authored-by: Adrien Grand <jpountz@apache.org>
2019-04-16 14:00:09 +02:00
Tomoko Uchida f85c08224b LUCENE-2562: Luke has no Maven artifacts 2019-04-13 01:23:46 +09:00