mirror of https://github.com/apache/lucene.git
b4a8810b7a
This updates the postings format in order to inline skip data into postings. This format is generally similar to the current `Lucene99PostingsFormat`, e.g. it shares the same block encoding logic, but it has a few differences: - Skip data is inlined into postings to make the access pattern more sequential. - There are only 2 levels of skip data: on every block (128 docs) and every 32 blocks (4,096 docs). In general, I found that the fact that skip data is inlined may slow down a bit queries that don't need skip data at all (e.g. `CountOrXXX` tasks that never advance of consult impacts) and speed up a bit queries that advance by small intervals. The fact that the greatest level only allows skipping 4096 docs at once means that we're slower at advancing by large intervals, but data suggests that it doesn't significantly hurt performance. |
||
---|---|---|
.. | ||
datasets | ||
documentation | ||
generation | ||
hacks | ||
ide | ||
java | ||
maven | ||
pylucene | ||
testing | ||
validation | ||
wrapper | ||
conventions.gradle | ||
ge.gradle | ||
globals.gradle | ||
help.gradle | ||
template.gradle.properties |