[Save this search]

Status
All
   Fixed (7187)
  Closed (4529)
Open (2539)
   Won't Fix (543)
   Duplicate (297)
   Invalid (217)
   Not A Problem (195)
Issue type
All
  Issue (2234)
  PR (305)
Author relation
All
  None (2033)
  Member (608)
  Contributor (420)
  New contributor (57)
Created
All
  Past day (2)
  Past 2 days (3)
  Past 3 days (6)
  Past week (14)
  Past month (42)
  Past 3 months (87)
  Past 6 months (154)
  Past year (279)
Updated
All
  Past day (7)
  Past 2 days (9)
  Past 3 days (14)
  Past week (27)
  Past month (81)
  Past 3 months (125)
  Past 6 months (402)
  Past year (508)
Updated ago
All
  > 1 day ago (2532)
  > 2 days ago (2530)
  > 3 days ago (2525)
  > 1 week ago (2512)
  > 1 month ago (2458)
  > 3 months ago (2414)
  > 1 year ago (2031)
Comment count
All
  0 (554)
  1 (354)
  2 - 5 (797)
  6 - 10 (433)
  10 - 20 (303)
  > 20 (168)
Reaction count
All
  0 (2259)
  1 (166)
  2 - 5 (92)
  6 - 10 (16)
  10 - 20 (5)
  > 20 (1)
Review Requested
All
  jpountz (15)
  mikemccand (8)
  benwtrent (6)
  iverase (4)
  rmuir (3)
  dweiss (3)
  msokolov (3)

See all 22...
Mentioned
All
  jpountz (91)
  mikemccand (71)
  rmuir (50)
  benwtrent (46)
  msokolov (39)
  uschindler (37)
  dweiss (16)

See all 195...
Reviewed
All
  jpountz (23)
  mikemccand (23)
  rmuir (14)
  msokolov (14)
  uschindler (13)
  benwtrent (12)
  dweiss (11)

See all 60...
Commented
All
  asfimport (1369)
  github-actions[bot] (234)
  jpountz (140)
  mikemccand (120)
  rmuir (98)
  benwtrent (82)
  msokolov (80)

See all 283...
User
All
  asfimport (1811)
  github-actions[bot] (274)
  mikemccand (203)
  jpountz (199)
  rmuir (129)
  benwtrent (117)
  msokolov (101)

See all 435...
Last comment user
All
  asfimport (1326)
  github-actions[bot] (227)
  jpountz (32)
  mikemccand (31)
  rmuir (24)
  msokolov (16)
  uschindler (15)

See all 158...
Draft
All
  No (252)
  Yes (53)
Component
All
  core (594)
  analysis (150)
  highlighter (47)
  spatial (41)
  facet (40)
  queryparser (27)
  test-framework (25)

See all 23...
Type
All
  enhancement (1165)
  bug (726)
  task (198)
  test (78)
  documentation (22)
Labels
All
  Stale (223)
  legacy-jira-fix-versio... (214)
  legacy-jira-fix-versio... (169)
  affects-version:4.0-ALPHA (81)
  tool:build (54)
  vector-based-search (47)
  affects-version:6.0 (37)

See all 156...
Commits?
All
  No (2539)
Reporter
All
  rmuir (270)
  mikemccand (153)
  jpountz (126)
  dsmiley (68)
  uschindler (52)
  iverase (43)
  romseygeek (41)

See all 738...
Assignee
All
  Unassigned (2298)
  mikemccand (35)
  uschindler (31)
  romseygeek (27)
  dsmiley (22)
  rmuir (14)
  jpountz (11)

See all 45...
  Filters: Status (Open),  Issue type,  Author relation,  Created,  Updated,  Updated ago,  Comment count,  Reaction count,  Review Requested,  Mentioned,  Reviewed,  Commented,  User,  Last comment user,  Draft,  Component,  Type,  Labels,  Commits?,  Reporter,  Assignee

#15803 PR: Add ReaderUtil#partitionByLeaf to partition sorted global doc IDs by …
1.9 hours ago  34 comments  0 votes  0 watches  epotyomgithub-actions[bot]gsmillermikemccandsgup432zihanx
…leaf reader Summary This PR adds a new utility method ReaderUtil.partitionByLeaf(int[] ... Motivation A common pattern when working with top-N hits is to map global doc IDs back to their ... The existing ReaderUtil.subIndex does a binary search per doc ID (O(n * log(n))), but when you need ...
    gsmiller 1.9 hours ago:  Ah, Mike had some great suggestions! Thanks for iterating. I left a few comments on things that jumped out to me on this revision.
    gsmiller 1.9 hours ago:  I don't think we should explicitly throw inside this helper method. Let's simply return false here and let the calling assert handle throwing the exception?

#15832 PR: Vectorize FixedBitSet.cardinality() via VectorizationProvider (Java 25 MRJAR)
3.2 hours ago  30 comments  0 votes  0 watches  github-actions[bot]iprithvrmuiruschindler
# Summary This change adds SIMD acceleration for FixedBitSet.cardinality() using the Java Vector ... . - **Java 25 SIMD implementation**: PanamaBitSetUtilSupport in lucene/core/src/java25/, using ...
    iprithv 3.2 hours ago:  For the factory-level check, I can move the capability check into PanamaVectorizationProvider so it ...
    iprithv 3.2 hours ago:  Benchmarked this on ARM (which has native NEON vector popcount). The scalar loop (cardinalityScalar, no --add-modules) runs at 1.908 ops/µs at 65K bits. The Panama path runs at 4.167 ops/µs (+118%).

#15732 PR: Prevent writing vectors twice during merging HNSW graphs
5.1 hours ago  35 comments  0 votes  0 watches  benwtrentgithub-actions[bot]iverasemccullochtmsokolov
This PR is just addressing the following suggestion from Adrien when merging https://github.com/ ... This version only supports un-quantised vectors because for quantise vectors the logic is a bit ... The key aspects of this implementation are: 1.- KnnVectorsWritermergeOneField returns now an ...
    iverase 5.1 hours ago:  done in https://github.com/apache/lucene/pull/15732/commits/0f061e3103dafbc420a23e1271fa7409625b9920
    benwtrent 5.7 hours ago:  can we simplify the predicate branching and simply exit early like this if fi.scalarEncoding.isAsymm ...

#15835 PR: Fix method declared 'final' in 'final' class.
10.5 hours ago  0 comments  0 votes  0 watches  github-actions[bot]vsop-479
Description <!-- If this is your first contribution to Lucene, please make sure you have reviewed the ... https://github.com/apache/lucene/blob/main/CONTRIBUTING.md -->

#15834 PR: Simplify TermCounts#sortedTermCounts in luke module
11.2 hours ago  0 comments  0 votes  0 watches  gaobinlonggithub-actions[bot]
Description When addressing issues in luke module, found that the TermCountssortedTermCounts ...

#15818 PR: Add BM25 k3 query-term frequency saturation to classic query parser
21.6 hours ago  3 comments  0 votes  0 watches  github-actions[bot]msfrohsgup432
Description Related issue - https://github.com/apache/lucene/issues/15768 Adds BM25 k3 ... Duplicate terms in a query string now get their boost computed as ((k3+1)*qtf)/(k3+qtf) instead of ... Defaults to -1 (disabled) — fully backward compatible.
    sgup432 21.6 hours ago:  >What if your saturation depends on the total number of query terms (or the total number of ... I have no idea what parameters are being used in the IR literature (other than query-term frequency ... Sadly I am not an expert to give an opinion on this.
    sgup432 21.9 hours ago:  >In particular, right now we should have everything related to BM25 captured in the BM25Similarity ... Hmm, yeah this does make sense. As with the current implementation, k3 logic sits inside query parser, so if someone uses non-BM25 ...

#15825 PR: Fix CJKBigramFilter inconsistent positions with outputUnigrams disabled
23.5 hours ago  7 comments  0 votes  0 watches  amatobagithub-actions[bot]herley-shaorirmuir
# Summary Fixes 15812 CJKBigramFilter produces different token positions for the same input ... This breaks phrase queries when index-time and search-time analyzers use different outputUnigrams ... Root cause In flushBigram(), when outputUnigrams=false, bigrams are emitted with the default ...
    amatoba 23.5 hours ago:  How does this impact searching of existing indices that already use this filter with outputUnigrams ... Would this cause match failures that did not exist before?
    herley-shaori 2.6 days ago:  Thanks for the review! Applied your suggestion and extended the same reasoning to the other guards: ... outputUnigrams && (your suggestion) - flushBigram(): added if (deferredPosInc > 0) ...

[54.9 msec search, 57.0 msec total]