[Save this search]

Status
All
   Fixed (7187)
  Closed (4648)
Open (2550)
   Won't Fix (543)
   Duplicate (297)
   Invalid (217)
   Not A Problem (195)
Issue type
All
  Issue (2245)
  PR (305)
Author relation
All
  None (2034)
  Member (621)
  Contributor (427)
  New contributor (59)
Created
All
  Past day (0)
  Past 2 days (2)
  Past 3 days (4)
  Past week (7)
  Past month (37)
  Past 3 months (92)
  Past 6 months (134)
  Past year (267)
Updated
All
  Past day (6)
  Past 2 days (10)
  Past 3 days (15)
  Past week (22)
  Past month (60)
  Past 3 months (127)
  Past 6 months (250)
  Past year (481)
Updated ago
All
  > 1 day ago (2544)
  > 2 days ago (2540)
  > 3 days ago (2535)
  > 1 week ago (2528)
  > 1 month ago (2490)
  > 3 months ago (2423)
  > 1 year ago (2069)
Comment count
All
  0 (561)
  1 (358)
  2 - 5 (799)
  6 - 10 (432)
  10 - 20 (305)
  > 20 (165)
Reaction count
All
  0 (2267)
  1 (169)
  2 - 5 (92)
  6 - 10 (16)
  10 - 20 (5)
  > 20 (1)
Review Requested
All
  jpountz (15)
  mikemccand (9)
  rmuir (5)
  benwtrent (5)
  dweiss (4)
  iverase (4)
  msokolov (3)

See all 23...
Mentioned
All
  jpountz (94)
  mikemccand (72)
  rmuir (52)
  benwtrent (50)
  msokolov (41)
  uschindler (39)
  dweiss (19)

See all 202...
Reviewed
All
  jpountz (23)
  mikemccand (22)
  rmuir (15)
  msokolov (14)
  benwtrent (14)
  dweiss (11)
  uschindler (11)

See all 57...
Commented
All
  asfimport (1369)
  github-actions[bot] (240)
  jpountz (140)
  mikemccand (122)
  rmuir (100)
  benwtrent (86)
  msokolov (81)

See all 288...
User
All
  asfimport (1811)
  github-actions[bot] (273)
  mikemccand (206)
  jpountz (202)
  rmuir (132)
  benwtrent (125)
  msokolov (104)

See all 438...
Last comment user
All
  asfimport (1326)
  github-actions[bot] (233)
  mikemccand (33)
  jpountz (32)
  rmuir (25)
  benwtrent (16)
  gsmiller (15)

See all 153...
Draft
All
  No (253)
  Yes (52)
Component
All
  core (589)
  analysis (150)
  highlighter (48)
  spatial (41)
  facet (39)
  queryparser (27)
  test-framework (24)

See all 23...
Type
All
  enhancement (1174)
  bug (725)
  task (199)
  test (77)
  documentation (22)
Labels
All
  Stale (230)
  legacy-jira-fix-versio... (214)
  legacy-jira-fix-versio... (169)
  affects-version:4.0-ALPHA (81)
  tool:build (54)
  vector-based-search (47)
  affects-version:6.0 (37)

See all 157...
Commits?
All
  No (2550)
Reporter
All
  rmuir (270)
  mikemccand (156)
  jpountz (126)
  dsmiley (68)
  uschindler (52)
  romseygeek (42)
  iverase (42)

See all 744...
Assignee
All
  Unassigned (2308)
  mikemccand (35)
  uschindler (31)
  romseygeek (27)
  dsmiley (22)
  rmuir (15)
  jpountz (11)

See all 45...
  Filters: Status (Open),  Issue type,  Author relation,  Created,  Updated,  Updated ago,  Comment count,  Reaction count,  Review Requested,  Mentioned,  Reviewed,  Commented,  User,  Last comment user,  Draft,  Component,  Type,  Labels,  Commits?,  Reporter,  Assignee

#15732 PR: Prevent writing vectors twice during merging HNSW graphs
2.7 hours ago  40 comments  0 votes  0 watches  benwtrentgithub-actions[bot]iverasemccullochtmsokolov
This PR is just addressing the following suggestion from Adrien when merging https://github.com/ ... This version only supports un-quantised vectors because for quantise vectors the logic is a bit ... The key aspects of this implementation are: 1.- KnnVectorsWritermergeOneField returns now an ...
    github-actions[bot] 2.7 hours ago:  This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the dev@lucene.apache.org list. Thank you for your contribution!
    benwtrent 14.4 days ago:  I think this is looking good. But, even if its internal, its a large enough change to warrant CHANGES and of course some lucene ...

#15952 PR: Add afterMerge() lifecycle hook to KnnVectorsWriter and PerFieldKnnVe…
8.1 hours ago  12 comments  0 votes  0 watches  MrFlapbenwtrentgithub-actions[bot]navneet1vshatejas
Resolves https://github.com/apache/lucene/issues/15935 Problem PerFieldKnnVectorsFormat ... This makes it impossible to cleanly release per-merge resources, leading to resource leaks. Solution Add a protected void afterMerge() throws IOException method with a no-op default to both ...
    benwtrent 8.1 hours ago:  I am saying, that any expert extending PerFieldKnnVectorsFormat can likely do this in another way ... I am just wondering if this is the best way for Lucene.
    MrFlap 9.1 hours ago:  > On an exception with merge, all files, etc. should be closed, do we want to move finish into the ... There may be edge cases, for sure... I'm not sure I can give a completely educated call on this, but it seems like the right thing to do ...

#15950 PR: Add JMH benchmark comparing various sort algorithms specifically for sorting ScoreDoc[] of varying ...
12.1 hours ago  13 comments  0 votes  0 watches  gsmillermikemccandzihanx
[Spinoff from @zihanx's recent PR (15803) adding ReaderUtil.partitionByLeaf helper to collate any ... TL;DR: see this cool benchmark results UI testing various algorithms to sort ScoreDoc[] (smaller is ... The hits from a Lucene query (ScoreDoc[]) come out sorted by something important to the user (e.g. ...
    mikemccand 12.1 hours ago:  Ooh good catch, you're right -- Lucene hits would never duplicate docid.
    mikemccand 12.1 hours ago:  Good question -- genai did this. Oh and it picked that snarky seed too, sheesh.

#15958 PR: Bulk score hnsw neighbor array
13.1 hours ago  0 comments  0 votes  0 watches  github-actions[bot]leng25
issue 15606 Refactor NeighborArrayisWorstNonDiverse to use bulkScore instead of score, enabling ... This change follows the approach taken in 15607, though unlike that case, this function is called ... The primary motivation is consistency, aligning both call sites to use bulkScore ensures this code ...

#15961 PR: Remove implicit determinization from WildcardQuery
13.7 hours ago  17 comments  0 votes  0 watches  drempapisgithub-actions[bot]rmuirromseygeek
This change continues the determinization cleanup started for regexp queries (15939) by applying ... Previously, wildcard automata were implicitly determinized up front. After removing that implicit determinization, some code paths (especially query visiting/ ...
    rmuir 13.7 hours ago:  I'm a fan of this change: I feel like the hashcode/equals is the way it should have always worked! ... I'll give it a few days for more feedback. Thank you for doing this work @drempapis
    rmuir 16.7 hours ago:  Its good. I like how the new hashcode/equals are defined... as "same automaton". There were historical problems here around hashcode/equals being defined as "accepting same ...

#14758: Support multiple HNSW graphs backed by the same vectors
15.8 hours ago  30 comments  0 votes  0 watches  benwtrentdungba88github-project-automation[bot]iverasejpountzkaivalnpmccullochtmikemccandmsokolovnavneet1vshubhamvishu
Description For use-cases of searching different subsets of vectors in the index, where a ... This could be done today by: 1. Indexing all vectors in a single field and using query-time pre-filtering, but it can become ...
    iverase 17.7 hours ago:  I think most of the information is there, the only requirement is that the field needs to have doc ... you can access doc value information during flush using the method IndexChaingetDocValuesLeafReader ... During merge you can always access the doc values from the old segments via the DocValuesConsumer ...
    msokolov 18.6 hours ago:  I kind of like the "reference another field" idea -- from a high level it makes sense, and makes it ... Implementation-wise there are some complexities -- do you have to merge the fields in a particular ... Does a merger of one field require a reader for another field and so on ... but I don't think ...

#15929 PR: Rename CollectionStatistics to FieldStats and TermStatistics to TermStats.
1.1 days ago  4 comments  0 votes  0 watches  github-actions[bot]jainankitkrmuirvsop-479
    vsop-479 2 days ago:  > One thing to consider, we also have NumericFieldStats class. Maybe we want to think about more consistency and shorter names? e.g.
    rmuir 2.4 days ago:  I don't remember the reasoning behind the name, but I think FieldStatistics is better. One thing to consider, we also have NumericFieldStats class. Maybe we want to think about more consistency and shorter names?

[20.9 msec search, 22.0 msec total]