[WIP] sparse: use staged cursor and upper_bound for WAND #970

sparknack · 2024-12-03T02:49:12Z

Use staged cursor and upper_bound to avoid recalculating the score from cursor 0 every time in the loop. And modify some variable names to be more readable.

Simple benchmark result(time used to search all queries once on the full dataset) with drop ratio build 0.32 and drop ratio search 0.6:

BM25	Before	After	Improvement %
bigann SPLADE vectors of MSMarco	81,980ms	66,205ms	19.2%
Milvus 2.5 Full Text Search Vectors of MSMarco	1,005ms	1,034ms	~

IP	Before	After	Improvement %
bigann SPLADE vectors of MSMarco	85,527ms	75,366ms	12%
Milvus 2.5 Full Text Search Vectors of MSMarco(using IP/TF-IDF metric though, not BM25)	2,839ms	3,191ms	-12%

This optimization seems to have an improvement on learned sparse embedding like SPLADE, which is denser, but leads to a little bit of performance degration on more sparse embeddings.

Hold the review of this PR for now until more investigation.

sre-ci-robot · 2024-12-03T02:49:18Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: sparknack
To complete the pull request process, please assign hhy3 after the PR has been reviewed.
You can assign the PR to them by writing /assign @hhy3 in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

sre-ci-robot · 2024-12-03T02:49:22Z

Welcome @sparknack! It looks like this is your first PR to zilliztech/knowhere 🎉

mergify · 2024-12-03T02:49:50Z

@sparknack 🔍 Important: PR Classification Needed!

For efficient project management and a seamless review process, it's essential to classify your PR correctly. Here's how:

If you're fixing a bug, label it as kind/bug.
For small tweaks (less than 20 lines without altering any functionality), please use kind/improvement.
Significant changes that don't modify existing functionalities should be tagged as kind/enhancement.
Adjusting APIs or changing functionality? Go with kind/feature.

For any PR outside the kind/improvement category, ensure you link to the associated issue using the format: “issue: #”.

Thanks for your efforts and contribution to the community!.

sparknack · 2024-12-03T02:50:45Z

/kind improvement

sparknack · 2024-12-03T03:12:14Z

issue: #967

Signed-off-by: Shawn Wang <[email protected]>

zhengbuqian · 2024-12-03T11:51:43Z

/hold

codecov · 2024-12-03T13:15:08Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 74.39%. Comparing base (3c46f4c) to head (f0d8927).
Report is 262 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##           main     #970       +/-   ##
=========================================
+ Coverage      0   74.39%   +74.39%     
=========================================
  Files         0       82       +82     
  Lines         0     6690     +6690     
=========================================
+ Hits          0     4977     +4977     
- Misses        0     1713     +1713

see 82 files with indirect coverage changes

sre-ci-robot requested review from cydrain and liliu-z December 3, 2024 02:49

sre-ci-robot added the size/M label Dec 3, 2024

mergify bot added the dco-passed label Dec 3, 2024

mergify bot added the do-not-merge/missing-related-issue label Dec 3, 2024

sre-ci-robot added the kind/improvement label Dec 3, 2024

mergify bot removed the do-not-merge/missing-related-issue label Dec 3, 2024

sparse: use staged cursor and upper_bound for WAND

f0d8927

Signed-off-by: Shawn Wang <[email protected]>

sparknack force-pushed the sparse-wand-opt branch from 9ddd7a7 to f0d8927 Compare December 3, 2024 11:50

sre-ci-robot added the do-not-merge/hold label Dec 3, 2024

sparknack changed the title ~~sparse: use staged cursor and upper_bound for WAND~~ [WIP] sparse: use staged cursor and upper_bound for WAND Dec 3, 2024

sre-ci-robot added the do-not-merge/work-in-progress label Dec 3, 2024

mergify bot added the ci-passed label Dec 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] sparse: use staged cursor and upper_bound for WAND #970

[WIP] sparse: use staged cursor and upper_bound for WAND #970

sparknack commented Dec 3, 2024 •

edited

Loading

sre-ci-robot commented Dec 3, 2024

sre-ci-robot commented Dec 3, 2024

mergify bot commented Dec 3, 2024

sparknack commented Dec 3, 2024

sparknack commented Dec 3, 2024

zhengbuqian commented Dec 3, 2024

codecov bot commented Dec 3, 2024

[WIP] sparse: use staged cursor and upper_bound for WAND #970

Are you sure you want to change the base?

[WIP] sparse: use staged cursor and upper_bound for WAND #970

Conversation

sparknack commented Dec 3, 2024 • edited Loading

sre-ci-robot commented Dec 3, 2024

sre-ci-robot commented Dec 3, 2024

mergify bot commented Dec 3, 2024

sparknack commented Dec 3, 2024

sparknack commented Dec 3, 2024

zhengbuqian commented Dec 3, 2024

codecov bot commented Dec 3, 2024

Codecov Report

sparknack commented Dec 3, 2024 •

edited

Loading