-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Support for thread number limits in queries and maximum memory footprint metrics for indexing and querying #38013
Comments
We already use a threadpool to limit the thread to execute. Did you find any thread leakage? can you explian your test env and how did you do the test? |
There is my code:
I am running Milvus 2.4 Standalone with the server and query client on the same machine (Ubuntu 20 with 128 cores and 2TB RAM). I would like to know how to:
Additionally, I am facing an issue where the recall rate decreases as the query progresses, even though I am using the same query and experimental setup throughout. |
the reason is your are using PQ index. There is no way to seperate index memory usage and data load memory usage, unless you use seperate indexnode and querynode |
Milvus has limited thread on query so ideally you don't need to care about it. The default thread number is linear to cores on your machine. |
Thank you for your response. If there is any method to configure the thread number for query execution or other related operations, I would be grateful if you could share that with me. It would be extremely helpful for my experiment. |
Is there an existing issue for this?
Is your feature request related to a problem? Please describe.
I would like to request two features that would enhance the control and observability of Milvus 2.4 during query and indexing operations:
Ability to Limit Thread Number in Queries:
Provide a configuration option or API to control the maximum number of threads utilized during query execution.
Expose Maximum Memory Footprint Metrics:
Introduce a way to fetch memory usage metrics during both indexing and querying, like:
Peak memory usage during index construction.
Describe the solution you'd like.
No response
Describe an alternate solution.
No response
Anything else? (Additional Context)
If such features is already available, how do I use it? It seems Milvus distributed can use monitor to track its usage, but I would like to estimate its efficiency in the Standalone or Lite version.
The text was updated successfully, but these errors were encountered: