-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: GPU_CAGRA omm and slower as GPU_IVF_PQ #38101
Comments
#This is my python code: ` if name == 'main':
` |
/assign @Presburger |
@yongxin3344520 Hello, you don't need to modify the default |
Is there an existing issue for this?
Environment
Current Behavior
I used 5000 to query 6000000, with dimensions of 256 and topk=100.
use GPU_CAGRA index, and omm , and slower as GPU_IVF_PQ
The configuration file milvus.yaml is not effective
Setting gpu'maxMemSize is 20480 , but use 22144MiB at least
Expected Behavior
Traceback (most recent call last):
File "/soft/test_gpu2.py", line 115, in
results = collection.search(data=[
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/orm/collection.py", line 801, in search
resp = conn.search(
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/decorators.py", line 141, in handler
raise e from e
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/decorators.py", line 137, in handler
return func(*args, **kwargs)
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/decorators.py", line 176, in handler
return func(self, *args, **kwargs)
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/decorators.py", line 116, in handler
raise e from e
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/decorators.py", line 86, in handler
return func(*args, **kwargs)
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/client/grpc_handler.py", line 805, in search
return self._execute_search(request, timeout, round_decimal=round_decimal, **kwargs)
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/client/grpc_handler.py", line 746, in _execute_search
raise e from e
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/client/grpc_handler.py", line 735, in _execute_search
check_status(response.status)
File "/root/miniconda3/lib/python3.9/site-packages/pymilvus/client/utils.py", line 63, in check_status
raise MilvusException(status.code, status.reason, status.error_code)
pymilvus.exceptions.MilvusException: <MilvusException: (code=65535, message=fail to search on QueryNode 3: worker(3) query failed: Operator::GetOutput faide id: 197] : => failed to search: config={{"itopk_size":128,"k":100,"metric_type":"IP","span_id":"8b80dac94bbe6d43","trace_flags":0,"trace_id":"7edcf859 raft inner error: std::bad_alloc: out_of_memory: RMM failure at:/workspace/source/cmake_build/3rdparty_download/rmm-src/include/rmm/mr/device/pool_memorypool size exceeded at /workspace/source/internal/core/src/index/VectorMemIndex.cpp:411
Steps To Reproduce
Milvus Log
..............
Anything else?
......
The text was updated successfully, but these errors were encountered: