-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: querynode often restart when loading diskann index to local #35038
Comments
you can not use nfs for diskann index. DiskANN requires at least 100K iops and nfs usually not performant enought. You need local nvme ssd for caching |
can you review the error log , and explain the relationship between the error and nfs more in detail |
-- | -- |
you don't have a directory specified |
/assign @wangqia0309 |
does it means that milvus querynode actually does not support to configure the local storage dir with mounted share-nfs directory? |
share nfs is just too slow for milvus |
you can configure it to local storage, but usually we will see nfs/nas has super long latency or reported error when io issues |
@xiaofan-luan where does the 100k IOPS requirement comes from? Can you point me out to resources on how milvus makes use of nvme ssds ? I am exploring milvus's use and optimizations for nvme drives (mainly what optimizations milvus has done for storage?) |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
We need SSD mainly on querynode(if you use mmap or diskann), and also etcd. |
Is there an existing issue for this?
Environment
Current Behavior
the querynode pod often restart casually when loading index from minio, all of the querynodes use the same share nfs directory(mounted as local directory) as localStorage to store diskann index.
the total index data is dozens of tb, corrspond with 10 querynode pod with 50 cpu,800gb memory.
the log attach a file when restart occured
please review the error
error.log
Expected Behavior
No response
Steps To Reproduce
No response
Milvus Log
No response
Anything else?
No response
The text was updated successfully, but these errors were encountered: