-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: [Nightly] Milvus deployment failed because of pods crash #37404
Comments
I will try to fix by updating milvus helm chart version to 4.2.18. (It is 4.2.8 now) |
maybe the same to #37402, the address is null
|
@congqixia & @LoveEachDay are working on it. /assign @congqixia |
See also milvus-io#37404 milvus-io#37402 IP address in paramtable need validation and fail fast with reasonable error message Signed-off-by: Congqi Xia <[email protected]>
why this need a fix? this is just listen tcp :19530: bind: address already in use which means the 19530 port has been used by other process |
don't think there is a issue here |
The error msg is a bit confusing, but the root cause here is the ip address was parsed to null for an empty config. |
I can manually deploy milvus now with the pr #37418, but the nightly run still failed in deployment with the same commit. @congqixia do we still need your fix pr above? |
@yanliang567 the root cause shall be the misbehavior of yaml parser. #37418 shall be handling |
if milvus failed to discover a viable ip address and put it in session, other component could never be able to connect to coordinators and vise versa |
See also milvus-io#37404 milvus-io#37402 IP address in paramtable need validation and fail fast with reasonable error message Signed-off-by: Congqi Xia <[email protected]>
Related to milvus-io#37404 Signed-off-by: Congqi Xia <[email protected]>
The root cause was behavior change of yaml parsing. |
Related to #37404 --------- Signed-off-by: Congqi Xia <[email protected]>
It seems that last nightly run started successfully, could you please verify? |
The issue has been fixed. Thanks for all your help ~ 🌈 |
See also milvus-io#37404 milvus-io#37402 IP address in paramtable need validation and fail fast with reasonable error message Signed-off-by: Congqi Xia <[email protected]>
See also #37404 #37402 IP address in paramtable need validation and fail fast with reasonable error message --------- Signed-off-by: Congqi Xia <[email protected]>
See also milvus-io#37404 milvus-io#37402 IP address in paramtable need validation and fail fast with reasonable error message --------- Signed-off-by: Congqi Xia <[email protected]>
Cherry-pick from master pr: #37416 See also #37404 #37402 IP address in paramtable need validation and fail fast with reasonable error message --------- Signed-off-by: Congqi Xia <[email protected]>
Is there an existing issue for this?
Environment
Current Behavior
Milvus deployment failed because of pods crash.
link: https://jenkins.milvus.io:18080/blue/organizations/jenkins/Milvus%20Nightly%20CI(new)/detail/master/167/pipeline/151
panic log:
server log: artifacts-milvus-distributed-kafka-mdk-master-167-py-n-167-e2e-logs.tar.gz
Expected Behavior
deploy successfully
Steps To Reproduce
No response
Milvus Log
No response
Anything else?
No response
The text was updated successfully, but these errors were encountered: