-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: milvus-querycoord and milvus-proxy milvus-datanode are not able to connect to rootcoord it is going to 127.0.0.1:53100 instead of [milvus-rootcoord]:53100 #34260
Comments
/assign @LoveEachDay |
from the log, root coord register 127.0.0.1 as it's address address it get by
so, most likely you are deploy in our official docker. You can use ifconfig check the network setting |
yes we had deployed the official docker |
ifconfig result.... lo: flags=73<UP,LOOPBACK,RUNNING> mtu 65536 virbr0: flags=4099<UP,BROADCAST,MULTICAST> mtu 1500 |
The reason os.Hostname is seeing 127.0.0.1 rather than 192.168.122.1 is that 127.0.0.1 is the address associated with the loopback interface (lo), which is the default address for the local host. The 192.168.122.1 address is associated with a virtual bridge interface (virbr0), which is typically used for virtual networking, such as with virtual machines or containers. Here’s a breakdown of what’s happening: Loopback Interface (lo): If you want the hostname to resolve to 192.168.122.1, you would need to modify your system’s network configuration. Here’s how you can adjust this on a Linux system: Edit /etc/hosts: 192.168.122.1 your-hostname sudo systemctl restart networking For programmatic access, you might need to explicitly query the IP address of the virbr0 interface instead of relying on os.Hostname. This can be done using various libraries or system calls to retrieve the IP address of a specific interface. |
this is what i got from GPT and hopefully that could help |
the ifconfig result.. that i shared it was for rootcoord only so for itself it was using the 127.0.0.1 address |
apart from this we had tried with milvus v2.3.13 and that is running fine with same config no change apart from image |
reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""] |
how did you install milvus? this logic has not been changed since 2.3.4 So this is definitely not a bug but more of a env issue |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Is there an existing issue for this?
Environment
Current Behavior
milvus-querycoord/milvus-proxy/milvus-datanode it is going to 127.0.0.1:53100 instead of [milvus-rootcoord]:53100
Expected Behavior
milvus-querycoord/milvus-proxy/milvus-datanode it is going to 127.0.0.1:53100
It have to go on below adress
[milvus-rootcoord]:53100
because if milvus-querycoord/milvus-proxy/milvus-datanode will try to connect the root-coord inside their pod by localhost that is giving the error
Steps To Reproduce
We can deploy the Milvus v2.4.4 and will get the error by default helm by passing values required in values.yaml/
Milvus Log
[2024/06/28 05:27:26.834 +00:00] [INFO] [etcd/etcd_util.go:49] ["create etcd client"] [useEmbedEtcd=false] [useSSL=false] [endpoints="[milvus-etcd:2379]"] [minVersion=1.3]
[2024/06/28 05:27:26.837 +00:00] [DEBUG] [querycoord/service.go:218] [network] [port=19531]
[2024/06/28 05:27:26.938 +00:00] [INFO] [etcd/etcd_util.go:49] ["create etcd client"] [useEmbedEtcd=false] [useSSL=false] [endpoints="[milvus-etcd:2379]"] [minVersion=1.3]
[2024/06/28 05:27:26.941 +00:00] [DEBUG] [sessionutil/session_util.go:257] ["Session try to connect to etcd"]
[2024/06/28 05:27:26.942 +00:00] [DEBUG] [sessionutil/session_util.go:272] ["Session connect to etcd success"]
[2024/06/28 05:27:26.943 +00:00] [DEBUG] [querycoord/service.go:168] ["QueryCoord try to wait for RootCoord ready"]
[2024/06/28 05:27:26.944 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:26.944 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:26.945 +00:00] [WARN] [grpcclient/client.go:554] ["fail to get grpc client"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:26.945 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:26.946 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:26.946 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:26.947 +00:00] [WARN] [grpcclient/client.go:476] ["fail to get grpc client in the retry state"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:26.947 +00:00] [WARN] [grpcclient/client.go:467] ["retry func failed"] [retried=0] [error="empty grpc client: failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:27.038 +00:00] [DEBUG] [querycoordv2/server.go:584] ["QueryCoord current state"] [StateCode=Abnormal]
[2024/06/28 05:27:27.148 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100,
reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:27.149 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:27.149 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:27.150 +00:00] [WARN] [grpcclient/client.go:476] ["fail to get grpc client in the retry state"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc =
"transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:27.551 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100,
reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:27.552 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:27.552 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:27.553 +00:00] [WARN] [grpcclient/client.go:476] ["fail to get grpc client in the retry state"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc =
"transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:28.354 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100,
reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:28.355 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:28.355 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:28.356 +00:00] [WARN] [grpcclient/client.go:476] ["fail to get grpc client in the retry state"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc =
"transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:29.957 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100,
reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:29.958 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:29.958 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:29.959 +00:00] [WARN] [grpcclient/client.go:476] ["fail to get grpc client in the retry state"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc =
"transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
:
##################
[2024/06/28 05:27:24.391 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:24.391 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:24.392 +00:00] [WARN] [grpcclient/client.go:554] ["fail to get grpc client"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:24.392 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:24.393 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:24.393 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:24.394 +00:00] [WARN] [grpcclient/client.go:476] ["fail to get grpc client in the retry state"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:24.394 +00:00] [WARN] [grpcclient/client.go:467] ["retry func failed"] [retried=0] [error="empty grpc client: failed to connect 127.0.0.1:53100, reason: connection error: desc = "transport: error
while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:24.595 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100,
reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:24.596 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
[2024/06/28 05:27:24.596 +00:00] [DEBUG] [client/client.go:93] ["RootCoordClient GetSessions success"] [address=127.0.0.1:53100] [serverID=1146]
[2024/06/28 05:27:24.597 +00:00] [WARN] [grpcclient/client.go:476] ["fail to get grpc client in the retry state"] [client_role=rootcoord] [error="failed to connect 127.0.0.1:53100, reason: connection error: desc =
"transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:24.998 +00:00] [WARN] [retry/retry.go:104] ["grpc client is nil, maybe fail to get client in the retry state"] [client_role=rootcoord] [error="empty grpc client: failed to connect 127.0.0.1:53100,
reason: connection error: desc = "transport: error while dialing: dial tcp 127.0.0.1:53100: connect: connection refused""]
[2024/06/28 05:27:25.000 +00:00] [DEBUG] [sessionutil/session_util.go:620] ["SessionUtil GetSessions"] [prefix=rootcoord] [key=rootcoord] [address=127.0.0.1:53100]
Anything else?
It is working with version v2.3
############## Config.tpl is generating the expected result
and there is no issue in helm chart.
The text was updated successfully, but these errors were encountered: