-
Notifications
You must be signed in to change notification settings - Fork 199
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[uReplicator] uReplicator worker is crashed with specific topic #313
Comments
I analyzed & provide some more information about the issue above Code Analysis
Log around
POSSIBILITY
SOLUTIONS
|
It looks like uReplicator crashed because of produce timeout. Can you share your produce configuration? |
Hi @yangy0000. Here is consumer.properties, producer.properties file inside worker consumer.properties zookeeper.connection.timeout.ms=30000 group.id=group-uReplicator-d1-kafka-test consumer.id=consume-uReplicator-d1-kafka producer.properties root@d1-kafka-ureplicator-worker-98ddf5cbb-7lb5f:/uReplicator/config# cat producer.properties bootstrap.servers=xxx client.id=group-uReplicator-d1-kafka-test producer.type=async compression.type=none batch.size=262144 queue.buffering.max.messages=10 security.protocol=SSL ssl.keystore.location=/uReplicator/bin/manager-clients.int.vinid.net.keystore.jks If you need more information or have more suggestion please let me know. Thanks, |
Can you try to increase the request.timeout.ms to 120000, my suspicion is worker crash because of request timeout. |
@yangy0000 Thanks your suggestion I will try and let back to you when have any information. |
hi anh @binhtd @thanhptr any update in this issue ??? |
@yangy0000 @dungnt081191 It is quite hard to reproduce this error on our side ( i tried to set firewall on source topic, target topic vm to simulate uReplicator couldn't connect to source and target topic. I saw the controller and worker pod in k8s that was restarted continuously it is quite similar with our case. We will be increasing request.timeout.ms in producer config and see what happen in the next time. |
Hi men!
|
Hi, |
Deploy uReplicator on k8s in gcp we use uReplicator replicate data between DC and Cloud. One day one big topic with 1 partition and high throughput. It worked normally for long time. One day the uReplicator worker is crashed with error code is 255 after some restart.
Found some error in pod
[2020-06-08 17:00:35,440] INFO [Producer clientId=group-uReplicator-kafka-d1-prod1-null-0] Proceeding to force close the producer since pending requests could not be completed within timeout 9223372036854775807 ms. (org.apache.kafka.clients.producer.KafkaProducer:1078)[2020-06-08 17:00:35,441] ERROR [group-uReplicator-kafka-d1-prod1-null-0] Closing producer due to send failure. topic: evoucher.event.voucher_serial (com.uber.stream.ureplicator.worker.DefaultProducer:123)java.lang.IllegalStateException: Producer is closed forcefully
we fix issue temporary by the way remove error topic out of list replication topics then we use other topic for replication purpose, the worker is up and running replication normally. We don't know exactly what happen in that case.
The text was updated successfully, but these errors were encountered: