Use async-websocket instead of celluloid-io. #75

dblock · 2018-08-27T12:38:00Z

No description provided.

ioquatix · 2018-08-29T02:29:10Z

It looks good. Can you explain in layman's terms how I can run this to reproduce the failure you are seeing?

dblock · 2018-08-29T08:02:59Z

What I am seeing with this change is that the ping thread doesn't always successfully restart the bot. It calls driver.close, but then not seeing the disconnect as expected, the bot doesn't exit and restart. This is new compared to celluloid implementation.
Still seeing Slack-side disconnects slack-ruby-client#208, but more info there.

ioquatix · 2018-08-29T08:05:48Z

What I am seeing with this change is that the ping thread doesn't always successfully restart the bot. It calls driver.close, but then not seeing the disconnect as expected, the bot doesn't exit and restart. This is new compared to celluloid implementation.

Once you close the driver, I don't think you can expect to see any further events. WDYT?

dblock · 2018-08-29T08:12:30Z

Correct, the driver should exit run_loop in that case, terminate its thread, and gracefully get restarted from https://github.com/slack-ruby/slack-ruby-bot/blob/1708693843487f1c1f97765d0d7ebbda1ef34d24/lib/slack-ruby-bot/server.rb. I see it happen locally just fine and by forcing online? to return false as many times as I can watch it. Happens in production as well, a few times, then eventually stops doing it. Haven't had much time to debug this one though (I'm traveling ;). Needs code to log whether it was able to call driver.close, etc. (I am going to guess that yes, but didn't correctly abort the reactor.)

dblock · 2018-08-29T15:53:32Z

Can confirm with some logs that the ping thread correctly calls client.close, but that the client fails to disconnect and restart.

W, [2018-08-29T15:08:34.492086 #216]  WARN -- : DOWN: name=..., id=..., 0 retries left
W, [2018-08-29T15:08:34.492392 #216]  WARN -- : RESTART: name=..., id=..., #<Slack::RealTime::Concurrency::Async::Client:0x007fec8376b138>
W, [2018-08-29T15:08:34.492697 #216]  WARN -- : Done pinging team

@ioquatix for Celluloid we used for abort the connection with a driver.emit(:close, WebSocket::Driver::CloseEvent.new(1001, 'bot offline')) - looks like what we did in close doesn't always have the same effect. Any suggestions?

ioquatix · 2018-09-02T21:52:11Z

Where is the code for the ping thread?

dblock · 2018-09-02T22:18:50Z

https://github.com/slack-ruby/slack-ruby-bot-server/blob/master/lib/slack-ruby-bot-server/ping.rb

dblock · 2018-09-02T22:19:44Z

With async in this PR, https://github.com/slack-ruby/slack-ruby-bot-server/pull/75/files#diff-8b0ebb9a3e2dfbf59b4d90b6e827ad33

ioquatix · 2018-09-02T22:43:08Z

My apologies but slack-ruby/slack-ruby-client#222 might be the first thing to check.

dblock · 2018-09-03T06:34:54Z

Thanks for this @ioquatix, dblock/slack-ruby-client@f2062ec, testing.

dblock · 2018-09-07T12:22:35Z

Emitting a close event on the web socket similarly to what I was doing with Celluloid makes this ping situation work as before.

driver.emit(:close, WebSocket::Driver::CloseEvent.new(1001, 'bot offline'))

Without this the socket doesn't close and just sits there waiting for more. Closing the driver isn't enough.

It sounds like this is related to slack-ruby/slack-ruby-client#222 (comment). Is the right(er) solution to call close on the socket here and ignore Async::Wrapper::Cancelled?

Or is the better solution to try and send a message over the websocket and expect the next read to return nil as per faye/websocket-driver-ruby#61 (comment)?

ioquatix · 2018-09-07T14:24:06Z

In theory you can just ignore the cancelled error, but I feel like it's the wrong approach.

In almost every case where I've written network IO, the #read call fails, and doesn't need an external check to indicate that #read is broken. That being said, it's fine to catch the cancelled error (perhaps report it).

dblock · 2018-09-07T15:23:44Z

@ioquatix I mean in this case I am force-closing the socket per your recommendation in slack-ruby-client#222. Is that incorrect?

ioquatix · 2018-09-07T21:08:03Z

I think it's the best option given the circumstances.

dblock mentioned this pull request Aug 27, 2018

Added support for async-websocket. slack-ruby/slack-ruby-client#219

Merged

dblock force-pushed the async branch from 0b9feb0 to 06a559a Compare August 27, 2018 14:39

dblock changed the title ~~WIP: use async-websocket.~~ Use async-websocket instead of celluloid-io. Aug 27, 2018

dblock force-pushed the async branch 4 times, most recently from 388d3d6 to 132b871 Compare August 28, 2018 17:29

Replace celluloid-io with async-websocket.

d1aa8a4

dblock force-pushed the async branch from 132b871 to d1aa8a4 Compare August 28, 2018 18:02

dblock referenced this pull request in slack-ruby/slack-ruby-client Sep 2, 2018

WIP: close everything.

9b84886

dblock force-pushed the async branch from 16d6509 to 47fe64e Compare September 3, 2018 06:37

Added debug logging to ping thread.

308ffb2

dblock force-pushed the async branch from 47fe64e to 308ffb2 Compare September 4, 2018 11:33

Truncate error message from ping thread.

f74bb9c

dblock force-pushed the async branch 2 times, most recently from 377d0ac to 3a76f6a Compare September 6, 2018 19:23

dblock mentioned this pull request Sep 7, 2018

Unhandled server-side disconnects faye/websocket-driver-ruby#61

Open

dblock force-pushed the async branch from 3a76f6a to ebb1a76 Compare September 8, 2018 22:06

Close socket, not driver.

b902846

dblock force-pushed the async branch from ebb1a76 to b902846 Compare September 8, 2018 22:10

dblock merged commit 2de6062 into slack-ruby:master Sep 8, 2018

dblock deleted the async branch September 8, 2018 22:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use async-websocket instead of celluloid-io. #75

Use async-websocket instead of celluloid-io. #75

dblock commented Aug 27, 2018

ioquatix commented Aug 29, 2018

dblock commented Aug 29, 2018

ioquatix commented Aug 29, 2018

dblock commented Aug 29, 2018 •

edited

Loading

dblock commented Aug 29, 2018

ioquatix commented Sep 2, 2018

dblock commented Sep 2, 2018

dblock commented Sep 2, 2018

ioquatix commented Sep 2, 2018

dblock commented Sep 3, 2018

dblock commented Sep 7, 2018

ioquatix commented Sep 7, 2018

dblock commented Sep 7, 2018 •

edited

Loading

ioquatix commented Sep 7, 2018

Use async-websocket instead of celluloid-io. #75

Use async-websocket instead of celluloid-io. #75

Conversation

dblock commented Aug 27, 2018

ioquatix commented Aug 29, 2018

dblock commented Aug 29, 2018

ioquatix commented Aug 29, 2018

dblock commented Aug 29, 2018 • edited Loading

dblock commented Aug 29, 2018

ioquatix commented Sep 2, 2018

dblock commented Sep 2, 2018

dblock commented Sep 2, 2018

ioquatix commented Sep 2, 2018

dblock commented Sep 3, 2018

dblock commented Sep 7, 2018

ioquatix commented Sep 7, 2018

dblock commented Sep 7, 2018 • edited Loading

ioquatix commented Sep 7, 2018

dblock commented Aug 29, 2018 •

edited

Loading

dblock commented Sep 7, 2018 •

edited

Loading