Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

shaka-lab-hub stuck after some kind of network failure #49

Open
joeyparrish opened this issue Mar 11, 2024 · 0 comments
Open

shaka-lab-hub stuck after some kind of network failure #49

joeyparrish opened this issue Mar 11, 2024 · 0 comments
Assignees
Labels
package: shaka-lab-hub Issues affecting shaka-lab-hub platform: Linux Issues affecting Linux priority: P1 Big impact or workaround impractical; resolve before feature release type: bug Something isn't working correctly
Milestone

Comments

@joeyparrish
Copy link
Member

According to our logs, our shaka-lab-hub service got into a bad state. The network may have been down when the job started, leading it to complain that it couldn't bind to a non-localhost IP. However, the job stayed "up" in spite of not being functional. Had the service quit at that error, it would have restarted until it was able to run.

Here are the logs from sudo journalctl -xu shaka-lab-hub:

-- Boot 79734be3001a4c92a2bba28a3f6ad2b6 --
Mar 09 22:47:50 shaka-test-linux systemd[1]: Started Shaka Lab Hub.
░░ Subject: A start job for unit shaka-lab-hub.service has finished successfully
░░ Defined-By: systemd
░░ Support: http://www.ubuntu.com/support
░░ 
░░ A start job for unit shaka-lab-hub.service has finished successfully.
░░ 
░░ The job identifier is 207.
Mar 09 22:47:50 shaka-test-linux start-hub.sh[907]: 22:47:50.933 INFO [GridLauncherV3.parse] - Selenium server version: 3.141.59, revision: e82be7d358
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: 22:47:51.002 INFO [GridLauncherV3.lambda$buildLaunchers$5] - Launching Selenium Grid hub on port 4444
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: 2024-03-09 22:47:51.382:INFO::main: Logging initialized @735ms to org.seleniumhq.jetty9.util.log.StdErrLog
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: Exception in thread "main" java.lang.RuntimeException: Error initializing the hub Could not find a non-loopback ip4 address for this mach>
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: Build info: version: '3.141.59', revision: 'e82be7d358', time: '2018-11-14T08:25:53'
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: System info: host: 'shaka-test-linux', ip: '127.0.1.1', os.name: 'Linux', os.arch: 'amd64', os.version: '5.15.0-100-generic', java.versio>
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: Driver info: driver.version: unknown
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.web.Hub.initServer(Hub.java:228)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.web.Hub.start(Hub.java:243)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.selenium.GridLauncherV3.lambda$buildLaunchers$5(GridLauncherV3.java:265)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.selenium.GridLauncherV3.lambda$launch$0(GridLauncherV3.java:86)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at java.base/java.util.Optional.map(Optional.java:265)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.selenium.GridLauncherV3.launch(GridLauncherV3.java:86)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.selenium.GridLauncherV3.main(GridLauncherV3.java:70)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: Caused by: org.openqa.selenium.WebDriverException: Could not find a non-loopback ip4 address for this machine
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: Build info: version: '3.141.59', revision: 'e82be7d358', time: '2018-11-14T08:25:53'
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: System info: host: 'shaka-test-linux', ip: '127.0.1.1', os.name: 'Linux', os.arch: 'amd64', os.version: '5.15.0-100-generic', java.versio>
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]: Driver info: driver.version: unknown
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.selenium.net.NetworkUtils.getIp4NonLoopbackAddressOfThisMachine(NetworkUtils.java:73)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.web.Hub.updateHostToNonLoopBackAddressOfThisMachine(Hub.java:309)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         at org.openqa.grid.web.Hub.initServer(Hub.java:188)
Mar 09 22:47:51 shaka-test-linux start-hub.sh[907]:         ... 6 more
Mar 11 19:11:07 shaka-test-linux systemd[1]: Stopping Shaka Lab Hub...

The service started, failed but didn't die, then stayed like that for two days until I restarted it.

@joeyparrish joeyparrish added type: bug Something isn't working correctly priority: P1 Big impact or workaround impractical; resolve before feature release platform: Linux Issues affecting Linux package: shaka-lab-hub Issues affecting shaka-lab-hub labels Mar 11, 2024
@joeyparrish joeyparrish self-assigned this Mar 11, 2024
@github-actions github-actions bot added this to the Backlog milestone Mar 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
package: shaka-lab-hub Issues affecting shaka-lab-hub platform: Linux Issues affecting Linux priority: P1 Big impact or workaround impractical; resolve before feature release type: bug Something isn't working correctly
Projects
None yet
Development

No branches or pull requests

1 participant