fix: don't block UI when canceling lint process #688

stevearc · 2024-10-24T03:20:41Z

I've been getting mysterious nondeterministic freezes in Neovim for months now, but couldn't get a solid repro. I finally found a repro and tracked it down to LintProc:cancel(). In #522 we added some logic to cancel a process by killing it with sigint, waiting, then if it's still running kill it again with sigkill. The problem is that the delay is done with vim.wait(), which will block the UI thread. If you have a poorly-behaved linter (and I do, unfortunately it's for work and not something I can reasonably fix) that doesn't respond at all to sigint, the UI thread will be blocked for 10 seconds each time it fails to cancel.

I've replaced the synchronous vim.wait with a call to vim.defer_fn. From comments on the previous PR I gather that there is some concern about orphaned lint processes when Neovim exits. There's not much we can do if it crashes, but for normal operation I added a VimLeavePre hook that will cancel all running linters and block until they are killed.

Test Plan

First I found a repro without this patch

Open file
Call lint.try_lint() twice
Observe 10 second UI freeze

Then after applying this patch

Open file
Call lint.try_lint() twice
Observe no freeze
Run ps aux | grep <formatter> and verify that there are two processes running
Wait 10 seconds
Observe that the canceled process (the first one) was killed

And to check the exit behavior

Open file
Call lint.try_lint()
:q
Run ps aux | grep <formatter> and verify that the formatter is running
Observe 10 second UI freeze
Observe that Neovim quits and ps aux | grep <formatter> shows that the formatter process has been killed

mfussenegger

The entries in the running_procs_by_buf table get replaced after a :cancel call, so I think the VimLeavePre logic wouldn't work like this because the table won't contain the cancelled LintProcs anymore.

But the bigger problem is that if the cancel is deferred, you end up running multiple linters in parallel and you might get results updated/replaced differently.

Not sure how to deal with this. Maybe other options could be:

Wrap your misbehaving linter in a small script that forwards sigint as sigkill
try_lint could have a difference debounce mode, where instead of cancelling a proc, it lets it finish instead of starting a new process. Although this means the results would be stale
New lint process gets chained onto the previous lint process, so it only starts once the previous one got cancelled - but this is probably more complex than I'd like.
Allow to define per linter what signal to use - to allow killing it immediately without sigint. Or a custom cancel function per linter.

stevearc · 2024-10-24T15:39:55Z

The entries in the running_procs_by_buf table get replaced after a :cancel call, so I think the VimLeavePre logic wouldn't work like this because the table won't contain the cancelled LintProcs anymore.

That should be fine, right? Because we only need to cancel the currently running tasks on exit. Unless you're worried about the case of

Run lint process
Manually cancel lint process
Exit vim immediately
Vim exits before the sigkill timeout is reached, so process is orphaned

If that's what you're concerned about, I can fix it pretty easily by just having another way to keep track of all active LintProcs. LMK if this is what you were thinking of or if I'm misinterpreting.

But the bigger problem is that if the cancel is deferred, you end up running multiple linters in parallel and you might get results updated/replaced differently.

I don't think this can happen because when a LintProc is cancelled it won't publish the (now stale) diagnostics.

nvim-lint/lua/lint.lua

Lines 242 to 244 in ab83154

    
           if api.nvim_buf_is_valid(self.bufnr) and not self.cancelled then 
        
             vim.diagnostic.set(self.ns, self.bufnr, diagnostics) 
        
           end

mfussenegger · 2024-11-08T15:33:49Z

That should be fine, right? Because we only need to cancel the currently running tasks on exit. Unless you're worried about the case of

Hm, then I wonder if it is needed at all as part of this PR. They weren't getting cancelled before

I don't think this can happen because when a LintProc is cancelled it won't publish the (now stale) diagnostics.

Good point

stevearc · 2024-11-08T16:36:48Z

Hm, then I wonder if it is needed at all as part of this PR. They weren't getting cancelled before

If we want to scope the PR to just fixing the UI freeze, then we could leave out the VimLeavePre behavior. If we think of this PR as more broadly handling linter processes that don't terminate on SIGINT, then this is still useful for preventing orphaned processes when vim exits. I think this handles an important edge case, but if you think it adds too much complexity I can remove it from this PR.

fix: don't block UI when canceling lint process

ab83154

mfussenegger reviewed Oct 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: don't block UI when canceling lint process #688

fix: don't block UI when canceling lint process #688

stevearc commented Oct 24, 2024

mfussenegger left a comment •

edited

Loading

stevearc commented Oct 24, 2024

mfussenegger commented Nov 8, 2024

stevearc commented Nov 8, 2024

fix: don't block UI when canceling lint process #688

Are you sure you want to change the base?

fix: don't block UI when canceling lint process #688

Conversation

stevearc commented Oct 24, 2024

Test Plan

mfussenegger left a comment • edited Loading

Choose a reason for hiding this comment

stevearc commented Oct 24, 2024

mfussenegger commented Nov 8, 2024

stevearc commented Nov 8, 2024

mfussenegger left a comment •

edited

Loading