Scanner should stop queueing work #582

meejah · 2021-11-02T03:48:46Z

If "a lot" of files are added at once, the scanner should have some sort of high-water mark where it stops adding new files for some time (e.g. until a low-water mark is reached).

Adding ~10000 new files at once will cause a lot of uploads to get queued, consuming CPU and memory.

meejah · 2021-11-03T07:16:18Z

In concrete terms, adding 2271 file and doing an explicit scan took 40s (which includes the time to create and serialize LocalSnapshot data in the state database as well).

Adding 8240 files at once took 3m, 25s.

meejah · 2022-01-25T19:16:44Z

Since this is now using cooperator it doesn't do infinite work at once.
As it's tagged "performance" we need to measure first before further work is very useful.

In any case it seems a lot of the slowdown is from producing JSON for either the status API or Eliot logs .. adding the 2409 files in my Twisted checkout produced 354MiB of Eliot logs consisting of 1.7M lines of JSON.

hacklschorsch · 2022-10-25T11:16:39Z

2409 files leading to 1.7M lines of JSON - i.e. in the order of one thousand log lines per added file 🙀 Might be a good ticket to have, no?

meejah · 2022-10-25T15:38:36Z

I think #632 counts ... it's not certain that it definitely is the "too much JSON" or whatever, and we'd need a performance-test to know if we got better or not.

(I presume that writing some of the performance tests suggested in the above ticket would reveal problems like the one in the comment)

meejah added the enhancement New feature or request label Nov 2, 2021

meejah added the performance label Nov 3, 2021

meejah closed this as completed Mar 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scanner should stop queueing work #582

Scanner should stop queueing work #582

meejah commented Nov 2, 2021

meejah commented Nov 3, 2021 •

edited

Loading

meejah commented Jan 25, 2022

hacklschorsch commented Oct 25, 2022

meejah commented Oct 25, 2022

Scanner should stop queueing work #582

Scanner should stop queueing work #582

Comments

meejah commented Nov 2, 2021

meejah commented Nov 3, 2021 • edited Loading

meejah commented Jan 25, 2022

hacklschorsch commented Oct 25, 2022

meejah commented Oct 25, 2022

meejah commented Nov 3, 2021 •

edited

Loading