Performance: `two_percent` is faster and more efficient than `fzf` #561

kimono-koans · 2024-03-04T19:10:16Z

If the reason you're not using skim is raw performance, my branch, two_percent, is faster, more efficient and uses less memory than fzf for largish inputs (40MB):

> hyperfine -i -w 3 "sk --algo=simple --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt" "fzf --algo=v1 --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt" "sk --algo=skim_v2 --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt"
Benchmark 1: sk --algo=simple --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt
  Time (mean ± σ):      81.2 ms ±   2.0 ms    [User: 205.2 ms, System: 18.6 ms]
  Range (min … max):    78.7 ms …  86.2 ms    35 runs

  Warning: Ignoring non-zero exit code.

Benchmark 2: fzf --algo=v1 --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt
  Time (mean ± σ):     125.5 ms ±   2.8 ms    [User: 229.7 ms, System: 72.7 ms]
  Range (min … max):   116.7 ms … 130.8 ms    22 runs

  Warning: Ignoring non-zero exit code.
  Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options.

Benchmark 3: sk --algo=skim_v2 --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt
  Time (mean ± σ):     135.2 ms ±   2.0 ms    [User: 797.5 ms, System: 18.3 ms]
  Range (min … max):   131.6 ms … 140.4 ms    21 runs

  Warning: Ignoring non-zero exit code.

Summary
  sk --algo=simple --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt ran
    1.55 ± 0.05 times faster than fzf --algo=v1 --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt
    1.66 ± 0.05 times faster than sk --algo=skim_v2 --no-sort --query=hopscotchbubble --exit-0 < ../countwords/kjvbible_x10.txt

If anyone else is interested in performance improvements, and refactoring to reduce long term memory usage, I've been building on skim actively and I am eager to work with others who are similarly inclined.

Re: @lotabout 's wonderful comment re: performance:

The overhead of skim's item is 88B while fzf's item is 56B.

For ordinary text inputs, I'm using Arc<Box<str>> which I think is 32B? If there is some way to use Arc<str>, that would be even better but I can't seem to make it work re: traits. Re: my 10x duplicated KJV Bible 43M-ish corpus, the memory usage is about 90M on my Mac.

In my experience, skim's memory usage is around 1.5x ~ 2x of fzf's.
Fuzzy finder is the excellent use case of shared memory, but rust has limited support of it.

On ingest, I have a method for deduplicating lines. This is a significant performance win for inputs with lots of empty or the same lines.

Both skim and fzf's matching algorithm are O(n)

Algorithm switching is broken in the latest version. This is fixed in mine. I have a simple algorithm which is much closer to the fzf v1 algo used for large inputs. See above. You too can now write your own super, simple algo or improve mine!

But the size of structure to store matching result is different (skim is bad at it).

I've made a few changes which may help.

Performance of crossbeam's channel seems to be slower than go(not sure)? It's claimed to be fast, but it's still one of the bottlenecks in skim's use case.

My experience has been that ingest was not reading in enough bytes at once, and other threads were spinning wait on the lock.

The text was updated successfully, but these errors were encountered:

nrdxp · 2024-04-01T22:19:34Z

Just curious. Is there a reason why you can't/won't merge these changes back into skim?

kimono-koans · 2024-04-02T00:01:46Z

Just curious. Is there a reason why you can't/won't merge these changes back into skim?

As I stated in my post:

If anyone else is interested in performance improvements, and refactoring to reduce long term memory usage, I've been building on skim actively and I am eager to work with others who are similarly inclined.

My experience has been that this project is not actively maintained. I think I still have PRs outstanding: https://github.com/lotabout/skim/pulls/kimono-koans.

If anyone wants to assist, and contribute here, I'd help.

If no one else is so inclined, right now, I'll just keep developing in my own fork/branch.

d3-X-t3r · 2024-04-30T02:32:09Z

Just curious, what's the reason for the name "two_percent"? It's kinda hard to remeber (as in, what it does) and also to recommend to folks...

kimono-koans · 2024-04-30T03:19:40Z

In the US, at least, we sell milk with different percentages of milk fat. skim has 0% milk fat. Whole milk has something like 4% milk fat. A popular option is called 2% milk or 2%.

If you install cargo install two_percent or cargo deb --install, the binary/command will be installed as sk.

litoj · 2024-05-04T15:10:17Z

Have you also implemented fzf-like color option parsing? I am very used to having some elements of the ui in bold, but that doesn't seem to be possible with the current implementation.

Edit: I tried cargo install two_percent, but the result executable just shows the loading indicator forever without actually loading anything.

kimono-koans · 2024-05-11T15:12:19Z

Have you also implemented fzf-like color option parsing? I am very used to having some elements of the ui in bold, but that doesn't seem to be possible with the current implementation.

No, I haven't done any work re this, but I already see bold colors in my view? See: https://asciinema.org/a/637475

LoricAndre · 2024-11-08T11:53:48Z

Hi @kimono-koans we'd be happy to work with you to merge your improvements back into Skim ! Please tell us if you are interested

litoj · 2024-11-08T14:47:35Z

No, I haven't done any work re this, but I already see bold colors in my view?

Maybe the white looks a little bit whiter, but that isn't bold as in bold.

kimono-koans · 2024-11-10T22:10:41Z

Hi @kimono-koans we'd be happy to work with you to merge your improvements back into Skim ! Please tell us if you are interested

Once you start merging new things, I'll perhaps work on some commits for upstream.

LoricAndre · 2024-11-22T08:29:27Z

With this release, we should have some "quiet time" to start progressively merging your branch if you'd like

junegunn · 2024-12-20T11:41:36Z

I don't see much point in measuring the performance of v1 algorithm, because it is rarely used in practice. fzf does switch to it if an item is very long, but only for that item, and the rest of the items in the list are processed using the default algorithm. This switching happens when M * N > 102400, where M is the length of the query, and N is the length of an item. So for the query hopscotchbubble, a single item should be longer than 68KB for v1 to be used, and there are few such items in practice.

Also, please consider benchmarking against the latest version of fzf because the performance of fzf has improved a lot.

ibhagwan mentioned this issue Mar 12, 2024

[BUG] Run fzf-lua with skim failed ibhagwan/fzf-lua#1085

Closed

1 task

kimono-koans mentioned this issue Jul 30, 2024

What's the point of SkimItems with non-static lifetime? kimono-koans/two_percent#13

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance: `two_percent` is faster and more efficient than `fzf` #561

Performance: `two_percent` is faster and more efficient than `fzf` #561

kimono-koans commented Mar 4, 2024 •

edited

Loading

nrdxp commented Apr 1, 2024

kimono-koans commented Apr 2, 2024

d3-X-t3r commented Apr 30, 2024

kimono-koans commented Apr 30, 2024 •

edited

Loading

litoj commented May 4, 2024 •

edited

Loading

kimono-koans commented May 11, 2024

LoricAndre commented Nov 8, 2024

litoj commented Nov 8, 2024

kimono-koans commented Nov 10, 2024 •

edited

Loading

LoricAndre commented Nov 22, 2024

junegunn commented Dec 20, 2024 •

edited

Loading

Performance: two_percent is faster and more efficient than fzf #561

Performance: two_percent is faster and more efficient than fzf #561

Comments

kimono-koans commented Mar 4, 2024 • edited Loading

nrdxp commented Apr 1, 2024

kimono-koans commented Apr 2, 2024

d3-X-t3r commented Apr 30, 2024

kimono-koans commented Apr 30, 2024 • edited Loading

litoj commented May 4, 2024 • edited Loading

kimono-koans commented May 11, 2024

LoricAndre commented Nov 8, 2024

litoj commented Nov 8, 2024

kimono-koans commented Nov 10, 2024 • edited Loading

LoricAndre commented Nov 22, 2024

junegunn commented Dec 20, 2024 • edited Loading

Performance: `two_percent` is faster and more efficient than `fzf` #561

Performance: `two_percent` is faster and more efficient than `fzf` #561

kimono-koans commented Mar 4, 2024 •

edited

Loading

kimono-koans commented Apr 30, 2024 •

edited

Loading

litoj commented May 4, 2024 •

edited

Loading

kimono-koans commented Nov 10, 2024 •

edited

Loading

junegunn commented Dec 20, 2024 •

edited

Loading