chore: replace max_io_request #12997

JackTan25 · 2023-09-25T04:34:02Z

I hereby agree to the terms of the CLA available at: https://databend.rs/dev/policies/cla/

Summary

Summary about this PR
we need to use thread_nums to limit the permit_nums , in some cases, if the permit num is too large, we will get oom.

Closes #issue

This change is

vercel · 2023-09-25T04:34:05Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
databend	⬜️ Ignored (Inspect)	Visit Preview		Sep 25, 2023 8:36am

JackTan25 · 2023-09-25T04:36:04Z

use thread_nums as permit_nums to control the concurrency. Do we need to use try_join_all directly without permits? or use thread_nums * 4 as the permit_nums? cc @dantengsky @zhyass

src/query/ee/src/storages/fuse/io/snapshots.rs

src/query/service/tests/it/storages/fuse/operations/mutation/recluster_mutator.rs

github-actions · 2023-09-25T05:05:43Z

Docker Image for PR

tag: pr-12997-5e39b00

note: this image tag is only available for internal use,
please check the internal doc for more details.

github-actions · 2023-09-25T06:05:50Z

Docker Image for PR

tag: pr-12997-cb1815e

note: this image tag is only available for internal use,
please check the internal doc for more details.

src/query/storages/fuse/src/io/segments.rs

zhang2014 · 2023-09-25T06:25:20Z

maybe we should add new settings is better, with default values same to max_threads

JackTan25 · 2023-09-25T06:29:55Z

maybe we should add new settings is better, with default values same to max_threads

I'm not sure whether we need the factor * thread_nums or just use try_join_all()

github-actions · 2023-09-25T06:41:21Z

ClickBench Report

zhang2014 · 2023-09-25T06:46:19Z

maybe we should add new settings is better, with default values same to max_threads

I'm not sure whether we need the factor * thread_nums or just use try_join_all()

it may be necessary when we need to adjust its parallel？

src/query/storages/fuse/src/io/snapshots.rs

BohuTANG · 2023-09-25T07:16:01Z

maybe we should add new settings is better, with default values same to max_threads

Looks we don't need the new setting, only adjust a factor with CPU is fine.

dantengsky · 2023-09-25T07:25:59Z

ci-benchmark

#12997 (comment)

shows no performance degradations, which is expected, since all the modifications are table mutation-related only (If I get it right).

but scenarios the ci-benchmark not covered, e.g. deletions/compactions/replace-into, etc, we do not know the impacts yet.

Maybe too conservative, but I am with @zhang2014 , that we may need to introduce a new setting(let's say, max_mutation_io_request or a better name), with a default value that equals the values of max_stroage_io_request, but can be tweaked individually.

JackTan25 · 2023-09-25T07:40:00Z

ci-benchmark

#12997 (comment)

shows no performance degradations, which is expected, since all the modifications are table mutation-related only (If I get it right).

but scenarios the ci-benchmark not covered, e.g. deletions/compactions/replace-into, etc, we do not know the impacts yet.

Maybe too conservative, but I am with @zhang2014 , that we may need to introduce a new setting(let's say, max_mutation_io_request or a better name), with a default value that equals the values of max_stroage_io_request, but can be tweaked individually.

I agree with that. When we set max_stroage_io_request, we will also set max_mutation_io_request, but we can also set max_mutation_io_request individually. And no need factor, just use max_mutation_io_request as the permit_nums in execute_futures_in_parallel. cc @dantengsky @zhang2014 @BohuTANG @zhyass can we be consistent with this?

BohuTANG · 2023-09-25T07:46:08Z

ci-benchmark
#12997 (comment)
shows no performance degradations, which is expected, since all the modifications are table mutation-related only (If I get it right).
but scenarios the ci-benchmark not covered, e.g. deletions/compactions/replace-into, etc, we do not know the impacts yet.
Maybe too conservative, but I am with @zhang2014 , that we may need to introduce a new setting(let's say, max_mutation_io_request or a better name), with a default value that equals the values of max_stroage_io_request, but can be tweaked individually.

I agree with that. When we set max_stroage_io_request, we will also set max_mutation_io_request, but we can also set max_mutation_io_request individually. And no need factor, just use max_mutation_io_request as the permit_nums in execute_futures_in_parallel. cc @dantengsky @zhang2014 @BohuTANG @zhyass can we be consistent with this?

Hmm...
This means that we do not do the CPU*fatctor right, depends another setting max_mutation_io_request .
This PR aims to make less settings.

JackTan25 · 2023-09-25T08:08:19Z

@zhyass Any concerns? shall we let the permit_nums eq scale_factor * threads_nums, where scale_factor = 4, in this case (reclustering/compaction) for example?

Set scale_factor * threads_nums, if use execute_futures_in_parallel. Otherwise use threads_nums directly.
use @zhyass 's suggestion.

* replace max_io_request * fix check * replace max_io_request with max_threads * remove write_segments * fix typo * use factor 4 when execute_futures_in_parallel advised by zhyass * rename * rename * add factor * use factor as 2

replace max_io_request

43fafc0

JackTan25 requested a review from zhyass September 25, 2023 04:34

github-actions bot added the pr-chore this PR only has small changes that no need to record, like coding styles. label Sep 25, 2023

JackTan25 requested a review from dantengsky September 25, 2023 04:34

JackTan25 added the ci-benchmark Benchmark: run all test label Sep 25, 2023

fix check

36376b0

BohuTANG reviewed Sep 25, 2023

View reviewed changes

src/query/ee/src/storages/fuse/io/snapshots.rs Outdated Show resolved Hide resolved

BohuTANG reviewed Sep 25, 2023

View reviewed changes

src/query/service/tests/it/storages/fuse/operations/mutation/recluster_mutator.rs Show resolved Hide resolved

replace max_io_request with max_threads

17fe995

dantengsky added ci-benchmark Benchmark: run all test and removed ci-benchmark Benchmark: run all test labels Sep 25, 2023

dantengsky reviewed Sep 25, 2023

View reviewed changes

src/query/storages/fuse/src/io/segments.rs Outdated Show resolved Hide resolved

remove write_segments

66afef7

dantengsky reviewed Sep 25, 2023

View reviewed changes

src/query/storages/fuse/src/io/snapshots.rs Outdated Show resolved Hide resolved

fix typo

db5f920

use factor 4 when execute_futures_in_parallel advised by zhyass

9dba639

rename

6e91a80

JackTan25 added 3 commits September 25, 2023 16:10

rename

e8cb7cb

add factor

7b5a77a

use factor as 2

7003dda

dantengsky approved these changes Sep 25, 2023

View reviewed changes

dantengsky merged commit 4ffd197 into databendlabs:main Sep 25, 2023
58 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: replace max_io_request #12997

chore: replace max_io_request #12997

JackTan25 commented Sep 25, 2023 •

edited

Loading

vercel bot commented Sep 25, 2023 •

edited

Loading

JackTan25 commented Sep 25, 2023 •

edited

Loading

github-actions bot commented Sep 25, 2023

github-actions bot commented Sep 25, 2023

zhang2014 commented Sep 25, 2023

JackTan25 commented Sep 25, 2023 •

edited

Loading

github-actions bot commented Sep 25, 2023

zhang2014 commented Sep 25, 2023

BohuTANG commented Sep 25, 2023

dantengsky commented Sep 25, 2023

JackTan25 commented Sep 25, 2023

BohuTANG commented Sep 25, 2023 •

edited

Loading

JackTan25 commented Sep 25, 2023

chore: replace max_io_request #12997

chore: replace max_io_request #12997

Conversation

JackTan25 commented Sep 25, 2023 • edited Loading

Summary

vercel bot commented Sep 25, 2023 • edited Loading

JackTan25 commented Sep 25, 2023 • edited Loading

github-actions bot commented Sep 25, 2023

Docker Image for PR

github-actions bot commented Sep 25, 2023

Docker Image for PR

zhang2014 commented Sep 25, 2023

JackTan25 commented Sep 25, 2023 • edited Loading

github-actions bot commented Sep 25, 2023

ClickBench Report

zhang2014 commented Sep 25, 2023

BohuTANG commented Sep 25, 2023

dantengsky commented Sep 25, 2023

JackTan25 commented Sep 25, 2023

BohuTANG commented Sep 25, 2023 • edited Loading

JackTan25 commented Sep 25, 2023

JackTan25 commented Sep 25, 2023 •

edited

Loading

vercel bot commented Sep 25, 2023 •

edited

Loading

JackTan25 commented Sep 25, 2023 •

edited

Loading

JackTan25 commented Sep 25, 2023 •

edited

Loading

BohuTANG commented Sep 25, 2023 •

edited

Loading