[SPIKE] Experiment with performance of catalog queries #9506

dbeatty10 · 2024-02-01T21:23:04Z

Housekeeping

I am a maintainer of dbt-core

Short description

For dbt-bigquery and dbt-snowflake, experiment with different settings for relation_count:

how does it affect the performance of the catalog query over a range of values?
is there a point at which the query becomes to big to be accepted for execution?
- can we determine the size of the query to be submitted so as not to exceed the 1 MB limits set by Snowflake and BigQuery

Acceptance criteria

We can see a graph of the performance to run the catalog query (in seconds) on the y-axis vs. the number of selected nodes on the x-axis.

We'd generally expect it to look like one of the curves below (ideally the constant time blue one, but I'm guessing not 😉):

Impact to Adapters

Depending on the results of the experiment, we may choose to use different values for relation_count in dbt-bigquery and/or dbt-snowflake. Alternatively, we may choose to change our implementation in some way.

Context

The work was initially performed in #8521 / #8648.

Then #9394 expressed expectation that we'd get the benefits of #8648 even if more than 100 nodes are selected.

The text was updated successfully, but these errors were encountered:

ChenyuLInx · 2024-04-09T18:42:23Z

@dbeatty10 why only bigquery and snowflake?

ChenyuLInx · 2024-04-09T18:46:01Z

Another dimension to consider: how many objects are in the schema.

dbeatty10 · 2024-04-09T19:06:41Z

@ChenyuLInx yeah, it makes good sense to do both bigquery and snowflake. And also consider the number objects within the schema. 👍

dbeatty10 mentioned this issue Feb 1, 2024

[CT-3562] [Feature] Catalog queries filters for > 100 nodes #9394

Closed

2 tasks

graciegoheen mentioned this issue Feb 2, 2024

[Epic] Applied State (part 2) #9425

Closed

graciegoheen added the enhancement New feature or request label Feb 2, 2024

martynydbt modified the milestones: v1.8, v1.9 Feb 8, 2024

martynydbt assigned peterallenwebb Apr 6, 2024

graciegoheen unassigned peterallenwebb Apr 9, 2024

martynydbt assigned aranke and unassigned aranke Apr 25, 2024

graciegoheen removed this from the v1.9 milestone Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPIKE] Experiment with performance of catalog queries #9506

[SPIKE] Experiment with performance of catalog queries #9506

dbeatty10 commented Feb 1, 2024

ChenyuLInx commented Apr 9, 2024

ChenyuLInx commented Apr 9, 2024

dbeatty10 commented Apr 9, 2024

[SPIKE] Experiment with performance of catalog queries #9506

[SPIKE] Experiment with performance of catalog queries #9506

Comments

dbeatty10 commented Feb 1, 2024

Housekeeping

Short description

Acceptance criteria

Impact to Adapters

Context

ChenyuLInx commented Apr 9, 2024

ChenyuLInx commented Apr 9, 2024

dbeatty10 commented Apr 9, 2024