Queries stuck in FINISHING time #463

KerenMousseri · 2024-06-02T09:00:58Z

Expected behavior

Queries' results should be successfully recieved to the client.

Actual behavior

In our Trino cluster, we are facing an issur that some queries remain stuck in the FINISHING state for an extended period before eventually failing with the error message: "Query was abandoned by the client, as it may have exited or stopped checking for query results."

After conducting some investigation, it appears that this issue predominantly occurs when querying Trino using the Python client. Here's a breakdown of the observed flow:

In the main module, we execute the TrinoQuery.execute function with our query.
This function initiates a POST request to the Trino coordinator.
Subsequently, it sends a GET request to the nextUri to retrieve the initial batch of query results.
As the results start arriving, the query state transitions to FINISHING.
The execution of the execute function ends.
Following this, the cursor.fetchall() function in the main module iterates over the nextUris, yielding each received row to the client. However, after a certain duration of fetching query results, the query fails with the "query abandon" error (as mentioned above).

Any assistance on resolving this significant issue would be greatly appreciated.

Thank you!!

Steps To Reproduce

Is it advisable to incorporate heartbeats to the coordinator while fetching results?
Would it be feasible to fetch multiple nextUris in parallel? I'm uncertain about this possibility due to the need to access nextUris as a linked list.

Log output

No response

Operating System

Windows

Trino Python client version

0.326.0

Trino Server version

439

Python version

3.9.3

Are you willing to submit PR?

Yes I am willing to submit a PR!

The text was updated successfully, but these errors were encountered:

njalan · 2024-09-10T07:27:05Z

@hashhar Is there any progress on it? We also face the same issue

hashhar · 2024-09-12T08:25:38Z

This is hard to reproduce and unclear if the causes for your case and our reproduction is same.

So we plan to add additional debug logging and then when someone is able to reproduce this issue we can look at the logs to figure out what is going wrong.

Probably here - https://github.com/trinodb/trino-python-client/blob/a87566794d9a9eefdd481a95f001ce2e37e20531/trino/client.py#L846C1-L846C65

hashhar added the bug Something isn't working label Jun 6, 2024

hashhar self-assigned this Jun 6, 2024

hashhar mentioned this issue Nov 21, 2024

Curious Trino retry behaviour trinodb/trino#22989

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Queries stuck in FINISHING time #463

Queries stuck in FINISHING time #463

KerenMousseri commented Jun 2, 2024 •

edited

Loading

njalan commented Sep 10, 2024

hashhar commented Sep 12, 2024 •

edited

Loading

Queries stuck in FINISHING time #463

Queries stuck in FINISHING time #463

Comments

KerenMousseri commented Jun 2, 2024 • edited Loading

Expected behavior

Actual behavior

Steps To Reproduce

Log output

Operating System

Trino Python client version

Trino Server version

Python version

Are you willing to submit PR?

njalan commented Sep 10, 2024

hashhar commented Sep 12, 2024 • edited Loading

KerenMousseri commented Jun 2, 2024 •

edited

Loading

hashhar commented Sep 12, 2024 •

edited

Loading