Fix the `FormSum` memory leak #3897

Ig-dolci · 2024-11-28T16:05:17Z

Description

We are having memory leaks and more expensive computation of solvers involving FormSum (See more details on this discussion). That is caused by this operation. My solution here is to make a sum operation through the numpy arrays. Below is a comparison (using the example added at the discussion) of time and memory computation differences between the master branch and the current PR.

github-actions · 2024-11-28T16:29:06Z

	Tests	Passed ✅	Skipped ⏭️	Failed ❌
Firedrake complex	8125 ran	6540 passed	1585 skipped	0 failed

github-actions · 2024-11-28T16:29:07Z

	Tests	Passed ✅	Skipped ⏭️	Failed ❌
Firedrake real	8131 ran	7345 passed	786 skipped	0 failed

JHopeCollins · 2024-11-28T16:45:13Z

Oh dear, if this fixes the issue then that interface could do with updating!

I think that the traversal method is taking advantage of the behaviour described here: #3348 (comment)

This behaviour is not widely known and is quite unintuitive - it's usually considered a bug. It might be best to change this method signature (and the preorder traversal method) to use visited=None as the kwarg, and use a self._visited dict with visited = visited if visited else self._visited

Ig-dolci · 2024-11-28T16:58:04Z

Oh dear, if this fixes the issue then that interface could do with updating!

I think that the traversal method is taking advantage of the behaviour described here: #3348 (comment)

This behaviour is not widely known and is quite unintuitive - it's usually considered a bug. It might be best to change this method signature (and the preorder traversal method) to use visited=None as the kwarg, and use a self._visited dict with visited = visited if visited else self._visited

No. This does not fix it. I am still debugging.

JHopeCollins · 2024-11-28T17:08:15Z

No. This does not fix it. I am still debugging.

Shame it wasn't so simple! It may still be good to change the signatures to avoid that behaviour though

Ig-dolci · 2024-11-28T17:17:45Z

Oh dear, if this fixes the issue then that interface could do with updating!

I think that the traversal method is taking advantage of the behaviour described here: #3348 (comment)

This behaviour is not widely known and is quite unintuitive - it's usually considered a bug. It might be best to change this method signature (and the preorder traversal method) to use visited=None as the kwarg, and use a self._visited dict with visited = visited if visited else self._visited

I see. I gonna try visited=None.

JHopeCollins · 2024-11-29T10:45:59Z

firedrake/assemble.py

@@ -584,7 +583,8 @@ def update_tensor(assembled_base_form, tensor):
            raise NotImplementedError("Cannot update tensor of type %s" % type(tensor))

    @staticmethod
-    def base_form_postorder_traversal(expr, visitor, visited={}):
+    def base_form_postorder_traversal(expr, visitor, visited=None):
+        visited = visited if visited is not None else {}


To have the same caching behaviour as before, without the {} default kwarg, this should stash visited as an attribute so it get's reused unless the user passes visited, e.g.

visited = visited if visited is not None else self._visited

I decided to keep the original code here.

…nstead of dat

Ig-dolci · 2024-12-02T11:08:21Z

firedrake/assemble.py

+                            dat_result.data_ro_with_halos,
+                            w * dat_op.data_ro_with_halos,
+                            out=dat_result.data_wo_with_halos)


I am not sure if make this operation with_halos is the right think to do.

Does result.assign(sum(w*arg for arg in args)) work? This code looks very very similar to what we do in assign.py.

No. result is a Cofunction, and Cofunction assigning an exp that is isinstance(expr, BaseForm) will reach this code again, which leads to a maximum recursion. See this Cofunction assignment code.

Remove visited

3359367

Ig-dolci added 2 commits November 28, 2024 17:46

Set visited as None

c66d385

wip

7151998

JHopeCollins reviewed Nov 29, 2024

View reviewed changes

Ig-dolci added 2 commits November 29, 2024 15:46

wip

243c6dc

Revert to original visited definition; optimize via data operations i…

b35d07a

…nstead of dat

Ig-dolci changed the title ~~Fix the FormSum!?~~ Fix the FormSum memory leak Dec 2, 2024

Ig-dolci commented Dec 2, 2024

View reviewed changes

Ig-dolci marked this pull request as ready for review December 2, 2024 14:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the `FormSum` memory leak #3897

Fix the `FormSum` memory leak #3897

Ig-dolci commented Nov 28, 2024 •

edited

Loading

github-actions bot commented Nov 28, 2024 •

edited

Loading

github-actions bot commented Nov 28, 2024 •

edited

Loading

JHopeCollins commented Nov 28, 2024

Ig-dolci commented Nov 28, 2024

JHopeCollins commented Nov 28, 2024

Ig-dolci commented Nov 28, 2024

JHopeCollins Nov 29, 2024

Ig-dolci Dec 2, 2024

Ig-dolci Dec 2, 2024

connorjward Dec 2, 2024

Ig-dolci Dec 2, 2024 •

edited

Loading

Fix the FormSum memory leak #3897

Are you sure you want to change the base?

Fix the FormSum memory leak #3897

Conversation

Ig-dolci commented Nov 28, 2024 • edited Loading

Description

github-actions bot commented Nov 28, 2024 • edited Loading

github-actions bot commented Nov 28, 2024 • edited Loading

JHopeCollins commented Nov 28, 2024

Ig-dolci commented Nov 28, 2024

JHopeCollins commented Nov 28, 2024

Ig-dolci commented Nov 28, 2024

JHopeCollins Nov 29, 2024

Choose a reason for hiding this comment

Ig-dolci Dec 2, 2024

Choose a reason for hiding this comment

Ig-dolci Dec 2, 2024

Choose a reason for hiding this comment

connorjward Dec 2, 2024

Choose a reason for hiding this comment

Ig-dolci Dec 2, 2024 • edited Loading

Choose a reason for hiding this comment

Fix the `FormSum` memory leak #3897

Fix the `FormSum` memory leak #3897

Ig-dolci commented Nov 28, 2024 •

edited

Loading

github-actions bot commented Nov 28, 2024 •

edited

Loading

github-actions bot commented Nov 28, 2024 •

edited

Loading

Ig-dolci Dec 2, 2024 •

edited

Loading