Using xdist to run pytest in parallel #6620

unkcpz · 2024-11-19T23:44:04Z

Running tests in parallel using pytest-xdist reduce the time of ci-codes (2 cores).

ci-code / tests (3.9): ~28m -> ~15m
ci-code / tests (3.12): ~20m -> 12m
ci-code / presto: ~11m -> 6m
Fix all failed tests, see if it possible.
Bring the inconsistent test back to run.

codecov · 2024-11-20T00:36:52Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 77.89%. Comparing base (ef60b66) to head (f7d3072).
Report is 147 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6620      +/-   ##
==========================================
+ Coverage   77.51%   77.89%   +0.39%     
==========================================
  Files         560      567       +7     
  Lines       41444    42180     +736     
==========================================
+ Hits        32120    32852     +732     
- Misses       9324     9328       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

danielhollas

This is very cool, thanks for doing this!

The PR itself looks good, but I have running into errors when I try to run locally. I'll need to investigate a bit.

requirements/requirements-py-3.12.txt

danielhollas · 2024-11-20T14:46:13Z

tests/orm/test_fields.py

-)
-def test_all_node_fields(group, name, data_regression):
+@pytest.fixture
+def available_entry_points():


Thanks, this is much better. Maybe the name of the fixture should be more specific, since you're only returning node and data entry points. Perhaps node_and_data_entry_points?

unkcpz · 2024-11-21T08:55:58Z

Run with my 16 cores, I found some issues that storage is accessed at the same time.
The problem is caused in tests/tools/archive/test_simple.py::test_base_data_nodes didn't clean the storage before the test that cause the shared storage has unsealed nodes.

The restapi tests failed randomly because the test open restapi server using port 5000 at the same time. (fixed by 3bf2c72)
The following one seems be able to be solved by Use only one global var for marking config folder tree #6610

    def test_load_backend_if_not_loaded_load_once(manager, monkeypatch):
        """Test :meth:`aiida.cmdline.utils.decorators.load_backend_if_not_loaded` calls ``get_profile_storage`` once."""
        mocked = mock.Mock()
    
        # This test assumes the ``load_backend_if_not_loaded`` uses ``get_profile_storage`` to load the profile, so we need
        # to first check that this is the case. If this changes, this first test will fail alerting that it needs to be
        # adapted.
        with monkeypatch.context() as context:
            context.setattr(manager.__class__, 'get_profile_storage', mocked)
            load_backend_if_not_loaded()
>           assert mocked.call_count == 1
E           AssertionError: assert 0 == 1
E            +  where 0 = <Mock id='125376138318368'>.call_count

tests/cmdline/utils/test_decorators.py:82: AssertionError

transport/test_all_plugins.py should use independent tmp path. It is addressed in Refactoring: use tmp path fixture to mock remote and local for transport plugins #6627

The typical error of not using tmp_path fixture is:

FAILED tests/transports/test_all_plugins.py::test_put_get_empty_string_file[core.ssh] - OSError: Error during mkdir of 'tmp_try' from folder '/tmp', maybe you don't have the permissions to do it, or the dire
ctory already exists? (Failure)

More tests randomly failed, should be fixed

ERROR tests/tools/archive/orm/test_groups.py::test_nodes_in_group - sqlalchemy.orm.exc.DetachedInstanceError: Instance <DbUser at 0x73ad11c478f0> is not bound to a Session; attribute refresh operation cannot
 proceed (Background on this error at: https://sqlalche.me/e/20...
ERROR tests/cmdline/commands/test_status.py::test_status_no_profile - TimeoutError: disconnect after 10 seconds
FAILED tests/cmdline/commands/test_rabbitmq.py::test_revive - AssertionError: Traceback (most recent call last):
FAILED tests/orm/test_querybuilder.py::TestBasic::test_tuples - AssertionError: assert 11 == 1
FAILED tests/tools/archive/orm/test_authinfo.py::test_create_all_no_authinfo - aiida.tools.archive.exceptions.ExportValidationError: All ProcessNodes must be sealed before they can be exported. Node(s) with PK(s): 61 is/are not se
aled.
FAILED tests/tools/archive/orm/test_authinfo.py::test_create_all_with_authinfo - aiida.tools.archive.exceptions.ExportValidationError: All ProcessNodes must be sealed before they can be exported. Node(s) with PK(s): 61 is/are not 
sealed.

tests/tools/archive/test_simple.py

for more information, see https://pre-commit.ci

unkcpz · 2024-11-29T15:11:06Z

Hi @danielhollas, for tests of creating an archive, it randomly failed because it tries to use the shared DB where may have unsealed data nodes.
I think it would be possible to use SQlite backend that having a DB only for a test here, correct?

@pytest.mark.usefixtures('aiida_localhost')
def test_create_all_no_authinfo(tmp_path):
    """Test archive creation that does not include authinfo."""
    filename1 = tmp_path / 'export1.aiida'
    create_archive(None, filename=filename1, include_authinfos=False)
    with get_format().open(filename1, 'r') as archive:
        assert archive.querybuilder().append(orm.AuthInfo).count() == 0


@pytest.mark.usefixtures('aiida_localhost')
def test_create_all_with_authinfo(tmp_path):
    """Test archive creation that does include authinfo."""
    filename1 = tmp_path / 'export1.aiida'
    create_archive(None, filename=filename1, include_authinfos=True)
    with get_format().open(filename1, 'r') as archive:
        assert archive.querybuilder().append(orm.AuthInfo).count() == 1

EDIT: never mind, I think I can just put the aiida_profile_clean fixture. The storage is per test and will reset with the fixture.

unkcpz · 2024-11-29T15:47:25Z

I test with both my -n 16 in my laptop and -n 32 in my workstation multiple times. All test passed 😎

unkcpz · 2024-11-29T15:49:02Z

tests/tools/visualization/test_graph.py

@@ -353,6 +353,7 @@ def test_graph_node_identifiers(self, node_id_type, monkeypatch, file_regression
        # The order of certain output lines can be randomly ordered so we split the file in lines, sort, and then join
        # them into a single string again. The node identifiers generated by the engine are of the form ``N{pk}`` and
        # they also clearly vary, so they are replaced with the ``NODE`` placeholder.
-        string = '\n'.join(sorted(graph.graphviz.source.strip().split('\n')))


If the sort happened first, then the order may depend on the pk number and the test fail with different string where lines order changed. I sort the lines after replace the pk number.

unkcpz · 2024-11-29T15:51:37Z

In my workstation, it failed now sometime with SSH banner, but I think it is because the SSH connection is too many and probably I have some sshd limitation setup on the machine. But I think it is fine.

Using xdist to run pytest in parallel

d2750fd

unkcpz marked this pull request as draft November 19, 2024 23:44

unkcpz added 2 commits November 20, 2024 01:11

Add to requirement deps list

577c188

The regular tests suits as well

5848b8c

Fix order change entry points test for xdist

115e041

unkcpz marked this pull request as ready for review November 20, 2024 07:40

unkcpz requested a review from danielhollas November 20, 2024 07:43

danielhollas reviewed Nov 20, 2024

View reviewed changes

unkcpz added 2 commits November 21, 2024 09:56

Clean storage before test_base_data_nodes

7bb6476

Rename the fixture for static node/data entry points

81895bb

danielhollas reviewed Nov 21, 2024

View reviewed changes

tests/tools/archive/test_simple.py Show resolved Hide resolved

unkcpz and others added 4 commits November 21, 2024 10:57

Make restapi ports dynamic and non-conflict

3bf2c72

test_all_plugins conflict when run in parallel

e740a0e

[pre-commit.ci] auto fixes from pre-commit.com hooks

9b4bd40

for more information, see https://pre-commit.ci

Merge branch 'main' into xdist

d697748

This was referenced Nov 25, 2024

Refactoring: use tmp path fixture to mock remote and local for transport plugins #6627

Merged

Use only one global var for marking config folder tree #6610

Merged

unkcpz and others added 4 commits November 28, 2024 01:27

Merge branch 'main' into xdist

b2656d4

Independent profile in test_log.py

9f591fe

Merge branch 'main' into xdist

2612f52

[pre-commit.ci] auto fixes from pre-commit.com hooks

62f6a54

for more information, see https://pre-commit.ci

unkcpz added 4 commits November 29, 2024 16:19

Clean Db before query when the test need count

305cc54

Clean storage for test_authinfo tests

2b4eda3

test_group.py all require reset DB

39d083f

Replace string and then sort for graph node string tests

b832d6a

unkcpz requested a review from danielhollas November 29, 2024 15:46

unkcpz commented Nov 29, 2024

View reviewed changes

unkcpz added 2 commits November 29, 2024 16:57

Also for test in test-install

1d5b9ba

clean storage before for test_walk_nodes

f7d3072

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using xdist to run pytest in parallel #6620

Using xdist to run pytest in parallel #6620

unkcpz commented Nov 19, 2024 •

edited

Loading

codecov bot commented Nov 20, 2024 •

edited

Loading

danielhollas left a comment

danielhollas Nov 20, 2024

unkcpz commented Nov 21, 2024 •

edited

Loading

unkcpz commented Nov 29, 2024 •

edited

Loading

unkcpz commented Nov 29, 2024

unkcpz Nov 29, 2024

unkcpz commented Nov 29, 2024 •

edited

Loading

Using xdist to run pytest in parallel #6620

Are you sure you want to change the base?

Using xdist to run pytest in parallel #6620

Conversation

unkcpz commented Nov 19, 2024 • edited Loading

codecov bot commented Nov 20, 2024 • edited Loading

Codecov Report

danielhollas left a comment

Choose a reason for hiding this comment

danielhollas Nov 20, 2024

Choose a reason for hiding this comment

unkcpz commented Nov 21, 2024 • edited Loading

unkcpz commented Nov 29, 2024 • edited Loading

unkcpz commented Nov 29, 2024

unkcpz Nov 29, 2024

Choose a reason for hiding this comment

unkcpz commented Nov 29, 2024 • edited Loading

unkcpz commented Nov 19, 2024 •

edited

Loading

codecov bot commented Nov 20, 2024 •

edited

Loading

unkcpz commented Nov 21, 2024 •

edited

Loading

unkcpz commented Nov 29, 2024 •

edited

Loading

unkcpz commented Nov 29, 2024 •

edited

Loading