Summary
For a while, this test has been flaky, but since #298, it has been noted that it is more consistently failing for Python 3.13. This is likely related to the underlying issue/mechanism in #276. A more robust version of the test should be found.