Memory refactor #1205

Andy-Jost · 2025-10-31T22:41:52Z

Major refactoring of the memory package.

Overview
This PR refactors the _memory.pyx module into a dedicated package (_memory/) to address its growing size and complexity, which were hindering further development. The primary goals are to physically separate the code into more manageable submodules, simplify the internal logic, and enhance the overall structure, including the addition of .pxd headers for better Cython integration.

Major Changes

Split _memory.pyx into submodules, the major ones being the following:
- Buffers: _buffer.*
- Device memory resources: _dmr.*
- IPC (Inter-Process Communication): _ipc.*
- Virtual memory management: _vmm.*
Introduced Cython headers (.pxd) for public definitions to improve modularity and type safety.
Refactored DeviceMemoryResource to isolate IPC-related code, reducing coupling.
Simplified IPC implementation by adding an IPCData class to encapsulate relevant data members and eliminating a redundant uuid field.
Streamlined the class hierarchy by removing unnecessary classes.
Simplified the Cython interface for memory allocation and deallocation operations.

Minor Improvements

Added __all__ lists to modules for explicit control over exports.
Extracted long implementation functions from class definitions to make classes more concise and readable.
Renamed various private attributes and methods for consistency (e.g., _handle instead of _mempool_handle).
Consolidated and alphabetized property definitions for better organization.
Converted additional classes and functions to Cython for performance gains.

copy-pr-bot · 2025-10-31T22:41:55Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Andy-Jost · 2025-11-07T20:43:52Z

/ok to test cf4dc9d

Andy-Jost · 2025-11-10T20:38:35Z

/ok to test 19e4b8f

cuda_core/cuda/core/experimental/_memory/_buffer.pyx

cuda_core/cuda/core/experimental/_memory/_legacy.py

rparolin

Generally looks good to me. I left some general comments that I consider non-blocking.

@leofang Any major concerns you'd like to see address before this gets merged?

cuda_core/cuda/core/experimental/_device.pyx

cuda_core/tests/test_memory.py

Andy-Jost · 2025-11-12T19:09:48Z

/ok to test ff3820f

leofang · 2025-11-12T19:19:08Z

cuda_core/cuda/core/experimental/_memory/__init__.py

+from ._buffer import *  # noqa: F403
+from ._device_memory_resource import *  # noqa: F403
+from ._ipc import *  # noqa: F403
+from ._legacy import *  # noqa: F403
+from ._virtual_memory_resource import *  # noqa: F403


Style: Would be better to call out what's being imported. Maintaining __all__ or having to chasing after each module for their __all__ is not fun. I don't recall we ever maintain __all__ in any other modules.

I guess I have the opposite preference. E.g., if I add something to, say, _buffer I find it easier to update the __all__ list there rather than in a separate file.

github-actions · 2025-11-12T20:39:55Z

Doc Preview CI
Preview removed because the pull request was closed or merged.

cuda_core/cuda/core/experimental/_memory/_buffer.pyx

Andy-Jost added 25 commits October 28, 2025 15:43

Resolve a Cython build warning.

37849a3

Make memory module into a package.

ac8a69c

Rename cyStream to _cyStream for consistency.

123aa24

Move defs to memory.pxd header

fe4b67e

Separate VMM.

ce77d44

Weaken dependencies from device to memory module.

c5179bc

Move LegacyPinnedMemoryResource to a submodule.

e192748

Move _SynchronousMemoryResource into a submodule.

729c900

Partly separates the IPC implementation.

8735455

Move IPC registry to ipc module.

b2517f6

Collect and reorder DeviceMemoryResource properties.

a61317a

Move more IPC implementation out of DeviceMemoryResource.

0e2d1d8

Minor refactoring.

5387629

Move Buffer IPC implementation.

7fa38ca

Simplify the class hierarchy (remove _cyBuffer and _cyMemoryResource).

f357abd

Refactor to shrink Cython interface.

89057f9

Simplify Buffer close.

00b60eb

Refactor DeviceMemoryResource.__init__.

ecc9405

Move Buffer into a separate module.

228936b

Refactors DeviceMemoryResource IPC implementation.

9a86bde

Removes superfluous _uuid member of DeviceMemoryResource.

c7f6cde

Adds __all__ lists.

216b4fb

Prepend underscore to submodules, add a test for package contents.

6a30a39

Refactor IPC data of DMR into IPCData class.

229ddc6

General clean up.

0fd3ca9

Andy-Jost requested review from cpcloud, leofang and mdboom and removed request for cpcloud October 31, 2025 22:41

Merge branch 'main' into memory-refactor

cf4dc9d

leofang assigned Andy-Jost Nov 10, 2025

leofang added this to the cuda.core beta 9 milestone Nov 10, 2025

leofang added enhancement Any code-related improvements P0 High priority - Must do! labels Nov 10, 2025

Merge branch 'main' into memory-refactor

19e4b8f

rparolin reviewed Nov 12, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory/_buffer.pyx Show resolved Hide resolved

rparolin reviewed Nov 12, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory/_buffer.pyx Show resolved Hide resolved

rparolin reviewed Nov 12, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory/_buffer.pyx Show resolved Hide resolved

rparolin reviewed Nov 12, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory/_buffer.pyx Show resolved Hide resolved

rparolin reviewed Nov 12, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory/_buffer.pyx Show resolved Hide resolved

Andy-Jost added 2 commits November 12, 2025 10:34

Merge remote-tracking branch 'origin/main' into memory-refactor

743b8a3

Rename files _dmr.* and _vmm.py to avoid abbreviations.

cce7f6c

Andy-Jost force-pushed the memory-refactor branch from b5ae007 to cce7f6c Compare November 12, 2025 18:34

Merge branch 'main' into memory-refactor

ff3820f

rparolin reviewed Nov 12, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory/_legacy.py Show resolved Hide resolved

rparolin approved these changes Nov 12, 2025

View reviewed changes

leofang reviewed Nov 12, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_device.pyx Show resolved Hide resolved

leofang reviewed Nov 12, 2025

View reviewed changes

cuda_core/tests/test_memory.py Show resolved Hide resolved

Andy-Jost enabled auto-merge (squash) November 12, 2025 19:09

leofang reviewed Nov 12, 2025

View reviewed changes

Andy-Jost merged commit f9df16f into NVIDIA:main Nov 12, 2025
57 checks passed

leofang reviewed Nov 19, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_memory/_buffer.pyx Show resolved Hide resolved

leofang mentioned this pull request Nov 26, 2025

feat: Introduce StridedLayout, support wrapping external allocations in Buffer, add StridedMemoryView.from_buffer #1283

Merged

2 tasks

leofang mentioned this pull request Dec 11, 2025

Is MemoryResource base class changed on the main branch on purpose? #1359

Closed

Andy-Jost deleted the memory-refactor branch January 14, 2026 01:55

Memory refactor #1205

Memory refactor #1205

Uh oh!

Conversation

Andy-Jost commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Major refactoring of the memory package.

Uh oh!

copy-pr-bot bot commented Oct 31, 2025

Uh oh!

Andy-Jost commented Nov 7, 2025

Uh oh!

Andy-Jost commented Nov 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rparolin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Andy-Jost commented Nov 12, 2025

Uh oh!

leofang Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Andy-Jost Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Andy-Jost commented Oct 31, 2025 •

edited

Loading