feat: support array-api by amalia-k510 · Pull Request #2071 · scverse/anndata

amalia-k510 · 2025-08-11T06:59:55Z

fixes #1731, fixes #1195, fixes #697

First step in getting anndata concat and test generation to work properly with JAX, (and Cubed potentially), without just converting everything into NumPy.

Random data creation and shape handling use xp.asarray so arrays stay in their original backend where possible. I also updated concat paths to actually check types before converting, added helpers for sparse detection and array API checks, and made sure backend arrays only get turned into NumPy when absolutely necessary. This fixes a bunch of concat-related test failures.

It’s still not perfect. Some pandas calls in concat still force conversion to NumPy, so the data gets copied instead of being used directly. Cubed support is only a placeholder right now. Type detection might still be a bit too broad, which can lead to extra conversions. Works for NumPy and JAX in tests, but I haven’t tried other backends.

…at_func0-False]

…a into ig/array_api_continue import merge

flying-sheep

OK, I just went over general code style, nothing JAX-related

flying-sheep · 2025-08-11T12:40:29Z

src/anndata/_core/merge.py

+        # Force to NumPy (materializes JAX/Cubed); fine for small tests,
+        # but may be slow or fail on large/lazy arrays


This code doesn’t just run for tests though. Also are you sure that this is a good idea for arrays with pandas dtypes?

Yeah, I was initially forcing everything to NumPy, but that’s no longer the case. I’ve updated it so the it should preserve arrays with pandas dtypes.

src/anndata/_core/merge.py

flying-sheep · 2025-08-11T12:45:36Z

src/anndata/_core/merge.py

+        return False
+
+
+def _to_numpy_if_array_api(x):


there should be no second copy of this that’s slightly different, only one!

src/anndata/_core/merge.py

src/anndata/tests/helpers.py

tests/test_helpers.py

ilan-gold · 2025-08-19T15:31:53Z

src/anndata/_core/anndata.py

+                dest = self._adata_ref._X
+                # Handles read-only NumPy views from backend arrays like JAX by
+                # making a writable copy so in-place assignment on views can succeed.
+                if isinstance(dest, np.ndarray) and not dest.flags.writeable:
+                    dest = np.array(dest, copy=True)  # make a fresh, writable buffer
+                    self._adata_ref._X = dest


I would actually just let the error be thrown in this case. If something isn't writeable, I don't think that's our responsibility to handle

ilan-gold · 2025-08-19T15:35:13Z

src/anndata/_core/merge.py

+        hasattr(x, "dtype") and is_extension_array_dtype(x.dtype)
+    ):
+        return x
+    return np.asarray(x)


Ok nice this is the right direction no doubt! So what we want here probably is not to rely on asarray but dlpack to do the conversion. In short:

We should have a check in _apply_to_array to see if something is array-api compatible but not a numpy ndarray.

If this case is true, dlpack into numpy, recursively call _apply_to_array

Then use dlpack to take the output of the recursive call to the original type before we went to numpy.

Does that make sense?

I think this is a nice paradigm to follow for situations where we have an existing numpy or cupy implementation and it isn't clear how to use the array-api to achieve our aims. We should still try to use it as much as possible so that we can eventually remove numpy codepaths where possible, but this is a nice first step.

… with copying introduced as an extra precaution

src/anndata/_core/merge.py

flying-sheep

looks great, I just have some clarifying questions!

.github/workflows/test-gpu.yml

src/anndata/_core/anndata.py

src/anndata/_core/index.py

src/anndata/_io/specs/registry.py

src/anndata/compat/__init__.py

Co-authored-by: Philipp A. <flying-sheep@web.de>

for more information, see https://pre-commit.ci

.github/workflows/test-gpu.yml

flying-sheep · 2026-02-06T16:03:16Z

src/anndata/compat/__init__.py

+            # As of 2023 dlpack, it must be possible for a library to export to this, see: https://data-apis.org/array-api/latest/API_specification/generated/array_api.array.__dlpack__.html#array_api.array.__dlpack__
+            # However, https://github.com/numpy/numpy/issues/20742 means we can't roundtrip jax arrays using dlpack so better to just let numpy do its thing in asarray.
+            self.add_array(np.asarray(self.get_default()))
+        res = np.from_dlpack(self._manager[(1, 0)])


Suggested change

res = np.from_dlpack(self._manager[(1, 0)])

res = np.from_dlpack(self._manager[1, 0])

Two-tier detection: tier 1 uses the canonical has_xp() protocol check from anndata.compat (catches JAX, numpy >=2.0); tier 2 falls back to duck-typing (shape/dtype/ndim) for arrays that don't yet implement the full protocol (PyTorch, TensorFlow). Also uses __array_namespace__() for backend label resolution and updates stale PR scverse#2063 → scverse#2071.

ilan-gold and others added 18 commits July 29, 2025 14:05

chore: add tests

21d5882

fix: allow creating objects with array-api

7cd6d69

chore: add indexing test

eff9dde

fix: add xp pass-through

b230d11

chore: add more indexing methods

692a270

Merge branch 'main' into ig/array_api_starter

b52e5b7

initial backend step

806becf

concat lazy error fix

98d249a

comment for merge fix

cdc4fdd

dask array fix

030f985

fix test_concatenate_roundtrip[inner-np_array-pandas-concat-lazy-conc…

5aff4d6

…at_func0-False]

fixed all contact errors

ba74743

comments fix

a749637

quick fix but not ideal as it converts jax and other arrays to numpy

c716fb5

Merge branch 'main' into ig/array_api_continue

951c026

comment fix

d8adf27

Merge branch 'ig/array_api_continue' of github.com:amalia-k510/anndat…

0e410a4

…a into ig/array_api_continue import merge

comment fix

8ea34e8

flying-sheep reviewed Aug 11, 2025

View reviewed changes

extra tests and function changes

96992d5

ilan-gold reviewed Aug 19, 2025

View reviewed changes

amalia-k510 added 4 commits August 21, 2025 17:17

dlpack introduction and trying to make gen_adata fully backend native…

460428a

… with copying introduced as an extra precaution

removed unnecessary function

dd3b867

minor fixes

743ebb3

minor fix

5a6c825

ilan-gold reviewed Aug 21, 2025

View reviewed changes

src/anndata/_core/merge.py Outdated Show resolved Hide resolved

ilan-gold reviewed Aug 21, 2025

View reviewed changes

src/anndata/_core/merge.py Outdated Show resolved Hide resolved

amalia-k510 added 3 commits August 25, 2025 14:25

begin merge modification

90d9e6a

concat on jax arrays is introduced

787feb0

precommit fixes

dea107d

ilan-gold added 2 commits February 3, 2026 12:27

fix: subtests

dff37e7

fix: wrap jnp_array usage up

cddb604

ilan-gold force-pushed the ig/array_api_continue branch from bf5194f to cddb604 Compare February 3, 2026 12:05

ilan-gold added 6 commits February 3, 2026 13:09

fix: give module types when type checking tests

d7526e4

fix: arg type

4fdbcb1

fix: converting x

f8e591b

fix: typingf

62f467e

fix: some test cleanup

65b5874

chore: relnote

5fa2341

ilan-gold requested a review from flying-sheep February 3, 2026 17:00

This was referenced Feb 4, 2026

Add official testing.anndata API #1699

Open

feat: Unify X and layers #1707

Draft

feat: DataFrame API for custom obs and var #2328

Draft

flying-sheep added 2 commits February 5, 2026 16:10

Some type fixes

c973c5f

simplify

f019c01

flying-sheep approved these changes Feb 6, 2026

View reviewed changes

ilan-gold and others added 8 commits February 6, 2026 14:58

Update src/anndata/_core/anndata.py

a9a9ed6

Co-authored-by: Philipp A. <flying-sheep@web.de>

Update src/anndata/_core/index.py

0654d56

Co-authored-by: Philipp A. <flying-sheep@web.de>

gpu ci.

defca58

Merge branch 'main' into ig/array_api_continue

b46c2b3

[pre-commit.ci] auto fixes from pre-commit.com hooks

4d1c1ad

for more information, see https://pre-commit.ci

fix: else!

fc8d466

fix: yq

7ffe326

Merge branch 'main' into ig/array_api_continue

17d7918

flying-sheep reviewed Feb 6, 2026

View reviewed changes

ilan-gold added 2 commits February 6, 2026 18:06

fix: yq

b24deae

Fix pytest command to run with coverage in parallel

62f0bc1

ilan-gold approved these changes Feb 6, 2026

View reviewed changes

ilan-gold merged commit ed34f6b into scverse:main Feb 6, 2026
24 checks passed

		# Force to NumPy (materializes JAX/Cubed); fine for small tests,
		# but may be slow or fail on large/lazy arrays

	res = np.from_dlpack(self._manager[(1, 0)])
	res = np.from_dlpack(self._manager[1, 0])

Conversation

amalia-k510 commented Aug 11, 2025 • edited by ilan-gold Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flying-sheep left a comment

Choose a reason for hiding this comment

Uh oh!

flying-sheep Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

amalia-k510 Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

flying-sheep Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilan-gold Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

ilan-gold Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

ilan-gold Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

flying-sheep left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

flying-sheep Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amalia-k510 commented Aug 11, 2025 •

edited by ilan-gold

Loading