perf: add ability to write downcasted indices #2159

ilan-gold · 2025-10-20T11:54:22Z

TODO:

gather feedback in Downcast indices for sparse matrices if possible on-disk #2153
~~tests for + run on GPU CI~~ We don't support io directly into GPU anyway!

Checks

Closes Downcast indices for sparse matrices if possible on-disk #2153
Tests added
Release note added (or unnecessary)

codecov · 2025-10-20T11:59:02Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.65%. Comparing base (5212db8) to head (1361d36).
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2159      +/-   ##
==========================================
- Coverage   84.77%   84.65%   -0.12%     
==========================================
  Files          46       46              
  Lines        7132     7142      +10     
==========================================
  Hits         6046     6046              
- Misses       1086     1096      +10

Files with missing lines	Coverage Δ
src/anndata/_io/specs/methods.py	`90.78% <100.00%> (+0.14%)`	⬆️
src/anndata/_settings.py	`91.86% <100.00%> (+0.04%)`	⬆️

... and 2 files with indirect coverage changes

flying-sheep

looks great!

When reading this in, will the dtype be preserved? And if so, how do certain APIs deal with tiny int dtypes? Like anndata’s concatenation and scanpy’s algorithms?

tests/test_io_elementwise.py

ilan-gold · 2025-10-23T11:21:52Z

When reading this in, will the dtype be preserved? And if so, how do certain APIs deal with tiny int dtypes? Like anndata’s concatenation and scanpy’s algorithms?

scipy will convert for us to match indptr from what I can tell from the issues and the behavior of the tests passing (i.e., the data is read back in and matches the input). So this isn't really a scanpy problem. In the future when supporting different sparse matrices, (like finch tensor), I think we will hopefully be able to preserve types. I don't see anything in https://graphblas.org/binsparse-specification/ that would indicate that the data types have to match

Co-authored-by: Philipp A. <flying-sheep@web.de>

scverse-benchmark · 2025-10-23T11:36:23Z

Benchmark changes

Change	Before [`5212db8`]	After [`1361d36`]	Ratio	Benchmark (Parameter)
-	21.1±3ms	15.6±0.4ms	0.74	backed_hdf5.BackedHDF5Indexing.time_slice_obs_to_memory('sparse')
+	1.99±0.01ms	2.36±0.01ms	1.19	dataset2d.Dataset2D.time_getitem_slice('h5ad', (-1,), 'cat')
+	301±3μs	351±6μs	1.17	sparse_dataset.SparseCSRContiguousSlice.time_getitem_adata('alternating', True)

Comparison: https://github.com/scverse/anndata/compare/5212db80485432719021445084d93407c0ce11b2..1361d36a4dd2e4bc20dfe3f89f03369f131be637
Last changed: Mon, 3 Nov 2025 13:57:48 +0000

More details: https://github.com/scverse/anndata/pull/2159/checks?check_run_id=54361625499

perf: add ability to write downcasted indices

3dc93f7

ilan-gold added this to the 0.12.4 milestone Oct 20, 2025

ilan-gold added performance 🐌 topic: io type: sparse 🫥 labels Oct 20, 2025

Merge branch 'main' into ig/downcast_indices

ecdfd00

ilan-gold added 7 commits October 20, 2025 15:57

fix: add to pyi

aa45989

chore: add some more assertions

cee5cd2

refactor: be clearer about expected behavior

781b9b8

Merge branch 'main' into ig/downcast_indices

ad7364e

chore: relnote

daa0b29

chore: add link to issue

5f2741a

fix: add explicit dtype use

e36960c

ilan-gold added the skip-gpu-ci label Oct 21, 2025

chore: add more comment

356547f

ilan-gold requested a review from flying-sheep October 21, 2025 16:27

flying-sheep approved these changes Oct 23, 2025

View reviewed changes

tests/test_io_elementwise.py Outdated Show resolved Hide resolved

Merge branch 'main' into ig/downcast_indices

366378b

ilan-gold added the benchmark label Oct 23, 2025

Update tests/test_io_elementwise.py

fd3b2f1

Co-authored-by: Philipp A. <flying-sheep@web.de>

flying-sheep approved these changes Oct 23, 2025

View reviewed changes

Merge branch 'main' into ig/downcast_indices

c55c792

ilan-gold modified the milestones: 0.12.4, 0.12.5 Oct 27, 2025

ilan-gold added 3 commits October 28, 2025 16:45

Merge branch 'main' into ig/downcast_indices

dece57d

Merge branch 'main' into ig/downcast_indices

bdad952

Merge branch 'main' into ig/downcast_indices

1361d36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: add ability to write downcasted indices #2159

perf: add ability to write downcasted indices #2159

ilan-gold commented Oct 20, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 20, 2025 •

edited

Loading

Uh oh!

flying-sheep left a comment

Uh oh!

Uh oh!

ilan-gold commented Oct 23, 2025

Uh oh!

scverse-benchmark bot commented Oct 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: add ability to write downcasted indices #2159

Are you sure you want to change the base?

perf: add ability to write downcasted indices #2159

Conversation

ilan-gold commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

flying-sheep left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ilan-gold commented Oct 23, 2025

Uh oh!

scverse-benchmark bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ilan-gold commented Oct 20, 2025 •

edited

Loading

codecov bot commented Oct 20, 2025 •

edited

Loading

scverse-benchmark bot commented Oct 23, 2025 •

edited

Loading