A memory efficient implementation of the .mtx reading function #3389

gjeuken · 2024-11-28T13:19:13Z

Closes #
Tests included or not required because: test_datasets.py already implemented

Release notes not necessary because: This is a backend change

Pandas read_csv function is very memory intensive, and this makes loading data (especially large datasets from EBI Single Cell Expression Atlas) impossible on computers with 16gb of ram or less. The subsequent analysis of such datasets with scanpy, however, works well on such computers.

Loading the data into chunks, using the same pandas function, solves this problem.

codecov · 2024-11-28T13:35:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 76.50%. Comparing base (7131500) to head (fa91b73).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #3389   +/-   ##
=======================================
  Coverage   76.50%   76.50%           
=======================================
  Files         111      111           
  Lines       12874    12877    +3     
=======================================
+ Hits         9849     9852    +3     
  Misses       3025     3025

Files with missing lines	Coverage Δ
src/scanpy/datasets/_ebi_expression_atlas.py	`94.18% <100.00%> (+0.21%)`	⬆️

memory efficient mtx loading

fa91b73

Zethson assigned Intron7 Nov 28, 2024

Zethson added the Area – Performance 🐌 label Nov 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A memory efficient implementation of the .mtx reading function #3389

A memory efficient implementation of the .mtx reading function #3389

gjeuken commented Nov 28, 2024

codecov bot commented Nov 28, 2024 •

edited

Loading

A memory efficient implementation of the .mtx reading function #3389

Are you sure you want to change the base?

A memory efficient implementation of the .mtx reading function #3389

Conversation

gjeuken commented Nov 28, 2024

codecov bot commented Nov 28, 2024 • edited Loading

Codecov Report

codecov bot commented Nov 28, 2024 •

edited

Loading