Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate memory usage of data refresh HLA processing step #1288

Open
Tracked by #1280
zabeen opened this issue Apr 22, 2024 · 0 comments
Open
Tracked by #1280

Investigate memory usage of data refresh HLA processing step #1288

zabeen opened this issue Apr 22, 2024 · 0 comments
Labels
matching-algorithm Work relates to the algorithm which matches and scores donors for given patient HLA performance Relates to improving the performance of a part of Atlas

Comments

@zabeen
Copy link
Contributor

zabeen commented Apr 22, 2024

Image

The above image shows CPU and memory usage during data refresh job run on UAT-WMDA.

The first half of the chart (before the line at 7:12AM) was the donor import step - and the second half of the chart is the HLA processing step.

It's not clear to me why the memory usage during HLA processing gradually increases to high levels, and then suddenly drops off at the end of the stage.

It suggests either some data is either being cached to memory on purpose, or it is not being disposed of correctly while batches of donors are being processed.

Either way, addressing this issue would mean data refresh could be safely run during business hours and days.

@zabeen zabeen added matching-algorithm Work relates to the algorithm which matches and scores donors for given patient HLA performance Relates to improving the performance of a part of Atlas labels Apr 22, 2024
@zabeen zabeen moved this to Project Backlog in Atlas Development Apr 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
matching-algorithm Work relates to the algorithm which matches and scores donors for given patient HLA performance Relates to improving the performance of a part of Atlas
Projects
None yet
Development

No branches or pull requests

1 participant