AEP: Caching with remote data #35

unkcpz · 2022-06-29T07:56:41Z

sphuber

Thanks @unkcpz for describing the problems and some solutions. I think this AEP would benefit from some slight restructuring just to make it a bit more clear what the current problems are and what you are addressing exactly. I also think we should add the problem of cache invalidation when a RemoteData is cleaned. I propose you adopt a structure like the following

Problems

Shallow copy on RemoteData.clone()
Invalidate cache after RemoteData.clean()
Hash calculation of RemoteData
Prospective workchain caching

Now describe in more detail in following subheaders.

1. Shallow copy

Contents of the RemoteData should really be cloned on the remote as well, not just the reference in AiiDA's database.

2. Invalidate cache after clean

When RemoteData._clean() is called, an attribute or extra should be set which will cause it to no longer be considered as a valid cache source.

3. Hash calculation of `RemoteData`

The hash of a RemoteData is computed based on the absolute filepath on the remote file system. This means that two nodes that have identical contents, but have different base folders, will have different hashes. Ideally, the hash should be calculated based on the hash of all contents, independent of the location on the remote.

4. Prospective workchain caching

If an entire workchain has already been run once, in principle it is not necessary to run its individual calcjobs again and we can simply cache everything. Currently, the engine will still execute each step and consider each calcjob individually whether it should be cached. If one of the remote folders of the cached jobs has been cleaned in the meantime, the workchain will fail, even though all results are essentially known. One might think that we could just add a check that if the workchain with the same inputs exist, we simply clone the entire matched workchain, including everything it ran, without running anything. This assumes that there is no variability of workchain based on their inputs, but I am not 100% sure this is guaranteed.

Proposed solution

Describe which of the problems you address and how.

I think you should definitely spend some time about thinking how you would address problem 1. It might seem easy, but the cloning will often happen by a daemon worker and so the opening of the transport should go through the transport queue. However, the call for the clone comes somewhere from Node.store() and it is not evident how to get access to the transport queue in a "nice" non-hacky way. I think this might be more tricky to do properly than it seems on the surface.

unkcpz · 2022-08-26T12:42:08Z

Hi @sphuber,

I update the description of AEP, but not sure about the ways how to implement it. I think we need to set a discussion.

unkcpz added 2 commits June 29, 2022 09:10

AEP 008: clever caching policy with RemoteData node

dbde44f

Proposed Enhancement

215d7a2

unkcpz changed the title ~~Caching with remote data~~ AEP: Caching with remote data Jun 29, 2022

unkcpz marked this pull request as draft June 29, 2022 09:59

kjappelbaum added status/draft type/standard Standard Track AEP labels Jul 13, 2022

sphuber reviewed Aug 10, 2022

View reviewed changes

unkcpz added 2 commits August 16, 2022 16:57

Restructure the problems

a79bc77

Add content of workflow caching.

7cb4d07

unkcpz requested a review from sphuber August 26, 2022 12:42

unkcpz mentioned this pull request Dec 6, 2022

Clever caching policy involving RemoteData aiidateam/aiida-core#5823

Open

unkcpz mentioned this pull request Mar 7, 2023

Defect: Improve the caching policy when the RemoteData is cleaned up aiidateam/team-compass#13

Open

unkcpz mentioned this pull request Jul 8, 2024

Update aiida-core to 2.6.2 aiidalab/aiidalab-docker-stack#474

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AEP: Caching with remote data #35

AEP: Caching with remote data #35

unkcpz commented Jun 29, 2022 •

edited

Loading

sphuber left a comment

unkcpz commented Aug 26, 2022

AEP: Caching with remote data #35

Are you sure you want to change the base?

AEP: Caching with remote data #35

Conversation

unkcpz commented Jun 29, 2022 • edited Loading

sphuber left a comment

Choose a reason for hiding this comment

Problems

1. Shallow copy

2. Invalidate cache after clean

3. Hash calculation of RemoteData

4. Prospective workchain caching

Proposed solution

unkcpz commented Aug 26, 2022

unkcpz commented Jun 29, 2022 •

edited

Loading

3. Hash calculation of `RemoteData`