Memoize key isn't always invertible

Hey, really appreciate this library!

I have a small feature request (and an accompanying PR), which is just to make the `.__cache_key__` of a `memoize`d function invertible (which also means making `args_to_key` actually produce unique keys for different arguments).

A use case is migrating many different cached functions due to a source changes, e.g. suppose we had some caches that have been populated using:

```python
@cache.memoize
def foo(a: OldClass):
    ...

@cache.memoize
def bar(b: OldClass, c: OldClass | None = None):
    ...
```

and want to roll out new versions of these

```python
@cache.memoize
def foo(anew: NewClass):
    ...

@cache.memoize
def bar(*, bnew: NewClass, c: NewClass | None = None):
    ...
```

and want to update migrate all of `OldClass` arguments to some `NewClass`, as well as changing the signatures appropriately, so that we can keep our old caches, e.g. via something like

```python
import diskcache as dc

...

for old_key in old_cache.iterkeys():
    # some random function to deal with qualname changes
    new_key_base = get_new_qualname(old_key[0])
    # some random function to update any values/types and the signature
    new_args, new_kwargs = get_new_args_kwargs(old_key)
    new_key = dc.core.args_to_key(
        new_key_base, new_args, new_kwargs, typed=typed, ignore=ignore
    )
    new_cache.add(new_key, old_cache.get(old_key))
```

Since https://github.com/grantjenks/python-diskcache/issues/195 / https://github.com/grantjenks/python-diskcache/commit/d55a50ee083784afa9c85e14e41c4a2d132f3111, then the `args` and `kwargs` delimiter in `args_to_key` is no longer a special sentinel, and so `get_new_args_kwargs` needs to know the signature of whichever function it was previously caching to faithfully find out the arguments used for that key. Namely, there's signatures where we couldn't faithfully determine where the `(None,)` delimiter between `args` and `kwargs` in `args_to_key` is placed (if `typed=False`), e.g. for

```python
@cache.memoize
def f(*args, b=None, c=0):
    ...

assert f.__cache_key__(None, "b", c=1) == f.__cache_key__(None, b=None, c=1)
```

My proposed fix (https://github.com/grantjenks/python-diskcache/pull/312) is just to change the `key` accumulation of `kwargs` to use its (key, value) pairs.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Memoize key isn't always invertible #313

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Memoize key isn't always invertible #313

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions