Fix `cache_position` initialisation for generation with `use_cache=False` #30485

nurlanov-zh · 2024-04-25T14:55:53Z

What does this PR do?

Fixes #30482

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker @younesbelkada @gante

ArthurZucker

Thanks! Let's maybe avoid relying on the model's cache position initialization !

src/transformers/generation/utils.py

gante

LGTM, thank you for the fix 👍

@ArthurZucker tbh, when we have use_cache=False, we shouldn't even be creating or using cache_positions at all (as the name of the variable indicates, its purpose is cache-related). ATM, the only dependency is the causal mask creation function, which uses cache_positions as a proxy for the sequence length (i.e. we can easily rearrange the code to avoid using it when there is no cache :) )

ArthurZucker

Yes @gante, agreed would split both paths a bit better

ArthurZucker · 2024-05-07T08:25:25Z

Can you just rebase on main? 🤗

HuggingFaceDocBuilderDev · 2024-05-07T08:49:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Co-authored-by: Arthur <[email protected]>

nurlanov-zh · 2024-05-07T09:08:00Z

Can you just rebase on main? 🤗

@ArthurZucker , done ✔️

ArthurZucker · 2024-05-07T09:13:14Z

Thanks 🤗

…lse` (huggingface#30485) * Fix cache_position init for generation * Update src/transformers/generation/utils.py Co-authored-by: Arthur <[email protected]> * Fix cache position update --------- Co-authored-by: Arthur <[email protected]>

…lse` (#30485) * Fix cache_position init for generation * Update src/transformers/generation/utils.py Co-authored-by: Arthur <[email protected]> * Fix cache position update --------- Co-authored-by: Arthur <[email protected]>

nurlanov-zh mentioned this pull request Apr 25, 2024

generate method does not work with use_cache=False in Llama-2 model with model.config._attn_implementation = "eager" #30482

Closed

ArthurZucker reviewed Apr 30, 2024

View reviewed changes

src/transformers/generation/utils.py Outdated Show resolved Hide resolved

nurlanov-zh requested a review from ArthurZucker April 30, 2024 14:53

gante approved these changes May 3, 2024

View reviewed changes

ArthurZucker approved these changes May 7, 2024

View reviewed changes

nurlanov-zh and others added 3 commits May 7, 2024 11:01

Fix cache_position init for generation

4c0be24

Update src/transformers/generation/utils.py

043df8b

Co-authored-by: Arthur <[email protected]>

Fix cache position update

e1b1c10

nurlanov-zh force-pushed the fix-cache-position-init branch from 7b9c0ba to e1b1c10 Compare May 7, 2024 09:01

ArthurZucker merged commit 4fda78c into huggingface:main May 7, 2024
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `cache_position` initialisation for generation with `use_cache=False` #30485

Fix `cache_position` initialisation for generation with `use_cache=False` #30485

nurlanov-zh commented Apr 25, 2024

ArthurZucker left a comment

gante left a comment

ArthurZucker left a comment

ArthurZucker commented May 7, 2024

HuggingFaceDocBuilderDev commented May 7, 2024

nurlanov-zh commented May 7, 2024

ArthurZucker commented May 7, 2024

Fix cache_position initialisation for generation with use_cache=False #30485

Fix cache_position initialisation for generation with use_cache=False #30485

Conversation

nurlanov-zh commented Apr 25, 2024

What does this PR do?

Before submitting

Who can review?

ArthurZucker left a comment

Choose a reason for hiding this comment

gante left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented May 7, 2024

HuggingFaceDocBuilderDev commented May 7, 2024

nurlanov-zh commented May 7, 2024

ArthurZucker commented May 7, 2024

Fix `cache_position` initialisation for generation with `use_cache=False` #30485

Fix `cache_position` initialisation for generation with `use_cache=False` #30485