Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about RotaryPEMultiHeadAttention: rotary_percentage #246

Open
YOONSEOKHEO opened this issue Mar 13, 2024 · 0 comments
Open

question about RotaryPEMultiHeadAttention: rotary_percentage #246

YOONSEOKHEO opened this issue Mar 13, 2024 · 0 comments

Comments

@YOONSEOKHEO
Copy link

YOONSEOKHEO commented Mar 13, 2024

I confirmed that there is code in the RotaryPEMultiHeadAttention class that reduces the dimension using a parameter called rope_percentage.
(URL:

)

I am curious in what cases you would set rope_percentage to a value less than 1.

(Of course, in experiment.py, we confirmed that rope_percentage is set to 1.0.)

@YOONSEOKHEO YOONSEOKHEO changed the title question about RoPE code(rotary_percentage) question about RotaryPEMultiHeadAttention: rotary_percentage Mar 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant