add support for flash-attention3 #8829

sandyhouse · 2025-08-06T02:25:18Z

What does this PR do?

Add support for flash attention 3

Fixes # (issue)
#4854

Kuangdd01 · 2025-08-07T03:10:24Z

src/llamafactory/model/model_utils/attention.py

@@ -14,7 +14,7 @@

 from typing import TYPE_CHECKING

-from transformers.utils import is_flash_attn_2_available, is_torch_sdpa_available
+from transformers.utils import is_flash_attn_2_available, is_flash_attn_3_available, is_torch_sdpa_available


We should add a version check here? AFAIK, FA3 utils were introduced recently.

FA3 was introduced starting from transformers version 4.53.0. Therefore, the dependency on transformers was updated in the requirements.txt file. @Kuangdd01

add support for flash-attention3

a4189cf

hiyouga added the pending This problem is yet to be addressed label Aug 6, 2025

Kuangdd01 reviewed Aug 7, 2025

View reviewed changes

penfever pushed a commit to mlfoundations/LLaMA-Factory that referenced this pull request Aug 13, 2025

Merge PR hiyouga#8829: Add support for flash-attention3

5dfecc1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for flash-attention3 #8829

add support for flash-attention3 #8829

sandyhouse commented Aug 6, 2025

Uh oh!

Kuangdd01 Aug 7, 2025

Uh oh!

sandyhouse Aug 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

add support for flash-attention3 #8829

Are you sure you want to change the base?

add support for flash-attention3 #8829

Conversation

sandyhouse commented Aug 6, 2025

What does this PR do?

Uh oh!

Kuangdd01 Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

sandyhouse Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sandyhouse Aug 8, 2025 •

edited

Loading