HabanaAI / vllm-fork Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 109
Star 76

Code
Issues 10
Pull requests 91
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: HabanaAI/vllm-fork

Labels 19 Milestones 0

New pull request New

91 Open 1,299 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix OOM in fp8 w/ HPUGraph for llama3.2 (#1365)

#1479 opened Jun 25, 2025 by kdamaszk

Loading…

Add @PatrykWo to CODEOWNERS

#1477 opened Jun 25, 2025 by michalkuligowski

Loading…

Michalkuligowski patch test1212bucket

#1476 opened Jun 25, 2025 by michalkuligowski

Loading…

Added multi-image payload json (from cutomer images) and .sh files with certain # of tokens

#1475 opened Jun 24, 2025 by gilliean • Draft

Remove sync point in warmup

#1472 opened Jun 24, 2025 by kzawora-intel

Loading…

Port branch Sasarkar/jha/sliding_window_gemma3_1 to habana_main.

#1470 opened Jun 23, 2025 by libinta • Draft

DP: Optimizations for Data Parallel Attention

#1463 opened Jun 23, 2025 by xinyu-intel

Loading…

3 of 4 tasks

[PD] reduce a D2D copy.

#1462 opened Jun 23, 2025 by jikunshang

Loading…

Integrate DP with PD

#1461 opened Jun 23, 2025 by xinyu-intel

Loading…

Sasarkar/jha/sliding window gemma3 1

#1460 opened Jun 23, 2025 by libinta • Draft

Fix the script file typo in README file

#1458 opened Jun 22, 2025 by taotod

Loading…

Support Qwen3 Embedding & Reranker on Gaudi for aice/v1.21.0 branch

#1456 opened Jun 19, 2025 by gyou2021

Loading…

Bump ossf/scorecard-action from 2.3.1 to 2.4.2 dependencies

Pull requests that update a dependency file

github_actions

Pull requests that update GitHub Actions code

#1449 opened Jun 18, 2025 by dependabot bot

Loading…

Bump actions/upload-artifact from 97a0fba1372883ab732affbe8f94b823f91727db to c24449f33cd45d4826c6702db7e49f7cdb9b551d dependencies

Pull requests that update a dependency file

github_actions

Pull requests that update GitHub Actions code

#1448 opened Jun 18, 2025 by dependabot bot

Loading…

enable profiler on v1 baseline

#1445 opened Jun 17, 2025 by hsubramony • Draft

Syn with deepseek_r1 branch

#1439 opened Jun 17, 2025 by tvoas

Loading…

Remove redundant dependencies

#1437 opened Jun 16, 2025 by afierka-intel

Loading…

speculative decoding and mtp optimization

#1435 opened Jun 16, 2025 by inkcherry • Draft

patch for gpu migration tool work

#1434 opened Jun 16, 2025 by huijuanzh

Loading…

fix load_weights hang when tensor is on hpu

#1432 opened Jun 16, 2025 by yupengzh-intel

Loading…

Fix for KVCache Layerwise that breaks after rebase

#1427 opened Jun 14, 2025 by srajabos

Loading…

Exp bucketing tweaks

#1425 opened Jun 13, 2025 by madamczyk-intel

Loading…

Enable embedding accuracy test with hpu

#1424 opened Jun 13, 2025 by akarnows

Loading…

Change path to weka in tests

#1422 opened Jun 13, 2025 by adobrzyn

Loading…

fix prompt_logprob crash when delayed sampling is on

#1421 opened Jun 13, 2025 by ccrhx4

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!