-
Notifications
You must be signed in to change notification settings - Fork 23
Pull requests: tenstorrent/tt-inference-server
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Collapse same heading blocks in v2 reports
#3501
opened May 12, 2026 by
ddjukicTT
Collaborator
Loading…
Forge SDXL: implement LoRA (load/fuse/unload) via recompile-on-change
#3488
opened May 12, 2026 by
ctr-lelanchelianTT
Collaborator
Loading…
4 tasks
SDXL: perf benchmark script + methodology for Blackhole comparison
#3487
opened May 12, 2026 by
ctr-lelanchelianTT
Collaborator
Loading…
2 tasks
SDXL: per-stage perf logging in trace and Forge runners
#3486
opened May 12, 2026 by
ctr-lelanchelianTT
Collaborator
Loading…
3 tasks
Forge SDXL: support full-on-device (text encoders + VAE) via TTXLA_SDXL_FULL_ON_DEVICE
#3485
opened May 12, 2026 by
ctr-lelanchelianTT
Collaborator
Loading…
3 tasks
SDXL: add Blackhole accuracy eval, trace non-LoRA eval, and LoRA rollback test
#3484
opened May 12, 2026 by
ctr-lelanchelianTT
Collaborator
Loading…
3 tasks
correction of the links in models_by_hardware.md for N300 and P150 #3447
#3465
opened May 12, 2026 by
vcankovicTT
Contributor
Loading…
Hapaic/upgrade vllm to v0.18.1 support prefixcaching
community
#3457
opened May 12, 2026 by
jeyoon321
Loading…
fix(model_spec): add max_num_batched_tokens field to DeviceModelSpec
community
#3433
opened May 10, 2026 by
Dev-X25874
Loading…
cpp server - propagate cancel to blaze + handle all cases
#3428
opened May 9, 2026 by
knovokmetTT
Contributor
•
Draft
Fix Forge LLM models so local launches work without --override-docker-image
#3414
opened May 8, 2026 by
kmabeeTT
Contributor
Loading…
Fix LLM p300x2 trace region size & set has_builtin_warmup=True
#3411
opened May 7, 2026 by
bgoelTT
Collaborator
Loading…
Migrate image models to cpp_server (SDXL)
#3391
opened May 6, 2026 by
ztorlakTT
Contributor
Loading…
Add p300x2 model_spec.py entries and update models-ci-config.json
#3371
opened May 6, 2026 by
bgoelTT
Collaborator
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.