-
Notifications
You must be signed in to change notification settings - Fork 464
Pull requests: pytorch/ao
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Delete deprecated Float8Layout and Float8AQTTensorImpl
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4127
opened Mar 20, 2026 by
jerryzh168
Loading…
Delete deprecated CutlassSemiSparseLayout and related code
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4126
opened Mar 20, 2026 by
jerryzh168
Loading…
Fix test_sparse_api failures for builds without hipSPARSELt (#4125)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
meta-exported
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4125
opened Mar 20, 2026 by
bbeckca
Loading…
[WIP]torchao support npu fp8
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4124
opened Mar 20, 2026 by
frankleitong
•
Draft
Delete deprecated autoquant v1 and all references
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4122
opened Mar 20, 2026 by
jerryzh168
Loading…
[sam2] Replace complex-number RoPE with PyTorch F.apply_rotary_emb
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4118
opened Mar 19, 2026 by
jainapurva
•
Draft
Fix float8 operand precision validation
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4115
opened Mar 19, 2026 by
GotFusion
Loading…
[moe training] Optimize triton_fp8_per_group_colwise_scales for AMDGPU
ciflow/rocm
ciflow/4xh100
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
[ROCm] Enable MXFP8 MoE training kernel and grouped GEMM tests on ROCm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: rocm
#4110
opened Mar 18, 2026 by
brucechanglongxu
Loading…
4 tasks done
Enable Float8 dynamic activation quantization on Navi4
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4108
opened Mar 18, 2026 by
zhangnju
Loading…
[X86] intmm: Use u8s8 when only support avx512-vnni
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4103
opened Mar 18, 2026 by
cyxlily
Loading…
Fix aten._has_compatible_shallow_copy_type', overload='default' dispatcher failed for LinearActivationQuantizedTensor in inference mode
bug
Something isn't working
ciflow/xpu
label used to trigger xpu CI jobs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: core
changes affecting multiple modules, e.g. base config/tensor, observers, quant ops
#4098
opened Mar 17, 2026 by
xiaowangintel
Loading…
[xpu][feat] Add XPU support for blockwise FP8
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
quantize_
quantize_ API
topic: for developers
Use this tag if this PR is mainly developer facing
xpu
Intel XPU related features
[ROCm] Enable Int4 weight-only quantization (TILE_PACKED_TO_4D) on ROCm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: rocm
#4092
opened Mar 16, 2026 by
brucechanglongxu
Loading…
Fix _is_128_128_scaled false positive for PerTensor on 128x128 linears
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4091
opened Mar 16, 2026 by
brucechanglongxu
Loading…
[WIP] npu support: int4 and fp8
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4087
opened Mar 16, 2026 by
frankleitong
•
Draft
Fuse expert grouping and padding into the MXFP8 EP dispatch path
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
moe
mx
speed up mxfp8 rceil quantization kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: inference
quantize_ api inference flow
#4073
opened Mar 13, 2026 by
vkuzo
Loading…
[moe training] Optimize FP8 MoE backward pass: fused colwise kernel + AMD tuning
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
float8
module: training
quantize_ api training flow
moe
Use in-place ops in _quantize_affine_float8 to reduce peak memory (#4068)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
meta-exported
module: inference
quantize_ api inference flow
#4068
opened Mar 12, 2026 by
r3t2
Loading…
[mxfp8 moe training] fused all2all dispatch + token group padding + expert major kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
soften version guard check for low precision attention API
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4057
opened Mar 11, 2026 by
howardzhang-cv
Loading…
[xpu][test] Skip WIP config for Intel GPU in test_safetensors_support.py and test_x86inductor_fusion.py
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: core
changes affecting multiple modules, e.g. base config/tensor, observers, quant ops
xpu
Intel XPU related features
#4049
opened Mar 11, 2026 by
zxd1997066
Loading…
[xpu][test] Port test/prototype/quant_logger/test_quant_logger.py to Intel GPU
ciflow/xpu
label used to trigger xpu CI jobs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: inference
quantize_ api inference flow
#4046
opened Mar 11, 2026 by
zxd1997066
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.