GitHub · Where software is built

Welcome v5
#40822 · LysandreJik opened on Sep 11, 2025
17

Labels Milestones New issue

<code>generate()</code> produces incoherent output when <code>inputs_embeds</code> has length 1

#41863

· tyarkoni opened

on Oct 25, 2025

Request for InternVL3_5_Flash

#41862

· YanxingLiu opened

on Oct 25, 2025

transformers.Adafactor is almost 2x slower on Windows than Linux - even WSL is slow what can be reason?

#41861

· FurkanGozukara opened

on Oct 25, 2025

Human Verification not working?

#41859

· thefued opened

on Oct 25, 2025

Performance regression: <code>allow_is_causal_skip</code> incorrectly disabled when <code>use_cache=False</code>

#41856

· williamsnell opened

on Oct 25, 2025

Incompatibility single-modality AutoProcessor and PEFT Adapter

#41846

· tomaarsen opened

on Oct 24, 2025

Incorrect usage of <code>num_items_in_batch</code>?

#41842

· gohar94 opened

on Oct 24, 2025

ONNX export vmap functorch"

Feature request

#41838

· Worke1221 opened

on Oct 24, 2025

Integrating TiledMLP for a much smaller memory footprint

#41826

· stas00 opened

on Oct 23, 2025

IndexError: tuple index out of range when using Tensor Parallelism with FSDP2 on GPT-OSS 20B (tensor_parallel.py, line 510)

#41819

· JdRion opened

on Oct 23, 2025

Processor saving does not work when multiple tokenizers

#41816

· AmitMY opened

on Oct 23, 2025

AutoModel does not support Qwen3VLMoE

#41814

· AylinAkkus opened

on Oct 23, 2025