-
Notifications
You must be signed in to change notification settings - Fork 328
Issues: pytorch/torchtitan
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Context parallel on Turing GPUs?
module: context parallel
question
Further information is requested
#1034
opened Mar 31, 2025 by
dingqingy
Linear layer weights are in float32 ?
question
Further information is requested
#1027
opened Mar 28, 2025 by
githubsgi
Unable to run flex attention and torch.compile
module: flex attention
#1005
opened Mar 22, 2025 by
lkhphuc
Is a PP+FSDP+TP config + toml available for pre-training 405B model ?
#986
opened Mar 19, 2025 by
githubsgi
[Feature] Add Multi-Token Prediction module
enhancement
New feature or request
#933
opened Mar 5, 2025 by
lessw2020
[TP] RuntimeError: shape '[1, 8192, -1, 128]' is invalid for input of size 524288
module: dtensor
#932
opened Mar 5, 2025 by
aahehehe
[Feature] add preflight NCCL and GEMM check to multinode slurm script
#915
opened Mar 3, 2025 by
lessw2020
[Feature] expose Torch Nan checker as configurable option in toml for those training at scale
#914
opened Mar 3, 2025 by
lessw2020
[Checkpointing] fails out if checkpoint folder does not exist when using keep_latest_k
bug
Something isn't working
module: checkpoint
#911
opened Mar 2, 2025 by
lessw2020
[Possible PR discuss] Will a PR of training HF model be welcomed?
community help wanted
huggingface integration
#903
opened Feb 28, 2025 by
junjzhang
Question about triton in deepseek implementtion
question
Further information is requested
#902
opened Feb 28, 2025 by
zqwenn
dcp.load fails on checkpoints prior to AdamW refactor
module: checkpoint
#886
opened Feb 25, 2025 by
eminorhan
[Evaluation] Minimal support for downstream tasks
community help wanted
enhancement
New feature or request
#883
opened Feb 24, 2025 by
K-H-Ismail
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-30.