Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add unigram sampling (alpha, nbest_size)
#1994 opened Mar 27, 2026 by kennethsible Loading…
Regex split parity
#1991 opened Mar 27, 2026 by ArthurZucker Loading…
feat(tokenizer): add early exit for truncation
#1990 opened Mar 26, 2026 by McPatate Loading…
feat: add new faster whitespace split pretok
#1985 opened Mar 26, 2026 by McPatate Loading…
Add explicit license metadata for Python bindings
#1976 opened Mar 23, 2026 by julia-thorn Loading…
Implementing Parity-aware BPE
#1974 opened Mar 21, 2026 by cimeister Loading…
Fix type_ids not applied to overflow encodings
#1965 opened Mar 17, 2026 by joaquinhuigomez Loading…
feat: add pcre2 as optional feature
#1959 opened Mar 2, 2026 by wheynelau Loading…
Update release workflow for Python 3.14
#1952 opened Feb 19, 2026 by ngoldbaum Loading…
Add get_special_tokens and is_special_token methods
#1945 opened Feb 5, 2026 by ArthurZucker Loading…
2 tasks done
Add post_process_tokens and post_process_ids methods
#1944 opened Feb 5, 2026 by ArthurZucker Loading…
3 tasks done
feat: add unk_token property to Unigram model
#1943 opened Feb 5, 2026 by ArthurZucker Loading…
4 tasks done
fix: added type hints in .py files
#1932 opened Jan 20, 2026 by ashmi8 Loading…
Include license file into python wheels
#1931 opened Jan 20, 2026 by justeph Loading…
Upgrade GitHub Actions for Node 24 compatibility
#1916 opened Dec 20, 2025 by salmanmkc Loading…
Fix undefined names in docs/source/_ext/entities.py
#1895 opened Nov 28, 2025 by cclauss Loading…
Python: Add ruff rules for asyncio and performance
#1894 opened Nov 28, 2025 by cclauss Loading…
Implement Append normalizer
#1893 opened Nov 28, 2025 by ArthurZucker Loading…
ProTip! no:milestone will show everything without a milestone.