Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix broken source links in documentation
#1934 opened Jan 21, 2026 by Shivam-Bhardwaj Loading…
fix: added type hints in .py files
#1932 opened Jan 20, 2026 by ashmi8 Loading…
Include license file into python wheels
#1931 opened Jan 20, 2026 by justeph Loading…
Add Python 3.14 CI
#1925 opened Jan 5, 2026 by ngoldbaum Loading…
feat: add progress_format option for machine-readable JSON output
#1921 opened Dec 26, 2025 by podarok Loading…
6 tasks done
Upgrade GitHub Actions for Node 24 compatibility
#1916 opened Dec 20, 2025 by salmanmkc Loading…
Fix undefined names in docs/source/_ext/entities.py
#1895 opened Nov 28, 2025 by cclauss Loading…
Python: Add ruff rules for asyncio and performance
#1894 opened Nov 28, 2025 by cclauss Loading…
Implement Append normalizer
#1893 opened Nov 28, 2025 by ArthurZucker Loading…
Mark Python tests that need network access
#1872 opened Oct 2, 2025 by gordonmessmer Loading…
feat: whitespace optimize Feature Request
#1841 opened Aug 6, 2025 by b00f Loading…
Unused Unicode Character Filter
#1832 opened Jul 23, 2025 by sanderland Loading…
Add enforce_utf8_boundaries option to BpeTrainer
#1830 opened Jul 22, 2025 by sanderland Loading…
Faster Whitespace PreTokenizer (Drop-in Replacement)
#1822 opened Jul 7, 2025 by 8ria Loading…
Adding multiprocessing for sentencepiece_extractor
#1804 opened Jun 19, 2025 by AamodThakur Loading…
ProTip! Follow long discussions with comments:>50.