-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Pull requests: huggingface/tokenizers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add
MultiRegex for common regexes (gpt2,etc) fast path
#2143
opened Jul 2, 2026 by
McPatate
Member
Loading…
ci: Simple bench workflow for PipelineTokenizer
#2141
opened Jul 2, 2026 by
SBrandeis
Contributor
Loading…
feat:
impl pipeline::PreTokenizer for WhitespaceSplit
#2140
opened Jul 1, 2026 by
McPatate
Member
Loading…
feat: handle
Merge* variants of the SplitDelimiterBehavior
#2139
opened Jul 1, 2026 by
McPatate
Member
Loading…
feat:
impl pipeline::PreTokenizer for UnicodeScripts
#2138
opened Jul 1, 2026 by
McPatate
Member
Loading…
feat:
impl pipeline::PreTokenizer for FixedLength
#2137
opened Jul 1, 2026 by
McPatate
Member
Loading…
feat:
impl pipeline::PreTokenizer for CharDelimiterSplit
#2135
opened Jul 1, 2026 by
McPatate
Member
Loading…
chore(deps): bump js-yaml from 3.14.2 to 3.15.0 in /bindings/node
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#2131
opened Jun 30, 2026 by
dependabot
Bot
Loading…
docs: add docstrings to Tokenizer::from_file, from_bytes, and from_petrained
#2124
opened Jun 23, 2026 by
BBloggsbott
Loading…
chore(deps-dev): bump webpack-dev-server from 5.2.1 to 5.2.5 in /tokenizers/examples/unstable_wasm/www
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#2118
opened Jun 18, 2026 by
dependabot
Bot
Loading…
chore(deps): bump @babel/core from 7.24.3 to 7.29.7 in /bindings/node
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#2113
opened Jun 16, 2026 by
dependabot
Bot
Loading…
chore(deps-dev): bump launch-editor from 2.10.0 to 2.14.1 in /tokenizers/examples/unstable_wasm/www
dependencies
Pull requests that update a dependency file
javascript
Pull requests that update Javascript code
#2112
opened Jun 16, 2026 by
dependabot
Bot
Loading…
BPE trainer: inherit continuing_subword_prefix/end_of_word_suffix from the model
#2108
opened Jun 13, 2026 by
discobot
Loading…
Fix Unigram trainer prune loss to use per-piece alternative count
#2106
opened Jun 13, 2026 by
NahButch
Loading…
Return an error instead of panicking on out-of-range BPE merges
#2104
opened Jun 12, 2026 by
NahButch
Loading…
Fix empty Encoding.overflowing when truncation is enabled
#2098
opened Jun 12, 2026 by
discobot
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-29.