[go: nahoru, domu]

Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

build(node): Include binaries in NPM packing Stale
#1459 by aaronclong was closed Apr 22, 2024 Loading…
testable example docs for training-serialization
#373 by ropottnik was merged Aug 31, 2020 Loading…
Allow pre-tokenized inputs to encode/encode_batch
#249 by n1t0 was merged May 21, 2020 Loading…
2 tasks
Port of unigram algorithm.
#292 by Narsil was merged Sep 2, 2020 Loading…
Adding ByteFallback support for tokenizers.
#1183 by Narsil was merged Mar 23, 2023 Loading…
Replace Container.
#355 by sebpuetz was merged Aug 4, 2020 Loading…
Parallelize unigram trainer
#976 by mishig25 was merged May 22, 2023 Loading…
Added ability to inspect a 'Sequence' pre-tokenizer.
#1341 by eaplatanios was merged Sep 21, 2023 Loading…
Python - update pyo3 and start using new API
#136 by ljos was merged Apr 8, 2020 Loading…
Add unigram bytefallback
#1217 by ArthurZucker was merged Jun 26, 2023 Loading…
implement a simple max_sentencepiece_length into BPE
#1228 by chris-ha458 was merged May 16, 2023 Loading…
Upgrade pyo3 to 0.16
#956 by h-vetinari was merged May 5, 2022 Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.