[go: nahoru, domu]

Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adding ByteFallback support for tokenizers.
#1183 by Narsil was merged Mar 23, 2023 Loading…
Ability to train from memory
#544 by n1t0 was merged Nov 28, 2020 Loading…
1 task done
Fix BPE trainer pair counts
#179 by n1t0 was merged Mar 2, 2020 Loading…
Port of unigram algorithm.
#292 by Narsil was merged Sep 2, 2020 Loading…
Improve PreTokenizer and Model interfaces
#360 by n1t0 was merged Aug 3, 2020 Loading…
Fix broken links in docs
#1133 by hvaara was merged Dec 23, 2022 Loading…
fix fmt::Display for WordPiece error
#3 by epwalsh was merged Dec 12, 2019 Loading…
Add RustFmt and Clippy to CI pipeline
#5 by epwalsh was merged Dec 13, 2019 Loading…
Add BPE tests and documentation
#6 by epwalsh was merged Dec 20, 2019 Loading…
simplify initialization of BpeTrainer
#8 by epwalsh was merged Dec 24, 2019 Loading…
Allow importing from tokenizers modules.
#18 by mfuntowicz was closed Jan 14, 2020 Loading…
Clean up Rust docs
#20 by epwalsh was merged Dec 30, 2019 Loading…
Implement dropout for BPE
#21 by epwalsh was merged Dec 31, 2019 Loading…
replace print statements with logging
#22 by epwalsh was closed Jan 1, 2020 Loading…
make sure we don't warn on empty tokens
#1554 by ArthurZucker was merged Jun 20, 2024 Loading…
TokenizerBuilder
#27 by epwalsh was closed Oct 20, 2020 Loading…
[WIP] try parking_lot::RwLock
#28 by epwalsh was closed Jan 3, 2020 Loading…
[WIP] try chashmap in cache implementations
#29 by epwalsh was closed Jan 3, 2020 Loading…
[WIP] try evmap
#32 by epwalsh was closed Jan 3, 2020 Loading…
avoid unnecessary write locks in the BPE cache
#34 by epwalsh was merged Jan 4, 2020 Loading…
Update benchmarks
#36 by epwalsh was merged Jan 4, 2020 Loading…
make cache optional
#37 by epwalsh was merged Jan 3, 2020 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.