[go: nahoru, domu]

Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

C++ bindings Stale
#559 by alexeyr was closed May 7, 2024 Draft
2 of 7 tasks
Convert word counts to u64
#1433 by stephenroller was merged Feb 6, 2024 Loading…
fix fmt::Display for WordPiece error
#3 by epwalsh was merged Dec 12, 2019 Loading…
Add benchmark framework and benches for BPE (GPT2)
#4 by epwalsh was merged Jan 1, 2020 Loading…
Add RustFmt and Clippy to CI pipeline
#5 by epwalsh was merged Dec 13, 2019 Loading…
Add BPE tests and documentation
#6 by epwalsh was merged Dec 20, 2019 Loading…
remove Cargo.lock
#7 by epwalsh was merged Dec 24, 2019 Loading…
simplify initialization of BpeTrainer
#8 by epwalsh was merged Dec 24, 2019 Loading…
Allow importing from tokenizers modules.
#18 by mfuntowicz was closed Jan 14, 2020 Loading…
Clean up Rust docs
#20 by epwalsh was merged Dec 30, 2019 Loading…
Implement dropout for BPE
#21 by epwalsh was merged Dec 31, 2019 Loading…
replace print statements with logging
#22 by epwalsh was closed Jan 1, 2020 Loading…
make sure we don't warn on empty tokens
#1554 by ArthurZucker was merged Jun 20, 2024 Loading…
refactor benchmarks
#25 by epwalsh was merged Jan 2, 2020 Loading…
TokenizerBuilder
#27 by epwalsh was closed Oct 20, 2020 Loading…
[WIP] try parking_lot::RwLock
#28 by epwalsh was closed Jan 3, 2020 Loading…
[WIP] try chashmap in cache implementations
#29 by epwalsh was closed Jan 3, 2020 Loading…
[WIP] try evmap
#32 by epwalsh was closed Jan 3, 2020 Loading…
avoid unnecessary write locks in the BPE cache
#34 by epwalsh was merged Jan 4, 2020 Loading…
Update benchmarks
#36 by epwalsh was merged Jan 4, 2020 Loading…
make cache optional
#37 by epwalsh was merged Jan 3, 2020 Loading…
ProTip! Adding no:label will show everything without a label.