[go: nahoru, domu]

Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adding a new tests for PreTokenizer.custom.
#467 by Narsil was merged Oct 15, 2020 Loading…
TemplateProcessing serialization is now deterministic
#476 by n1t0 was merged Oct 21, 2020 Loading…
Finish exposing the UnicodeScripts PreTokenizer
#477 by n1t0 was merged Oct 21, 2020 Loading…
Last missing pieces for Unigram + ByteLevel
#480 by n1t0 was merged Oct 26, 2020 Loading…
Fix UnigramTrainer
#485 by n1t0 was merged Oct 26, 2020 Loading…
Improve Encoding mappings for pairs of sequence
#506 by n1t0 was merged Nov 6, 2020 Loading…
Python 0.9.4
#514 by n1t0 was merged Nov 9, 2020 Loading…
New PR to fix #270 (not #157).
#516 by Narsil was merged Nov 11, 2020 Loading…
Python - Mutable components
#530 by n1t0 was merged Nov 27, 2020 Loading…
Python releases build & upload Conda packages
#533 by LysandreJik was merged Nov 19, 2020 Loading…
Improve Python API Reference and help
#538 by n1t0 was merged Nov 23, 2020 Loading…
Ability to train from memory
#544 by n1t0 was merged Nov 28, 2020 Loading…
1 task done
2
Python - Prepare pre-release 0.10.0rc1
#551 by n1t0 was merged Dec 8, 2020 Loading…
2 tasks
Fix WordLevelTrainer default values
#557 by n1t0 was merged Dec 8, 2020 Loading…
Bump ini from 1.3.5 to 1.3.8 in /bindings/node dependencies Pull requests that update a dependency file
#561 by dependabot bot was merged Dec 15, 2020 Loading…
Python - Improve training with iterators
#565 by n1t0 was merged Jan 6, 2021 Loading…
Fix clippy warnings for rust 1.49
#582 by n1t0 was merged Jan 6, 2021 Loading…
Python - Add train_from_iterator to implementations
#583 by n1t0 was merged Jan 7, 2021 Loading…
Python - Fix breaking change in Model.save
#589 by n1t0 was merged Jan 11, 2021 Loading…
Python - Add components getter/setters to BaseTokenizer
#590 by n1t0 was merged Jan 11, 2021 Loading…
Simplify Whitespace pre_tokenizer
#591 by n1t0 was merged Jan 11, 2021 Loading…
Add documentation for training from iterators
#594 by n1t0 was merged Jan 12, 2021 Loading…
Python - Prepare for release 0.10.0
#595 by n1t0 was merged Jan 12, 2021 Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.