[go: nahoru, domu]

Skip to content
This repository has been archived by the owner on Feb 25, 2022. It is now read-only.

Pull requests: EleutherAI/gpt-neo

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Nuke stitch; simplify config; sh{ard,uffle} files
#28 by shawwn was closed Nov 19, 2020 Loading… updated Nov 19, 2020
Wikitext eval / fix validation
#76 by sdtblck was merged Nov 19, 2020 Loading… updated Nov 19, 2020
Update gpt2.py
#74 by sdtblck was merged Nov 17, 2020 Loading… updated Nov 17, 2020
improve on causal linear attention
#68 by lucidrains was merged Nov 1, 2020 Loading… updated Nov 1, 2020
Update gpt2.py
#67 by sdtblck was merged Nov 1, 2020 Loading… updated Nov 1, 2020
add faster sampling for global attention
#43 by lucidrains was merged Sep 12, 2020 Loading… updated Oct 10, 2020
3 of 4 tasks
Mistobaan/add summary
#44 by Mistobaan was closed Sep 12, 2020 Loading… updated Oct 10, 2020
logic for automatically generating dataset config file
#59 by lucidrains was merged Sep 22, 2020 Loading… updated Oct 10, 2020
set global random seed
#60 by lucidrains was closed Sep 20, 2020 Loading… updated Oct 10, 2020
make dataset shuffling and interleaving deterministic by addition of …
#63 by lucidrains was merged Sep 22, 2020 Loading… updated Oct 10, 2020
make sure datasets skips the number of batches equal to the current g…
#62 by lucidrains was merged Sep 22, 2020 Loading… updated Oct 10, 2020
lambada changes
#29 by kevinwatkins was merged Sep 11, 2020 Loading… updated Oct 10, 2020
Pw/revert dataset skipping
#66 by lucidrains was merged Sep 23, 2020 Loading… updated Oct 10, 2020
Cleanup
#65 by ConnorJL was merged Sep 23, 2020 Loading… updated Sep 23, 2020
Print warning if truncating input
#64 by sdtblck was merged Sep 23, 2020 Loading… updated Sep 23, 2020
Pw/resume deterministic dataset correctly
#61 by lucidrains was closed Sep 20, 2020 Loading… updated Sep 20, 2020
move scripts for tokenization and creating tfrecords to root, also ad…
#58 by lucidrains was merged Sep 19, 2020 Loading… updated Sep 19, 2020
training with multiple GPUs, update readme
#56 by lucidrains was merged Sep 19, 2020 Loading… updated Sep 19, 2020
allow use of tokenizer from gpt2 when creating tfrecords
#57 by lucidrains was merged Sep 19, 2020 Loading… updated Sep 19, 2020
add masked language modeling, for training BERT or RoBERTa like atten…
#47 by lucidrains was merged Sep 13, 2020 Loading… updated Sep 13, 2020
do not count padding tokens in loss
#45 by lucidrains was merged Sep 13, 2020 Loading… updated Sep 13, 2020
fix big bug with exclamation mark being used as padding for gpt2
#50 by lucidrains was merged Sep 13, 2020 Loading… updated Sep 13, 2020
main.py: another bug fix
#49 by kevinwatkins was merged Sep 13, 2020 Loading… updated Sep 13, 2020
loss_summary
#48 by sdtblck was merged Sep 12, 2020 Loading… updated Sep 12, 2020
add support for absl
#40 by Mistobaan was merged Sep 12, 2020 Loading… updated Sep 12, 2020
ProTip! Follow long discussions with comments:>50.