diff --git a/README.md b/README.md index f019f96..450d7f3 100644 --- a/README.md +++ b/README.md @@ -141,6 +141,7 @@ answer = trainer.generate(2048, prompt = prompts[0], num_samples = 10) # (<= 204 - [ ] allow for finetuning penultimate N layers only in either actor or critic, assuming if pretrained - [ ] incorporate some learning points from Sparrow, given Letitia's video - [ ] simple web interface with django + htmx for collecting human feedback +- [ ] equip with the best attention ## Citations