You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @quark0 , thanks for releasing the code. Really enjoy your paper. Here I have a problem and hope to get your help. I run the search code with 5 different random seeds, including the default seed and get 5 different RNN architectures. However, when I train these 5 architectures respectively from scratch, I get test ppl of 57.16, 61, 57.81, 60.99 and 57.53. None of them get a test ppl around 56. Is there anything I missed to get a robust result of 56.1 or 55.8 in the paper?
The text was updated successfully, but these errors were encountered:
You probably need to adjust the hyperparameters for the final evaluation. The default hyperparameters were tuned wrt the provided genotype but are likely suboptimal for the new architectures.
@quark0 Thanks for your response! Are there any suggestions for tunning hyperparams? Like what hyper parameters needed to be tuned, and the range. I see there`re a lot of hyper parameters needed to be tuned, including four dropout. Is it hard or expensive to do so?
Hi @quark0 , thanks for releasing the code. Really enjoy your paper. Here I have a problem and hope to get your help. I run the search code with 5 different random seeds, including the default seed and get 5 different RNN architectures. However, when I train these 5 architectures respectively from scratch, I get test ppl of 57.16, 61, 57.81, 60.99 and 57.53. None of them get a test ppl around 56. Is there anything I missed to get a robust result of 56.1 or 55.8 in the paper?
The text was updated successfully, but these errors were encountered: