The Decoder-only model with RoPE, SwiGLU and a BPE tokenizer is in assignment/assianment1-basics/cs336_basics. I only run one experiment on my mac because I do not ...
This assumes you've already launched a suitable MySQL or MariaDB database container. A minimal set-up using docker-compose is available in the .examples folder. If you want to use the import logs ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results