The Decoder-only model with RoPE, SwiGLU and a BPE tokenizer is in assignment/assianment1-basics/cs336_basics. I only run one experiment on my mac because I do not ...
This assumes you've already launched a suitable MySQL or MariaDB database container. A minimal set-up using docker-compose is available in the .examples folder. If you want to use the import logs ...