Training curves: lessons from the past

Bloom

GPT3

Llama1

Llama2