PaLM2-L's Token Diet: Outsmarting Chinchilla and LLaMA

18 May 2023·48 words·1 min · Download pdf

Below leaked PaLM numbers are bit unusual. It points to new scaling law paper eluded to.

This means PaLM2-L needed 2.5X less tokens by Chinchilla standards and even less by LLaMA standards. https://x.com/ml_hardware/status/1658936724943142913