Weaving Longer Contexts: Yarn Scaling for LLMs
·25 words·1 min
Yarn scaling technique looks amazing to extend the context length of LLMs through fine tuning with no sacrifice on the base model performance! https://x.com/EnricoShippole/status/1697317625116742119