LLM Inference on a Shoestring Budget
Very nice overview of techniques for making LLM inference cheap:
https://lilianweng.github.io/posts/2023-01-10-inference-optimization/
Very nice overview of techniques for making LLM inference cheap:
https://lilianweng.github.io/posts/2023-01-10-inference-optimization/