Chinchillas and Flashy Attention: How ChatGPT Got 10X Faster

2 March 2023·47 words·1 min · Download pdf

A lot of people are being surprised by ChatGPT inference cost being 10X smaller than GPT3 but the field had many many advances for past 2 years.

Top 2 are: