Transformer Telepathy: Estimating Flops with Just a Glance

10 November 2022·29 words·1 min · Download pdf

How do you estimate flops, latency and memory footprint of a transformer model just looking at the architecture? Transformer inference arithmetic is a great post on how:

https://kipp.ly/blog/transformer-inference-arithmetic/

Discussion