Transformer Math 101

Article
Apr 18, 2023
#Math

A lot of basic, important information about transformer language models can be computed quite simply. Unfortunately, the equations for this are not widely known in the NLP community... Show More

Mentions

See All

Jean de Nyandwi @Jeande_d · Apr 20, 2023

Post
From Twitter

Transformer Math 101 An excellent blog post about basic math related to computation and memory usage for transformers. Nicely explained!!