Hrishi's Blog

What is a Transformer?

Transformer is an architecture which is very efficient to compute the function (AI model), that can compute arbitrary curves in the higher dimensional spaces.

The Transformer is the architecture for building our AI model

The transformer defines:

The transformer doesn’t compute anything. It’s not the AI model. It defines the form of the function (the curve family), and gives us an efficient way to train and evaluate it.