LLM Architecture: Transformers & Attention

A deep dive into the components of the Transformer model.

1 week

Topics in this Chapter

Overview of the encoder-decoder stacks that define the model.

The core mechanism allowing the model to weigh token importance.

Injecting information about word order into the model.

Other essential components of a Transformer block.