Vanilla Transformer: A complete implementation of the original transformer architecture as described in the "Attention Is All You Need" paper. This includes both the encoder and decoder components.