Skip to content

Attention Is All You Need

attention-is-all-you-need

Research paper introducing transformer architecture

Replaced rnn and lstm for NLP tasks

encoder-decoder architecture with self-attention mechanism

Original use case: language-translation

Enabled multilingual-language-model


*References


*References

#ml-notes

On this page