Attention Is All You Need
attention-is-all-you-need
Research paper introducing transformer architecture
Replaced rnn and lstm for NLP tasks
encoder-decoder architecture with self-attention mechanism
Original use case: language-translation
Enabled multilingual-language-model
*References
*References
#ml-notes