This is the second article of my article series “Instructions on Transformer for people outside NLP field, but with examples of NLP.” 1 Machine translation and seq2seq models I think […]