In this post, I plan to explore aspects of cutting edge architectures in NLP like BERT/Transformers. I assume that the readers are...