Keyword: LSTM GRU Seq2Seq Attention Related Papers: Efficient Estimation of Word Representations in Vector Space Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling Sequence to Sequence Learning with Neural Networks Neural Machine Translation by Jointly Learning to Align and Translate