Papers on architecture of Recurrent Neural Networks (RNN)
24 Jul 2019Bookmarking some papers mentioned in Andrew Ng’s course Sequence Models:
Gate Recurrent Unit (GRU)
-
Cho, K., Van Merriënboer, B., Bahdanau, D., & Bengio, Y. (2014). On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259.
-
Chung, J., Gulcehre, C., Cho, K., & Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555.
Long short-term memory (LSTM)
- Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural computation, 9(8), 1735-1780.
(More to update)