(n.d.). How LSTM networks solve the problem of vanishing gradients | by Nir Arbel | Data Driven Investor | Medium. Retrieved from https://medium.com/datadriveninvestor/how-do-lstm-networks-solve-the-problem-of-vanishing-gradients-a6784971a577
(n.d.). [ Back to Basics ] Deriving Back Propagation on simple RNN/LSTM (feat. Aidan Gomez) | by Jae Duk Seo | Towards Data Science. Retrieved from https://towardsdatascience.com/back-to-basics-deriving-back-propagation-on-simple-rnn-lstm-feat-aidan-gomez-c7f286ba973d
(n.d.). Loss Functions — ML Glossary documentation. Retrieved from https://ml-cheatsheet.readthedocs.io/en/latest/loss_functions.html
(n.d.). CS 230 - Recurrent Neural Networks Cheatsheet. Retrieved from https://stanford.edu/~shervine/teaching/cs-230/cheatsheet-recurrent-neural-networks