Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/29330
Title: Deep Learning: the power behind the Long Short-term Memory model
Authors: VAN HOUDT, Greg 
Advisors: NAPOLES RUIZ, Gonzalo
Issue Date: 2019
Publisher: UHasselt
Abstract: Long Short-Term Memory (LSTM) has transformed both machine learning and neurocomputing fields. According to the webpage of one of the LSTM's fathers – Prof. Jürgen Schmidhuber – this model improved speech recognition on over 2 billion Android phones, greatly improved machine translation through Google Translate, and the answers of Amazon's Alexa. Interestingly, recurrent neural networks had shown a rather discrete performance until LSTM showed up. One reason for the success of this recurrent network lies in its ability to handle the exploding and vanishing gradient problem, which stands as a difficult issue to be circumvented when training recurrent or very deep neural networks. In this paper, we present a comprehensible review that goes over both theory and practice. To start, LSTM’s formulation and training is briefly described. However, as this theory was recently reviewed in the literature, the second part is the most elaborate, covering relevant applications reported in the literature. From this study, we learned that LSTM is a very suitable model to tackle, among others, time series prediction, text recognition and natural language processing. The applications also showed how LSTM can work together with other models to create hybrid deep learning architectures. This is done to increase performance, as vanilla LSTM is not always the ideal answer. Finally, we conclude with code resources implementing the neural system for a toy example, showing how easy it is to run the model on your home computer.
Notes: master in de toegepaste economische wetenschappen: handelsingenieur in de beleidsinformatica
Document URI: http://hdl.handle.net/1942/29330
Category: T2
Type: Theses and Dissertations
Appears in Collections:Master theses

Files in This Item:
File Description SizeFormat 
ccb6397d-f73f-4512-8b9a-a4113ed8a209.pdf627.32 kBAdobe PDFView/Open
Show full item record

Page view(s)

96
checked on Aug 6, 2023

Download(s)

42
checked on Aug 6, 2023

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.