Introduction to Recurrent Neural Network

Yan Xu
Houston Machine Learning Meetup
May 20, 2017
Introduction to Recurrent Neural Network

Roadmap
• Tour of machine learning algorithms (1 session)
• Feature engineering (1 session)
• Feature selection - Yan
• Supervised learning (4 sessions)
• Regression models -Yan
• SVM and kernel SVM - Yan
• Tree-based models - Dario
• Bayesian method - Xiaoyang
• Ensemble models - Yan
• Unsupervised learning (3 sessions)
• K-means clustering
• DBSCAN - Cheng
• Mean shift
• Agglomerative clustering – Kunal
• Spectral clustering – Yan
• Dimension reduction for data visualization - Yan
• Deep learning (4 sessions)
• Neural network - Yan
• Convolutional neural network – Hengyang Lu
• Recurrent neural networks – Yan
• Hands-on session with deep nets
Slides posted on:
http://www.slideshare.net/xuyangela

More deep learning coming up!
• Optimization in Deep learning
• Behind AlphaGo
• Mastering the game of Go with deep neural networks
and tree search
• Deep learning showcase: Share your experience!

Outline
• Recap on neural network
• Recurrent neural network overview
• Application of RNN
• Long short term memory network
• An example

Recap: Feed-forward neural network
Activation
function

Activation function
rectified linear unit (ReLU)

Training with gradient descent

Convolutional Neural Network
Full-connected
neural nets
Convolutional
neural nets
(kernel size = 2)

Recurrent NN: Considering Sequence

Recurrent Neural Network
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Weights are kept the same in cell A!

Recurrent Neural Network

Wide application of RNN
Image
classification
Image
Captioning
Sentiment
analysis
Machine
translation
Labeling each
frame of video

Special RNN: LSTM NN
• Short term memory
• Long term memory
the clouds are in the sky
I grew up in China … I speak fluent Chinese.

Special RNN: LSTM NN
SLTM in products!
• Google Translate
• Apple Siri
• Amazon Alexa
Cell
https://www.youtube.com/watch?v=93rzMHtYT_0

LSTM

Keep gate
N × 1 M × 1
（N+M) × 1N × (N+M)
N × 1 N × 1N × 1

Write Gate

Update cell state

Read gate

Training LSTM
• Back propagates like feed-forward nets
• Sum up all updates and applied to all

Example: Predicting next word
https://medium.com/towards-data-science/lstm-by-example-using-tensorflow-feb0c1968537

Each word represented by an integer. Output is a one-hot vector.
512 hidden units
Improvement?
Example: Predicting next word

Generating a story!
Input: a general council
had a general council to consider what measures they could take to outwit their
common enemy , the cat . some said this , and some said that but at last a young
mouse got
Input: mouse mouse mouse
mouse mouse mouse , neighbourhood and could receive a outwit always the neck
of the cat . some said this , and some said that but at last a young mouse got up
and said

Great reference
• http://colah.github.io/posts/2015-08-Understanding-LSTMs/
• https://medium.com/@ageitgey/machine-learning-is-fun-part-5-language-
translation-with-deep-learning-and-the-magic-of-sequences-2ace0acca0aa
• Visualizing and Understanding RNN:
• https://skillsmatter.com/skillscasts/6611-visualizing-and-understanding-recurrent-networks

Summary
• Learn about RNN, how it relates to feed forward NN
• Long short term memory RNN
• Keep gate
• Write gate
• Read gate
• Application and Example

Roadmap
• Tour of machine learning algorithms (1 session)
• Feature engineering (1 session)
• Feature selection - Yan
• Supervised learning (4 sessions)
• Regression models -Yan
• SVM and kernel SVM - Yan
• Tree-based models - Dario
• Bayesian method - Xiaoyang
• Ensemble models - Yan
• Unsupervised learning (3 sessions)
• K-means clustering
• DBSCAN - Cheng
• Mean shift
• Agglomerative clustering – Kunal
• Spectral clustering – Yan
• Dimension reduction for data visualization - Yan
• Deep learning (4 sessions)
• Neural network - Yan
• Convolutional neural network – Hengyang Lu
• Recurrent neural networks – Yan
• Hands-on session with deep nets
Slides posted on:
http://www.slideshare.net/xuyangela
More deep learning
coming up!

Thank you
Data Disruptors Conference, ddc (energy)
@ Houston, June 14
PROMO: HEDS99 to get 99$ off
Slides will be posted at: http://www.slideshare.net/xuyangela
Leave a
group
review
please 

Introduction to Recurrent Neural Network

More Related Content

Introduction to Recurrent Neural Network