Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
SlideShare a Scribd company logo
Yan Xu
Houston Machine Learning Meetup
May 20, 2017
Introduction to Recurrent Neural Network
Roadmap
• Tour of machine learning algorithms (1 session)
• Feature engineering (1 session)
• Feature selection - Yan
• Supervised learning (4 sessions)
• Regression models -Yan
• SVM and kernel SVM - Yan
• Tree-based models - Dario
• Bayesian method - Xiaoyang
• Ensemble models - Yan
• Unsupervised learning (3 sessions)
• K-means clustering
• DBSCAN - Cheng
• Mean shift
• Agglomerative clustering – Kunal
• Spectral clustering – Yan
• Dimension reduction for data visualization - Yan
• Deep learning (4 sessions)
• Neural network - Yan
• Convolutional neural network – Hengyang Lu
• Recurrent neural networks – Yan
• Hands-on session with deep nets
Slides posted on:
http://www.slideshare.net/xuyangela
More deep learning coming up!
• Optimization in Deep learning
• Behind AlphaGo
• Mastering the game of Go with deep neural networks
and tree search
• Deep learning showcase: Share your experience!
Outline
• Recap on neural network
• Recurrent neural network overview
• Application of RNN
• Long short term memory network
• An example
Recap: Feed-forward neural network
Activation
function
Activation function
rectified linear unit (ReLU)
Training with gradient descent
Convolutional Neural Network
Full-connected
neural nets
Convolutional
neural nets
(kernel size = 2)
Recurrent NN: Considering Sequence
Recurrent Neural Network
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Weights are kept the same in cell A!
Recurrent Neural Network
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Machine Translation
Conversation Bot
Image Description
Image Search
Write like Shakespeare
Wide application of RNN
Image
classification
Image
Captioning
Sentiment
analysis
Machine
translation
Labeling each
frame of video
Special RNN: LSTM NN
• Short term memory
• Long term memory
the clouds are in the sky
I grew up in China … I speak fluent Chinese.
Special RNN: LSTM NN
SLTM in products!
• Google Translate
• Apple Siri
• Amazon Alexa
Cell
https://www.youtube.com/watch?v=93rzMHtYT_0
LSTM
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Keep gate
N × 1 M × 1
(N+M) × 1N × (N+M)
N × 1 N × 1N × 1
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Write Gate
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Update cell state
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Read gate
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
Training LSTM
• Back propagates like feed-forward nets
• Sum up all updates and applied to all
Example: Predicting next word
https://medium.com/towards-data-science/lstm-by-example-using-tensorflow-feb0c1968537
Each word represented by an integer. Output is a one-hot vector.
512 hidden units
Improvement?
Example: Predicting next word
Generating a story!
Input: a general council
had a general council to consider what measures they could take to outwit their
common enemy , the cat . some said this , and some said that but at last a young
mouse got
Input: mouse mouse mouse
mouse mouse mouse , neighbourhood and could receive a outwit always the neck
of the cat . some said this , and some said that but at last a young mouse got up
and said
Great reference
• http://colah.github.io/posts/2015-08-Understanding-LSTMs/
• https://medium.com/@ageitgey/machine-learning-is-fun-part-5-language-
translation-with-deep-learning-and-the-magic-of-sequences-2ace0acca0aa
• Visualizing and Understanding RNN:
• https://skillsmatter.com/skillscasts/6611-visualizing-and-understanding-recurrent-networks
Summary
• Learn about RNN, how it relates to feed forward NN
• Long short term memory RNN
• Keep gate
• Write gate
• Read gate
• Application and Example
Roadmap
• Tour of machine learning algorithms (1 session)
• Feature engineering (1 session)
• Feature selection - Yan
• Supervised learning (4 sessions)
• Regression models -Yan
• SVM and kernel SVM - Yan
• Tree-based models - Dario
• Bayesian method - Xiaoyang
• Ensemble models - Yan
• Unsupervised learning (3 sessions)
• K-means clustering
• DBSCAN - Cheng
• Mean shift
• Agglomerative clustering – Kunal
• Spectral clustering – Yan
• Dimension reduction for data visualization - Yan
• Deep learning (4 sessions)
• Neural network - Yan
• Convolutional neural network – Hengyang Lu
• Recurrent neural networks – Yan
• Hands-on session with deep nets
Slides posted on:
http://www.slideshare.net/xuyangela
More deep learning
coming up!
Thank you
Data Disruptors Conference, ddc (energy)
@ Houston, June 14
PROMO: HEDS99 to get 99$ off
Slides will be posted at: http://www.slideshare.net/xuyangela
Leave a
group
review
please 

More Related Content

Introduction to Recurrent Neural Network