0% found this document useful (0 votes)

13 views

NLP DL Lecture2

This document discusses word embedding techniques for natural language processing. It introduces word2vec, a model that represents words as vectors in a low-dimensional space to capture semantic meaning. Word2vec uses either the Continuous Bag-of-Words or Skip-gram model to predict words from their context to learn word embeddings. These word embeddings can then be used to compare words and measure similarity based on their vector representations.

Uploaded by

thanh.tien.96.vn

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

NLP DL Lecture2

Uploaded by

thanh.tien.96.vn

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

Deep Learning for Natural Language Processing

Lecture 2: Deep Neural Networks and Word Embedding

Quan Thanh Tho

Faculty of Computer Science and Engineering
Ho Chi Minh City University of Technology
Acknowledgement

• Some slides are from Coursera course of Prof. Andrew Ng.

Agenda

• Intuition of Neural Network for Classification Task

• Neural Network and Deep NN Text Classification
• Specific DNN Architecture: CNN and AutoEncoder
• From AutoEncoder to Word Embedding
• Doc2vec: combining tf.idf and WE
Classical Machine Learning Techniques for NLP

• Machine Learning Task

• The tf.idf weights
• Neural-based approach
Word Embedding
To compare pieces of text

• We need effective representation of :

– Words
– Sentences
– Text
• Approach 1: Use existing thesauri or ontologies like WordNet and Snomed CT (for
medical). Drawbacks:
– Manual
– Not context specific
• Approach 2: Use co-occurrences for word similarity. Drawbacks:
– Quadratic space needed
– Relative position and order of words not considered

33
Approach 3: low dimensional vectors
• Store only “important” information in fixed, low dimensional vector.
• Single Value Decomposition (SVD) on co-occurrence matrix
– is the best rank k approximation to X , in terms of least squares
– Motel = [0.286, 0.792, -0.177, -0.107, 0.109, -0.542, 0.349, 0.271]
• m = n = size of vocabulary

34
Problems with SVD

• Computational cost scales quadratically for n x m matrix: O(mn2) flops

(when n<m)
• Hard to incorporate new words or documents
• Does not consider order of words

35
word2vec Approach to represent the
meaning of word
• Represent each word with a low-dimensional vector
• Word similarity = vector similarity
• Key idea: Predict surrounding words of every word
• Faster and can easily incorporate a new sentence/document or add a
word to the vocabulary

36
Word2Vec
Word2Vec
Word2Vec
Auto-Encoder
Stacked Auto-Encoder
Word2vec
Represent the meaning of word – word2vec

• 2 basic neural network models:

– Continuous Bag of Word (CBOW): use a window of word to predict
the middle word
– Skip-gram (SG): use a word to predict the surrounding ones in
window.

43
Word2vec – Continuous Bag of Word

• E.g. “The cat sat on floor”

– Window size = 2

the

cat

sat

floor

44
Input layer
0
Index of cat in vocabulary 1
0
0
cat 0 Hidden layer Output layer
0
0 0
0 0
… 0
0 0
0
one-hot 0 sat one-hot
vector 0 0 vector
0 1
0 …
1 0
0
on 0
0
0
…
0

45
We must learn W and W’
Input layer
0
1
0
0
cat Hidden layer Output layer
𝑊!×#
0
0
0 0
0 0
… 0
V-dim 0 0

𝑊′#×! 0
0 sat
0 0
0 1
0 …
N-dim
𝑊!×# V-dim
1 0
0
on 0
0
0
…
V-dim 0 N will be the size of word vector

46
$
𝑊!×# ×𝑥%&' = 𝑣%&'
0
0.1 2.4 1.6 1.8 0.5 0.9 … … … 3.2 2.4
1
Input layer 0.5 2.6 1.4 2.9 1.5 3.6 … … … 6.1
0
2.6

0
1
… … … … … … … … … …
× 0
0
= …

… … … … … … … … … … …
0 0

xcat
0
0 𝑊$ 0.6 1.8 2.7 1.9 2.4 2.0 … … … 1.2 0
0
1.8

Output layer
!×
0
0
# ×𝑥 …
0 0
%& '
0
… =𝑣 0
0
V-dim 0
%& ' 𝑣!"# + 𝑣$%
0
0
+ 𝑣! = 0 sat
0 𝑣 () 2 0
0
0
= 1
…
1
× 𝑥 () 0
V-dim
xon
0
0 $ ×# Hidden layer
N-dim
0
0 𝑊!
…
V-dim 0

47
$
𝑊!×# ×𝑥() = 𝑣()
0
0.1 2.4 1.6 1.8 0.5 0.9 … … … 3.2 1.8
0
Input layer 0.5 2.6 1.4 2.9 1.5 3.6 … … … 6.1
0
2.9

0
1
… … … … … … … … … …
× 1
0
= …

… … … … … … … … … … …
0 0

xcat
0
0 𝑊$ 0.6 1.8 2.7 1.9 2.4 2.0 … … … 1.2 0
0
1.9

48
Input layer
0
1
0
0
cat Hidden layer Output layer
𝑊!×#
0
0
0 0
0 0
… 0
V-dim 0 0
*
𝑊!×# ×𝑣' = 𝑧 0
0
𝑦' = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑧)
0 0
0 1
…
0
𝑣!
𝑊!×#
1 0

on
0
N-dim
0 𝑦!&'(
0
0 V-dim
…
V-dim 0 N will be the size of word vector

49
Input layer
0
1 We would prefer 𝑦! close to 𝑦!)"#
0
0
cat Hidden layer Output layer
𝑊!×#
0
0
0 0 0.01
0 0
0.02
… 0
V-dim 0
*
0 0.00

𝑊!×# ×𝑣'=𝑧 0
0
0.02

0.01

𝑦' = 𝑠𝑜𝑓𝑡𝑚𝑎𝑥(𝑧)
0 0
0 1 0.02
…
0
𝑣! 0.01

𝑊!×#
1 0
0.7
on
0
N-dim
0 𝑦!&'( …
0
0 V-dim 0.00

…
V-dim 0 N will be the size of word vector 𝑦!

50
$
𝑊!×#
0.1 2.4 1.6 1.8 0.5 0.9 … … … 3.2

Input layer 0.5 2.6 1.4 2.9 1.5 3.6 … … … 6.1

Contain word’s vectors
0 … … … … … … … … … …
1 … … … … … … … … … …
0
0 0.6 1.8 2.7 1.9 2.4 2.0 … … … 1.2

xcat 0
0
Output layer
0 0
0
…
𝑊!×# 0
0
V-dim 0
*
0

𝑊!×# 0
0 sat
0 0
0 1
0 …
1
0
𝑊!×# Hidden layer
0
V-dim
xon 0
0 N-dim
0
…
V-dim 0

We can consider either W or W’ as the word’s representation. Or

even take the average. 51
Some interesting results

52
Word analogies

53
Combination of tf.idf and WE

• Recall
– Doc = {tfidf1, tfidf2,…,tfidf_t}
– W1 = {x11,x12,…, x_t}
– …
– Wt = {xt1,xt2,…, x_tt}
• Then d_1 = W1*tfidf1, d_2 = W2*tfidf2,… etc
• Finally DocVect = average of all d_i

The Absolute Basics: Basic and Intermediate Python 3 - Notes/Cheat Sheet
100% (3)
The Absolute Basics: Basic and Intermediate Python 3 - Notes/Cheat Sheet
11 pages
ZTE Config
No ratings yet
ZTE Config
4 pages
Fractions Performance Tasks
No ratings yet
Fractions Performance Tasks
15 pages
01 Revit Model Checklist - CNS R14
50% (2)
01 Revit Model Checklist - CNS R14
82 pages
TSEK38 - Radio Frequency Transceiver Design in Agilent ADS
100% (2)
TSEK38 - Radio Frequency Transceiver Design in Agilent ADS
78 pages
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
No ratings yet
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
20 pages
Word Embeddings
No ratings yet
Word Embeddings
55 pages
NLP Lec 03
No ratings yet
NLP Lec 03
26 pages
Diagonal Matrices, Upper and Lower Triangular Matrices: Linear Algebra
No ratings yet
Diagonal Matrices, Upper and Lower Triangular Matrices: Linear Algebra
2 pages
W03 NLP
No ratings yet
W03 NLP
88 pages
U1_S5
No ratings yet
U1_S5
7 pages
Introduction To Digital Systems 9 - Standard Combinational Modules
No ratings yet
Introduction To Digital Systems 9 - Standard Combinational Modules
62 pages
Binary Number Worksheet
No ratings yet
Binary Number Worksheet
2 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
Word 2 Vec
No ratings yet
Word 2 Vec
29 pages
Level 1-Unit-3
No ratings yet
Level 1-Unit-3
13 pages
Jour 4
No ratings yet
Jour 4
50 pages
Math5335 2016
No ratings yet
Math5335 2016
4 pages
Wavelet - A Tutorial On Wavelets and Their Applications
No ratings yet
Wavelet - A Tutorial On Wavelets and Their Applications
29 pages
3 WordMeaning
No ratings yet
3 WordMeaning
78 pages
Lecture 6 7
No ratings yet
Lecture 6 7
69 pages
problem_sets
No ratings yet
problem_sets
98 pages
Assignment 2: Transformation and Viewing: 1 Three-Dimensional Homogeneous Coordinates (15 PTS)
No ratings yet
Assignment 2: Transformation and Viewing: 1 Three-Dimensional Homogeneous Coordinates (15 PTS)
6 pages
Lecture2.2 UnimodalRepresentations Part2
No ratings yet
Lecture2.2 UnimodalRepresentations Part2
51 pages
Lecture 3 PDF
No ratings yet
Lecture 3 PDF
66 pages
Lect2 PDF
No ratings yet
Lect2 PDF
6 pages
EE3101 - 7 Chapter 4-1 Error Correction Coding - r1
No ratings yet
EE3101 - 7 Chapter 4-1 Error Correction Coding - r1
28 pages
Building Convolutional Neural Networks For Image Classification Slides
No ratings yet
Building Convolutional Neural Networks For Image Classification Slides
57 pages
DSP Lec5
No ratings yet
DSP Lec5
7 pages
Linear Block Codes
No ratings yet
Linear Block Codes
19 pages
Chapter4 Associative Memory
No ratings yet
Chapter4 Associative Memory
27 pages
NNFL 1 RA Moodle
No ratings yet
NNFL 1 RA Moodle
42 pages
Exam Prep Assistance
No ratings yet
Exam Prep Assistance
32 pages
Anshul_Aritra_David_ASEN5014_Project2
No ratings yet
Anshul_Aritra_David_ASEN5014_Project2
68 pages
Singular Value Decomposition
No ratings yet
Singular Value Decomposition
24 pages
ENGG1120_PastPapers_Midterm_2023Spring_Solutions
No ratings yet
ENGG1120_PastPapers_Midterm_2023Spring_Solutions
9 pages
Skeleton PDF
No ratings yet
Skeleton PDF
3 pages
Level 1 Grade Nursery Unit 3
No ratings yet
Level 1 Grade Nursery Unit 3
10 pages
Solution: Introduction To Deep Learning
No ratings yet
Solution: Introduction To Deep Learning
20 pages
8a. Artificial Neural Network
No ratings yet
8a. Artificial Neural Network
1 page
Sec 5.3
No ratings yet
Sec 5.3
2 pages
2DTISE
No ratings yet
2DTISE
25 pages
1.7 - Diagonal, Triangular and Symmetric Matrix - 6
No ratings yet
1.7 - Diagonal, Triangular and Symmetric Matrix - 6
10 pages
EE3101 Communication Engineering: Chapter 3-2, Synchronization
No ratings yet
EE3101 Communication Engineering: Chapter 3-2, Synchronization
15 pages
Stereo Calibration 2
No ratings yet
Stereo Calibration 2
6 pages
XOR Problem Demonstration Using MATLAB
0% (1)
XOR Problem Demonstration Using MATLAB
19 pages
Lecture 2&3
No ratings yet
Lecture 2&3
30 pages
Lecture5 Vit Ink
No ratings yet
Lecture5 Vit Ink
58 pages
Lect 5
No ratings yet
Lect 5
17 pages
Wavelets and Their Application in Image Processing With Future Research Prospect
No ratings yet
Wavelets and Their Application in Image Processing With Future Research Prospect
4 pages
MAT2A201.7
No ratings yet
MAT2A201.7
15 pages
Covariance Matrix Applications: Dimensionality Reduction
No ratings yet
Covariance Matrix Applications: Dimensionality Reduction
24 pages
Lecture 3
No ratings yet
Lecture 3
67 pages
Resolution of The Laplace Equation Using Lu Decomposition: D U DX U
No ratings yet
Resolution of The Laplace Equation Using Lu Decomposition: D U DX U
1 page
NR V2X Communication: Ji Min Lee
No ratings yet
NR V2X Communication: Ji Min Lee
21 pages
Pib 2 D
No ratings yet
Pib 2 D
26 pages
Fly Me To The Moon 1
No ratings yet
Fly Me To The Moon 1
2 pages
Assignment 1 - IP
No ratings yet
Assignment 1 - IP
1 page
1 Vectors
No ratings yet
1 Vectors
29 pages
05.euler Beams
No ratings yet
05.euler Beams
12 pages
Practice 4 Exam 1
No ratings yet
Practice 4 Exam 1
6 pages
Batida Do Catira Movimento 1: Moviment o 2
No ratings yet
Batida Do Catira Movimento 1: Moviment o 2
1 page
non-linear-regression-saturation-growth-curve
No ratings yet
non-linear-regression-saturation-growth-curve
2 pages
Precalculus Reproducibles
From Everand
Precalculus Reproducibles
Marilyn Occhiogrosso
No ratings yet
Hi, Word Bird!
From Everand
Hi, Word Bird!
The Child's World
No ratings yet
Full Download Cognitive Psychology A Methods Companion 1st Edition Nick Braisby PDF DOCX
100% (12)
Full Download Cognitive Psychology A Methods Companion 1st Edition Nick Braisby PDF DOCX
60 pages
Cloud Edition Install Guide
No ratings yet
Cloud Edition Install Guide
44 pages
Huawei S6720-EI Series Switches Product Datasheet
No ratings yet
Huawei S6720-EI Series Switches Product Datasheet
24 pages
Fragment Vs Div in React Js
No ratings yet
Fragment Vs Div in React Js
8 pages
Mat
No ratings yet
Mat
210 pages
CS402 Theory of Automata Solved MCQs From Quiz
80% (10)
CS402 Theory of Automata Solved MCQs From Quiz
3 pages
Et Final HND This Is Emerging Technology Assignment Guide On To Get The Distinction
No ratings yet
Et Final HND This Is Emerging Technology Assignment Guide On To Get The Distinction
82 pages
dc2 MCQ
No ratings yet
dc2 MCQ
7 pages
Literature Review On Hospital Billing System
100% (2)
Literature Review On Hospital Billing System
6 pages
Medicine Reminder: Budget Manager
No ratings yet
Medicine Reminder: Budget Manager
3 pages
Plan 9 Operating System
No ratings yet
Plan 9 Operating System
17 pages
Full Cyber Security Power and Technology Martti Lehto PDF All Chapters
100% (3)
Full Cyber Security Power and Technology Martti Lehto PDF All Chapters
53 pages
CQF P MP Ps Solutions
No ratings yet
CQF P MP Ps Solutions
14 pages
857-um002_-en-p
No ratings yet
857-um002_-en-p
48 pages
Free Old Question Papers Gndu, Ptu HP Board, Punjab Board
No ratings yet
Free Old Question Papers Gndu, Ptu HP Board, Punjab Board
3 pages
OLA EV Center Checklist
No ratings yet
OLA EV Center Checklist
3 pages
Discussion Point - 20230109
No ratings yet
Discussion Point - 20230109
10 pages
Yealink Device Management Platform Quick Start Guide: Applies To Version 3.1.0.13 or Later
No ratings yet
Yealink Device Management Platform Quick Start Guide: Applies To Version 3.1.0.13 or Later
15 pages
Design and Experimental Study On Fresnel Lens
No ratings yet
Design and Experimental Study On Fresnel Lens
7 pages
Gmail - CLINT FMT FACEBOOK
No ratings yet
Gmail - CLINT FMT FACEBOOK
2 pages
Kuka KRC2 Controller Brochure
No ratings yet
Kuka KRC2 Controller Brochure
16 pages
Module 0a. Review On Special Products
No ratings yet
Module 0a. Review On Special Products
19 pages
Draft Filsfils Spring srv6 Network Programming 07
No ratings yet
Draft Filsfils Spring srv6 Network Programming 07
42 pages
Project - Documentation For Obs
No ratings yet
Project - Documentation For Obs
53 pages
IC Multiple Project Dashboard 10918 PowerPoint
No ratings yet
IC Multiple Project Dashboard 10918 PowerPoint
10 pages