Topic 6_Natural Language Processing (NLP)
Topic 6_Natural Language Processing (NLP)
PROCESSING (NLP)
TOPIC 6
3. Sentiment Analysis:- also known as opinion mining. It is used on the web to analyse
the attitude, behaviour, and emotional state of the sender.
4. Machine translation:- used to translate text or speech from one natural language to
another natural language.
5. Speech recognition:- used for converting spoken words into text. It is used in
applications, such as mobile
3. Semantic Analysis − It draws the exact meaning or the dictionary meaning from the
text. The text is checked for meaningfulness. It is done by mapping syntactic
structures and objects in the task domain. The semantic analyser disregards sentence
such as “hot ice-cream”.
SCS 4101 Artificial Intelligence Wednesday, December 25, 2024
6.5 Phases of NLP
NLP has the following five steps/phases:
4. Discourse Integration − The meaning of any sentence depends upon the meaning of
the sentence just before it. In addition, it also brings about the meaning of immediately
succeeding sentence.
5. Pragmatic Analysis − During this, what was said is re-interpreted on what it actually
meant. It involves deriving those aspects of language which require real world knowledge.
Example:
Manya is looking for a match.
In the above example, the word match refers to that either Manya is
looking for a partner or Manya is looking for a match. (Cricket or
other match)
Example:
I saw the girl with the binocular.
In the above example, did I have the binoculars? Or did the girl have
the binoculars?
Example:
Kiran went to Sunita. She said, "I am hungry."
In the above sentence, you do not know that who is hungry, either
Kiran or Sunita.
3. Text Representation
Bag of Words (BoW): Representing text as a collection of words,
ignoring grammar and word order but keeping track of word
frequency.
Term Frequency-Inverse Document Frequency (TF-IDF): A
statistic that reflects the importance of a word in a document relative
to a collection of documents.
Word Embeddings: Using dense vector representations of words
where semantically similar words are closer together in the vector
space (e.g., Word2Vec, GloVe).
4. Feature Extraction
Extracting meaningful features from the text data that can be used for
various NLP tasks.
N-grams: Capturing sequences of N words to preserve some context
and word order.
Syntactic Features: Using parts of speech tags, syntactic
dependencies, and parse trees.
Semantic Features: Leveraging word embeddings and other
representations to capture word meaning and context.