0% found this document useful (0 votes)

62 views

Project Report

The document discusses sentiment analysis of Twitter data. It aims to classify Twitter data into positive and negative sentiment using machine learning techniques. The key steps discussed are data collection, preprocessing, feature extraction and classification of sentiment using algorithms like random forest.

Uploaded by

necromaniac dimitri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

Project Report

Uploaded by

necromaniac dimitri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

A

Project Report
on
SENTIMENT ANALYSIS OF TWITTER DATA SET

Submitted in partial fulfillment of the requirements

for the award of the degree of

Bachelor of Technology
in
Computer Science and Engineering

By
BHARAT SINGH 1709710035
CHOUDHARY RISHAB KUMAR 1709710038
PRASHANT RAJ 1709710080

Under the Supervision of

DR. RITESH SRIVASTAVA

Galgotias College of Engineering & Technology

Greater Noida, Uttar Pradesh
India-201306
Affiliated to

Dr. A.P.J. Abdul Kalam Technical University

Lucknow, Uttar Pradesh,
India-226031
December, 2020

I
GALGOTIAS COLLEGE OF ENGINEERING & TECHNOLOGY
GREATER NOIDA, UTTER PRADESH, INDIA- 2 0 1 3 0 6 .

CERTIFICATE

This is to certify that the project report entitled “SENTIMENT ANALYSIS

OF TWITTER DATA SET” submitted by Mr. BHARAT SINGH 1709710035,
Mr. CHOUDHARY RISHAB KUMAR 1709710038, Mr. PRASHANT RAJ
1709710080 to the Galgotias College of Engineering & Technology, Greater Noida,
Utter Pradesh, Affiliated to Dr. A.P.J. Abdul Kalam Technical University Lucknow,
Uttar Pradesh in partial fulfillment for the award of Degree of Bachelor of
Technology in Computer science & Engineering is a bonafide record of the project
work carried out by them under my supervision during the year 2020-2021.

DR. RITESH SRIVASTAVA DR. VISHNU SHARMA

ASSOCIATE PROFESSOR PROFESSOR AND HEAD

DEPT. OF CSE DEPT. OF CSE

II
GALGOTIAS COLLEGE OF ENGINEERING & TECHNOLOGY
GREATER NOIDA, UTTER PRADESH, INDIA- 2 0 1 3 0 6 .

ACKNOWLEDGEMENT
We have taken efforts in this project. However, it would not have been possible
without the kind support and help of many individuals and organizations. We would
like to extend my sincere thanks to all of them.

We are highly indebted to DR. RITESH SRIVASTAVA for their guidance and
constant supervision as well as for providing necessary information regarding the
project & also for their support in completing the project.

We are extremely indebted to Dr. Vishnu Sharma, the HOD of Department of

Computer Science and Engineering, GCET and Dr. Jaya Sinha, Project Coordinator,
Department of Computer Science and Engineering, GCET for his valuable
suggestions and constant support throughout my project tenure. We would like to
express our thanks to all faculty and Staff members of Department of Computer
Science and Engineering, GCET for their support in completing this project on time.

We also express gratitude towards our parents for their kind co-operation and
encouragement which help me in completion of this project. Our thanks and
appreciations also go to our friends in developing the project and people who have
willingly helped me out with their abilities.

BHARAT SINGH

CHOUDHARY RISHAB KUMAR

PRASHANT RAJ

III
ABSTRACT
This project aims to present an overview of sentimental analysis. Sentimental
analysis works in the background of many industries. Today the industries have
become more considerate to their customer’s demands, likes, dislikes, opinions,
responses etc. The bright minds of the consumer market have realized the power of
opinions. The producers have learned to respect the needs and ideas of their
customers, as they know in this date of neck-to-neck competitions, their customers
have the power to make and break them. Keeping an eye on the customer response
has become a crucial part of production system. Sentiment analysis has come out as
a powerful tool to make this process easier and much faster. Sentiment analysis
involves collection of data, pre-processing and polarity detection.

Keywords: Text Mining, Data Mining, Information Extraction, Natural Language

Processing, Concept Mining.

IV
TABLE OF CONTENTS

Title Page

CERTIFICATE ii
ACKNOWLEDGEMENT iii
ABSTRACT iv
CONTENTS v
LIST OF TABLES vi
LIST OF FIGURES vii
LIST OF ABBREVIATIONS viii

CHAPTER 1: INTRODUCTION 1
CHAPTER 2: LITERATURE REVIEW 5
CHAPTER 3: PROBLEM FORMULATION 10
3.1 IE Difficulties 10
3.2 IE Techniques 11
3.2 Representation Model 12
CHAPTER 4: PROPOSED WORK 13
4.1 Data Set Description 13
4.2 Model Components 13
CHAPTER 5: SYSTEM DESIGN 17
CHAPTER 6: IMPLIMENTATION 18

CHAPTER 7: RESULT ANALYSIS 20

CHAPTER 8: CONCLUSION, LIMITATION AND FUTURE SCOPE 21

8.1 Conclusion 21
8.2 Limitations 21
8.3 Future Scope 22
REFERENCE 23
APPENDIX 25

V
LIST OF TABLES

Table Title Page

2.1 Brief Literature Survey 7

6.1 Example Tweets

VI
LIST OF FIGURES

Figure Title Page

Fig. 2.1 Evolution of Data Mining 6

Fig. 4.1 Positive Emoticon 14

Fig. 4.2 Negative Emoticon 14

Fig. 4.3 Steps in text message processing 15

Fig. 5.1 A simple architecture of the recommender system Block 17

Fig. 5.2 Block Diagram of process 17

Fig 6.1 System UI 19

Fig. 7.1 Results of different method instance 20

VII
LIST OF ABBREVIATIONS

NLP Natural Language Processing

KDD Knowledge Discovery in Databases
IE Information Extraction
SVM Support Vector Machine
KNN K-Nearest Neighbours
TF-IDF Term Frequency–Inverse Document Frequency
WSD Word Sense Disambiguation
PoS Part of Speech
IR Information Retrieval
HMM Hidden Markov Model
CRF Conditional Random Field
MBL Memory Based Learning
TBL Transformation Based Learning
VSM Vector Space Model
BOW Bag-of-Words

VIII
CHAPTER-1
INTRODUCTION
Given a large set of data, e.g., a database of movie reviews in our case, the goal of
Sentiment Analysis is to determine the emotional tone in each review and classify it
as positive or negative. Since the dataset is huge, comprising of over ten thousand
review files, we want to automate the process of sentiment analysis using machine
learning and natural language processing techniques. With the explosion of data in
recent years, a lot of information is available to us that can be used to improve
business strategies, help research and also solve social problems. Organizations can
use the technique of sentiment analysis to get to know the reactions of their
customers’ about their products and services.

The ability to extract insights from social data like Twitter, Facebook, Instagram, etc.
is a practice which is widely being used and adopted by different organisations all
over the world. From our research on past work done in sentiment analysis, there are a
number of methods that have already been successfully used for the given problem of
sentiment analysis. This has been discussed in more detail in the next page. We have
chosen the Random Forest classifier for classification of the text. One of its main
advantage is that it handles high dimensional spaces as well as large number of
training examples very well and eliminates the problem of overfitting which is
prevalent in such cases.

Sentiment Analysis is also known as Mining of Opinion, is a field of Natural

Language Processing (NLP) that creates a systems which try to identify and extract
opinions within the text extracted from the source. Generally, apart from identifying
the opinion, the technique of opinion mining is used to extract attributes of the
expression e.g.:

 Polarity: It is the attribute that describes the text expression is either a positive or a
negative opinion,
 Subject: It is the attribute which describes around which of the following topic the
thing is being talked about,

1
 Opinion holder: It is the attribute which describes about the person, object or entity
which expresses the opinion of the extracted text.

In the current scenario, the technology of sentiment analysis is one of the topic with
high interest and development due to the reason that it has many practical applications
of the real life. As the information is constantly growing, either it is publicly or
privately available information, a large number of text expression opinions are readily
available in review sites, blogs, forums, and social media.
With the help of sentiment analysis techniques and systems, the unstructured
information/text can be converted into the structured data of public opinions
automatically about the products, politics, services, brands, or any other topic that
people can express opinions about. This structured and classified data can be useful
for different commercial applications like public relations, marketing analysis product
reviews, product feedback, net promoter scoring, and customer service.

Any industry’s success depends primarily on its customers. Social media has come
out as a boon for the customers. Customer response have become more and more
important. Social media avails a platform where every opinion matter. People can
endorse their favourites and show criticism more openly. With people’s voice louder
than ever, it’s very important for a brand to maintain a good reputation in the public
eye. This is the point at which sentimental analysis comes into the action.

Sentiment analysis, also known as opinion mining or emotion AI, has always been an
intriguing topic in the world of computer science. Much like any other fancy
technology in computer science, sentimental analysis still remains a widely bandied
yet misunderstood term. Breaking the term itself into two words, we get ‘sentiment’
and ‘analysis’. ‘Sentiment’ means emotions and ‘analysis’ means detailed
examination.

So, sentiment analysis is nothing but detailed examination of emotions. In reality, it is

the process for the determination of the emotional tone in a sentence or series of
words, used to enhance the understanding of the opinions, attitudes, and emotions
expressed. Natural language processing, text analysis, computational linguistics,

2
and biometrics are used to systematically identify, extract, quantify, and study
affective states and subjective information.

The best businesses is recognized as the one which completely understands the
sentiments of their customers and recognizes the answers to the following questions:

1. What people are saying?

2. How people are saying it?

3. What do people mean by the saying it?

As we are aware that the “Sentiment Analysis” is the field of understanding the
emotions of the text/information available using the sentiment analysis tool which is a
must-understand tool for modern workplace business leaders and developers.

Along with this many other fields, we are able to advance researches and solution like
Deep Learning have created Sentiment Analysis as of the cutting-edge algorithms. In
the current scenario, we use natural language processing technique, statistics theory,
and text analysis for extraction, and identification of the sentiment of the extracted
text in three main categories of sentiments i.e. neutral, positive, or negative.

SENTIMENT ANALYSIS TO MONITOR BRAND VALUE

One of the mostly documented uses of Sentiment Analysis is to get a full 360 degree
view of ways your product, corporation, or brand is viewed by your clients and
stakeholders. Widely present media, like product reviews and social, can reveal key
insights up to some extent. What your commercial organisation is doing correct or
incorrect. Organisations could also use the technique of sentiment analysis to
calculate the impact of a latest product launched, ad campaign, or consumer’s
response to recent company’s news on social media. Private companies like Unamo
offer it as a service.

3
SENTIMENT ANALYSIS FOR CUSTOMER SERVICE
Customer service agents often use sentiment analysis to accordingly sort incoming
user email into “urgent” or “not very urgent” buckets on the basis of sentiment of the
email, proactively identifying frustrated users. The agent then directs their time
toward resolving the users with the most urgent needs first. As customer service
becomes more and more automated through Machine Learning, understanding the
sentiment of a given case becomes increasingly important.

SENTIMENT ANALYSIS FOR MARKET RESEARCH AND

ANALYSIS
Sentiment evaluation is utilized in commercial enterprise intelligence to apprehend
the subjective motives why clients are or aren't responding to something (ex. why are
clients shopping for a product? What do they consider the consumer experience? Did
customer support/help meet their expectations?). Sentiment evaluation also can be
used with inside the regions of political science, sociology, and psychology to
examine trends, opinions, ideological bias, gauge reactions, etc.

A lot of those programs are already up and running. Bing currently incorporated
sentiment evaluation into its Multi-Perspective Answers product. Hedge finances are
nearly definitely the usage of the era to be expecting rate fluctuations primarily based
totally on public sentiment. And businesses like Call Miner provide sentiment
evaluation for client interactions as a service.

4
CHAPTER 2
LITERATURE REVIEW
ABSTRACT
In the current scenario when some people spread hate through social media platforms
like Twitter, Facebook, etc. Most of the people doing this type of work to spread hate
and rumours are mostly highly followed minister/celebrity or any other famous
person. This problem is even bigger when we look at the number of followers
following these leader or people and the increasing scalability of internet. This
increasing of these things can develop negative impacts in our society and wrong
tendencies in our youth and sometime it may force someone to commit some illegal
steps. Due to the reserved nature and busy schedules of people it is becoming
extremely difficult to interact with peers and family members. Therefore, social media
platforms are considered as the most used platform for getting news and general
awareness also. People feel free to share their political views and feelings over social
media such as Twitter, Facebook, etc with friends and family members via services
such as messaging. Therefore, analysis the tweets shared by a famous person will help
a lot in making a change in the society. The aim of this paper this task is to detect hate
speech percentage (neutral and positive speech percentage as well) in tweets. For the
simpler terms in this project, we are considering a tweet negative if it contains hate
speech such as (It has a sexist or racist sentiment associated with it which would
create a bad impact on the society and especially to the readers. Hence, the task is to
classify sexist or racist or happy or sad tweets from other tweets. This paper is divided
into six sections first about introduction second about literature survey third about
proposed methodology fourth about result analysis fifth about future scope and
application and finally sixth as conclusion.

KEYWORDS: Text Mining, Data Mining, Information Extraction, Natural

Language Processing, Concept Mining, Wikipedia.

INTRODUCTION:
Mostly all of the data-mining research based on the sentiment analysis assumes that
the information to be “mined” is by default present in the relational database form.
But the real scenario is a little different. The data that we get from many different
5
applications and mainly the data of the social media that we extract using different
tools (like for Twiiter, we use Twitter API for the extraction of the data) and the data
present after the extraction process is obviously not structure. Mostly, the data is of
the form of unstructured data instead of structured databases. As a result, the
difficulty of text mining has different phrases. For eg., the phase of discovering useful
knowledge from unstructured information present in the extracted text, is becoming
an increasingly important aspect in the discovery of the Knowledge Discovery.
Mostly all of the work in text mining and analysis doesn’t manoeuvre any form of
natural-language processing (NLP), treating extracted information as an unordered
“bag of words/sentence” which is typical in information retrieval.

The standard a vector space model of text represents a document as a sparse vector
that clearly specifies a weighted frequentness for every distinct word or token that
appear in a corpus among the large of words. Such a simplified representation of text
has been shown to be quite effective for a number of standard tasks such as document
retrieval, classification, and clustering. However, most of the knowledge that might be
mined from text cannot be discovered using a simple bag-of-words representation.
The entities referenced in a document and the properties and relationships asserted
about and between these entities cannot be determined using a standard vector-space
representation. Although full natural-language understanding is still far from the
capabilities of current technology, existing methods in information extraction (IE) are,
with reasonable accuracy, able to recognize several types of entities in text and
identify some relationships that are asserted between them. Therefore, extraction of
information can serve as an important factor in the technology for text mining and
analysis. If we consider that the knowledge to be discovered is expressed directly in
the information to be mined, then Information Extraction would could have single
headedly served as an effective approach to text/speech analysis. Nevertheless, if the
extracted information contains data in unstructured form rather than abstract
knowledge, it might be helpful to firstly use Information Extraction process in the
sentiment analysis to transform the unstructured data in corpus into a structured
database which is designed according to our requirement, and then use traditional
datamining tools for the identification of the abstract patterns in the extracted data.
There are two approaches to text mining for sentiment analysis with information
extraction, using one of our own research projects to illustrate each approach. First of
6
all, we introduce the basics of information extraction. Then, we discuss using IE to
directly extract knowledge from text. At last, we discuss finding knowledge by
mining data that is extracted in first step of extraction from unstructured or semi-
structured text.

Fig 2.1: Evolution of Data Mining

7
BRIEF LITERATURE SURVEY:
There have been several researches done in the area of text mining. Now a days data
mining is becoming one of the emerging technology. Based on study of some papers
we have compiled our own literature survey as given below
Table 2.1: Brief Literature Survey

Research Name of
No. Advantages Disadvantages
Paper Title Authors

1 Information Rishab Verma, ● Easy to ● Model is very

implement and
Extraction Sartaj Ahmad. specified.
understand.
Through Text
● Model ● It violates user
Messages using implements SVM privacy certain
data mining.[1] algo. extent.

● Model
implements KNN
algo.

● This model
2 Enhancing Mark Chignell, analyzes the ● Very complex to
Predictive Power Nipon Electronic Health implement
of Cluster-Boo Charoenkitk Record to because even a
sted Regression arn, Jonathan H. improve single mistake
with Text Based Chan, efficiency. could harm
Indexing.[2] Wutthipong ● Model also someone’s health.
Kongburan. examine whether
textual features
can be used to
improve accuracy
of ICU mortality
prediction.

8
3 Financial Latent Nout ● In this model ● This model is
Dirichlet Kaunungsuk basically, not fast and takes
Allocation kasem, technical and much time to
(FinLDA): Teerapong fundamental predict.
Feature Extraction Leelanupab. analyses are used
in Text and Data by investors to
Mining for predict financial
Financial Time time evolution,
Series Prediction. such as stock
[ 5] prices.

● Using the
4 Multistage Gene Hong-Jie Dai, multistage GN ● This model can
Normalization and Po-Ting Lai, algorithm, we only be used for
SVM-Based and Richard have been able to multistage gene
Ranking for Tzong-Han improve system normalization for
Protein Interactor Tsai. performance by protein interactor
Extraction in Full- 1.719 percent extraction.
Text Articles.[6] compared to a
one-stage GN
algorithm.
● Our
experimental
results also show
that with full text,
versus abstract
only, INT AUC
performance was
22.6 percent
higher.

9
CONCLUSION:
Based on all 4 Research Papers we got a lot information about various models. Many
models implementing algorithms like Support Vector Machine algorithm, k-Nearest
Neighbours algorithm, term frequency–inverse document frequency model, Data
mining concept etc. Data Mining gives insights behind various decision. Our model is
also one of them. One of the basic understanding we got that if we want our model to
work effectively then we need to have good volume pf trained data. We implemented
NLP technique which is subset of data mining Techniques. For this we used Python as
programming language where we implemented many open-source libraries such as
TKInter, TextBlob. Here TextBlob has already trained data for different sentiments.

10
CHAPTER 3
PROBLEM FORMULATION
The ambiguity and complexity present in the human language is a huge different from
a computer to successfully understand human language. Apart from large, and
complex grammars, there are always some issue while predicting the meaning by
computer about the people using language freely, without adhering to rules.
Abbreviations, wrong spellings, idioms, and slang are just a few problems. Instead,
there is vast difference in terms of tone of people. Although, in social media we use
emoji in the current scenario which expresses the sentiment of the speech up to some
extent. We can summarize the issues faced during the process and different methods
to deal with them.

3.1 IE DIFFICULTIES:
Usually, text documents contain many words that aren’t necessary to understand the
general idea of the text. These mostly used words which doesn’t have a great impact
of the sentiment like ‘of’, ‘a’, and ‘the’ and are called stop words and which can be
directly ignored in many situations. This is the approach fields like IR take to reduce
the dimensionality of their term spaces and improve performance. It is mostly less
useful in the field of text analysis, nevertheless, cause some of these words can often
lead to clarify semantics and lend information. For an instance consider the below
example, where the statement is that “He was promoted” which contains mainly two
potential stop words i.e., “he” and “was”. Without “was” we interpret an entirely
different meaning: “He promoted” and same would have been the case if “he” was
not present in the sentence.
In such a scenario, Stemming (or lemmatization) comes into action which is also
generally used in Information Reduction (technically known as reduction of
dimensionality of the text). To classify similar words into one groups, we reduce
words to their stem, or root form. Let us take an example for such words which are
classified into one groups using stemming method, “walking”, “walk”, “walked”, and
“walker” will all be reduced to the root word “walk”. After all, it is possible to argue
that this effect is not as difficult as stop word lists, it can be harmful to the semantics
of the text but still it is an easy to reduce the information dimensions. Now, we look at
the part where we have to eliminate noisy data, we can use spelling correctors and
11
acronym and abbreviation expanders which generally requires a thesaurus or
dictionary. We must also try to deal with the larger issues in NLP as a whole which is
ambiguity. For example, we can try and feel the effect of the following words which
are lexically ambiguous, which becomes a difficult task for the algorithm to correctly
identify those words.
There are two types of ambiguity for such words which are lexically ambiguous:
a) Homonymy
b) Polysemy

In these type of ambiguity, the main difference is present in how the information is
present in the person’s mental lexicon. In homonymy, different words that happen to
have the same sound) which is not a problem which performing sentiment analysis of
some text as the correct word is present in the information. But polysemy which
means that one words is present many meaning and senses. Let us take an example for
an instance, the word “lean” which has different meaning in different parts of
grammar (or we can say senses). The word “lean”, if look at it as an adjective it
means “lacking or deficient in flesh” or “containing a little or absolutely no fat” or
“productiveness, sufficiency, or lacking richness” or “containing little valuable
minerals” or “of fuel mixtures with low in combustible component”. On the other
hand, it we look at it in form of a verb, it means “to bend, incline, or deviate, from
vertical position” or “to cast someone’s weight to another for support” or “to incline
in opinion, taste, or desire” or “ to rely for inspiration or support on someone”.

The word sense disambiguation (WSD) problem deals with finding the most probable
sense of a polysemous word (a word with multiple meanings). We can approach this
problem by considering the context in which the word occurs, to, for instance,
determine if a word is a noun or verb.
Tagging involves the labelling of words in a corpus with part of speech (PoS) tags or
XML mark-up. PoS tags label syntactic categories like nouns, verbs, and adjectives in
order to identify syntactic structures like noun phrases or verb phrases.

A collocation is a sequence of words that are commonly used together but mean
different things if separated. An example of this is “light rain”. In general, we will
treat the collocation as a main unit in spite of distributing it into two different words
12
because there is a consideration that we might lose the correct meaning of the text if
we only consider the words separately. (For an instance, let us consider the sentence,
“the person was playing ‘Holi’ and enjoyed a lot.’. In this sentence if look at the sole
meaning of the words present in the text, it would look like ‘Holi’ is a game which is
not. So, these type of things are kept into consideration as well while performing
information reduction.

And at the end, we perform the tokenization which is a subject that we would face at
some point during the above process. If we want to split the text into units of
sentences, phrases, paragraphs, of a particular length, or single words? To provide
support to our splitting we can take advantage of some delimiters which are present in
the text which are mentioned below:
a) Spaces
b) Tabs
c) Punctuation marks
d) Certain stop words.
We can even use a method like N-gram to find the most frequent word phrases, words
in the text extracted and processed.

3.2 INFORMATION EXTRACTION TECHNIQUES/ PROBLEMS/

METHODS:
In the current time, there are different approaches to constructing Information
Extraction systems. It is possible to manually create pattern-matching rules for the
identification of the desired entities and relations, but this is not as simple as it looks
like while saying and it proves to be very difficult and tedious work to do and also in
some of the cases it doesn’t result in very robust systems. Simultaneously, we can use
technique of supervised machine-learning methods which is trained on human
annotations to develop our systems. We are able to learn pattern-based extraction
rules with the help of patterns that match the beginning and inductive logic rules or
ending of phrases automatically.
We can also turn to sequence labelling, which assigns labels to each token in the
document to facilitate extraction. Even the statistical sequence models such as the
Conditional Random Field (CRF) or Hidden Markov Model (HMM) can be applied,

13
learning their parameters from supervised training human automated that we require
to different tasks like finding the patterns that match the beginning and inductive logic
rules or ending of phrases. We can take the advantage of the Standard feature-based
classifier as well which are used in predicting the label of every token based on the
token and its surrounding context. By representing the context using a set of features
that include the one or two tokens on either side of the target token and the previously
extracted labels, we can generalize the sequence labelling problem to decision trees,
boosting, support-vector machines (SVMs), memory-based learning (MBL),
transformation-based learning (TBL), maximum entropy (MaxEnt), and many others.
Using the API of Twitter, we will be able to collect twitter dataset for the sentiment
analysis. There are many types of APIs and tool which can be used to crawl and
collect data:
a) Twitter’s Firehose
b) Twitter’s Search API
c) Twitter’s Streaming API
d) NodeXL

Fig. 3.1 Process of collection the dataset from Twitter account

14
3.3 REPRESENTATION MODELS:
The most generally used representation of text present in the extracted text which is to
analysed sentimentally is the Vector Space Model. The extracted and processed text is
described by a vector of which dimension is the frequency of same text features and
content is a function of the numbers with which these features appear.
Things like the order and relations between words are totally ignored, which is why
this model is also called the (BOW) “bag-of-words” model. Most other
representations are extensions of the BOW model. Some focus on phrases instead of
single words, some give importance to the semantics and relations between words,
and others take advantage of the hierarchical nature of the text.

15
CHAPTER 4

PROPOSED WORK
The proposed methodology helps to save the life of the people who may be get
distracted by the negative tweets and some might take some serious steps after
reading those negative tweets. As a multiple application of the analysis, it is possible
to analyse the neutral, positive, or negative tweets of a certain twitter account or from
a twitter data set.

PROPOSED METHODOLOGY:
The aim is to extract information from the tweets of the user account on Twitter
(Generally, a famous person, who has sufficient number of followers) and use it for
sentiments analysis for different purposes to find the percentage of neutral, positive,
and negative tweets the person is posting on the social media. The aim of this paper
this task is to detect hate speech percentage (neutral and positive speech percentage as
well) in tweets. For the simpler terms in this project, we are considering a tweet
negative if it contains hate speech such as (It has a sexist or racist sentiment
associated with it which would create a bad impact on the society and especially to
the readers. Hence, the task is to classify sexist or racist or happy or sad tweets from
other tweets. The model also includes the analysis of emoticons in order to completely
parse the statements the aim is to extract information from the text messages of the
user and use it for different purposes such as sentiments analysis.
In this component the data is assigned a sentiment such as positive or negative and the
extent of it by performing data pre-processing using Support Vector Machine
algorithm

4.1 DATA SET DESCRIPTION:

The data is obtained by extracting all the text messages send by the subject. This can
be achieved from multiple sources such as Facebook, WhatsApp, etc. All the
messages send through these messaging services are stored in a database where we
can apply our model and analyse the sentiments. The data set will contain text form of
data and emoticons. No other form of data such as images will be analysed through
the data is obtained by extracting all the text messages send by the subject. This can
be achieved from multiple sources such as Facebook, WhatsApp, etc. All the
16
messages send through these messaging services are stored in a database where we
can apply our model and analyse the sentiments.

4.2 MODEL COMPONENTS:

SENTIMENT ANALYSIS:
In this component the data is assigned a sentiment such as positive or negative and the
extent of it by performing data pre-processing using SVM algorithm [9].

• TEXT PRE-PROCESSING:
The processes involved in text pre-processing are:
Tokenization: Each tweet of the account is distributed into meaningful words which
are known as tokens. Example - “Morning walk is a bliss” is converted to “Morning”
“walk” “is” “a” “bliss”.
Data standardization: It involves converting all words in the message in standard
form, converting all words in lower case [10]. Example. “The market is near Puneet’s
house” is converted to “the market is near Puneet’s house”.
Emoji conversion: The emoticons present in the text messages are assigned a keyword
based on the expression they convey.
There are two type of emoticons which are classified as following:
Positive emoticons: These are the emoticons which convey positive sentiment and are
replaced by positive words based on the symbol.

Fig. 4.1 Positive Emoticons [11]

17
Negative emoticons: These emoticons reflect the sad or disturbed sentiments of the
subject and are thus replaced by negative words.

Fig. 4.2 Negative Emoticons [11]

Fig. 4.3 Sentiment Classification Based On Emoticons

18
Stop-word-removal: All the words in the message which do not convey a special
meaning are removed like a, the, then, etc. Stemming is the process which is focused
on obtaining the root word which corresponds to every word by dropping suffixes ling
–ion, -ing, etc. Abbreviation analysis: Replacing the abbreviations present in the
message by their full forms. Example FB by Facebook, GM by good morning, etc

• N-gram
The next step after data pre-processing is N-gram features extraction. N-gram is a
series of n tokens. N-gram is a model very widely used in NLP tasks [3]. The model
creates N-grams from the messages in the data set to extract keyword features from
the data set. For n = 3 a sequence of three-words for each message is generated. The
process of N-gram increases the efficiency and accuracy of the classification step
because of the feature extracted from three sequence of token combination. Example.
“What is your name” is analysed as “what is your” “is your name”.

• Term Frequency
The number of times a token occurs in each data sample is called its term frequency.
Words which are present in high frequency are considered to have a better
relationship with the sample.

• Inverse Document Frequency

Inverse Document Frequency factor is generally used to diminish the weight of words
that occur very regular in the data set and to increase the weight of words that occur
rarely.

• Support Vector Machines

The resulting stream of words after the text pre-processing step is processed by
Support Vector Machine Algorithm in order to classify the messages as “normal” or
“critical” sentiment. The process is applied on every message in data set in order to
classify the chat as one among “normal” and “critical” sentiment.
Thus, we will get a sentiment associated with the messages associated with the user.
Support Vector Machines are part of the supervised learning models which can be
used for regression analysis and classification of data which is extracted, processed
and now classified. A Support Vector Machine model represents the set of examples
19
as points in space, different classes of examples are divided by a certain gap which
must be as broad as possible. New examples are mapped into the space which are
predicted to belong to a class of examples based on which side of the gap they fall.

• KNN Algorithm
The output obtained from Support Vector Machines Algorithm are clusters of two
sentiments with class labels “normal” and “critical”. Based on the output k-Nearest
Neighbour algorithm is applied in order to deduce the overall sentiments of the
subject. The input for k-Nearest Neighbour algorithm is the sentiments associated
with all the chats that the subject is involved in. The final step is to predict the
sentiment of the person based on the collected feature set. Data is divided into two
sets i.e. training and testing sets, and k-Nearest Neighbour algorithm is used to predict
the sentiment of the text processes. K-Nearest Neighbour algorithm is a method for
classifying data based on the nearest training sets in the feature space. The class label
is assigned the same class as the nearest K instances in the training set. K-Nearest
Neighbour is a type of lazy learner strategy. K-Nearest Neighbour algorithm is
considered a flexible and simple classification technique based on machine learning
concepts.

Fig. 4.4 Steps in text processing of tweets/twitter data sets

20
CHAPTER 5
SYSTEM DESIGN

Fig. 5.1 A simple architecture of the recommender system Block diagram of the system can be
described as below which shows the flow of working model of the system

21
Fig. 5.2 Block diagram of process

22
Fig. 5.3 Classification of Sentiment Analysis

Fig. 5.4 System Flow Diagram Sentiment Analysis using Twitter API

CHAPTER 6
23
IMPLEMENTATION

The process of implementing information extracted using data mining system can be
realized in given pieces.
1) As we can see from this brief survey, while the fields of text mining and
information extraction are rich with proven techniques and promising results.
2) They also offer new directions and hopes of leaps in the areas of efficiency,
accuracy, and usability. This trend will only strengthen as more and more knowledge
floods the Internet and users all over the globe strive for efficient extraction and
interpretation of information.
3) Information extraction using mining also help to understand someone’s emotion
which might be used further.
4) Getting meaningful information from someone’s text messages and analysing it
further gives an insight of personality of somebody.
5) People suffering from mental disorder always hesitate to say someone about their
problem but our model helps them to find the way to get out of mental trauma.
6) Due to busy lifestyle now, many people even don’t take care of themselves, based
on information extracted from their massage model could suggest them how to bring
positive change in their lifestyle.
7) Most IE systems are developed by training on human annotated corpora. However,
constructing corpora sufficient for training accurate IE systems is a burdensome
chore.
8) The data access and integration service provide web service which interacts with
data sources. The search service communicates the user entered search criteria to the
server.
9) The restrict security enforcement layers guard the resources and authorize access
based on roles of users. The remaining users are used as training data (i.e., the set of
users to which compare the test users for recommendations). The aim in testing is to
correctly recommend the withheld items from the test users' usage patterns.

24
Fig. 6.1 System UI

Packages: There are a lot of python packages are available which can be used the
sentiment analysis. Some of the available packages are NumPy, Pandas, matplotlib,
wordcloud, etc. We can directly import these packages in our code while
implementing.
Some packages that are used in Sentiment analysis:
 NumPy
 Pandas
 Wordcloud
 TextBlob
 Matplotlib
 TensorFlow

Fig 6.2: How to Import Packages

25
Data Set Description: The data is obtained by extracting all the text messages sent by
the subject. This can be taken from Twitter. All the tweets posted on Twitter are
stored in a database, we can analyze the sentiments on there.
The data set will contain tweets in text format and emojis. Data, that is in another
format cannot be analyzed.

Using the API of Twitter, we will be able to collect twitter dataset for the sentiment
analysis.
There are many types of APIs and tool which can be used to crawl and collect data:
 Twitter’s Firehose
 Twitter’s Search API
 Twitter’s Streaming API
 NodeXL

Table 6.1: Example Tweets

Fig 6.3: Code Snippet to load the DataSet

26
Text Cleaning: Initial step of sentiment Analysis techniques is applied to data in order
to reduce the dimensionality and the noise of text, along with that assistance in the
improvement of effectiveness of classification.

There are many types of techniques to pre-process data:

a) Remove numbers
b) Stemming
c) Remove stop words
d) Part of speech tagging

Fig 6.4: Code Snippet to Clean Data

Table 6.2: Cleaned Tweet Examples

27
Classification of Sentiments: There are different types of sentiment analysis in which
it can be performed:
 Machine Learning
 Lexicon-Based
 Hybrid

Fig 6.5: Classification of Sentiments

28
CHAPTER 7
RESULT ANALYSIS
The result obtained from the proposed model gives the estimated sentiment prediction
of the subject based on the tweets by the user account. The resulting output which we
got after performing the sentiment analysis can be used in many scenarios, such as,
better marketing, brand value, good and positive politics. The percentage estimated
for positive, negative, and neutral sentiments are therefore generalised as “critical”
sentiments among the peers and society members of the subject which can take
actions to perform actions accordingly. Hence, sentiment analysis models are basic
requirement for shaping the society into a happening place. The result obtained from
the proposed model gives the estimated sentiment prediction of the subject based on
the tweets sent by the user. The resulting output can be used in many situations, the
mental disorders and stress level is estimated in the country and therefore in case of
“critical” sentiments, police can take actions to discourage, demotivate and press such
leaders who posts such content which results in destroying the harmony and peace of
mind of the subject/reader. In the end, such sentiment analysis models are a
requirement for shaping the society into a happening place.

Fig 7.1 Results of different method instance

29
Fig 7.2: Result Analysis of Narendramodi’s Twitter Account

30
CHAPTER 8
CONCLUSION, LIMITATIONS AND FUTURE SCOPE

8.1 CONCLUSION
The proposed model takes input from the data set created by accumulating all the
tweets sent by the user. All the tweets may be from different social media platforms
such as Facebook, WhatsApp, etc. Further, these tweets are pre-processed to obtain
the key words from the data sets. After pre-processing we use probabilistic language
models like N-gram. Associating weights to the data set using TF-IDF increases
overall efficiency of classifying algorithms. The next step is to use the classifying
algorithms to classify the sentiments of the tweets as “positive”, “negative” or
“neutral”. [6] First a supervised algorithm is used which is Support Vector Machine as
it proves to be highly efficient for such computations and then an unsupervised
algorithm is used which in turn increases the efficiency drastically, in our case we use
the KNN algorithm. Thus, we propose to give a highly efficient method [7] of finding
the sentiment of the person by analysing the text messages and also processing
emoticons. Emoticons [12] are very common tokens in any text message in the new
world, therefore we must also focus on efficient ways to analyse them. We have
converted emoticons to textual form for our computation processes. Thus, this model
is a requirement and a life saviour in the modern world.

8.2 LIMITATION
In practice, many commercial recommender systems are based on large datasets
extracted from different social media accounts. As a result, the user-item matrix is
used for collaborative filtering which could be very large and sparse, and it brings
some of the challenges in the performances of the recommendation system
performing sentiment analysis. Some of the problems are defined below such as the
problem caused by the sparsity of data which is the cold start problem. As we all
know that the human language always consist of some ambiguity and complexity
which is a giant hindrance to a successful understanding of a computer. Even with
complex and large grammars, there is always some issue with the way of people using

31
language freely. Some of the common issue which people perform now-a-days are
mentioned below:
a) Mostly, people do not adhere to rules of grammar while writing tweets.
b) Misspellings
c) Abbreviations
d) Use of slang
These are only some of the issue that we have mentioned. Apart from this also, there
are number of issue with the language used on the social media.

We can summarize some of the issues that we come across and methods we can use to
deal with them. Usually, text documents contain many words that aren’t necessary to
understand the general idea of the text. The stop words which are present in high
frequency are the words as mentioned below:
a) ‘a’
b) ‘the’
c) ‘of”
These type of words are generally known as stop words and can be directly ignored in
many cases. This type of approach fields like Information Reduction takes place to
reduce the dimensionality of their term spaces and hence improves performance. It is
less useful in the field of text mining, however, because these words can often lend
information and clarify semantics. For example, the statement “She got present”
contains two potential stop words: “She” and “got”. Without “got” we interpret an
entirely different meaning: “She present”. There is a consideration that we might lose
the correct meaning of the text if we only consider the words separately. (For an
instance, let us consider the sentence, “the person was playing ‘Holi’ and enjoyed a
lot.’. In this sentence if look at the sole meaning of the words present in the text, it
would look like ‘Holi’ is a game which is not. So, these type of things are considered
as limitation of sentiment analysis.
Stemming (or lemmatization) is also commonly used in IR. To group similar words
into one, we reduce words to their stem, or root form. For example, “walking”,
“walk”, “walked”, and “walker” will all be reduced to the root word “walk”.
To eliminate noisy data, we can use spelling correctors and acronym and abbreviation
expanders. These usually require a dictionary or thesaurus.

32
8.3 FUTURE SCOPE
The proposed model can be used in situations where sentiment analysis is required to
achieve the desired result and use it for various different purposes such as critic
reviews for hotels, [8] movies, videos, etc. Sentiment analysis methods till now have
been used to detect the polarity in the thoughts and opinions of all the users that
access social media. Businesses are very interested to understand the thoughts of
people and how they are responding to all the products and services around them.
Companies use sentiment analysis to evaluate their advertisement campaigns and to
improve their products. Companies aim to use such sentiment analysis tools in the
areas of customer feedback, marketing, CRM, and e-commerce.

33
REFERENCES

[1] May, R. M. 1997. The Scientific Wealth of Nations, Science, vol. 275, no. 5301,
pp. 793-796.

[2] Torres, R. McNee, S. M. Abel, M. Konstan, J. A. and Ried l, J.2004. Enhancing

Digital Libraries with TechLens, Proceedings of JCDL’04, pp. 228-236.

[3] Pennock, D. M. Horvitz, E. Lawrence, S. and Giles, L. C. 2000. Collaborative

Filtering by Personality Diagnosis: A Hybrid Memory- and Model-Based Approach,
in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence.

[4] Fano, R. M. 1956. Information theory and the retrieval of recorded information,
in Documentation in Action, Shera, J. H. Kent, A. Perry, J. W. (Edts), New York:
Reinhold Publ. Co., pp.238–244.

[5] Small, H. 1973. Co-citation in the scientific literature: a new measure of the
relationship between two documents, Journal of the American Society for
Information Science, vol. 24, pp. 265–269

[6] Giles, C. L. Bollacker, K. D. And Lawrence, S. 1998. Cite Seer: an automatic

citation indexing system, In Digital Libraries 98 -The Third ACM Conference on
Digital Libraries, pp. 89-98.

[7] Garfield, E. and Well jams-Dor of, A. 1992. Citation data: their use as
quantitative indicators for science and technology evaluation and policy-making,
Science & Public Policy, vol. 19, no. 5, pp. 321-327.

[8] G. Lin, H. Zhu, X. Kang, C. Fan, and E. Zhang, “Feature structure fusion and its

application,” Inf. Fusion, vol. 20, pp. 146–154, Nov. 2014.

[9] J. He and N. Xiong, ‘‘An Effective Information Detection Method for Social big
data,’’ in Multimedia Tools Appl., vol. 77, no. 9, pp. 11277–11305, 2018.
[10] H. Si, Z. Chen, W. Zhang, J. Wan, J. Zhang, and N. N. Xiong, ‘‘A member
recognition approach for specific organizations based on relationships among users
in social networking Twitter,’’ Future Gener. Comput. Syst., vol. 92, pp. 1009–1020,
Mar. 2019.

34
[11] Positive and Negative Emoticons
(https://images.app.goo.gl/Pr3JvNdcbaVAiZB27)

[12] I. H. Witten, K. J. Don, M. Dewsnip, and V. Tablan, ‘‘Text mining in a digital

library,’’ Int. J. Digit. Libraries, vol. 4, no. 1, pp. 56–59, 2004.

35
APPENDIX

CODE:

#Title: Sentiment Analysis of Twitter Data Set

#Final Year Project
#This project is completed in the guidance of Dr. Ritesh Srivastava (ASSOCIATE
#PROFESSOR OF DEPARTMENT OF CSE) and Dr. VISHNU SHARMA(PROFESSOR
#AND HEAD OF DEPARTMENT OF CSE) by Mr. BHARAT SINGH 1709710035, Mr.
#CHOUDHARY RISHAB KUMAR 1709710038, Mr. PRASHANT RAJ 1709710080 of
#Galgotias College of Engineering & Technology, Greater Noida, Utter Pradesh, Affiliated to
#Dr. A.P.J. Abdul Kalam Technical University Lucknow, Uttar Pradesh in partial
#fulfillment for the award of Degree of Bachelor of Technology in Computer science
#& Engineering is a bonafide record of the project work carried out

# Library in the python to provide the way to access twitter API

import tweepy

# TextBlob - Python library for processing textual data

from textblob import TextBlob

# WordCloud - Python linrary for creating image wordclouds

from wordcloud import WordCloud

# Pandas - Data manipulation and analysis library

import pandas as pd

# NumPy - mathematical functions on multi-dimensional arrays and matrices

# import numpy as np
# Regular Expression Python module
import re

# Matplotlib - plotting library to create graphs and charts

36
import matplotlib.pyplot as plt

# Settings for Matplotlib graphs and charts

from pylab import rcParams
rcParams['figure.figsize'] = 12, 8

#including configuration for the file in which extracted

config = pd.read_csv("./config.csv")

# Twitter API configuration

twitterApiKey = config['twitterApiKey'][0]
twitterApiSecret = config['twitterApiSecret'][0]
twitterApiAccessToken = config['twitterApiAccessToken'][0]
twitterApiAccessTokenSecret = config['twitterApiAccessTokenSecret'][0]

# Authentication of twitter API

auth = tweepy.OAuthHandler(twitterApiKey, twitterApiSecret)

auth.set_access_token(twitterApiAccessToken, twitterApiAccessTokenSecret)
twetterApi = tweepy.API(auth, wait_on_rate_limit = True)

#Taking input in the project as the twiitter handle user id

twitterAccount=input("Enter twitter handle of user to perform sentiment analysis on user's
tweets: ")

#Step 1: Data Extraction using tweepy

tweets = tweepy.Cursor(twetterApi.user_timeline,
screen_name=twitterAccount,
count=None,
since_id=None,
max_id=None,
trim_user=True,

37
exclude_replies=True,
contributor_details=False,
include_entities=False
).items(50);
df = pd.DataFrame(data=[tweet.text for tweet in tweets], columns=['Tweet'])
df.head()

#Cleaning of the text extracted from the twitter API

#Step 2: Data Pre-processing
#removal of mentions, hastags, retweets, urls present in the tweets
#removed using regular expressions

def cleanUpTweet(txt):
# Remove mentions
# Regular expression of mentions will be '@[A-Za-z0-9_]+' because it starts with @ and
# have twitter id
txt = re.sub(r'@[A-Za-z0-9_]+', '', txt)

# Remove hashtags
# Regular expression of hashtags will be start '#' because it starts with #

txt = re.sub(r'#', '', txt)

# Remove retweets:
# Regular expression of retweets will be ‘RT:' because in the text it starts from RT: in the
#extracted text.

txt = re.sub(r'RT : ', '', txt)

# Remove urls
# Regular expression of retweets will be 'https?:\/\/[A-Za-z0-9\.\/]+' because it starts from
#https

38
txt = re.sub(r'https?:\/\/[A-Za-z0-9\.\/]+', '', txt)

return txt

df['Tweet'] = df['Tweet'].apply(cleanUpTweet)

def getTextSubjectivity(txt):
return TextBlob(txt).sentiment.subjectivity

def getTextPolarity(txt):
return TextBlob(txt).sentiment.polarity
df['Subjectivity'] = df['Tweet'].apply(getTextSubjectivity)
df['Polarity'] = df['Tweet'].apply(getTextPolarity)
df.head(50)
df = df.drop(df[df['Tweet'] == ''].index)
df.head(50)

# negative, nautral, positive analysis

#if the value of a is greater than 0 then it is condiered as positive tweet
#if the value of a is less than 0 then it is considered as negative tweet
#if the value of a is equal to 0 then it is considered as neutral tweet

def getTextAnalysis(a):
if a < 0:
return "Negative"
elif a == 0:
return "Neutral"
else:
return "Positive"

df['Score'] = df['Polarity'].apply(getTextAnalysis)
df.head(50)
positive = df[df['Score'] == 'Positive']

39
#First part of the output includes the percentage of positive tweets. It is shown as a bar graph
#along with the numerical percentage.
print(str(positive.shape[0]/(df.shape[0])*100) + " % of positive tweets")
positive = df[df['Score'] == 'Negative']

#First part of the output includes the percentage of negative tweets. It is shown as a bar
#graph.
print(str(positive.shape[0]/(df.shape[0])*100) + " % of negative tweets")
objective = df[df['Subjectivity'] == 0]

#First part of the output includes the percentage of neutral tweets. It is shown as a bar graph.
#along with the numerical percentage
print(str(objective.shape[0]/(df.shape[0])*100) + " % of objective tweets")
labels = df.groupby('Score').count().index.values

values = df.groupby('Score').size().values

plt.bar(labels, values)

for index, row in df.iterrows():

if row['Score'] == 'Positive':
plt.scatter(row['Polarity'], row['Subjectivity'], color="green")
elif row['Score'] == 'Negative':
plt.scatter(row['Polarity'], row['Subjectivity'], color="red")
elif row['Score'] == 'Neutral':
plt.scatter(row['Polarity'], row['Subjectivity'], color="blue")

plt.title('Twitter Sentiment Analysis')

plt.xlabel('Polarity')
plt.ylabel('Subjectivity')

40
# add legend

plt.show()
#objective = df[df['Subjectivity'] == 0]

#There are many for visualization but in the project we have chosen wordcloud as it displays
#the words according their size i.e. more the frquency of the word used larger the size of the
#word in word cloud.
#print(str(objective.shape[0]/(df.shape[0])*100) + " % of objective tweets")

print("Most used words are")

words = ' '.join([tweet for tweet in df['Tweet']])
wordCloud = WordCloud(width=600, height=400).generate(words)
plt.imshow(wordCloud)
plt.show()

41
OUTPUT:

Pranic Psychotherapy Session 1
83% (12)
Pranic Psychotherapy Session 1
57 pages
FIDP Business Ethics and Social Responsibility PDF
100% (3)
FIDP Business Ethics and Social Responsibility PDF
7 pages
Sentimental Analysis Project Documentation
83% (6)
Sentimental Analysis Project Documentation
67 pages
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
From Everand
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
EMC Education Services
No ratings yet
Minor Project Report grp 11 (2)
No ratings yet
Minor Project Report grp 11 (2)
21 pages
Sentiment Analysys of Tweets Using Machine Learning
No ratings yet
Sentiment Analysys of Tweets Using Machine Learning
74 pages
Final Twitter - Sentiment - Analysis - Report
100% (1)
Final Twitter - Sentiment - Analysis - Report
14 pages
La Vanya
No ratings yet
La Vanya
44 pages
ProjectReport2023
No ratings yet
ProjectReport2023
32 pages
Black Book 3.0 Krishna-1
No ratings yet
Black Book 3.0 Krishna-1
88 pages
PushpendraSkill_Based
No ratings yet
PushpendraSkill_Based
26 pages
Theolaaaa4273 Merged
No ratings yet
Theolaaaa4273 Merged
76 pages
Sentiment Analysis of Product-Based Reviews Using Machine Learning Approaches
No ratings yet
Sentiment Analysis of Product-Based Reviews Using Machine Learning Approaches
38 pages
Social Media Sentiment Analysis Using Twitter Dataset
No ratings yet
Social Media Sentiment Analysis Using Twitter Dataset
26 pages
NLP Project Report
No ratings yet
NLP Project Report
27 pages
minor_project_report
No ratings yet
minor_project_report
29 pages
Major Project Report Naman
No ratings yet
Major Project Report Naman
44 pages
School of Engineering and Technology: A Dissertation Report On
No ratings yet
School of Engineering and Technology: A Dissertation Report On
20 pages
Project report arvind
No ratings yet
Project report arvind
114 pages
Mini Project Report
No ratings yet
Mini Project Report
35 pages
Freport
No ratings yet
Freport
25 pages
Social Media Sentiment Analysis
No ratings yet
Social Media Sentiment Analysis
49 pages
PROJECT_REVIEW[1][1]
No ratings yet
PROJECT_REVIEW[1][1]
17 pages
04 - Prof. Sushma Kadge - Sentiment AI - Twitter Sentiment Analysis - MJ2024
No ratings yet
04 - Prof. Sushma Kadge - Sentiment AI - Twitter Sentiment Analysis - MJ2024
56 pages
Project Report
No ratings yet
Project Report
42 pages
yaswanth (1)
No ratings yet
yaswanth (1)
103 pages
Semantic Computing
From Everand
Semantic Computing
Phillip C.-Y. Sheu
No ratings yet
Mini Project Report: Submitted in Partial Fulfilment of The Requirement For The University of Mumbai For The Degree of by
No ratings yet
Mini Project Report: Submitted in Partial Fulfilment of The Requirement For The University of Mumbai For The Degree of by
24 pages
17BIT051
No ratings yet
17BIT051
26 pages
Akshada Tweet Report With Pages Removed
No ratings yet
Akshada Tweet Report With Pages Removed
15 pages
Technical Report Format
No ratings yet
Technical Report Format
14 pages
Sentiment Analysis On Youtube Comments
No ratings yet
Sentiment Analysis On Youtube Comments
54 pages
internship report final
No ratings yet
internship report final
31 pages
ProjectReport Sample-1
No ratings yet
ProjectReport Sample-1
55 pages
rvt final
No ratings yet
rvt final
58 pages
Mastering Data Science: From Basics to Expert Proficiency
From Everand
Mastering Data Science: From Basics to Expert Proficiency
William Smith
No ratings yet
PROJECT REVIEW ON THE OPINION MININ
No ratings yet
PROJECT REVIEW ON THE OPINION MININ
4 pages
Cognitive Computing and Big Data Analytics
From Everand
Cognitive Computing and Big Data Analytics
Judith S. Hurwitz
No ratings yet
Tribhuvan University: Institute of Engineering
No ratings yet
Tribhuvan University: Institute of Engineering
48 pages
Minor New Report
No ratings yet
Minor New Report
45 pages
Anna University: Chennai 600 025
No ratings yet
Anna University: Chennai 600 025
10 pages
Project Report
No ratings yet
Project Report
47 pages
PBL Sentiment Analysis Report
No ratings yet
PBL Sentiment Analysis Report
54 pages
Complete Report
No ratings yet
Complete Report
56 pages
1822 B.tech It Batchno 359
No ratings yet
1822 B.tech It Batchno 359
86 pages
Proposal PHD CSE Synopsis On Sentimant Analysis For Blogs
100% (1)
Proposal PHD CSE Synopsis On Sentimant Analysis For Blogs
16 pages
DoddiKiran
No ratings yet
DoddiKiran
47 pages
Project Report: Sentiment Analysis in Hindi Language
No ratings yet
Project Report: Sentiment Analysis in Hindi Language
27 pages
minor_project_report
No ratings yet
minor_project_report
25 pages
Final Minor Project
No ratings yet
Final Minor Project
83 pages
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
No ratings yet
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
51 pages
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
No ratings yet
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
51 pages
Tourist Palce Reviews Sentiment Classification
No ratings yet
Tourist Palce Reviews Sentiment Classification
41 pages
Final 011
No ratings yet
Final 011
47 pages
RETURN BLACK BOOK 1.2.2
No ratings yet
RETURN BLACK BOOK 1.2.2
83 pages
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
No ratings yet
Major Project Report: AT "Baldev Ram Mirdha Institute of Technology"
51 pages
Contextualization of Project Management Practice and Best Practice
From Everand
Contextualization of Project Management Practice and Best Practice
Claude Besner
No ratings yet
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
25 pages
Report Final 5th Sem
No ratings yet
Report Final 5th Sem
18 pages
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
From Everand
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Chitra Lele
No ratings yet
Sentiment Analysis: Team
No ratings yet
Sentiment Analysis: Team
3 pages
Twitter Sentiment Analysis Project Report Compressed
No ratings yet
Twitter Sentiment Analysis Project Report Compressed
33 pages
Training For Jamuna Bank Employees
0% (1)
Training For Jamuna Bank Employees
4 pages
The Nike Company
100% (1)
The Nike Company
15 pages
(DT2.0. Week 2. Empathy
No ratings yet
(DT2.0. Week 2. Empathy
41 pages
Activity 29 - Intructions
No ratings yet
Activity 29 - Intructions
2 pages
Akum Esmond Tezock11111
No ratings yet
Akum Esmond Tezock11111
5 pages
Speaking in Public: Preface Xvii Reviewers and Focus Group Participants XXX
No ratings yet
Speaking in Public: Preface Xvii Reviewers and Focus Group Participants XXX
12 pages
What Techniques Are Used To Make Contemporary Artwork
No ratings yet
What Techniques Are Used To Make Contemporary Artwork
2 pages
HRD Audit
No ratings yet
HRD Audit
18 pages
Approval Sheet
No ratings yet
Approval Sheet
2 pages
MAN201 - Class 3 - Attitudes + Job Satisfaction - Emotions + Moods
No ratings yet
MAN201 - Class 3 - Attitudes + Job Satisfaction - Emotions + Moods
10 pages
Chapter 6 - Conformity and Obedience
No ratings yet
Chapter 6 - Conformity and Obedience
21 pages
Qualitative Research Sample (San Pedro National High School)
0% (1)
Qualitative Research Sample (San Pedro National High School)
35 pages
Math Activity 9
100% (1)
Math Activity 9
3 pages
5 Symbolic Frame Worksheet
No ratings yet
5 Symbolic Frame Worksheet
4 pages
Senior HR Business Partner
No ratings yet
Senior HR Business Partner
3 pages
Clinical Practice Evaluation 3 1
No ratings yet
Clinical Practice Evaluation 3 1
12 pages
Strategy and Effectiveness
No ratings yet
Strategy and Effectiveness
20 pages
Tugas Akhir Semester 1
No ratings yet
Tugas Akhir Semester 1
12 pages
Evaluasi Program Adiwiyata (Sekolah Berwawasan Lingkungan) Di SD Negeri 44 Pekanbaru
No ratings yet
Evaluasi Program Adiwiyata (Sekolah Berwawasan Lingkungan) Di SD Negeri 44 Pekanbaru
7 pages
DLP REVIEW
No ratings yet
DLP REVIEW
3 pages
EDUC 109 - Activity 1
No ratings yet
EDUC 109 - Activity 1
3 pages
Neurociência Social
No ratings yet
Neurociência Social
10 pages
Chapter_1_Final
No ratings yet
Chapter_1_Final
9 pages
The Grasshopper and The Ant-Rimala
No ratings yet
The Grasshopper and The Ant-Rimala
8 pages
PDF Collingwood S The Idea of History A Reader S Guide Reader S Guides 1st Edition Peter Johnson Download
100% (5)
PDF Collingwood S The Idea of History A Reader S Guide Reader S Guides 1st Edition Peter Johnson Download
62 pages
Kaniya Wardayani 4 APB Task 5
No ratings yet
Kaniya Wardayani 4 APB Task 5
5 pages
Module-9 Students
No ratings yet
Module-9 Students
20 pages
Edfd227 Assignment 2 - Essay
No ratings yet
Edfd227 Assignment 2 - Essay
9 pages