Fake News Spreader Detection Using Naïve Bayes Classifier and Logistic Regression
ISSN No:-2456-2165
Abstract:- Till date people have worked only one domain I am not thinking that social media is worst only fake
means only politics news but I have worked on politics as news disadvantage but within seconds we are updated on
well as crime and film industry news. world current affaire using social media .
I collect this data set from Kaggl e.com . Our data is III. LITERATURE SURVEY
text form so we have convert this data to numeric form
Undermining trust in institutions: Fake news spreaders But we have text data .text data also have have feature
can undermine trust in institutions such as the government, scaling means vectorization method we use one of the most
media, and scientific community. This can lead to a lack of famous method count vectorizer .
confidence in public health measures or scientific research,
for example. Classification algorithm :- There so many classification
algorithm like Descision tree ,random forest,svm,knn But
Exacerbating social tensions: Fake news spreaders can we use Multinomial Naive Bayes ,Bernoulli Naive Bayes,
exacerbate social tensions by spreading false information Gaussian Naive bayes and Logistic Regression .
about various groups of people. This can lead to
discrimination, prejudice, and even violence. Generally, we need a procedure for representing text
information for the ML algorithm. Bag- of-words are useful
Overall, the impact of fake news spreaders on to complete this task. This model is simple to implement. It
individuals can be significant and can lead to confusion, is one of the methods to extract features from the given text
anxiety, and mistrust. It is important to take steps to detect for machine learning models. The Bag of Words model is
and prevent the spread of fake news in order to ensure that used to pre-process the input text by changing it into a bag
accurate information is available to people and that they can of words. The bow can be represented using a table, which
make informed decisions based on facts and evidence. contains the count of words corresponding to the word itself.
The news is real or fake. 1 for real news and 0 for fake
news This dataset contains 20800 news that is balanced with
10413 for positive and 10387 for fake news.
Model Performance:-
1. TN / True Negative: when a case was negative and
predicted negative
2. TP / True Positive: when a case was positive and
predicted positive
3. FN / False Negative: when a case was positive but
predicted negative
4. FP / False Positive: when a case was negative but
predicted positive
In conclusion, the Naive Bayes classifier and logistic
regression are two popular and effective machine learning
algorithms that can be used for fake news spreader
detection. Naive Bayes classifier is a probabilistic model
that is based on Bayes' theorem, and it is a simple and fast
algorithm that can handle large datasets. Logistic regression
is a linear model that is used to predict binary outcomes, and
it is a widely used algorithm for classification tasks.
Both algorithms can be trained on a dataset of labeled
examples of fake news spreaders and non-spreaders, and
then used to classify new instances of news stories or social
media posts. Naive Bayes classifier and logistic regression
have been shown to achieve high accuracy in detecting fake
news spreaders, and they can be used in combination with
other techniques such as feature engineering and ensemble
learning to improve their performance.
Overall, the detection of fake news spreaders using
machine learning algorithms is an important area of research
that can help to mitigate the impact of fake news on society.
While no single algorithm can guarantee perfect accuracy,
the use of multiple algorithms and techniques can help to
improve the accuracy and reliability of fake news detection
systems.
