100% found this document useful (1 vote)

78 views

Fake Account Detection Using Machine Learning Techniques

This project focuses on detecting fake and real accounts using various features such as metadata, interaction patterns, and content analysis. By analyzing these features, the project aims to build a model that can accurately classify accounts, helping to maintain the integrity of online platforms.

Uploaded by

mominaayman03074

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

78 views

Fake Account Detection Using Machine Learning Techniques

Uploaded by

mominaayman03074

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Volume 12, Issue 4, April 2023

Impact Factor: 8.423

International Journal of Innovative Research in Science, Engineering and Technology (IJIRSET)

|| Volume 12, Issue 4, April 2023 ||

| DOI:10.15680/IJIRSET.2023.1204136 |

Fake Account Detection using Machine

Learning Techniques
Prof.V.G.Bharane, Momin Aayman Rafik, Pathan Nafisa Sharif, Suryavanshi Priyanka Dattatray
Department of Computer Engineering, S.B.Patil College of Engineering, Indapur, India

ABSTRACT—The online social networks are a very large growth in the world today, but the attacks are more
common, including one of the attacks is the attack of Twitter in this spammer spreading several malicious tweets that
can take the form of links or hash tags in the website and online services, which are too harmful for real users. To
prevent these attacks, training tweets are added and, moreover, these problems are solved by extracting 12 lightweight
functions, like the age of the account, no. of followers, no. to follow, no. of tweets, no. of re-tweets, etc. For the
transmission of spam detection from tweets, the discretization of a function is important for the performance of spam
detection. There is a great truth in the system that includes a total of 600 public tweets based on the URL-based security
tool. Spam detection primarily creates the classification model that includes binary classification and can also be solved
using the automatic learning algorithm. Machine learning algorithms such as the Naïve Bayesian classifier or the vector
support machine classifier have informed the behavior of the models. The system reported the impact of data-related
factors, such as the relationship between spam and non-spam, the size of training data and data sampling, and detection
performance. The implemented system function is the detection of simple and variable tweets of spam over time. The
system shows how spam detection is a major challenge and bridges the gap between performance appraisals and
focuses primarily on data, features and patterns to identify the real user and inform the user of spam when providing the
valuable response binary. The contribution work is to detect the tweets of spam in real time, because the new tweets
come in the form of sequences and use the updated training data set.

KEYWORDS-Machine Learning, Parallel Computing, Spam Detection, Scalability, Twitter

I. INTRODUCTION

Online social networking sites like Twitter, Facebook, Instagram and some online social networking companies have
become extremely popular in recent years. People spend a lot of time in OSN making friends with people they are
familiar with or interested in. Twitter, founded in 2006, has become one of the most popular microblogging service
sites. Around 200 million users create around 400 million new tweets a day for spam growth. Twitter spam, known as
unsolicited tweets containing malicious links that the non-stop victims to external sites containing the spread of
malware, spreading malicious links, etc., hit not only more legitimate users, but also the whole platform Consider the
example because during the election of the Australian Prime Minister in 2013, a notice confirming that his Twitter
account had been hacked. Many of his followers have received direct spam messages containing malicious links. The
ability to order useful information is essential for the academic and industrial world to discover hidden ideas and
predict trends on Twitter. However, spam generates a lot of noise on Twitter. To detect spam automatically, researchers
applied machine learning algorithms to make spam detection a classification problem. Ordering a tweet broadcast
instead of a Twitter user as spam or non-spam is more realistic in the real world.

II. PROPOSED SYSTEM

 The collection of tweets with respect to trending topics on Twitter. After storing the tweets in a particular file
format, the tweets are subsequently analyzed.

 Labelling of spam is performed to check through all datasets that are available to detect the malignant URL.

 Feature extraction separates the characteristics construct based on the language model that uses language as a tool
and helps in determining whether the tweets are fake or not.

IJIRSET©2023 | An ISO 9001:2008 Certified Journal | 3370

International Journal of Innovative Research in Science, Engineering and Technology (IJIRSET)

|| Volume 12, Issue 4, April 2023 ||

| DOI:10.15680/IJIRSET.2023.1204136 |
 The classification of data set is performed by shortlisting the set of tweets that is described by the set of features
provided to the classifier to instruct the model and to acquire the knowledge for spam detection.

 The spam detection uses the classification technique to accept tweets as the input and classify the spam and non-
spammer.

Architecture

Algorithm flowchart

IJIRSET©2023 | An ISO 9001:2008 Certified Journal | 3371

International Journal of Innovative Research in Science, Engineering and Technology (IJIRSET)

|| Volume 12, Issue 4, April 2023 ||

| DOI:10.15680/IJIRSET.2023.1204136 |

DFD Diagram
Level 0

IJIRSET©2023 | An ISO 9001:2008 Certified Journal | 3372

International Journal of Innovative Research in Science, Engineering and Technology (IJIRSET)

|| Volume 12, Issue 4, April 2023 ||

| DOI:10.15680/IJIRSET.2023.1204136 |
Level 1

III. ADVANTAGES

 To categories the Spammers and Non-spammers.

 To work on a performance evaluation such as Precision, Recall, F-measure.
 To categorize the tag based tweets and link based tweets.
 To try to improve detection accuracy using deep learning algorithms.

IV. APPLICATION

 Social Media Application

 Spam Detection Applications

V. CONCLUSION

In this Project, System found that classifiers ability to detect Twitter spam reduced when in a near real-world scenario
since the imbalanced data brings bias. System also identified that Feature discretization was an important preprocess to
ML-based spam detection. Second, increasing training data only cannot bring more benefits to detect Twitter spam
after a certain number of training samples. System should try to bring more discriminative features or better model to
further improve spam detection rate.

REFERENCES

[1] Q. Cao, M. Sirivianos, X. Yang, and T. Pregueiro, “Aiding the detection of fake accounts in large scale social
online services,” in Proc. Symp. Netw. Syst. Des. Implement. (NSDI), 2012, pp. 197–210.
[2] G. Stringhini, C. Kruegel, and G. Vigna, “Detecting spammers on social networks,” in Proc. 26th Annu. Comput.
Sec. Appl. Conf., 2010, pp. 1–9.

International Journal of Innovative Research in Science, Engineering and Technology (IJIRSET)

|| Volume 12, Issue 4, April 2023 ||

| DOI:10.15680/IJIRSET.2023.1204136 |
[3] J. Song, S. Lee, and J. Kim, “Spam filtering in Twitter using sender receiver relationship,” in Proc. 14th Int. Conf.
Recent Adv. Intrusion Detection, 2011, pp. 301–317.
[4] K. Lee, J. Caverlee, and S. Webb, “Uncovering social spammers: social honeypots + machine learning,” in Proc.
33rd Int. ACM SIGIR Conf. Res.Develop. Inf. Retrieval, 2010, pp. 435–442.
[5] Nathan Aston, Jacob Liddle and Wei Hu*, “Twitter Sentiment in Data Streams with Perceptron,” in Journal of
Computer and Communications, 2014, Vol-2 No-11.
[6] K. Thomas, C. Grier, D. Song, and V. Paxson, “Suspended accounts in retrospect: An analysis of Twitter spam,” in
Proc. ACM SIGCOMM Conf. Internet Meas., 2011, pp. 243–258.
[7] K. Thomas, C. Grier, J. Ma, V. Paxson, and D. Song, “Design and evaluation of a real-time URL spam filtering
service,” in Proc. IEEE Symp. Sec. Privacy, 2011, pp. 447–462.
[8] X. Jin, C. X. Lin, J. Luo, and J. Han, “Socialspamguard: A data mining based spam detection system for social
media networks,” PVLDB, vol. 4, no. 12, pp. 1458–1461, 2011.
[9] S. Ghosh et al., “Understanding and combating link farming in the Twitter social network,” in Proc. 21st Int. Conf.
World Wide Web, 2012, pp. 61–70.
[10] H. Costa, F. Benevenuto, and L. H. C. Merschmann, “Detecting tip spam in location-based social networks,” in
Proc. 28th Annu. ACM Symp. Appl. Comput., 2013, pp. 724–729.

8.423

DigitalForensics Autonomous Syllabus
No ratings yet
DigitalForensics Autonomous Syllabus
2 pages
SENTRI Manual
No ratings yet
SENTRI Manual
30 pages
Cyberbullying Detection in Social Media Using Supervised ML & NLP Techniques
No ratings yet
Cyberbullying Detection in Social Media Using Supervised ML & NLP Techniques
5 pages
UGC List of Approved Journals
No ratings yet
UGC List of Approved Journals
9 pages
The Spammer Detection and Fake User Identification On Social Networks
No ratings yet
The Spammer Detection and Fake User Identification On Social Networks
8 pages
Fake Social Media Profile Detection and Reporting
No ratings yet
Fake Social Media Profile Detection and Reporting
6 pages
Spammer Detection and Fake User Identification On Social Networks
No ratings yet
Spammer Detection and Fake User Identification On Social Networks
9 pages
A Framework To Predict Social Crimes Using Twitter Tweets
No ratings yet
A Framework To Predict Social Crimes Using Twitter Tweets
5 pages
Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability
No ratings yet
Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability
13 pages
A Batch Based Approach For Tweeting Geotags of Social Media Attributes
No ratings yet
A Batch Based Approach For Tweeting Geotags of Social Media Attributes
13 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
8 pages
Seminar f0
No ratings yet
Seminar f0
17 pages
Fake Account Detect From Machine Learning
No ratings yet
Fake Account Detect From Machine Learning
4 pages
Fileserve.php
No ratings yet
Fileserve.php
10 pages
Hate Speech Detection Using Machine Learning
No ratings yet
Hate Speech Detection Using Machine Learning
5 pages
Cyber Bullying Detection On Social Media Network
No ratings yet
Cyber Bullying Detection On Social Media Network
9 pages
Analysis and Optimization of Data Classification Using K-Means Clustering and Affinity Propagation Technique
No ratings yet
Analysis and Optimization of Data Classification Using K-Means Clustering and Affinity Propagation Technique
9 pages
Identification of Suicidal Intent Using Machine Learning Techniques Over Twitter Data
No ratings yet
Identification of Suicidal Intent Using Machine Learning Techniques Over Twitter Data
14 pages
Social Media Spam Comments Detection Analysis Using Machine Learning
No ratings yet
Social Media Spam Comments Detection Analysis Using Machine Learning
6 pages
Fraud App Detection: Jyoti Singh, Lakshita Suthar, Diksha Khabya, Simmi Pachori, Nikita Somani, Dr. Mayank Patel
No ratings yet
Fraud App Detection: Jyoti Singh, Lakshita Suthar, Diksha Khabya, Simmi Pachori, Nikita Somani, Dr. Mayank Patel
6 pages
Senti bp1
No ratings yet
Senti bp1
2 pages
0 Comparative Analysis On Social Trust Computation in Trust Based Personalised Recommender Systems
No ratings yet
0 Comparative Analysis On Social Trust Computation in Trust Based Personalised Recommender Systems
7 pages
Phishing Website Detector Using ML
No ratings yet
Phishing Website Detector Using ML
8 pages
Innovative Nitesh
No ratings yet
Innovative Nitesh
14 pages
PCRMS4
No ratings yet
PCRMS4
4 pages
Web Based Machine Learning Automated Pipeline
100% (1)
Web Based Machine Learning Automated Pipeline
6 pages
Review Paper On Detection of Malicious URLs Using Machine Learning Techniques
No ratings yet
Review Paper On Detection of Malicious URLs Using Machine Learning Techniques
4 pages
Improving Accuracy of Twitter Fake Profile Detection Using Deep Learning
No ratings yet
Improving Accuracy of Twitter Fake Profile Detection Using Deep Learning
5 pages
Social Media Analytics of The Internet of Things
No ratings yet
Social Media Analytics of The Internet of Things
15 pages
Ijitc Docu Go July Publication
No ratings yet
Ijitc Docu Go July Publication
11 pages
Iarjset 2022 9340
No ratings yet
Iarjset 2022 9340
6 pages
Resume Builder Application
No ratings yet
Resume Builder Application
7 pages
Cyber Security For Social Networking Sites: Issues, Challenges and Solutions
No ratings yet
Cyber Security For Social Networking Sites: Issues, Challenges and Solutions
7 pages
AI Based Content Controlling System Using Age Prediction Algorithm and Selenium Tool
No ratings yet
AI Based Content Controlling System Using Age Prediction Algorithm and Selenium Tool
8 pages
Resume Builder Application
No ratings yet
Resume Builder Application
8 pages
Resume Builder Application
No ratings yet
Resume Builder Application
8 pages
90_FACIAL
No ratings yet
90_FACIAL
10 pages
Personalized Mobile App Recommendation by Learning User's Interest From Social Media
No ratings yet
Personalized Mobile App Recommendation by Learning User's Interest From Social Media
5 pages
Timeline Analysis of Twitter User Timeline Analysis of Twitter User
No ratings yet
Timeline Analysis of Twitter User Timeline Analysis of Twitter User
10 pages
Twitter Spam Detection Based On Deep Learning: Tingmin Wu, Shigang Liu, Jun Zhang and Yang Xiang
No ratings yet
Twitter Spam Detection Based On Deep Learning: Tingmin Wu, Shigang Liu, Jun Zhang and Yang Xiang
8 pages
Big Data Stream Mining Using Integrated Framework With Classification and Clustering Methods
No ratings yet
Big Data Stream Mining Using Integrated Framework With Classification and Clustering Methods
9 pages
Android Application of Municipality Online
No ratings yet
Android Application of Municipality Online
5 pages
Chapter One and Two
No ratings yet
Chapter One and Two
18 pages
Referrence
No ratings yet
Referrence
5 pages
Chapter 3
No ratings yet
Chapter 3
20 pages
Fin Irjmets1651825107
No ratings yet
Fin Irjmets1651825107
4 pages
Analysis of Rumour Detection Using Deep Learning Methods On Social Media
No ratings yet
Analysis of Rumour Detection Using Deep Learning Methods On Social Media
10 pages
Smart Wi-Fi Dustbin System-IJAERDV04I0479866
No ratings yet
Smart Wi-Fi Dustbin System-IJAERDV04I0479866
4 pages
Cyber Bullying Text Detection Using Machine Learning
No ratings yet
Cyber Bullying Text Detection Using Machine Learning
7 pages
Resume Builder Application
No ratings yet
Resume Builder Application
8 pages
Fin Irjmets1682165062
No ratings yet
Fin Irjmets1682165062
4 pages
Twitter Data Preprocessing For Spam Detection: Myungsook Klassen
No ratings yet
Twitter Data Preprocessing For Spam Detection: Myungsook Klassen
6 pages
A survey on health care 2017
No ratings yet
A survey on health care 2017
21 pages
A Review On Fake Account Detection in Social Media
No ratings yet
A Review On Fake Account Detection in Social Media
7 pages
An INTRANET-Based Web Application For College Management System Using Python
No ratings yet
An INTRANET-Based Web Application For College Management System Using Python
10 pages
batch2
No ratings yet
batch2
21 pages
Detection of Cyber Bullying On Social Media Using Machine Learning
No ratings yet
Detection of Cyber Bullying On Social Media Using Machine Learning
8 pages
Real Time Twitter Sentiment Analysis
No ratings yet
Real Time Twitter Sentiment Analysis
12 pages
Techniques To Detect Spammers in Twitter-A Survey: International Journal of Computer Applications December 2013
No ratings yet
Techniques To Detect Spammers in Twitter-A Survey: International Journal of Computer Applications December 2013
7 pages
Sentiment Analysis Tool Using Machine Learning Algorithms
No ratings yet
Sentiment Analysis Tool Using Machine Learning Algorithms
5 pages
Karthik.S-Spammer Detection and Fake User Identification On Social Networks
No ratings yet
Karthik.S-Spammer Detection and Fake User Identification On Social Networks
5 pages
"Big Data Science" Basic Concepts and Applications
From Everand
"Big Data Science" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
Manual Itron
No ratings yet
Manual Itron
6 pages
BCD and Ascii Code
No ratings yet
BCD and Ascii Code
4 pages
Telugu Links - Ebook Preview
0% (1)
Telugu Links - Ebook Preview
10 pages
Leveled Problem Solving Least Common Multiple: Lesson
No ratings yet
Leveled Problem Solving Least Common Multiple: Lesson
1 page
Analysis of Three Phase Full Controlled Converter Using MATLAB/Simulink
100% (1)
Analysis of Three Phase Full Controlled Converter Using MATLAB/Simulink
5 pages
Disaster Recovery - Wikipedia
No ratings yet
Disaster Recovery - Wikipedia
14 pages
1.2 Linear Equations and Rational Equations: Section
No ratings yet
1.2 Linear Equations and Rational Equations: Section
15 pages
CIS-KYC For BJ GROUP HOLDING
No ratings yet
CIS-KYC For BJ GROUP HOLDING
4 pages
HvPE-II End Term
No ratings yet
HvPE-II End Term
51 pages
Airtel Payments Bank - Nanded PDF
No ratings yet
Airtel Payments Bank - Nanded PDF
19 pages
Income Tax Synopsis
No ratings yet
Income Tax Synopsis
24 pages
Products Affected / Serial Numbers Affected:: TP17 212.pdf 08-11-17
No ratings yet
Products Affected / Serial Numbers Affected:: TP17 212.pdf 08-11-17
4 pages
IT Code 402 Notes: CBSE Class 10
No ratings yet
IT Code 402 Notes: CBSE Class 10
6 pages
LESSON 2 - Purposive Communication
No ratings yet
LESSON 2 - Purposive Communication
4 pages
LM24-H200
No ratings yet
LM24-H200
2 pages
08 - Module 8
No ratings yet
08 - Module 8
87 pages
Module 5 Dbms Notes bcs403
No ratings yet
Module 5 Dbms Notes bcs403
11 pages
Free Digital Marketing Resources
100% (1)
Free Digital Marketing Resources
14 pages
File Processing
No ratings yet
File Processing
10 pages
Hacking With Kali Linux - A Comprehensive Beginner's Guide to Learn Ethical Hacking. Practical Examples to Learn the Basics of Cybersecurity. Includes Penetration Testing With Kali Linux by ITC ACADEMY
100% (1)
Hacking With Kali Linux - A Comprehensive Beginner's Guide to Learn Ethical Hacking. Practical Examples to Learn the Basics of Cybersecurity. Includes Penetration Testing With Kali Linux by ITC ACADEMY
91 pages
Packet Tracer Manual
100% (5)
Packet Tracer Manual
177 pages
SDRSharp Users Guide
No ratings yet
SDRSharp Users Guide
12 pages
7 Ways To Boot in Safe Mode in Windows 10
No ratings yet
7 Ways To Boot in Safe Mode in Windows 10
19 pages
Company Presentation - Apple
No ratings yet
Company Presentation - Apple
2 pages
White Paper Interconnect Solutions Debugging Issues Advanced ARM CoreLink
No ratings yet
White Paper Interconnect Solutions Debugging Issues Advanced ARM CoreLink
8 pages
Raw Waveform Processing - BayesMap Solutions, LLC
No ratings yet
Raw Waveform Processing - BayesMap Solutions, LLC
3 pages
BASIC SCIENCE AND TECHNOLOGY JSS 3
No ratings yet
BASIC SCIENCE AND TECHNOLOGY JSS 3
4 pages
Cheat Effect: Buddha
No ratings yet
Cheat Effect: Buddha
15 pages