Detecting Fake Accounts in Media Application Using Machine Learning
Detecting Fake Accounts in Media Application Using Machine Learning
Detecting Fake Accounts in Media Application Using Machine Learning
Keywords - fake accounts, fake identities, social media, data science, friends, followers, fake profiles
------------------------------------------------------------------------------------------------------------------------------------------------
I. INTRODUCTION II. RELATED WORK
In the present generation, everyone in society has become This paper presents some filtering algorithms that rely on
associated with the Online Social Networks(OSN). These classification to decide whether the profile is genuine or
OSN have made a drastic change in the way we pursue our fake.
social life. Making new friends, keeping in contact with
them and knowing their updates has become easier. But III. SUPPORT VECTOR MACHINE
with the rapid growth of social media many problems like
Support Vector Machine is a binary classification
fake profiles, online impersonation have also grown.There
are no feasible solution existing to control these problems algorithm that finds the maximum separation hyper plane
.Fake accounts can be either human-generated, computer- between two classes. It is a supervised learning algorithm
generated(also referred as “bots”), or cyborgs[1]. A cyborg that gives enough training examples, divides two classes
is half-human, half-bot account [1]. Such an account is fairly well and classifies new examples .It offers a
manually opened by a human, but from then onwards the principle approach to machine learning problems because
actions are automated by a bot. of their mathematical foundation in statistical learning
To become member of the OSN the user has to create his theory [10]. SVM construct their solution as a weighted
profile by entering information like name, photo, date of sum of SVs ,which are only a subset of the training input
birth, Email ID, graduation details, place of work, home .It is effective in cases where number of dimensions is
town, interests and so on [2][3]. Some of the fields are greater than the number of samples given
mandatory and some are optional and it varies from one
OSN to the another. These websites are popular because of
IV. RANDOM FOREST
people’s interest in finding friends, sharing pictures,
tagging people in group photos,sharing their ideas and Random Forest is versatile method performing both
views on common topics, maintain good business classification and regression tasks[8]. It has nearly same
relationship and general interest with others. hyperparameters as a decision tree or a bagging classifier
In this paper we came up with a framework in which .It creates many variations of trees .The best outcome will
automatic detection of fake profiles is possible and is be used to predict identity deception .Each outcomes from
efficient. This framework uses classification techniques the classifier represents different section of a tree.
like Support Vector Machine, Random Forest and Deep
Neural Networks to classify the profiles into fake or
V. SPAM FILTERING
genuine classes.As it is an automatic detection method, it
can be applied easily by OSN which has millions of The research study by Simranjit Kaur et al [4] is based on
profile where profiles cannot be examined manually .We implementing a k-mean clustering algorithm on vector set
evaluate whether readily available and engineered features to increase efficiency .To detect spam emails using neural
that are used for the successful detection ,using machine networks the two phases namely training and testing are
learning models. needed to be done. The process of detecting spam and
phishing emails using feed forward neural network .The
paper has 11 features have been implemented as a binary
values 0 or 1 with value 1 indicating this feature appeared
Page 234
Special Issue Published in Int. Jnl. Of Advanced Networking & Applications (IJANA)
Page 235
Special Issue Published in Int. Jnl. Of Advanced Networking & Applications (IJANA)
Page 236
Special Issue Published in Int. Jnl. Of Advanced Networking & Applications (IJANA)
REFERENCES
[1] Estee Van Der Walt and Jan Eloff,"Using Machine
Learning to Detect Fake Identities:Bots vs
Humans"IEEE Trans. Emerg.TopicsComput. Intell.,
vol. 1, no. 1, pp. 61–71 March 2018.
[2] Loredana Caruccio,DomenicoDesiato and Giuseppe
Polese"Fake Account Identification in Social
Page 237