Abstract
With the growing amount of internet users, a negative form of sending email spreads that affects more and more users of email accounts: Spamming. Spamming means that the electronic mailbox is congested with unwanted advertising or personal email. Sorting out this email costs the user time and money. This paper introduces a distributed spam filter, which combines an off-the-shelf text classification with multiagent systems. Both the text classification as well as the multiagent platform are implemented in Java. The content of the emails is analyzed by the classification algorithm ‘support vector machines’. Information about spam is exchanged between the agents through the network. Identification numbers for emails which where identified as spam are generated and forwarded to all other agents connected to the network. These numbers allow agents to identify incoming spam email. In this way, the quality of the filter increases continuously.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
W.W. Cohen (1996): Learning Rules that classify e-mail. Proceedings of the AAAI Spring Symposium on Machine Learning in Information Access 18–25, AAAI Press.
V. Vapnik, H. Drucker, D. Wu (1999): Support Vector Machines for Spam Categorization. In: IEEE Transactions on Neural Networks, 10(5):1048–1054.
Homepage of the Foundation for Intelligent Physical Agent (FIPA). http://www..pa.org
Java Mail (TM) API Design Specification. Sun Microsystems, Inc.: http://java.sun.com/products/javamail/JavaMail-1.2.pdf
S. Shankar, G. Karypis (2000): A FeatureWeight Adjustment Algorithm for Document Categorization. Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD 2000).
P. Pantel and D. Lin (1998): SpamCop — A Spam Classification and Organization Program. Proceedings of AAAI-98Workshop on Learning for Text Categorization 95–98, AAAI Press.
Nortel Networks Corporation FIPA-OS Informations: http://www.nortelnetworks.com/products/announcements/.pa
M. F. Porter (1980): An algorithm for suffix stripping. In: Program, 14(3):130–137.
Secure Hash Standard (1995): Federal Information Processing Standards Publication 180-1. National Institute of Standards and Technology.
Spam Statistic: http://www.raingod.com/angus/Computing/Internet/Spam/Statistics/index.html
V.N. Vapnik, D. Wu (1998): Support Vector Machine for Text Categorization. AT&T Research Labs, http://citeseer.nj.nec.com/347263.htm
J. D. M. Rennie (2000): ifile: An Application of Machine Learning to E-Mail Filtering. Proceedings of the KDD-2000 Workshop on Text Mining, Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
M. Wooldridge, N. Jennings (1995): Intelligent Agents: Theory and Practice. In: Knowledge Engineering Review 10(2):115–152, Cambridge University Press.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Metzger, J., Schillo, M., Fischer, K. (2003). A Multiagent-Based Peer-to-Peer Network in Java for Distributed Spam Filtering. In: Mařík, V., Pěchouček, M., Müller, J. (eds) Multi-Agent Systems and Applications III. CEEMAS 2003. Lecture Notes in Computer Science(), vol 2691. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45023-8_59
Download citation
DOI: https://doi.org/10.1007/3-540-45023-8_59
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40450-7
Online ISBN: 978-3-540-45023-8
eBook Packages: Springer Book Archive