Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1835449.1835598acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Semi-supervised spam filtering using aggressive consistency learning

Published: 19 July 2010 Publication History

Abstract

A graph based semi-supervised method for email spam filtering, based on the local and global consistency method, yields low error rates with very few labeled examples. The motivating application of this method is spam filters with access to very few labeled message. For example, during the initial deployment of a spam filter, only a handful of labeled examples are available but unlabeled examples are plentiful. We demonstrate the performance of our approach on TREC 2007 and CEAS 2008 email corpora. Our results compare favorably with the best-known methods, using as few as just two labeled examples: one spam and one non-spam.

References

[1]
SVM Light. http://svmlight.joachims.org/.
[2]
O. Alter, P. Brown, and D. Botstein. Singular value decomposition for genome-wide expression data processing and modeling. In Proc Natl Acad Sci, USA, 2000.
[3]
C. Castillo, D. Donato, V. Murdock, and F. Silvestri. Know your neighbors: Web spam detection using the web topology. In 30st ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007), Netherlands, 2007.
[4]
M. Mojdeh and G. Cormack. Semi supervised spam filtering: Does it work? In 31st ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), Singapore, 2008.
[5]
D. Zhou, O. Bousquet, T. Lal, J. Weston, and B. Scholkopf. Learning with local and global consistency. In Advances in Neural Information Processing Systems 16 (NIPS 2003), pages 321--328. MIT Press.

Cited By

View all
  • (2024)Exploring Algorithmic Paradigms in Message Classification: Insights from the Enron E-mail DatasetAdvances in Information Communication Technology and Computing10.1007/978-981-97-6103-6_3(27-40)Online publication date: 2-Oct-2024
  • (2023)Email spam detection using hierarchical attention hybrid deep learning methodExpert Systems with Applications10.1016/j.eswa.2023.120977233(120977)Online publication date: Dec-2023
  • (2014)Towards Designing an Email Classification System Using Multi-view Based Semi-supervised LearningProceedings of the 2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications10.1109/TrustCom.2014.26(174-181)Online publication date: 24-Sep-2014
  • Show More Cited By

Index Terms

  1. Semi-supervised spam filtering using aggressive consistency learning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
      July 2010
      944 pages
      ISBN:9781450301534
      DOI:10.1145/1835449
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 19 July 2010

      Check for updates

      Author Tags

      1. classification
      2. email
      3. filtering
      4. spam

      Qualifiers

      • Poster

      Conference

      SIGIR '10
      Sponsor:

      Acceptance Rates

      SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Exploring Algorithmic Paradigms in Message Classification: Insights from the Enron E-mail DatasetAdvances in Information Communication Technology and Computing10.1007/978-981-97-6103-6_3(27-40)Online publication date: 2-Oct-2024
      • (2023)Email spam detection using hierarchical attention hybrid deep learning methodExpert Systems with Applications10.1016/j.eswa.2023.120977233(120977)Online publication date: Dec-2023
      • (2014)Towards Designing an Email Classification System Using Multi-view Based Semi-supervised LearningProceedings of the 2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications10.1109/TrustCom.2014.26(174-181)Online publication date: 24-Sep-2014
      • (2011)Clustering for semi-supervised spam filteringProceedings of the 8th Annual Collaboration, Electronic messaging, Anti-Abuse and Spam Conference10.1145/2030376.2030391(125-134)Online publication date: 1-Sep-2011
      • (undefined)Email Spam Detection Using Hierarchical Attention Hybrid Deep Learning MethodSSRN Electronic Journal10.2139/ssrn.4177036
      • (undefined)Email Spam Detection Using Hierarchical Attention Hybrid Deep Learning MethodSSRN Electronic Journal10.2139/ssrn.4177035

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media