Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Abstract

In this chapter, we show a server-side deduplication component, HEDS (Hybrid Email Deduplication System) for the proposed deduplication framework. HEDS removes redundancies by trading-off of file-level and block deduplication for email systems while achieving good storage space savings and low processing overhead.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13, 422–426 (1970)

    Article  MATH  Google Scholar 

  2. Bolosky, W., Corbin, S., Goebel, D., Douceur, J.: Single instance storage in Windows 2000. In: Proceeding of the 4th USENIX Windows Systems Symposium (2000)

    Google Scholar 

  3. FUSE: File in UserSpacE. http://fuse.sourceforge.net/ (2016)

  4. Klimt, B., Yang, Y.: The enron corpus: a new dataset for email classification research, pp. 217–226. http://nyc.lti.cs.cmu.edu/yiming/Publications/klimt-ecml04.pdf (2004)

  5. Lillibridge, M., Eshghi, K., Bhagwat, D., Deolalikar, V., Trezise, G., Camble, P.: Sparse indexing: large scale, inline deduplication using sampling and locality. In: Proceeding of the USENIX Conference on File and Storage Technologies (FAST) (2009)

    Google Scholar 

  6. Meyer, D.T., Bolosky, W.J.: A study of practical deduplication. In: Proceeding of the USENIX Conference on File and Storage Technologies (FAST) (2011)

    Google Scholar 

  7. Milter.org: Sendmail mail filters. http://www.sendmail.com/sm/partners/milter_partners/open_source_milter_partners/ (2015)

  8. National Institute of Standards and Technology (NIST): Secure Hash Standard 1 (SHA1). http://csrc.nist.gov/publications/fips/fips180-4/fips-180-4.pdf (2015)

  9. Rabin, M.O.: Fingerprinting by random polynomials. Tech. Rep. Report TR-15-81, Harvard University (1981)

    Google Scholar 

  10. Zhu, B., Li, K., Patterson, H.: Avoiding the disk bottleneck in the data domain deduplication file system. In: Proceeding of the USENIX Conference on File and Storage Technologies (FAST) (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Kim, D., Song, S., Choi, BY. (2017). HEDS: Hybrid Email Deduplication System. In: Data Deduplication for Data Optimization for Storage and Network Systems. Springer, Cham. https://doi.org/10.1007/978-3-319-42280-0_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42280-0_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42278-7

  • Online ISBN: 978-3-319-42280-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics