Location via proxy:   
[Report a bug]   [Manage cookies]                


Juncheng Yang

Juncheng Yang

Assistant Professor (Starting Fall 2025)

School of Engineering and Applied Science, Harvard University

juncheng # g.harvard.edu

150 Western Ave, Allston, MA, 02134




I am an Assistant Professor in School of Applied Science and Engineering, Harvard University.

I am broadly interested in storage systems, data management and machine learning systems with particular interests on workload analysis, efficient storage, and sustainable system design. I like in-depth measurement and analysis to get deep understanding of systems and algorithms in the real world.

My works have received best-paper awards at NSDI'24, NSDI'21, SOSP'21, and SYSTOR'16 and have been deployed in production at Google, VMware, Twitter, Redpanda, Momento with many open-source libraries contributed by the community. My research has been sponsored by Meta, Google Cloud, and AWS. I am a 2020 Meta Fellow, a 2023 Google Cloud Research Innovator, and a 2023 Rising Star in Machine Learning and Systems.


Please read this page if you are interested in working with me or asking for a recommendation letter.


News [all news]


Research Areas and Interests

Storage systems and machine learning systems with a focus on efficiency, scalability and robustness:
  • Efficient and scalable cache management systems
  • Robust and reliable cache/storage management and machine learning systems [OSDI'20][NSDI'22][VLDB'23]
  • New approaches to make machine learning practical for storage systems (machine learning for systems) [FAST'23][SOCC'17]
  • Performance optimization and sustainability of microservices and serverless architecture [SOCC'23]
  • Reliable large model inference on wimpy hardware (system for machine learning)

Research Highlights

  • SIEVE (NSDI'24): the first cache eviction algorithm simpler than LRU but yet more effective than state-of-the-art algorithms for web caches. Implemented in many open-source libraries, e.g., Golang, Python, JavaScript, Rust, Java, Swift, Ruby, Nim, and Zig. Find more details here.
  • S3-FIFO (SOSP'23): a simple and scalable cache eviction algorithm composed of only FIFO queues. Implemented or deployed at companies including Google, VMware and Redpanda, and many open-source libraries. Find more details here.
  • GL-Cache (FAST'23): my exploration on bringing low-overhead machine learning to caching.
  • Segcache (NSDI'21) received a best-paper award, and deployed at Twitter and Momento.
  • Kangaroo (SOSP'21) received a best-paper award at SOSP'21.

I have been very fortunate to work with many talented students. If you have worked with me, but not showing up on this page, please feel free to let me know.

Ph.D. Students

Master and Undergraduate Students

Alumni

Master and Undergraduate Students


Selected Publications [Google Scholar]


Teaching


Services

Conference

  • FAST Artifact Evaluation Chair: 2025, 2026
  • FAST Reivewer: 2025, 2026
  • MLSys Reviewer: 2025
  • ICDCS Reviewer: 2025

Workshop

  • New England Systems Day: 2025
  • HotStorage Reviewer: 2025

Journal review

  • ACM Transactions on Storage (TOS): 2023
  • ACM Computing Survey: 2025
  • Springer Journal of Supercomputing: 2024
  • IEEE Transaction on Computers (TOC): 2023, 2024
  • IEEE Transaction on Reliability (TOR): 2024
  • IEEE Transactions on Cloud Computing (TCC): 2023
  • IEEE ACCESS: 2023
  • IEEE Transactions on Parallel and Distributed Systems (TPDS): 2023, 2024
  • IEEE Transactions on Mobile Computing (TMC): 2023
  • IEEE Transactions on Knowledge and Data Engineering (TKDE): 2023

Awards

  • 2023 Machine learning and System Rising Star
  • 2023 Google Cloud Innovator
  • 2020-22, Facebook Fellowship
  • 2018, AWS research grant
  • 2013, Emerson Fellowship
  • 2013, Best Thesis Award (5/3000)
  • 2012, "Person of the Year" Nomination
  • 2012, Third Place in Green Tech International Competition, Taiwan
  • 2009, First Award in National Chemistry Olympiad

Talks

  • Simple Scalable Caching with Three Static FIFO Queues.
    • [11/2023] WOS conference
    • [09/2023] VMware
    • [09/2023] Kuaishou
    • [09/2023] Microsoft Research Asia
    • [09/2023] Tsinghua University
    • [08/2023] Cloudflare
    • [08/2023] USTC
    • [07/2023] Alluxio
  • LESSCache: LEarned Segment-Structured Cache
    • [02/2023] Meta
    • [10/2022] VMware vSAN and VMware Research
  • Ubiquitous Caching: A Journey of Building Efficient Distributed and Process Caches.
  • Segcache: a memory-efficient and scalable in-memory key-value cache for small objects
    • [01/2023] Oracle
    • [2022] Shopify

Bio

Juncheng Yang is an Assistant Professor in the School of Engineering and Applied Science at Harvard University. He received his Ph.D. in Computer Science from Carnegie Mellon University in 2024, His research interests broadly cover the efficiency, performance, reliability, and sustainability of large-scale data systems.

Juncheng's works have received best paper awards at NSDI'24, NSDI'21, SOSP'21, and SYSTOR'16. His OSDI'20 paper was recognized as one of the best storage papers at the conference and invited to ACM TOS'21. Juncheng received a Facebook Ph.D. Fellowship in 2020, was recognized as a Rising Star in machine learning and systems in 2023, and a Google Cloud Research Innovator in 2023.

His work, Segcache, has been adopted for production at Twitter and Momento. The two eviction algorithms he designed (S3-FIFO, SIEVE) have been adopted for production at Google, VMware, Redpanda, and several others, with over 20 open-source libraries available on GitHub. Moreover, the open-source cache simulation library he created, libCacheSim, has been used by almost 100 research institutes and companies.