Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1064212.1064258acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
Article

Data streaming algorithms for accurate and efficient measurement of traffic and flow matrices

Published: 06 June 2005 Publication History

Abstract

The traffic volume between origin/destination (OD) pairs in a network, known as traffic matrix, is essential for efficient network provisioning and traffic engineering. Existing approaches of estimating the traffic matrix, based on statistical inference and/or packet sampling, usually cannot achieve very high estimation accuracy. In this work, we take a brand new approach in attacking this problem. We propose a novel data streaming algorithm that can process traffic stream at very high speed (e.g., 40 Gbps) and produce traffic digests that are orders of magnitude smaller than the traffic stream. By correlating the digests collected at any OD pair using Bayesian statistics, the volume of traffic flowing between the OD pair can be accurately determined. We also establish principles and techniques for optimally combining this streaming method with sampling, when sampling is necessary due to stringent resource constraints. In addition, we propose another data streaming algorithm that estimates flow matrix, a finer-grained characterization than traffic matrix. Flow matrix is concerned with not only the total traffic between an OD pair (traffic matrix), but also how it splits into flows of various sizes. Through rigorous theoretical analysis and extensive synthetic experiments on real Internet traffic, we demonstrate that these two algorithms can produce very accurate estimation of traffic matrix and flow matrix respectively.

References

[1]
S. Bhattacharyya, C. Diot, J. Jetcheva, and N. Taft. Geographical and temporal characteristics of inter-pop flows: View from a single pop. European Transactions on Telecommunications, 2000.
[2]
J. Cao, D. Davis, S. Vander Wiel, and B. Yu. Time-varing network tomography:router link data. Journal of American Statistics Association, pages 1063--1075, 2000.
[3]
J. Carter and M. Wegman. Universal classes of hash functions. Journal of Computer and System Sciences, pages 143--154, 1979.
[4]
N. Duffield and M. Grossglauser. Trajectory sampling for direct traffic observation. IEEE transaction of Networking, pages 280--292, June 2001.
[5]
N. Duffield, C. Lund, and M. Thorup. Properties and prediction of flow statistics from sampled packet streams. In Proc. of ACM/SIGCOMM IMW, August 2002.
[6]
N. Duffield, C. Lund, and M. Thorup. Estimating flow distribution from sampled flow statistics. In Proc. of ACM SIGCOMM, August 2003.
[7]
C. Estan and G. Varghese. New directions in traffic measurement and a ccounting. In Proc. of ACM SIGCOMM, August 2002.
[8]
C. Estan and G. Varghese. Bitmap algorithms for counting active flows on high speed links. In Proc. of ACM/SIGCOMM IMC, October 2003.
[9]
A. Feldmann, A. Greenberg, C. Lund, N. Reingold, J. Rexford, and F.True. Deriving traffic demand for operational IP networks: Methodology and experience. In Proc. of ACM SIGCOMM, August 2000.
[10]
D.A. Huffman. A method for the construction of minimum-redundancy codes. Proc. of I.R.E, pages 1098--1102, 1952.
[11]
S. Jaiswal, G. Iannaccone, C. Diot, J. Kurose, and D. Towsley. Measurement and classification of out-of-sequence packets in a tier-1 IP backbone. In Proc. of IEEE INFOCOM, March 2003.
[12]
A. Kumar, M. Sung, J. Xu, and J. Wang. Data streaming algorithms for efficient and accurate estimation of flow size distribution. In Proc. of ACM SIGMETRICS, 2004.
[13]
A. Kumar, J. Xu, J. Wang, O. Spatschek, and L. Li. Space-code bloom filter for efficient per-flow traffic measurement. In Proc. of IEEE INFOCOM, March 2004.
[14]
A. Medina, N. Taft, K. Salamatian, S. Bhattacharyya, and C. Diot. Traffic matrix estimation:existing techniques and new directions. In Proc. of ACM SIGCOMM, August 2002.
[15]
http://pma.nlanr.net.
[16]
A. Nucci, R. Cruz, N. Taft, and C. Diot. Design of IGP link weight changes for estimation of traffic matrices. In Proc. of IEEE INFOCOM, March 2004.
[17]
A. Pasztor and D. Veitch. PC based precision timing without GPS. In Proc. of ACM SIGMETRICS, June 2002.
[18]
S. Ramabhadran and G. Varghese. Efficient implementation of a statistics counter architecture. In Proc. of ACM SIGMETRICS, June 2003.
[19]
M. Ramakrishna, E. Fu, and E. Bahcekapili. Efficient hardware hashing functions for high performance computers. IEEE Transactions on Computers, pages 1378--1381, 1997.
[20]
http://www.ripe.net.
[21]
A. Snoeren, C. Partridge, L. Sanchez, C. Jones, F. Tchakountio, S. Kent, and W. Strayer. Hash-based IP traceback. In Proc. of ACM SIGCOMM, August 2001.
[22]
A. Soule, A. Nucci, R. Cruz, E. Leonardi, and N. Taft. How to identify and estimate the largest traffic matrix elements in a dynamic environment. In Proc. of ACM SIGMETRICS, June 2004.
[23]
C. Tebaldi and M. West. Bayesian inference on network traffic using link count data. Journal of American Statistics Association, pages 557--576, 1998.
[24]
Y. Vardi. Internet tomography: estimating source-destination traffic intensities from link data. Journal of American Statistics Association, pages 365--377, 1996.
[25]
K.Y. Whang, B.T. Vander-zanden, and H.M. Taylor. A linear-time probabilistic counting algorithm for database applications. IEEE transaction of Database Systems, pages 208--229, June 1990.
[26]
J. Xu, M. Singhal, and Joanne Degroat. A novel cache architecture to support layer-four packet classification at memory access speeds. In Proc. of IEEE INFOCOM, March 2000.
[27]
Y. Zhang, M. Roughan, N. Duffield, and A. Greenberg. Fast accurate computation of large-scale IP traffic matrices from link loads. In Proc. of ACM SIGMETRICS, June 2003.
[28]
Y. Zhang, M. Roughan, C. Lund, and D. Donoho. An information-theoretic approach to traffic matrix estimation. In Proc. of ACM SIGCOMM, August 2003.

Cited By

View all
  • (2020)A MapReduce Approach for Traffic Matrix Estimation in SDNIEEE Access10.1109/ACCESS.2020.30162498(149065-149076)Online publication date: 2020
  • (2020)BitMatrix: A Multipurpose Sketch for Monitoring of Multi-tenant NetworksJournal of Network and Systems Management10.1007/s10922-020-09556-7Online publication date: 30-Jul-2020
  • (2018)Using Probabilistic Data Structures for Monitoring of Multi-tenant P4-based Networks2018 IEEE Symposium on Computers and Communications (ISCC)10.1109/ISCC.2018.8538352(00204-00207)Online publication date: Jun-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMETRICS '05: Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
June 2005
428 pages
ISBN:1595930221
DOI:10.1145/1064212
  • cover image ACM SIGMETRICS Performance Evaluation Review
    ACM SIGMETRICS Performance Evaluation Review  Volume 33, Issue 1
    Performance evaluation review
    June 2005
    417 pages
    ISSN:0163-5999
    DOI:10.1145/1071690
    Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 June 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data streaming
  2. network measurement
  3. sampling
  4. statistical inference
  5. traffic matrix

Qualifiers

  • Article

Conference

SIGMETRICS05

Acceptance Rates

Overall Acceptance Rate 459 of 2,691 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media