Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2388996.2389106acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

On using virtual circuits for GridFTP transfers

Published: 10 November 2012 Publication History

Abstract

The goal of this work is to characterize scientific data transfers and to determine the suitability of dynamic virtual circuit service for these transfers instead of the currently used IP-routed service. Specifically, logs collected by servers executing a commonly used scientific data transfer application, GridFTP, are obtained from three US super-computing/scientific research centers, NERSC, SLAC, and NCAR, and analyzed. Dynamic virtual circuit (VC) service, a relatively new offering from providers such as ESnet and Internet2, allows for the selection of a path on which a rate-guaranteed connection is established prior to data transfer. Given VC setup overhead, the first analysis of the GridFTP transfer logs characterizes the duration of sessions, where a session consists of multiple back-to-back transfers executed in batch mode between the same two GridFTP servers. Of the NCAR-NICS sessions analyzed, 56% of all sessions (90% of all transfers) would have been long enough to be served with dynamic VC service. An analysis of transfer logs across four paths, NCAR-NICS, SLAC-BNL, NERSC-ORNL and NERSC-ANL, shows significant throughput variance, where NICS, BNL, ORNL, and ANL are other US national laboratories. For example, on the NERSC-ORNL path, the inter-quartile range was 695 Mbps, with a maximum value of 3.64 Gbps and a minimum value of 758 Mbps. An analysis of the impact of various factors that are potential causes of this variance is also presented.

References

[1]
J. T. Overpeck, G. A. Meehl, S. Bony, and D. R. Easterling, "Climate data challenges in the 21st century," Science, vol. 331, no. 6018, pp. 700--702, 2011. {Online}. Available: http://www.sciencemag.org/content/331/6018/700.abstract
[2]
GridFTP v2 Protocol Description. {Online}. Available: http://www.ggf.org/documents/GFD.47.pdf
[3]
Fast data transfer. {Online}. Available: http://monalisa.cern.ch/FDT/
[4]
bbftp. {Online}. Available: http://doc.in2p3.fr/bbftp/
[5]
S. Sarvotham, R. Riedi, and R. Baraniuk, "Connection-level analysis and modeling of nework traffic," in ACM SIGCOMM Internet Measurement Workshop 2001, November 2001, pp. 99--104.
[6]
On-Demand Secure Circuits and Advance Reservation System (OSCARS). {Online}. Available: http://www.es.net/services/virtual-circuits-oscars/
[7]
J. Postel and J. Reynolds, "File Transfer Protocol," RFC 959 (Standard), Internet Engineering Task Force, Oct. 1985, updated by RFCs 2228, 2640, 2773, 3659, 5797. {Online}. Available: http://www.ietf.org/rfc/rfc959.txt
[8]
GridFTP. {Online}. Available: http://www.globus.org/toolkit/docs/latest-stable/gridftp/
[9]
W. Allcock, J. Bresnahan, R. Kettimuthu, and M. Link, "The Globus striped GridFTP framework and server," in Supercomputing, 2005. Proceedings of the ACM/IEEE SC 2005 Conference, nov. 2005, p. 54.
[10]
M. Veeraraghavan, M. Karol, and G. Clapp, "Optical dynamic circuit services," Communications Magazine, IEEE, vol. 48, no. 11, pp. 109--117, november 2010.
[11]
MRI-R2 Consortium: Development of Dynamic Network System (DYNES). {Online}. Available: http://www.internet2.edu/ion/dynes.html
[12]
A. Lake, J. Vollbrecht, A. Brown, J. Zurawski, D. Robertson, M. Thompson, C. Guok, E. Chaniotakis, and T. Lehman, "Inter-domain Controller (IDC) Protocol Specification," May 2008.
[13]
X. Zhu and M. Veeraraghavan, "Analysis and design of book-ahead bandwidth-sharing mechanisms," Communications, IEEE Transactions on, vol. 56, no. 12, pp. 2156--2165, December 2008.
[14]
C. Anglano and M. Canonico, "Performance analysis of high-performance file-transfer systems for grid applications," Concurrency and Computation: Practice and Experience, vol. 18, no. 8, pp. 807--816, 2006. {Online}. Available: http://dx.doi.org/10.1002/cpe.976
[15]
Y. Zhu, D. Talia, A. Bassi, and P. Massonet, "Evaluation for high volume data transfer mechanisms in grids," Tech. Rep., October 14 2008, CoreGRID Technical Report, Number TR-0178. {Online}. Available: http://www.coregrid.net/mambo/images/stories/TechnicalReports/fellow\%20tr-0178.pdf
[16]
S. Zanikolas and R. Sakellariou, "A taxonomy of grid monitoring systems," Future Generation Computer Systems, vol. 21, no. 1, pp. 163--188, 2005. {Online}. Available: http://www.sciencedirect.com/science/article/pii/S0167739X04001190
[17]
B. Balis, M. Bubak, W. Funika, T. Szepieniec, and R. Wismüller, "Monitoring and performance analysis of grid applications," in Proceedings of the 1st international conference on Computational science: PartI, ser. ICCS'03. Berlin, Heidelberg: Springer-Verlag, 2003, pp. 214--224. {Online}. Available: http://dl.acm.org/citation.cfm?id=1764172.1764199
[18]
Kun-chan Lan and John Heidemann, "A measurement study of correlations of internet flow characteristics," Computer Networks, vol. 50, no. 1, pp. 46--62, 2006.
[19]
Z. Yan, C. Tracy, and M. Veeraraghavan, "A hybrid network traffic engineering system," in Proc. of IEEE 13th High Performance Switching and Routing (HPSR) 2012, June 24--27 2012.
[20]
The Lambda Station Project. {Online}. Available: http://www.lambdastation.org/
[21]
B. Allen, J. Bresnahan, L. Childers, I. Foster, G. Kandaswamy, R. Kettimuthu, J. Kordas, M. Link, S. Martin, K. Pickett, and S. Tuecke, "Software as a service for data scientists," Communications of the ACM, vol. 55, no. 2, Feb. 2012.
[22]
H. Wang, M. Veeraraghavan, R. Karri, and T. Li, "Design of a high-performance RSVP-TE hardware signaling accelerator," Selected Areas in Communications, IEEE Journal on, vol. 23, no. 8, pp. 1588--1595, aug. 2005.
[23]
M. Mellia, A. Carpani, and R. Lo Cigno, "Tstat: TCP statistic and analysis tool," in Quality of Service in Multiservice IP Networks, ser. Lecture Notes in Computer Science, M. Marsan, G. Corazza, M. Listanti, and A. Roveri, Eds. Springer Berlin/Heidelberg, 2003, vol. 2601, pp. 145--157, 10.1007/3-540-36480-3_11. {Online}. Available: http://dx.doi.org/10.1007/3-540-36480-3_11

Cited By

View all
  • (2013)On causes of GridFTP transfer throughput varianceProceedings of the Third International Workshop on Network-Aware Data Management10.1145/2534695.2534701(1-10)Online publication date: 17-Nov-2013

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
November 2012
1161 pages
ISBN:9781467308045

Sponsors

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 10 November 2012

Check for updates

Qualifiers

  • Research-article

Conference

SC '12
Sponsor:

Acceptance Rates

SC '12 Paper Acceptance Rate 100 of 461 submissions, 22%;
Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2013)On causes of GridFTP transfer throughput varianceProceedings of the Third International Workshop on Network-Aware Data Management10.1145/2534695.2534701(1-10)Online publication date: 17-Nov-2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media