Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2837476.2837479acmconferencesArticle/Chapter ViewAbstractPublication PagessepsConference Proceedingsconference-collections
short-paper

Annotatable systrace: an extended Linux ftrace for tracing a parallelized program

Published: 27 October 2015 Publication History

Abstract

Investigation of the runtime behavior is one of the most important processes for performance tuning on a computer system. Profiling tools have been widely used to detect hot- spots in a program. In addition to them, tracing tools produce valuable information especially from parallelized programs, such as thread scheduling, barrier synchronizations, context switching, thread migration, and jitter by interrupts. Users can optimize a runtime system and hardware configuration in addition to a program itself by utilizing the attained in- formation. However, existing tools provide information per process or per function. Finer information like task- or loop- granularity should be required to understand the program behavior more precisely. This paper has proposed a tracing tool, Annotatable Systrace, to investigate runtime execution behavior of a parallelized program based on an extended Linux ftrace. The Annotatable Systrace can add arbitrary an- notations in a trace of a target program. The proposed tool exploits traces from 183.equake, 179.art, and mpeg2enc on Intel Xeon X7560 and ARMv7 as an evaluation. The evaluation shows that the tool enables us to observe load imbalance along with the program execution. It can also generate a trace with the inserted annotations even on a 32-core ma- chine. The overhead of one annotation on Intel Xeon is 1.07 us and the one on ARMv7 is 4.44 us, respectively.

References

[1]
Eileen Kramer, and John T. Stasko: The Visualization of Parallel Systems: An Overview, Journal of Parallel and Distributed Computing, 1993
[2]
Google: Android Systrace, http://developer.android.com/tools/help/systrace.html, 2015
[3]
Jake Edge: A look at ftrace, http://lwn.net/Articles/322666/, 2009
[4]
Steven Rostedt: Debugging the kernel using Ftrace - part1, http://lwn.net/Articles/365835/, 2009
[5]
Intel Corporation: Intel VTune Amplifier XE 2015, https://software.intel.com/en-us/intel-vtune-amplifier-xe
[6]
Stephane Eranian, Eric Gouriou, Tipp Moseley, Willem de Bruijn: Tutorial - Linux kernel profiling with perf http://perf.wiki.kernel.org/index.php/Tutorial, 2015
[7]
Ariane Keller: Kernel Space - User Space Interfaces, http://people.ee.ethz.ch/ arkeller/linux/kernel user space howto.html
[8]
Obata M., Shirako J., Kaminaga H., Ishizaka K., Kasahara H.: Hierarchical parallelism control for multigrain parallell processing, LCPC 2002
[9]
SPEC CPU 2000: https://www.spec.org/
[10]
Chunho Lee, Miodrag Potkonjak, William H. Mangione-Smith: MedhiaBench: A Tool for Evaluating and Synthesizing Multimedia and Communications Systems, 1997

Cited By

View all
  • (2024)A Novel Database Acceleration Technology for Full Table ScansIEEE Access10.1109/ACCESS.2024.345210412(127532-127544)Online publication date: 2024
  • (2022)Hardware-assisted mechanisms to enforce control flow integrityJournal of Systems Architecture: the EUROMICRO Journal10.1016/j.sysarc.2022.102644130:COnline publication date: 1-Sep-2022
  • (2021)QIHE: Quantifying the Importance of Hardware Events with Respect to Performance of Mobile ProcessorsProceedings of the 6th International Conference on Big Data and Computing10.1145/3469968.3469999(186-191)Online publication date: 22-May-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SEPS 2015: Proceedings of the 2nd International Workshop on Software Engineering for Parallel Systems
October 2015
70 pages
ISBN:9781450339100
DOI:10.1145/2837476
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Automatic parallelization
  2. Linux
  3. Multicore
  4. Sys- trace
  5. ftrace

Qualifiers

  • Short-paper

Conference

SPLASH '15
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Novel Database Acceleration Technology for Full Table ScansIEEE Access10.1109/ACCESS.2024.345210412(127532-127544)Online publication date: 2024
  • (2022)Hardware-assisted mechanisms to enforce control flow integrityJournal of Systems Architecture: the EUROMICRO Journal10.1016/j.sysarc.2022.102644130:COnline publication date: 1-Sep-2022
  • (2021)QIHE: Quantifying the Importance of Hardware Events with Respect to Performance of Mobile ProcessorsProceedings of the 6th International Conference on Big Data and Computing10.1145/3469968.3469999(186-191)Online publication date: 22-May-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media