Abstract
The Reduced Complexity Many-Core architecture (RC/MC) targets to simplify timing analysis by increasing the predictability of all components. Since shared memory interference is a major source of pessimism in many-core systems, fine-grained message passing between small cores with private memories is used instead of a global shared memory.
In this paper, the RC/MC architecture is presented and evaluated by three models: a VHDL model that can be used to synthesise prototypes with up to \(6\times 6\) cores on an FPGA; a simulation model written in C that can be used for cycle-accurate simulation of more than 4096 cores; and a timing model for static timing analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Altera uses the term Adaptive Logic Module (ALM) for their elementary logic block, basically a lookup table with 6 inputs and 2 outputs (6-LUT). One ALM is equivalent to approximately 2.5 lookup tables with 4 inputs and 1 output (4-LUT).
References
Bailey, D.H., et al.: The NAS parallel benchmarks. Int. J. High Perform. Comput. Appl. 5(3), 63–73 (1991)
Ballabriga, C., Cassé, H., Rochange, C., Sainrat, P.: OTAWA: an open toolbox for adaptive WCET analysis. In: Min, S.L., Pettit, R., Puschner, P., Ungerer, T. (eds.) SEUS 2010. LNCS, vol. 6399, pp. 35–46. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16256-5_6
Bell, S., et al.: Tile64-processor: a 64-core soc with mesh interconnect. In: International Solid-State Circuits Conference (ISSCC), pp. 88–598 (2008)
Berg, C., Engblom, J., Wilhelm, R.: Requirements for and design of a processor with predictable timing. In: Perspectives Workshop: Design of Systems with Predictable Behaviour. No. 03471 in Dagstuhl Seminar Proceedings (2004)
d’Ausbourg, B., Boyer, M., Noulard, E., Pagetti, C.: Deterministic execution on many-core platforms: application to the SCC. In: Many-core Applications Research Community Symposium (MARC), December 2011
de Dinechin, B.D., et al.: A distributed run-time environment for the Kalray MPPA-256 integrated manycore processor. Procedia Comput. Sci. 18, 1654–1663 (2013)
Dongarra, J., Heroux, M.A.: Toward a new metric for ranking high performance computing systems. Sandia Report, SAND2013-4744 312 (2013)
Fan, D., et al.: Godson-T: an efficient many-core processor exploring thread-level parallelism. IEEE Micro 32(2), 38–47 (2012)
Frieb, M., Stegmeier, A., Mische, J., Ungerer, T.: Employing MPI collectives for timing analysis on embedded multi-cores. In: 16th International Workshop on Worst-Case Execution Time Analysis (2016)
Goossens, K., et al.: Virtual execution platforms for mixed-time-criticality systems: the CompSOC architecture and design flow. ACM SIGBED Rev. 10(3), 23–34 (2013)
Gorlatch, S.: Send-receive considered harmful: myths and realities of message passing. ACM Trans. Program. Lang. Syst. (TOPLAS) 26(1), 47–56 (2004)
Corporation, I.: Intel Xeon Phi Coprocessor System Software Developers Guide, 2.03 edn., November 2012
Lee, Y., et al.: A 45 nm 1.3 GHz 16.7 double-precision GFLOPS/W RISC-V processor with vector accelerators. In: European Solid State Circuits Conference (ESSCIRC), pp. 199–202. IEEE (2014)
Mattson, T.G., et al.: The 48-core SCC processor: the programmer’s view. In: International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pp. 1–11 (2010)
Message Passing Interface Forum. University of Tennesse: MPI: a Message-Passing Interface Standard. Version 3.1, June 2015
Mische, J., Ungerer, T.: Guaranteed service independent of the task placement in NoCs with torus topology. In: Proceedings of the 22nd International Conference on Real-Time Networks and Systems, p. 151. ACM (2014)
Mische, J., Metzlaff, S., Ungerer, T.: Distributed memory on chip - bringing together low power and real-time. In: Workshop on Reconciling Performance and Predictability (2014)
Mische, J., Ungerer, T.: Low power flitwise routing in an unidirectional torus with minimal buffering. In: International Workshop on Network on Chip Architectures (NoCArc), pp. 63–68 (2012)
Olofsson, A.: Epiphany-V: a 1024 processor 64-bit RISC system-on-chip. Technical report, Adapteva Inc. https://www.parallella.org/wp-content/uploads/2016/10/e5_1024core_soc.pdf
Reineke, J.: Caches in WCET analysis. Universität des Saarlandes, Saarbrücken, PhD Thesis (2008)
Rochange, C., et al.: WCET analysis of a parallel 3D multigrid solver executed on the MERASA multi-core. In: OASIcs-OpenAccess Series in Informatics, vol. 15. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik (2010)
Schoeberl, M., et al.: T-CREST: time-predictable multi-core architecture for embedded systems. J. Syst. Archit. 61(9), 449–471 (2015)
Stegmeier, A., Frieb, M., Mische, J., Ungerer, T.: WCTT bounds for MPI primitives in the PaterNoster NoC. In: 14th International Workshop on Real-Time Networks (2016)
Ungerer, T., et al.: MERASA: multicore execution of hard real-time applications supporting analyzability. IEEE Micro 30(5), 66–75 (2010)
Ungerer, T., et al.: Parallelizing industrial hard real-time applications for the parMERASA multicore. ACM Trans. Embed. Comput. Syst. (TECS) 15(3), 53 (2016)
Waterman, A., Lee, Y., Patterson, D.A., Asanovi, K.: The RISC-V instruction set manual, volume I: user-level ISA, version 2.1. Technical report UCB/EECS-2016-118, EECS Department, University of California, Berkeley, May 2016. http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-118.html
Wilhelm, R., et al.: Memory hierarchies, pipelines, and buses for future architectures in time-critical embedded systems. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 28(7), 966–978 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Mische, J., Frieb, M., Stegmeier, A., Ungerer, T. (2017). Reduced Complexity Many-Core: Timing Predictability Due to Message-Passing. In: Knoop, J., Karl, W., Schulz, M., Inoue, K., Pionteck, T. (eds) Architecture of Computing Systems - ARCS 2017. ARCS 2017. Lecture Notes in Computer Science(), vol 10172. Springer, Cham. https://doi.org/10.1007/978-3-319-54999-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-54999-6_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54998-9
Online ISBN: 978-3-319-54999-6
eBook Packages: Computer ScienceComputer Science (R0)