Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3491396.3506545acmconferencesArticle/Chapter ViewAbstractPublication PagesiceaConference Proceedingsconference-collections
research-article

An automatic many-core code generation method and its implementation under Sunway environment

Published: 07 January 2022 Publication History
  • Get Citation Alerts
  • Abstract

    The parallel programming model of Sunway supercomputer based on accelerated thread library plays an important role on the acceleration performance of master-slave core parallelism. At present, two thread-based programming libraries, OpenACC and Athread, are provided. Although OpenACC is convenient, its parallel efficiency is not high and it is disadvantageous for deep optimization. While Athread is flexible and easy to deep optimization, it has a huge workload compared with OpenACC. This paper is based on a three-tier program template in which the main program calls the master program and then the master program calls the slave program, and the Rust language is used for lexical and grammatical analysis. Through the above steps, a method that can automatically convert the source program into Athread-format code is proposed, and some useful optimization methods are also integrated, such as parameters passed by a structure, local static variables and slave-core partition parallelism. Finally, a prototype of conversion tool from Fortran and C code to Athread code is designed and implemented. This method can avoid the vast majority of errors in code writing and greatly improve the efficiency of many-core work for researchers.

    References

    [1]
    Liu X, Chen D. Parallel program design and optimization on "Sunway. TaihuLight"[M]. Wuxi: National research center of parallel computer engineering and technology, 2017.
    [2]
    Jiang X. A Compiler for Automatic Translating OpenACC program to Intel multicore and manycore platform [D]. University of Science and Technology of China, 2015.
    [3]
    Jiang X, An H, Liang W, Zhang A, Li F. Automatic OpenACC to Intel Offload and Optimization for MIC [J]. Journal of Chinese Computer Systems, 2016, 37(04):824--829.
    [4]
    Da Cai. Research on OpenACC-Based Automatic Parallelization Technology [D]. China University of Mining and Technology, 2016.
    [5]
    Holewinski J A. Automatic Code Generation for Stencil Computations on GPU Architectures[M]. The Ohio State University, 2012.
    [6]
    Fu H, Liao J, Ding N, et al. Redesigning CAM-SE for peta-scale climate modeling performance and ultra-high resolution on Sunway TaihuLight[C]//Proceedings of the International Conference for High Performance Computing, Networking. Storage and Analysis. 2017: 1--12.
    [7]
    Yao W, Chen J, Su Z, Yu Y, Liao C, An H. Porting and optimizing of NAMD on SunwayTaihuLight System [J]. Computer Engineering & Science, 2017, 39(06):1022--1030.
    [8]
    Zhu X, Zeng Y, Wei Y, et al. An auto code generator for stencil on SW26010[C]//2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). IEEE, 2019: 182--190.
    [9]
    Zhuang Y, Guo Q, Zhang J, Zeng Y. Large Scalability Method of 2D Computation on Shenwei Many-core [J]. Computer Science, 2020, 47(08):87--92.
    [10]
    Balasubramanian A, Panda A, Baranowski M S, et al. System Programming in Rust: Beyond Safety[J]. ACM SIGOPS Operating Systems Review, 2017, 51(1):94--9.
    [11]
    Honghui Shang, Xin Chen, Xingyu Gao, Rongfen Lin, Lifang Wang, Fang Li. Qian Xiao, Lei Xu, Qiang Sun, Leilei Zhu, Fei Wang, Yunquan Zhang, and Haifeng Song. 2021. TensorKMC: kinetic Monte Carlo simulation of 50 trillion atoms driven by deep learning on a new generation of Sunway supercomputer. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '21). Association for Computing Machinery, New York, NY, USA, Article 73, 1--14.

    Index Terms

    1. An automatic many-core code generation method and its implementation under Sunway environment

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        ACM ICEA '21: Proceedings of the 2021 ACM International Conference on Intelligent Computing and its Emerging Applications
        December 2021
        241 pages
        ISBN:9781450391603
        DOI:10.1145/3491396
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 07 January 2022

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Athread library
        2. Code conversion
        3. Heterogeneous hybrid parallelism
        4. Sunway many-core processor

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Funding Sources

        • Qilu University of Technology (Shandong Academy of Sciences)

        Conference

        ACM ICEA '21
        Sponsor:

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 44
          Total Downloads
        • Downloads (Last 12 months)13
        • Downloads (Last 6 weeks)5
        Reflects downloads up to 27 Jul 2024

        Other Metrics

        Citations

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media