Performance Evaluation of the Three-Dimensional Finite-Difference Time-Domain(FDTD) Method on Fermi Architecture GPUs

Hou, Kaixi; Zhao, Ying; Huang, Jiumei; Zhang, Lingjie

doi:10.1007/978-3-642-24650-0_40

Kaixi Hou²⁰,
Ying Zhao²⁰,
Jiumei Huang²⁰ &
…
Lingjie Zhang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7016))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

1224 Accesses

Abstract

GPUs excel at solving many parallel problems and hence dramatically increase the computation performance. In electrodynamics and many other fields, FDTD method is widely used due to its simplicity, accuracy, and practicability. In this paper, we applied the FDTD method on the Fermi Architecture GPUs, the latest product of NVidia, for a better understanding of Fermi’s new features, such as the double precision support and improved memory hierarchy. Then we make a comparison between the strategies using the shared memory, the traditional optimization method on GPUs, and using L1 cache. Next, the paper provides insights into the disparity of these two strategies. We demonstrate that parallel computations only using L1 cache can reach the similar or even better performance as the traditional optimization method using the shared memory does when the dataset is not too large or the frequency of repeated use of the related data is low.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

Article 22 June 2018

Scalability Issues in FFT Computation

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators

References

Kane, S.Y.: Numerical Solution of Initial Boundary Value Problems Involving Maxwell’s Equations in Isotropic Media. IEEE Transactions on Antennas and Propagation (1966)
Google Scholar
Allen, T., Susan, C.H.: Computational Electrodynamics: The Finite-Difference Time-Domain Method, 3rd edn. Artech House Inc., MA (2005)
MATH Google Scholar
John N., Ian B., Michael G., Kevin S.: Scalable Parallel Programming with CUDA. Queue, 40–53 (2008)
Google Scholar
NVIDIA Corporation: NVIDIA CUDA C Programming Guide: Version 4.0 (2011)
Google Scholar
Next Generation CUDA Architecture, Code Named Fermi, http://www.nvi-dia.com/object/fermi_architecture.html
Mehmet, F.S., Ihab, E-K., David, A.B., Shawn-Yu, L.: A Novel FDTD Application Featuring Open MP-MPI Hybrid Parallelization. In: Proceedings of International Conference on Parallel Processing, Montreal, Quebec, Canada, pp. 373–379 (2004)
Google Scholar
Hong, S., Kim, H.: An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness. In: Proc. ISCA, pp. 152–163 (2009)
Google Scholar
NVIDIA Corporation: NVidia Fermi Compute Architecture Whitepaper Version 1.1
Google Scholar
Hewlett-Packard Development Company: HP ProLiant SL390s G7 2U half width Server Maintenance and Service Guide
Google Scholar
Jun, L., Tian, Y., Tong, L.: Analysis of the Electromagnetic Characteristics of Coplanar Waveguide by FDTD Method. Testing and Diagnosis (2009)
Google Scholar
Wenhua, Y.: Electromagnetic Simulation Techniques Based on the FDTD Method, pp. 84–85. John Wiley and Sons Inc., Chichester (2009)
Google Scholar
NVIDIA Corporation: Compute Visual Profiler User Guide (2010)
Google Scholar
Phuong Hoai, H., Tsigas, P., Anshus, O.J.: The Synchronization Power of Coalesced Memory Accesses. IEEE Transactions on Parallel and Distributed System, 939–953 (2010)
Google Scholar
CUDA Zone, http://www.nvidia.com/object/cuda_home_new.html
Tesla GPU Computing Solutions for Data Centers, http://www.nvidia.com/object/preconfigured-clusters.html

Download references

Author information

Authors and Affiliations

Information Centre, Beijing University of Chemical Technology, Beijing, China
Kaixi Hou, Ying Zhao, Jiumei Huang & Lingjie Zhang

Authors

Kaixi Hou
View author publications
You can also search for this author in PubMed Google Scholar
Ying Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jiumei Huang
View author publications
You can also search for this author in PubMed Google Scholar
Lingjie Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology, Deakin University, Melbourne Burwood Campus, 221 Burwood Highway, 3125,, Burwood, VIC, Australia
Yang Xiang
ICAR-CNR and University of Calabria, Via P. Bucci 41 C, 87036, Rende (CS), Italy
Alfredo Cuzzocrea
School of Information Technology, Deakin University, Geelong Waurn Ponds Campus, Pigdons Road, 3217, Geelong, VIC, Australia
Michael Hobbs
School of Information Technology, Deakin University, Melbourne Burwood Campus, 221 Burwood Highway, 3125, Burwood, VIC, Australia
Wanlei Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hou, K., Zhao, Y., Huang, J., Zhang, L. (2011). Performance Evaluation of the Three-Dimensional Finite-Difference Time-Domain(FDTD) Method on Fermi Architecture GPUs. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2011. Lecture Notes in Computer Science, vol 7016. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24650-0_40

Download citation

DOI: https://doi.org/10.1007/978-3-642-24650-0_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24649-4
Online ISBN: 978-3-642-24650-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Performance Evaluation of the Three-Dimensional Finite-Difference Time-Domain(FDTD) Method on Fermi Architecture GPUs

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

Scalability Issues in FFT Computation

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Performance Evaluation of the Three-Dimensional Finite-Difference Time-Domain(FDTD) Method on Fermi Architecture GPUs

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

Scalability Issues in FFT Computation

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation