Parallelization of sparse cholesky factorization on an SMP cluster

Satoh, Shigehisa; Kusano, Kazuhiro; Tanaka, Yoshio; Matsuda, Motohiko; Sato, Mitsuhisa

doi:10.1007/BFb0100582

Shigehisa Satoh¹,
Kazuhiro Kusano¹,
Yoshio Tanaka¹,
Motohiko Matsuda¹ &
…
Mitsuhisa Sato¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1593))

Included in the following conference series:

International Conference on High-Performance Computing and Networking

133 Accesses
1 Citations

Abstract

In this paper, we present parallel implementations of the sparse Cholesky factorization kernel in the SPLASH-2 programs to evaluate performance of a Pentium Pro based SMP cluster. Solaris threads and remote memory operations are utilized for intranode parallelism and internode communications, respectively. Sparse Cholesky factorization is a typical irregular application with a high communication to computation ratio and no global synchronization between steps. We efficiently parallelized using asynchronous message handling instead of lock-based mutual exclusion between nodes, because synchronization between nodes reduces the performance significantly. We also found that the mapping of processes to processors on an SMP cluster affects the performance especially when the communication latency can not be hidden.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Y. Tanaka, et al, COMPaS: A Pentium Pro PC-based SMP Cluster and its Experience, In Proceedings of IPPS/SPDP workshop on Personal Computers Based Networks of Workstations, pages 486–497, 1998.
Google Scholar
Y. Tanaka, et al, Performance Improvement by Overlapping Computation and Communication on SMP Clusters, In Proceedings of the 1998 International Conference on Parallel and Distributed Processing Techniques and Applications, Vol. 1, pages 275–282, July 1998.
Google Scholar
E. Rothberg and A. Gupta, An Efficient Block-Oriented Approach To Parallel Sparse Cholesky Factorization, In Proceedings of Supercomputing'93, pages 503–512, November 1993.
Google Scholar
A. Gupta, G. Karypis, and V. Kumar, Highly Scalable Parallel Algorithms for Sparse Matrix Factorization, IEEE Transactions on Parallel and Distributed Systems, Vol. 8, No. 5, pages 502–520, May 1997.
Article Google Scholar
S. C. Woo, et al, The SPLASH-2 Programs: Characterization and Methodological Considerations, In Proceedings of the 22nd Annual International Symposium on Computer Architecture, pages 42–36, June 1995.
Google Scholar
L. Iftode, J. P. Singh and K. Li, Understanding Application Performance on Shared Virtual Memory Systems, In Proceedings of the 23rd Annual International Symposium on Computer Architecture, May 1996.
Google Scholar
C. Liao, et al, Monitoring Shared Virtual Memory Performance on a Myrinet-based PC Cluster, in Proceedings of the International Conference on Supercomputing, pages 251–258, July 1998.
Google Scholar
D. J. Scales, K. Gharachorloo and C. A. Thekkath, Shasta: A Low Overhead, Software-Only Approach for Supporting Fine-Grain Shared Memory, In Proceedings of the 7th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 174–185, October 1996.
Google Scholar
I. S. Duff, R. G. Grimes and J. G. Lewis, Sparse Matrix Test Problems, In ACM Transactions on Mathematical Software, Vol. 15, No. 1, pages 1–14, March 1989.
Google Scholar

Download references

Author information

Authors and Affiliations

Real World Computing Partnership, Tsukuba Research Center, 1-6-1 Takezono, 305-0032, Tsukuba, Ibaraki, Japan
Shigehisa Satoh, Kazuhiro Kusano, Yoshio Tanaka, Motohiko Matsuda & Mitsuhisa Sato

Authors

Shigehisa Satoh
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Kusano
View author publications
You can also search for this author in PubMed Google Scholar
Yoshio Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Motohiko Matsuda
View author publications
You can also search for this author in PubMed Google Scholar
Mitsuhisa Sato
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Peter Sloot Marian Bubak Alfons Hoekstra Bob Hertzberger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Satoh, S., Kusano, K., Tanaka, Y., Matsuda, M., Sato, M. (1999). Parallelization of sparse cholesky factorization on an SMP cluster. In: Sloot, P., Bubak, M., Hoekstra, A., Hertzberger, B. (eds) High-Performance Computing and Networking. HPCN-Europe 1999. Lecture Notes in Computer Science, vol 1593. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0100582

Download citation

DOI: https://doi.org/10.1007/BFb0100582
Published: 17 November 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65821-4
Online ISBN: 978-3-540-48933-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics