Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

An out-of-core sparse symmetric-indefinite factorization method

Published: 01 September 2006 Publication History

Abstract

We present a new out-of-core sparse symmetric-indefinite factorization algorithm. The most significant innovation of the new algorithm is a dynamic partitioning method for the sparse factor. This partitioning method results in very low I/O traffic and allows the algorithm to run at high computational rates, even though the factor is stored on a slow disk. Our implementation of the new code compares well with both high-performance in-core sparse symmetric-indefinite codes and a high-performance out-of-core sparse Cholesky code.

References

[1]
Amestoy, P. R., Duff, I. S., Koster, J., and L'Excellent, J. 2001. A fully asynchronous multifrontal solver using distributed dynamic scheduling. SIAM J. Matrix Anal. Appl. 23, 15--41.
[2]
Amestoy, P. R., Duff, I. S., L'Excellent, J., and Koster, J. 2003. MUltifrontal massively parallel solver (MUMPS version 4.3) user's guide. Available online from http://www.enseeiht.fr/lima/apo/MUMPS/doc.html.
[3]
Amestoy, P. R., Duff, I. S., and L'Excellent, J.-Y. 2000. Multifrontal parallel distributed symmetric and unsymmetric solvers. Computer Methods Appl. Mechanics Eng. 184, 501--520.
[4]
Ashcraft, C. and Grimes, R. 1999. SPOOLES: An object-oriented sparse matrix library. In Proceedings of the 9th SIAM Conference on Parallel Processing for Scientific Computing San Antonio, Tx. 10 pages on CD-ROM.
[5]
Ashcraft, C., Grimes, R. G., and Lewis, J. G. 1998. Accurate symmetric indefinite linear equation solvers. SIAM J. Matrix Anal. Appl. 20, 513--561.
[6]
Bunch, J. R. and Kaufman, L. 1977. Some stable methods for calculating inertia and solving symmetric indefinite linear systems. Math. Comput. 31, 163--179.
[7]
Bunch, J. R., Kaufman, L., and Parlett, B. N. 1976. Decomposition of a symmetric matrix. Numer. Math. 27, 95--109.
[8]
Dobrian, F. and Pothen, A. 2006. Oblio: Design and performance. In Applied Parallel Computing: State-of-the-Art in Scientific Computing; Proceedings of the 7th International Workshop (PARA 2004) Lyngby, Denmark, held in June 2004. Lecture Notes in Computer Science, vol. 3732. Springer Verlag, 758--767.
[9]
Duff, I. and Reid, J. 1983a. The multifrontal solution of indefinite sparse symmetric linear equations. ACM Trans. Math. Softw. 9, 302--325.
[10]
Duff, I. S., Erisman, A. M., and Reid, J. K. 1986. Direct Methods for Sparse Matrices. Oxford University Press, Oxford, UK.
[11]
Duff, I. S. and Reid, J. K. 1982. MA27: A set of Fortran subroutines for solving sparse symmetric sets of linear equations. Tech. Rep. AERE R10533, AERE Harwell, Didcot, Oxon, UK.
[12]
Duff, I. S. and Reid, J. K. 1983b. The multifrontal solution of indefinite sparse symmetric linear systems. ACM Trans. Math. Softw. 9, 302--325.
[13]
Duff, I. S. and Scott, J. A. 1999. A frontal code for the solution of sparse positive-definite symmetric systems arising from finite-element applications. ACM Trans. Math. Soft. 25, 4 (Dec.), 404--424.
[14]
Gilbert, J. R. and Toledo, S. 1999. High-Performance out-of-core sparse LU factorization. In Proceedings of the 9th SIAM Conference on Parallel Processing for Scientific Computing San Antonio, Tx. 10 pages on CDROM.
[15]
Gould, N. I. M. and Scott, J. A. 2004. A numerical evaluation of HSL packages for the direct solution of large sparse, symmetric linear systems of equations. ACM Trans. Math. Soft. 30, 3 (Sept.), 300--325.
[16]
Higham, N. J. 2002. Accuracy and Stability of Numerical Algorithms, 2nd ed. Society for Industrial and Applied Mathematics, Philadelphia, PA.
[17]
Irons, B. M. 1970. A frontal solution scheme for finite element analysis. Int. J. Numeri. Methods Eng. 2, 5--32.
[18]
Karypis, G. and Kumar, V. 1998. A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Sci. Comput. 20, 359--392.
[19]
Liu, J. W. H. 1990. The role of elimination trees in sparse factorization. SIAM J. Matrix Anal. Appl. 11, 134--172.
[20]
Liu, J. W. H. 1992. The multifrontal method for sparse matrix solution: Theory and practice. SIAM Rev. 34, 1, 82--109.
[21]
Natanzon, A., Shamir, R., and Sharan, R. 1998. A polynomial approximation algorithm for the minimum fill-in problem. In Proceedings of the 13th Annual ACM Symposium on Theory of Computing (STOC '98). ACM Press, New York, 41--47.
[22]
Ng, E. G. and Peyton, B. W. 1993. Block sparse Cholesky algorithms on advanced uniprocessor computers. SIAM J. Sci. Comput. 14, 5, 1034--1056.
[23]
Röllin, S. and Schenk, O. 2006. Maximum-weighted matching strategies and the application to symmetric indefinite systems. In Applied Parallel Computing: State-of-the-Art in Scientific Computing; Proceedings of the 7th International Workshop (PARA 2004), Lyngby, Denmark, held in June 2004. Lecture Notes in Computer Science, vol. 3732. Springer Verl. 808--817.
[24]
Rothberg, E. and Gupta, A. 1991. Efficient sparse matrix factorization on high-performance workstations---Exploiting the memory hierarchy. ACM Trans. Math. Softw. 17, 3, 313--334.
[25]
Rothberg, E. and Schreiber, R. 1999. Efficient methods for out-of-core sparse Cholesky factorization. SIAM J. Sci. Comput. 21, 129--144.
[26]
Rotkin, V. and Toledo, S. 2004. The design and implementation of a new out-of-core sparse Cholesky factorization method. ACM Trans. Math. Softw. 30, 1, 19--46.
[27]
Schenk, O. and Gärtner, K. 2004. On fast factorization pivoting methods for sparse symmetric indefinite systems. Tech. Rep. CS-2004-004, Department of Computer Science, University of Basel. Submitted to Electro. Trans. Numer. Anal.
[28]
Schreiber, R. 1982. A new implementation of sparse Gaussian elimination. ACM Trans. Math. Softw. 8, 256--276.
[29]
Stewart, G. W. 2001. Matrix Algorithms, vol. 2: Eigensystems. SIAM, Philadelphia, PA.

Cited By

View all
  • (2017)Non-GPU-resident symmetric indefinite factorizationConcurrency and Computation: Practice & Experience10.1002/cpe.401229:5(n/a-n/a)Online publication date: 10-Mar-2017
  • (2016)A survey of direct methods for sparse linear systemsActa Numerica10.1017/S096249291600007625(383-566)Online publication date: 23-May-2016
  • (2014)Trading cache hit rate for memory performanceProceedings of the 23rd international conference on Parallel architectures and compilation10.1145/2628071.2628082(357-368)Online publication date: 24-Aug-2014
  • Show More Cited By

Recommendations

Reviews

Anthony Joseph Duben

When the dimensionality of solving a problem for a linear system AX = B exceeds the internal memory of a computer, it needs to be partitioned so that part can be brought into memory and the remainder can stay in external storage until required. When matrix A is sparse, the dimensionality of the problem may increase even further, and additional effort must be applied to store and efficiently access the elements of the matrix. Since input/output (IO) requests to external storage are two orders of magnitude slower than requests to main memory, reduction of the penalty of disk IO is absolutely necessary. Meshar, Irony, and Toledo claim the first out-of-core sparse symmetric-indefinite matrix factorization algorithm to be fully described in the open literature. They also claim that it is the first left-looking method to be fully described. It has very low disk IO traffic, supporting high computation speeds. The method is direct, and more robust than iterative methods. It is out-of-core, meaning that the triangular factors in sparse matrix management are kept on disk, avoiding the need for extra memory. The paper is fairly long (26 pages), but it is thorough and documents its claims well. After the introductory section, the development follows logically. The second section provides the background for sparse factorizations, and for the data structures that need to be constructed and used. Left-looking factorization is discussed in the third section. This section is important, in that it amplifies one of the main claims of the paper. The fourth section describes pivot admissibility and search, needed for reducing the sparse matrix. The out-of-core factorization algorithm itself is the topic of the fifth section. The last section presents a lengthy performance evaluation of the method compared to other software packages (TAUCS, multi-frontal massively parallel sparse direct solver (MUMPS), and parallel sparse direct linear solver (PARDISO)). The matrices selected for testing approximated real data and problems. This is a seminal paper, which should garner plenty of attention in the future. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Mathematical Software
ACM Transactions on Mathematical Software  Volume 32, Issue 3
September 2006
133 pages
ISSN:0098-3500
EISSN:1557-7295
DOI:10.1145/1163641
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 2006
Published in TOMS Volume 32, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Out-of-Core
  2. symmetric-indefinite

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)1
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Non-GPU-resident symmetric indefinite factorizationConcurrency and Computation: Practice & Experience10.1002/cpe.401229:5(n/a-n/a)Online publication date: 10-Mar-2017
  • (2016)A survey of direct methods for sparse linear systemsActa Numerica10.1017/S096249291600007625(383-566)Online publication date: 23-May-2016
  • (2014)Trading cache hit rate for memory performanceProceedings of the 23rd international conference on Parallel architectures and compilation10.1145/2628071.2628082(357-368)Online publication date: 24-Aug-2014
  • (2012)Combinatorial Problems in Solving Linear SystemsCombinatorial Scientific Computing10.1201/b11644-3(21-68)Online publication date: 29-Mar-2012
  • (2011)BibliographyPolygon Mesh Processing10.1201/b10688-12(203-225)Online publication date: 14-Apr-2011
  • (2011)Elements of geometry processingSIGGRAPH Asia 2011 Courses10.1145/2077434.2077439(1-48)Online publication date: 12-Dec-2011
  • (2010)Spectral mesh processingACM SIGGRAPH 2010 Courses10.1145/1837101.1837109(1-312)Online publication date: 26-Jul-2010
  • (2010)Statistical Model Computation with UDFsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2010.4422:12(1752-1765)Online publication date: 1-Dec-2010
  • (2009)Spectral mesh processingACM SIGGRAPH ASIA 2009 Courses10.1145/1665817.1665834(1-47)Online publication date: 16-Dec-2009
  • (2009)Overview – Parallel Computing: Numerics, Applications, and TrendsParallel Computing10.1007/978-1-84882-409-6_1(1-42)Online publication date: 2009
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media