Abstract
In the dense nonsymmetric eigenvalue problem, work has focused on the Hessenberg reduction and QR iteration, using efficient algorithms and fast, Level 3 BLAS. Comparatively, computation of eigenvectors performs poorly, limited to slow, Level 2 BLAS performance with little speedup on multi-core systems. It has thus become a dominant cost in the solution of the eigenvalue problem. To address this, we present improvements for the eigenvector computation to use Level 3 BLAS and parallelize the triangular solves, achieving good parallel scaling and accelerating the overall eigenvalue problem more than three-fold.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bischof, C.: A summary of block schemes for reducing a general matrix to Hessenberg form. Technical report, ANL/MCS-TM-175, Argonne National Lab (1993)
Bischof, C., Van Loan, C.: The WY representation for products of Householder matrices. SIAM J. Sci. Stat. Comput. 8(1), s2–s13 (1987)
Braman, K., Byers, R., Mathias, R.: The multishift QR algorithm. part I: maintaining well-focused shifts and level 3 performance. SIAM J. Matrix Anal. Appl. 23(4), 929–947 (2002)
Braman, K., Byers, R., Mathias, R.: The multishift QR algorithm. part II: aggressive early deflation. SIAM J. Matrix Anal. Appl. 23(4), 948–973 (2002)
Golub, G., Van Loan, C.: Matrix Computations, 3rd edn. Johns Hopkins, Baltimore (1996)
Kågström, B., Kressner, D., Shao, M.: On aggressive early deflation in parallel variants of the QR algorithm. In: Jónasson, K. (ed.) PARA 2010, Part I. LNCS, vol. 7133, pp. 1–10. Springer, Heidelberg (2012)
Karlsson, L., Kågström, B.: Parallel two-stage reduction to Hessenberg form using dynamic scheduling on shared-memory architectures. Parallel Comput. 37(12), 771–782 (2011)
McCalpin, J.D.: Memory bandwidth and machine balance in current high performance computers. In: IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, December 1995
Tomov, S., Nath, R., Dongarra, J.: Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing. Parallel Comput. 36(12), 645–654 (2010)
Acknowledgments
The results were obtained in part with the financial support of the Russian Scientific Fund, Agreement N14-11-00190; the National Science Foundation, U.S. Department of Energy, Intel, NVIDIA, and AMD.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Gates, M., Haidar, A., Dongarra, J. (2015). Accelerating Computation of Eigenvectors in the Dense Nonsymmetric Eigenvalue Problem. In: Daydé, M., Marques, O., Nakajima, K. (eds) High Performance Computing for Computational Science -- VECPAR 2014. VECPAR 2014. Lecture Notes in Computer Science(), vol 8969. Springer, Cham. https://doi.org/10.1007/978-3-319-17353-5_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-17353-5_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-17352-8
Online ISBN: 978-3-319-17353-5
eBook Packages: Computer ScienceComputer Science (R0)