Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Skip header Section
LAPACK Users' guide (third ed.)November 1999
Publisher:
  • Society for Industrial and Applied Mathematics
  • 3600 University City Science Center Philadelphia, PA
  • United States
ISBN:978-0-89871-447-0
Published:01 November 1999
Pages:
430
Skip Bibliometrics Section
Reflects downloads up to 15 Oct 2024Bibliometrics
Abstract

No abstract available.

Cited By

  1. Struski Ł, Morkisz P, Spurek P, Bernabeu S and Trzciński T (2024). Efficient GPU implementation of randomized SVD and its applications▪, Expert Systems with Applications: An International Journal, 248:C, Online publication date: 15-Aug-2024.
  2. Zhang Z and Zou Q (2024). Data-driven robust iterative learning control of linear systems, Automatica (Journal of IFAC), 164:C, Online publication date: 1-Jun-2024.
  3. ACM
    Huang B, Lyubomirsky S, Li Y, He M, Smith G, Tambe T, Gaonkar A, Canumalla V, Cheung A, Wei G, Gupta A, Tatlock Z and Malik S (2024). Application-level Validation of Accelerator Designs Using a Formal Software/Hardware Interface, ACM Transactions on Design Automation of Electronic Systems, 29:2, (1-25), Online publication date: 31-Mar-2024.
  4. Pourafzal A, Skrabanek P, Cheffena M, Yildirim S and Roi-Taravella T (2024). Low complexity subspace approach for unbiased frequency estimation of a complex single-tone, Digital Signal Processing, 145:C, Online publication date: 1-Feb-2024.
  5. ACM
    Roman J, Alvarruiz F, Campos C, Dalcin L, Jolivet P and Lamas Daviña A (2023). Improvements to SLEPc in Releases 3.14–3.18, ACM Transactions on Mathematical Software, 49:3, (1-11), Online publication date: 30-Sep-2023.
  6. ACM
    Klein C and Strzodka R (2023). Tridigpu: A GPU Library for Block Tridiagonal and Banded Linear Equation Systems, ACM Transactions on Parallel Computing, 10:1, (1-33), Online publication date: 31-Mar-2023.
  7. Thien N, Wakabayashi Y, Iwai K and Nishiura T (2023). Inter-Frequency Phase Difference for Phase Reconstruction Using Deep Neural Networks and Maximum Likelihood, IEEE/ACM Transactions on Audio, Speech and Language Processing, 31, (1667-1680), Online publication date: 1-Jan-2023.
  8. Chee J, Renz) M, Damle A and De Sa C Model preserving compression for neural networks Proceedings of the 36th International Conference on Neural Information Processing Systems, (38060-38074)
  9. ACM
    Herholz P, Tang X, Schneider T, Kamil S, Panozzo D and Sorkine-Hornung O (2022). Sparsity-Specific Code Optimization using Expression Trees, ACM Transactions on Graphics, 41:5, (1-19), Online publication date: 31-Oct-2022.
  10. ACM
    Schwarz A (2022). Robust level-3 BLAS Inverse Iteration from the Hessenberg Matrix, ACM Transactions on Mathematical Software, 48:3, (1-30), Online publication date: 30-Sep-2022.
  11. ACM
    Chang T, Watson L, Larson J, Neveu N, Thacker W, Deshpande S and Lux T (2022). Algorithm 1028: VTMOP: Solver for Blackbox Multiobjective Optimization Problems, ACM Transactions on Mathematical Software, 48:3, (1-34), Online publication date: 30-Sep-2022.
  12. ACM
    Heavner N, Igual F, Quintana-Ortí G and Martinsson P (2022). Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures, ACM Transactions on Mathematical Software, 48:2, (1-42), Online publication date: 30-Jun-2022.
  13. ACM
    Alperen A, Afibuzzaman M, Rabbi F, Ozkaya M, Catalyurek U and Aktulga H An Evaluation of Task-Parallel Frameworks for Sparse Solvers on Multicore and Manycore CPU Architectures Proceedings of the 50th International Conference on Parallel Processing, (1-11)
  14. ACM
    Song R, Song X, Zhang Y and Ma Y Experiment in Parallel Computing for the Julia Programming Language Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence, (1-7)
  15. Ramon-Cortes C, Amela R, Ejarque J, Clauss P and Badia R (2021). AutoParallel, International Journal of High Performance Computing Applications, 34:6, (659-675), Online publication date: 1-Nov-2020.
  16. Marchand P, Claeys X, Jolivet P, Nataf F and Tournier P (2020). Two-level preconditioning for -version boundary element approximation of hypersingular operator with GenEO, Numerische Mathematik, 146:3, (597-628), Online publication date: 1-Nov-2020.
  17. Lopes M, Erichson N and Mahoney M Error estimation for sketched SVD via the bootstrap Proceedings of the 37th International Conference on Machine Learning, (6382-6392)
  18. Li F, Ye Y, Tian Z and Zhang X (2019). CPU versus GPU: which can perform matrix computation faster—performance comparison for basic linear algebra subprograms, Neural Computing and Applications, 31:8, (4353-4365), Online publication date: 1-Aug-2019.
  19. Birgin E and Martínez J (2019). A Newton-like method with mixed factorizations and cubic regularization for unconstrained minimization, Computational Optimization and Applications, 73:3, (707-753), Online publication date: 1-Jul-2019.
  20. ACM
    Charara A, Keyes D and Ltaief H (2019). Batched Triangular Dense Linear Algebra Kernels for Very Small Matrix Sizes on GPUs, ACM Transactions on Mathematical Software, 45:2, (1-28), Online publication date: 30-Jun-2019.
  21. ACM
    Kurzak J, Gates M, Charara A, YarKhan A and Dongarra J Least squares solvers for distributed-memory machines with GPU accelerators Proceedings of the ACM International Conference on Supercomputing, (117-126)
  22. ACM
    Han D, Nam Y, Lee J, Park K, Kim H and Kim M DistME Proceedings of the 2019 International Conference on Management of Data, (759-774)
  23. ACM
    Gaihre A, Wu Z, Yao F and Liu H XBFS Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing, (121-131)
  24. ACM
    Pittino F, Bonfà P, Bartolini A, Affinito F, Benini L and Cavazzoni C Prediction of Time-to-Solution in Material Science Simulations Using Deep Learning Proceedings of the Platform for Advanced Scientific Computing Conference, (1-9)
  25. Bylina B and Bylina J (2019). The Parallel Tiled WZ Factorization Algorithm for Multicore Architectures, International Journal of Applied Mathematics and Computer Science, 29:2, (407-419), Online publication date: 1-Jun-2019.
  26. Wang X, Li X, Zhang L and Li R (2019). An Efficient Numerical Method for the Symmetric Positive Definite Second-Order Cone Linear Complementarity Problem, Journal of Scientific Computing, 79:3, (1608-1629), Online publication date: 1-Jun-2019.
  27. ACM
    Zhang T, Shirzad S, Diehl P, Tohid R, Wei W and Kaiser H An Introduction to hpxMP Proceedings of the International Workshop on OpenCL, (1-10)
  28. Cao J, Genton M, Keyes D and Turkiyyah G (2019). Hierarchical-block conditioning approximations for high-dimensional multivariate normal probabilities, Statistics and Computing, 29:3, (585-598), Online publication date: 1-May-2019.
  29. Ayala A, Claeys X and Grigori L (2019). ALORA, Journal of Scientific Computing, 79:2, (1135-1160), Online publication date: 1-May-2019.
  30. ACM
    Nguyen D, Filippone M and Michiardi P Exact gaussian process regression with distributed computations Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, (1286-1295)
  31. León G, González C, Mayo R, Mozos D and Quintana-Ortí E (2019). Noise estimation for hyperspectral subspace identification on FPGAs, The Journal of Supercomputing, 75:3, (1323-1335), Online publication date: 1-Mar-2019.
  32. Ramiro C, Simarro M, Gonzalez A and Vidal A (2019). Parallel SUMIS soft detector for large MIMO systems on multicore and GPU, The Journal of Supercomputing, 75:3, (1256-1267), Online publication date: 1-Mar-2019.
  33. Buchheim C, Montenegro M and Wiegele A (2019). SDP-based branch-and-bound for non-convex quadratic integer optimization, Journal of Global Optimization, 73:3, (485-514), Online publication date: 1-Mar-2019.
  34. Ovchinnikov G, Pavlov A and Tsetserukou D (2019). Windowed multiscan optimization using weighted least squares for improving localization accuracy of mobile robots, Autonomous Robots, 43:3, (727-739), Online publication date: 1-Mar-2019.
  35. Koev P (2019). Accurate eigenvalues and exact zero Jordan blocks of totally nonnegative matrices, Numerische Mathematik, 141:3, (693-713), Online publication date: 1-Mar-2019.
  36. ACM
    Liu J and Cong J Dataflow Systolic Array Implementations of Matrix Decomposition Using High Level Synthesis Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, (187-187)
  37. Rodríguez-Sánchez R, Catalán S, Herrero J, Quintana-Ortí E and Tomás A (2019). Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD, Numerical Algorithms, 80:2, (635-660), Online publication date: 1-Feb-2019.
  38. George S, Liao M, Jiang H, Kotra J, Kandemir M, Sampson J and Narayanan V MDACache Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, (841-854)
  39. O'neil M (2018). Second-kind integral equations for the Laplace-Beltrami problem on surfaces in three dimensions, Advances in Computational Mathematics, 44:5, (1385-1409), Online publication date: 1-Oct-2018.
  40. ACM
    Luo S, Gao Z, Gubanov M, Perez L and Jermaine C (2018). Scalable Linear Algebra on a Relational Database System, ACM SIGMOD Record, 47:1, (24-31), Online publication date: 10-Sep-2018.
  41. Thomas A and Kumar A (2019). A comparative evaluation of systems for scalable linear algebra-based analytics, Proceedings of the VLDB Endowment, 11:13, (2168-2182), Online publication date: 1-Sep-2018.
  42. Nhan T, Maclachlan S and Madden N (2018). Boundary layer preconditioners for finite-element discretizations of singularly perturbed reaction-diffusion problems, Numerical Algorithms, 79:1, (281-310), Online publication date: 1-Sep-2018.
  43. ACM
    Lee C, Lim C and Wright S A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (1646-1655)
  44. ACM
    Dumas J and Pernet C Symmetric Indefinite Triangular Factorization Revealing the Rank Profile Matrix Proceedings of the 2018 ACM International Symposium on Symbolic and Algebraic Computation, (151-158)
  45. ACM
    Ltaief H, Sukkari D, Guyon O and Keyes D Extreme Computing for Extreme Adaptive Optics Proceedings of the Platform for Advanced Scientific Computing Conference, (1-10)
  46. Miyata T (2018). A heuristic search algorithm based on subspaces for PageRank computation, The Journal of Supercomputing, 74:7, (3278-3294), Online publication date: 1-Jul-2018.
  47. ACM
    You Y, Demmel J, Hsieh C and Vuduc R Accurate, Fast and Scalable Kernel Ridge Regression on Parallel and Distributed Systems Proceedings of the 2018 International Conference on Supercomputing, (307-317)
  48. Rathore M, Son H, Ahmad A, Paul A and Jeon G (2018). Real-Time Big Data Stream Processing Using GPU with Spark Over Hadoop Ecosystem, International Journal of Parallel Programming, 46:3, (630-646), Online publication date: 1-Jun-2018.
  49. Nicholson B, Wan W, Kameswaran S and Biegler L (2018). Parallel cyclic reduction strategies for linear systems that arise in dynamic optimization problems, Computational Optimization and Applications, 70:2, (321-350), Online publication date: 1-Jun-2018.
  50. Mezzadri F and Galligani E (2018). An inexact Newton method for solving complementarity problems in hydrodynamic lubrication, Calcolo: a quarterly on numerical analysis and theory of computation, 55:1, (1-28), Online publication date: 1-Mar-2018.
  51. ACM
    Tomás A, Rodríguez-Sánchez R, Catalán S and Quintana-Ortí E Reduction to Band Form for the Singular Value Decomposition on Graphics Accelerators Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, (51-60)
  52. ACM
    Spampinato D, Fabregat-Traver D, Bientinesi P and Püschel M Program generation for small-scale linear algebra applications Proceedings of the 2018 International Symposium on Code Generation and Optimization, (327-339)
  53. Steger C (2018). Algorithms for the Orthographic-n-Point Problem, Journal of Mathematical Imaging and Vision, 60:2, (246-266), Online publication date: 1-Feb-2018.
  54. Walker D (2018). Morton ordering of 2D arrays for efficient access to hierarchical memory, International Journal of High Performance Computing Applications, 32:1, (189-203), Online publication date: 1-Jan-2018.
  55. Videau B, Pouget K, Genovese L, Deutsch T, Komatitsch D, Desprez F and Méhaut J (2018). BOAST, International Journal of High Performance Computing Applications, 32:1, (28-44), Online publication date: 1-Jan-2018.
  56. Gonzlez-Caldern A, Vivas-Cruz L and Herrera-Hernndez E (2018). Application of the -method to a telegraphic model of fluid flow in a dual-porosity medium, Journal of Computational Physics, 352:C, (426-444), Online publication date: 1-Jan-2018.
  57. Hassan S, Mahmoud M, Hemeida A and Saber M (2018). Effective Implementation of MatrixVector Multiplication on Intel's AVX multicore Processor, Computer Languages, Systems and Structures, 51:C, (158-175), Online publication date: 1-Jan-2018.
  58. ACM
    Baroudi T, Seghir R and Loechner V (2017). Optimization of Triangular and Banded Matrix Operations Using 2d-Packed Layouts, ACM Transactions on Architecture and Code Optimization, 14:4, (1-19), Online publication date: 20-Dec-2017.
  59. ACM
    Matheou G and Evripidou P (2017). Data-Driven Concurrency for High Performance Computing, ACM Transactions on Architecture and Code Optimization, 14:4, (1-26), Online publication date: 20-Dec-2017.
  60. Owens A, Kphzi J and Eaton M (2017). Optimal trace inequality constants for interior penalty discontinuous Galerkin discretisations of elliptic operators using arbitrary elements with non-constant Jacobians, Journal of Computational Physics, 350:C, (847-870), Online publication date: 1-Dec-2017.
  61. ACM
    Kim K, Costa T, Deveci M, Bradley A, Hammond S, Guney M, Knepper S, Story S and Rajamanickam S Designing vector-friendly compact BLAS and LAPACK kernels Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  62. Costero L, Igual F, Olcoz K, Cataln S, Rodrguez-Snchez R and Quintana-Ort E (2017). Revisiting conventional task schedulers to exploit asymmetry in multi-core architectures for dense linear algebra operations, Parallel Computing, 68:C, (59-76), Online publication date: 1-Oct-2017.
  63. Kargar Z, Ruić D, Linn T and Jungemann C (2017). Numerical simulation of plasma waves in a quasi-2D electron gas based on the Boltzmann transport equation, Journal of Computational Electronics, 16:3, (487-496), Online publication date: 1-Sep-2017.
  64. Chen L, Kumar A, Naughton J and Patel J (2017). Towards linear algebra over normalized data, Proceedings of the VLDB Endowment, 10:11, (1214-1225), Online publication date: 1-Aug-2017.
  65. ACM
    Barthels H and Bientinesi P Linnea Proceedings of the International Workshop on Parallel Symbolic Computation, (1-3)
  66. Yarkhan A, Kurzak J, Luszczek P and Dongarra J (2017). Porting the PLASMA Numerical Library to the OpenMP Standard, International Journal of Parallel Programming, 45:3, (612-633), Online publication date: 1-Jun-2017.
  67. Voronin S and Martinsson P (2017). Efficient algorithms for cur and interpolative matrix decompositions, Advances in Computational Mathematics, 43:3, (495-516), Online publication date: 1-Jun-2017.
  68. Huang W, Absil P and Gallivan K (2017). Intrinsic representation of tangent vectors and vector transports on matrix manifolds, Numerische Mathematik, 136:2, (523-543), Online publication date: 1-Jun-2017.
  69. ACM
    Viviani P, Aldinucci M, Torquati M and d'lppolito R Multiple back-end support for the armadillo linear algebra interface Proceedings of the Symposium on Applied Computing, (1566-1573)
  70. Damle A, Lin L and Ying L (2017). SCDM-k, Journal of Computational Physics, 334:C, (1-15), Online publication date: 1-Apr-2017.
  71. ACM
    Filippone S, Cardellini V, Barbieri D and Fanfarillo A (2017). Sparse Matrix-Vector Multiplication on GPGPUs, ACM Transactions on Mathematical Software, 43:4, (1-49), Online publication date: 23-Mar-2017.
  72. Kirchhart M and Obi S (2017). A splitting-free vorticity redistribution method, Journal of Computational Physics, 330:C, (282-295), Online publication date: 1-Feb-2017.
  73. Chen J, Avron H and Sindhwani V (2017). Hierarchically compositional kernels for scalable nonparametric learning, The Journal of Machine Learning Research, 18:1, (2214-2255), Online publication date: 1-Jan-2017.
  74. Aliaga J, Barreda M, Castaño M, Dolz M and Quintana-Ortí E (2017). Adapting concurrency throttling and voltage---frequency scaling for dense eigensolvers, The Journal of Supercomputing, 73:1, (29-43), Online publication date: 1-Jan-2017.
  75. Huang J, Smith T, Henry G and van de Geijn R Strassen's algorithm reloaded Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  76. Kestyn J, Kalantzis V, Polizzi E and Saad Y PFEAST Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  77. ACM
    Leykin A (2016). Polynomial homotopy continuation in Macaulay2, ACM Communications in Computer Algebra, 50:3, (113-116), Online publication date: 4-Nov-2016.
  78. Zhang M, Wu Y, Chen K, Qian X, Li X and Zheng W Exploring the hidden dimension in graph processing Proceedings of the 12th USENIX conference on Operating Systems Design and Implementation, (285-300)
  79. Hassan S, Hemeida A and Mahmoud M (2016). Performance Evaluation of Matrix-Matrix Multiplications Using Intel's Advanced Vector Extensions (AVX), Microprocessors & Microsystems, 47:PB, (369-374), Online publication date: 1-Nov-2016.
  80. ACM
    Barthels H A compiler for linear algebra operations Companion Proceedings of the 2016 ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity, (49-50)
  81. ACM
    Matheou G, Kyriacou C and Evripidou P Data-Driven execution of the Tile LU Decomposition Proceedings of the Sixth Workshop on Data-Flow Execution Models for Extreme Scale Computing, (1-8)
  82. ACM
    Rong H, Park J, Xiang L, Anderson T and Smelyanskiy M Sparso Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, (247-259)
  83. ACM
    Low T, Igual F, Smith T and Quintana-Orti E (2016). Analytical Modeling Is Enough for High-Performance BLIS, ACM Transactions on Mathematical Software, 43:2, (1-18), Online publication date: 2-Sep-2016.
  84. ACM
    Filip S (2016). A Robust and Scalable Implementation of the Parks-McClellan Algorithm for Designing FIR Filters, ACM Transactions on Mathematical Software, 43:1, (1-24), Online publication date: 29-Aug-2016.
  85. ACM
    Sukkari D, Ltaief H and Keyes D (2016). A High Performance QDWH-SVD Solver Using Hardware Accelerators, ACM Transactions on Mathematical Software, 43:1, (1-25), Online publication date: 29-Aug-2016.
  86. Sukkari D, Ltaief H and Keyes D High Performance Polar Decomposition on Distributed Memory Systems Proceedings of the 22nd International Conference on Euro-Par 2016: Parallel Processing - Volume 9833, (605-616)
  87. Sovinec C (2016). Stabilization of numerical interchange in spectral-element magnetohydrodynamics, Journal of Computational Physics, 319:C, (61-78), Online publication date: 15-Aug-2016.
  88. ACM
    Abdelfattah A, Keyes D and Ltaief H (2016). KBLAS, ACM Transactions on Mathematical Software, 42:3, (1-31), Online publication date: 15-Jun-2016.
  89. ACM
    Ltaief H, Gratadour D, Charara A and Gendron E Adaptive Optics Simulation for the World's Largest Telescope on Multicore Architectures with Multiple GPUs Proceedings of the Platform for Advanced Scientific Computing Conference, (1-12)
  90. ACM
    Van Zee F, Smith T, Marker B, Low T, Geijn R, Igual F, Smelyanskiy M, Zhang X, Kistler M, Austel V, Gunnels J and Killough L (2016). The BLIS Framework, ACM Transactions on Mathematical Software, 42:2, (1-19), Online publication date: 3-Jun-2016.
  91. Ghasemi A and Taylor L (2016). Preconditioning Large Scale Iterative Solution of Ax = b Using a Statistical Method with Application to Matrix-Free Spectral Solution of Helmholtz Equation, Procedia Computer Science, 80:C, (2266-2270), Online publication date: 1-Jun-2016.
  92. Marques O, Druinsky A, Li X, Barker A, Vassilevski P and Kalchev D (2016). Tuning the Coarse Space Construction in a Spectral AMG Solver1, Procedia Computer Science, 80:C, (212-221), Online publication date: 1-Jun-2016.
  93. ACM
    Aruliah D, Veen L and Dubitski A (2016). Algorithm 956, ACM Transactions on Mathematical Software, 42:1, (1-18), Online publication date: 1-Mar-2016.
  94. ACM
    Spampinato D and Püschel M A basic linear algebra compiler for structured matrices Proceedings of the 2016 International Symposium on Code Generation and Optimization, (117-127)
  95. ACM
    Shimazaki T, Hashimoto M and Maeda T Developing a high-performance quantum chemistry program with a dynamic scripting language Proceedings of the 3rd International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering, (9-15)
  96. ACM
    Calderara M, Brück S, Pedersen A, Bani-Hashemian M, VandeVondele J and Luisier M Pushing back the limit of ab-initio quantum transport simulations on hybrid supercomputers Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  97. ACM
    Graillat S, Lauter C, Tang P, Yamanaka N and Oishi S (2015). Efficient Calculations of Faithfully Rounded l2-Norms of n-Vectors, ACM Transactions on Mathematical Software, 41:4, (1-20), Online publication date: 26-Oct-2015.
  98. Metcalfe P, Beven K and Freer J (2015). Dynamic TOPMODEL, Environmental Modelling & Software, 72:C, (155-172), Online publication date: 1-Oct-2015.
  99. Kabir K, Haidar A, Tomov S and Dongarra J (2015). Performance Analysis and Optimisation of Two-sided Factorization Algorithms for Heterogeneous Platform, Procedia Computer Science, 51:C, (180-190), Online publication date: 1-Sep-2015.
  100. Junchao Zhang , Behzad B and Snir M (2015). Design of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters, IEEE Transactions on Parallel and Distributed Systems, 26:7, (1861-1873), Online publication date: 1-Jul-2015.
  101. ACM
    Al-Saber N and Kulkarni M SemCache++ Proceedings of the 29th ACM on International Conference on Supercomputing, (79-88)
  102. ACM
    Van Zee F and van de Geijn R (2015). BLIS: A Framework for Rapidly Instantiating BLAS Functionality, ACM Transactions on Mathematical Software, 41:3, (1-33), Online publication date: 1-Jun-2015.
  103. ACM
    Hess B, Gross T and Püschel M (2014). Automatic locality-friendly interface extension of numerical functions, ACM SIGPLAN Notices, 50:3, (83-92), Online publication date: 12-May-2015.
  104. Aliaga J, Barreda M, Dolz M and Quintana-Ortí E (2015). Are our dense linear algebra libraries energy-friendly?, Computer Science - Research and Development, 30:2, (187-196), Online publication date: 1-May-2015.
  105. Kabir K, Haidar A, Tomov S and Dongarra J Performance analysis and design of a hessenberg reduction using stabilized blocked elementary transformations for new architectures Proceedings of the Symposium on High Performance Computing, (135-142)
  106. Anzt H, Tomov S and Dongarra J Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product Proceedings of the Symposium on High Performance Computing, (75-82)
  107. Berljafa M, Wortmann D and Di Napoli E (2015). An optimized and scalable eigensolver for sequences of eigenvalue problems, Concurrency and Computation: Practice & Experience, 27:4, (905-922), Online publication date: 25-Mar-2015.
  108. Kyrtatas N, Spampinato D and Püschel M A basic linear algebra compiler for embedded processors Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, (1054-1059)
  109. ACM
    Regondi P, Zubair M and Albanese C BLAS extensions for algebraic pricing methods Proceedings of the 2nd Workshop on Parallel Programming for Analytics Applications, (25-30)
  110. ACM
    Ravishankar M, Holewinski J and Grover V Forma: a DSL for image processing applications to target GPUs and multi-core CPUs Proceedings of the 8th Workshop on General Purpose Processing using GPUs, (109-120)
  111. ACM
    Haidar A, Dong T, Luszczek P, Tomov S and Dongarra J Optimization for performance and energy for batched matrix computations on GPUs Proceedings of the 8th Workshop on General Purpose Processing using GPUs, (59-69)
  112. ACM
    Anzt H, Tomov S and Dongarra J Energy efficiency and performance frontiers for sparse computations on GPU supercomputers Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, (1-10)
  113. Du K, Fairweather G and Sun W (2015). Matrix decomposition algorithms for arbitrary order C 0 tensor product finite element systems, Journal of Computational and Applied Mathematics, 275:C, (162-182), Online publication date: 1-Feb-2015.
  114. Dongarra J, Gates M, Haidar A, Jia Y, Kabir K, Luszczek P and Tomov S (2015). HPC programming on Intel many-integrated-core hardware with MAGMA port to Xeon Phi, Scientific Programming, 2015, (9-9), Online publication date: 1-Jan-2015.
  115. Jhurani C and Mullowney P (2015). A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices, Journal of Parallel and Distributed Computing, 75:C, (133-140), Online publication date: 1-Jan-2015.
  116. ACM
    Bosboom J, Rajadurai S, Wong W and Amarasinghe S (2014). StreamJIT, ACM SIGPLAN Notices, 49:10, (177-195), Online publication date: 31-Dec-2015.
  117. Yi Q, Wang Q and Cui H Specializing Compiler Optimizations through Programmable Composition for Dense Matrix Computations Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, (596-608)
  118. ACM
    Chiang W, Gopalakrishnan G, Rakamaric Z and Solovyev A (2014). Efficient search for inputs causing high floating-point errors, ACM SIGPLAN Notices, 49:8, (43-52), Online publication date: 26-Nov-2014.
  119. Elliott J, Hoemmen M and Mueller F Exploiting data representation for fault tolerance Proceedings of the 5th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, (9-16)
  120. Charara A, Ltaief H, Gratadour D, Keyes D, Sevin A, Abdelfattah A, Gendron E, Morel C and Vidal F Pipelining computational stages of the tomographic reconstructor for multi-object adaptive optics on a multi-GPU system Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (262-273)
  121. Agrawal K, Fahey M, McLay R and James D User environment tracking and problem detection with XALT Proceedings of the First International Workshop on HPC User Support Tools, (32-40)
  122. Knopp T Experimental multi-threading support for the Julia programming language Proceedings of the 1st First Workshop for High Performance Technical Computing in Dynamic Languages, (1-5)
  123. ACM
    Drebes A, Heydemann K, Drach N, Pop A and Cohen A (2014). Topology-Aware and Dependence-Aware Scheduling and Memory Allocation for Task-Parallel Languages, ACM Transactions on Architecture and Code Optimization, 11:3, (1-25), Online publication date: 27-Oct-2014.
  124. ACM
    Bosboom J, Rajadurai S, Wong W and Amarasinghe S StreamJIT Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications, (177-195)
  125. ACM
    Hess B, Gross T and Püschel M Automatic locality-friendly interface extension of numerical functions Proceedings of the 2014 International Conference on Generative Programming: Concepts and Experiences, (83-92)
  126. Kastner G and Frühwirth-Schnatter S (2014). Ancillarity-sufficiency interweaving strategy (ASIS) for boosting MCMC estimation of stochastic volatility models, Computational Statistics & Data Analysis, 76:C, (408-423), Online publication date: 1-Aug-2014.
  127. ACM
    Misev D and Baumann P Extending the SQL array concept to support scientific analytics Proceedings of the 26th International Conference on Scientific and Statistical Database Management, (1-11)
  128. ACM
    Kernert D, Köhler F and Lehner W SLACID - sparse linear algebra in a column-oriented in-memory database system Proceedings of the 26th International Conference on Scientific and Statistical Database Management, (1-12)
  129. ACM
    Song F and Dongarra J Scaling up matrix computations on shared-memory manycore systems with 1000 CPU cores Proceedings of the 28th ACM international conference on Supercomputing, (333-342)
  130. de Hoon N, van Pelt R, Jalba A and Vilanova A 4D MRI flow coupled to physics-based fluid simulation for blood-flow visualization Proceedings of the 16th Eurographics Conference on Visualization, (121-130)
  131. ACM
    Boghrati B and Sapatnekar S (2014). Incremental Analysis of Power Grids Using Backward Random Walks, ACM Transactions on Design Automation of Electronic Systems, 19:3, (1-29), Online publication date: 1-Jun-2014.
  132. ACM
    Fabregat-Traver D and Bientinesi P (2014). Computing Petaflops over Terabytes of Data, ACM Transactions on Mathematical Software, 40:4, (1-22), Online publication date: 1-Jun-2014.
  133. ACM
    Cao C, Dongarra J, Du P, Gates M, Luszczek P and Tomov S clMAGMA Proceedings of the International Workshop on OpenCL 2013 & 2014, (1-9)
  134. Amos B, Easterling D, Watson L, Castle B, Trosset M and Thacker W Fortran 95 implementation of QNSTOP for global and stochastic optimization Proceedings of the High Performance Computing Symposium, (1-8)
  135. Tang P, Kestyn J and Polizzi E A new highly parallel non-Hermitian eigensolver Proceedings of the High Performance Computing Symposium, (1-9)
  136. ACM
    Van Zee F, van de Geijn R and Quintana-Ortí G (2014). Restructuring the Tridiagonal and Bidiagonal QR Algorithms for Performance, ACM Transactions on Mathematical Software, 40:3, (1-34), Online publication date: 1-Apr-2014.
  137. Sieger D, Menzel S and Botsch M (2014). RBF morphing techniques for simulation-based design optimization, Engineering with Computers, 30:2, (161-174), Online publication date: 1-Apr-2014.
  138. Golyandina N and Korobeynikov A (2014). Basic Singular Spectrum Analysis and forecasting with R, Computational Statistics & Data Analysis, 71:C, (934-954), Online publication date: 1-Mar-2014.
  139. Lawrence P and Corless R (2014). Stability of rootfinding for barycentric Lagrange interpolants, Numerical Algorithms, 65:3, (447-464), Online publication date: 1-Mar-2014.
  140. ACM
    Spampinato D and Püschel M A Basic Linear Algebra Compiler Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization, (23-32)
  141. ACM
    Spampinato D and Püschel M A Basic Linear Algebra Compiler Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization, (23-32)
  142. ACM
    Chiang W, Gopalakrishnan G, Rakamaric Z and Solovyev A Efficient search for inputs causing high floating-point errors Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming, (43-52)
  143. ACM
    Karlsson L, Kressner D and Lang B (2014). Optimally packed chains of bulges in multishift QR algorithms, ACM Transactions on Mathematical Software, 40:2, (1-15), Online publication date: 1-Feb-2014.
  144. ACM
    Marker B, van de Geijn R and Batory D Interfaces are key Proceedings of the 1st International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering, (21-24)
  145. ACM
    Jia Y, Luszczek P, Bosilca G and Dongarra J CPU-GPU hybrid bidiagonal reduction with soft error resilience Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, (1-5)
  146. ACM
    Jia Y, Bosilca G, Luszczek P and Dongarra J Parallel reduction to hessenberg form with algorithm-based fault tolerance Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, (1-11)
  147. ACM
    Blanco M, Perdomo P, Ezzatti P, Pardo A and Viera M Towards a functional run-time for dense NLA domain Proceedings of the 2nd ACM SIGPLAN workshop on Functional high-performance computing, (85-96)
  148. ACM
    Foster L and Davis T (2013). Algorithm 933, ACM Transactions on Mathematical Software, 40:1, (1-23), Online publication date: 1-Sep-2013.
  149. ACM
    Davis T (2013). Algorithm 930, ACM Transactions on Mathematical Software, 39:4, (1-18), Online publication date: 1-Jul-2013.
  150. ACM
    Castaldo A, Whaley R and Samuel S (2013). Scaling LAPACK panel operations using parallel cache assignment, ACM Transactions on Mathematical Software, 39:4, (1-30), Online publication date: 1-Jul-2013.
  151. ACM
    Haidar A, Gates M, Tomov S and Dongarra J Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication Proceedings of the 27th international ACM conference on International conference on supercomputing, (223-232)
  152. ACM
    AlSaber N and Kulkarni M SemCache Proceedings of the 27th international ACM conference on International conference on supercomputing, (421-432)
  153. Ghoting A, Gunnels J, Kambadur P, Pednault E and Squillante M (2013). Trends and outlook for the massive-scale analytics stack, IBM Journal of Research and Development, 57:3-4, (2-2), Online publication date: 1-May-2013.
  154. Polok L, Ila V and Smrz P Cache efficient implementation for block matrix operations Proceedings of the High Performance Computing Symposium, (1-8)
  155. ACM
    Ltaief H, Luszczek P and Dongarra J (2013). High-performance bidiagonal reduction using tile algorithms on homogeneous multicore architectures, ACM Transactions on Mathematical Software, 39:3, (1-22), Online publication date: 1-Apr-2013.
  156. Hagan J and Priede J (2013). Capacitance matrix technique for avoiding spurious eigenmodes in the solution of hydrodynamic stability problems by Chebyshev collocation method, Journal of Computational Physics, 238, (210-216), Online publication date: 1-Apr-2013.
  157. Skalicky S, López S, Łukowiak M, Letendre J and Ryan M Performance modeling of pipelined linear algebra architectures on FPGAs Proceedings of the 9th international conference on Reconfigurable Computing: architectures, tools, and applications, (146-153)
  158. Haeri S and Shrimpton J (2013). A new implicit fictitious domain method for the simulation of flow in complex geometries with heat transfer, Journal of Computational Physics, 237, (21-45), Online publication date: 1-Mar-2013.
  159. ACM
    Poulson J, Marker B, van de Geijn R, Hammond J and Romero N (2013). Elemental, ACM Transactions on Mathematical Software, 39:2, (1-24), Online publication date: 1-Feb-2013.
  160. ACM
    Gustavson F, Waśniewski J, Dongarra J, Herrero J and Langou J (2013). Level-3 Cholesky Factorization Routines Improve Performance of Many Cholesky Algorithms, ACM Transactions on Mathematical Software, 39:2, (1-10), Online publication date: 1-Feb-2013.
  161. ACM
    Baboulin M, Dongarra J, Herrmann J and Tomov S (2013). Accelerating Linear System Solutions Using Randomization Techniques, ACM Transactions on Mathematical Software, 39:2, (1-13), Online publication date: 1-Feb-2013.
  162. Misener R and Floudas C (2012). Global optimization of mixed-integer quadratically-constrained quadratic programs (MIQCQP) through piecewise-linear and edge-concave relaxations, Mathematical Programming: Series A and B, 136:1, (155-182), Online publication date: 1-Dec-2012.
  163. ACM
    Van Zee F, van de Geijn R, Quintana-Ortí G and Elizondo G (2012). Families of Algorithms for Reducing a Matrix to Condensed Form, ACM Transactions on Mathematical Software, 39:1, (1-32), Online publication date: 1-Nov-2012.
  164. ACM
    Anjos A, El-Shafey L, Wallace R, Günther M, McCool C and Marcel S Bob Proceedings of the 20th ACM international conference on Multimedia, (1449-1452)
  165. ACM
    Van Voorst J, Tong Y and Kuhn L ArtSurf Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine, (36-43)
  166. ACM
    Rakić P, Stričević L and Rakić Z Statically typed matrix Proceedings of the Fifth Balkan Conference in Informatics, (217-222)
  167. König G, Moldaschl M and Gansterer W (2012). Computing eigenvectors of block tridiagonal matrices based on twisted block factorizations, Journal of Computational and Applied Mathematics, 236:15, (3696-3703), Online publication date: 1-Sep-2012.
  168. López-Espín J, Vidal A and Giménez D (2012). Two-stage least squares and indirect least squares algorithms for simultaneous equations models, Journal of Computational and Applied Mathematics, 236:15, (3676-3684), Online publication date: 1-Sep-2012.
  169. Foshati A and Khunjush F A novel implementation of double precision and real valued ICA algorithm for bioinformatics applications on GPUs Proceedings of the 18th international conference on Parallel processing workshops, (285-294)
  170. Donfack S, Grigori L and Khabou A Avoiding communication through a multilevel LU factorization Proceedings of the 18th international conference on Parallel Processing, (551-562)
  171. ACM
    Wimmer M (2012). Algorithm 923, ACM Transactions on Mathematical Software, 38:4, (1-17), Online publication date: 1-Aug-2012.
  172. ACM
    Poppe K, Cools R and Vandewoestyne B (2012). Error handling in Fortran 2003, ACM SIGPLAN Fortran Forum, 31:2, (7-19), Online publication date: 24-Jul-2012.
  173. ACM
    Song B and Li V A hybridization between memetic algorithm and semidefinite relaxation for the max-cut problem Proceedings of the 14th annual conference on Genetic and evolutionary computation, (425-432)
  174. Zhao Y, Zhang J and Chi X Implementations of main algorithms for generalized eigenproblem on GPU accelerator Proceedings of the Third international conference on Advances in Swarm Intelligence - Volume Part II, (473-481)
  175. Deadman E, Higham N and Ralha R Blocked schur algorithms for computing the matrix square root Proceedings of the 11th international conference on Applied Parallel and Scientific Computing, (171-182)
  176. Katz A and Sankaran V (2012). An Efficient Correction Method to Obtain a Formally Third-Order Accurate Flow Solver for Node-Centered Unstructured Grids, Journal of Scientific Computing, 51:2, (375-393), Online publication date: 1-May-2012.
  177. Bock H (2012). Editorial, Advances in Data Analysis and Classification, 6:1, (1-2), Online publication date: 1-Apr-2012.
  178. Wang D, Markey M, Wilke C and Arapostathis A (2012). Eigen-Genomic System Dynamic-Pattern Analysis (ESDA), IEEE/ACM Transactions on Computational Biology and Bioinformatics, 9:2, (430-437), Online publication date: 1-Mar-2012.
  179. ACM
    Reid J and Scott J (2012). Partial factorization of a dense symmetric indefinite matrix, ACM Transactions on Mathematical Software, 38:2, (1-19), Online publication date: 1-Dec-2011.
  180. ACM
    Dongarra J, Faverge M, Ltaief H and Luszczek P High performance matrix inversion based on LU factorization for multicore architectures Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers, (33-42)
  181. ACM
    Perumalla K, Nutaro J and Yoginath S Towards high performance discrete-event simulations of smart electric grids Proceedings of the first international workshop on High performance computing, networking and analytics for the power grid, (51-58)
  182. ACM
    Haidar A, Ltaief H and Dongarra J Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, (1-11)
  183. ACM
    Luisier M, Boykin T, Klimeck G and Fichtner W Atomistic nanoelectronic device engineering with sustained performances up to 1.44 PFlop/s Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, (1-11)
  184. ACM
    Davis T (2011). Algorithm 915, SuiteSparseQR, ACM Transactions on Mathematical Software, 38:1, (1-22), Online publication date: 1-Nov-2011.
  185. ACM
    Jeyapaul R and Shrivastava A Smart cache cleaning Proceedings of the 14th international conference on Compilers, architectures and synthesis for embedded systems, (105-114)
  186. Luszczek P and Dongarra J Reducing the time to tune parallel dense linear algebra routines with partial execution and performance modeling Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (730-739)
  187. Gustavson F, Waśniewski J and Herrero J New level-3 BLAS kernels for cholesky factorization Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (60-69)
  188. Ltaief H, Luszczek P and Dongarra J Enhancing parallelism of tile bidiagonal transformation on multicore architectures using tree reduction Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (661-670)
  189. Sørensen H Auto-tuning dense vector and matrix-vector operations for fermi GPUs Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (619-629)
  190. Lirkov I, Paprzycki M and Ganzha M Performance analysis of parallel alternating directions algorithm for time dependent problems Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (173-182)
  191. Becker D, Baboulin M and Dongarra J Reducing the amount of pivoting in symmetric indefinite systems Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (133-142)
  192. Gustavson F Cache blocking for linear algebra algorithms Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I, (122-132)
  193. Fabregat-Traver D and Bientinesi P Knowledge-based automatic generation of partitioned matrix expressions Proceedings of the 13th international conference on Computer algebra in scientific computing, (144-157)
  194. Macdonald C, Brandman J and Ruuth S (2011). Solving eigenvalue problems on curved surfaces using the Closest Point Method, Journal of Computational Physics, 230:22, (7944-7956), Online publication date: 1-Sep-2011.
  195. Sørensen H High-Performance matrix-vector multiplication on the GPU Proceedings of the 2011 international conference on Parallel Processing, (377-386)
  196. ACM
    Zheng C and James D Toward high-quality modal contact sound ACM SIGGRAPH 2011 papers, (1-12)
  197. Lavor C, Mucherino A, Liberti L and Maculan N (2011). On the computation of protein backbones by using artificial backbones of hydrogens, Journal of Global Optimization, 50:2, (329-344), Online publication date: 1-Jun-2011.
  198. Pan C (2011). Complexity Reduction by Using QR-Based Scheme in Computing Capacity for Optimal Transmission, Wireless Personal Communications: An International Journal, 58:2, (391-405), Online publication date: 1-May-2011.
  199. Zhang H and Sandu A FATODE Proceedings of the 19th High Performance Computing Symposia, (143-150)
  200. Lameed N and Hendren L Staged static techniques to efficiently implement array copy semantics in a MATLAB JIT compiler Proceedings of the 20th international conference on Compiler construction: part of the joint European conferences on theory and practice of software, (22-41)
  201. ACM
    Rozložník M, Shklarski G and Toledo S (2011). Partitioned Triangular Tridiagonalization, ACM Transactions on Mathematical Software, 37:4, (1-16), Online publication date: 1-Feb-2011.
  202. ACM
    Steffy D (2011). Exact solutions to linear systems of equations using output sensitive lifting, ACM Communications in Computer Algebra, 44:3/4, (160-182), Online publication date: 28-Jan-2011.
  203. Luisier M A Parallel Implementation of Electron-Phonon Scattering in Nanoelectronic Devices up to 95k Cores Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, (1-11)
  204. ACM
    Granat R and Kågström B (2010). Parallel Solvers for Sylvester-Type Matrix Equations with Applications in Condition Estimation, Part I, ACM Transactions on Mathematical Software, 37:3, (1-32), Online publication date: 1-Sep-2010.
  205. ACM
    Wendykier P and Nagy J (2010). Parallel Colt, ACM Transactions on Mathematical Software, 37:3, (1-22), Online publication date: 1-Sep-2010.
  206. ACM
    Hadri B, Fahey M and Jones N Identifying software usage at HPC centers with the automatic library tracking database Proceedings of the 2010 TeraGrid Conference, (1-8)
  207. Mouysset S, Noailles J, Ruiz D and Guivarch R On a strategy for spectral clustering with parallel computation Proceedings of the 9th international conference on High performance computing for computational science, (408-420)
  208. Emad N, Delannoy O and Dandouna M Numerical library reuse in parallel and distributed platforms Proceedings of the 9th international conference on High performance computing for computational science, (271-278)
  209. Cunha M, Coutinho A and Telles J On the vectorization of engineering codes using multimedia instructions Proceedings of the 9th international conference on High performance computing for computational science, (263-270)
  210. ACM
    Chan E, van de Geijn R and Chapman A Managing the complexity of lookahead for LU factorization with pivoting Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures, (200-208)
  211. Gustavson F Cache blocking Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I, (22-32)
  212. Petschow M and Bientinesi P The algorithm of multiple relatively robust representations for multi-core processors Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I, (152-161)
  213. Kågström B, Kressner D and Shao M On aggressive early deflation in parallel variants of the QR algorithm Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume Part I, (1-10)
  214. Diep N Efficient implementation of interval matrix multiplication Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2, (179-188)
  215. ACM
    Perez J, Badia R and Labarta J Handling task dependencies under strided and aliased references Proceedings of the 24th ACM International Conference on Supercomputing, (263-274)
  216. ACM
    Castaldo A and Whaley R (2010). Scaling LAPACK panel operations using parallel cache assignment, ACM SIGPLAN Notices, 45:5, (223-232), Online publication date: 1-May-2010.
  217. ACM
    Gustavson F, Waśniewski J, Dongarra J and Langou J (2010). Rectangular full packed format for cholesky's algorithm, ACM Transactions on Mathematical Software, 37:2, (1-21), Online publication date: 1-Apr-2010.
  218. ACM
    Rasch A and Bücker H (2010). EFCOSS, ACM Transactions on Mathematical Software, 37:2, (1-37), Online publication date: 1-Apr-2010.
  219. Emiris I, Pan V and Tsigaridas E Algebraic and numerical algorithms Algorithms and theory of computation handbook, (17-17)
  220. ACM
    Castaldo A and Whaley R Scaling LAPACK panel operations using parallel cache assignment Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (223-232)
  221. ACM
    Savage J and Zubair M (2010). Cache-optimal algorithms for option pricing, ACM Transactions on Mathematical Software, 37:1, (1-30), Online publication date: 1-Jan-2010.
  222. ACM
    Vömel C (2010). ScaLAPACK's MRRR algorithm, ACM Transactions on Mathematical Software, 37:1, (1-35), Online publication date: 1-Jan-2010.
  223. ACM
    Einarsson B, Hanson R and Hopkins T (2009). Standardized mixed language programming for Fortran and C, ACM SIGPLAN Fortran Forum, 28:3, (8-22), Online publication date: 4-Dec-2009.
  224. ACM
    Loera J, Haws D, Lee J and O'Hair A (2010). Computation in multicriteria matroid optimization, ACM Journal of Experimental Algorithmics, 14, (1.8-1.33), Online publication date: 1-Dec-2009.
  225. Ramakrishnan C (2009). Zirkonium, Organised Sound, 14:3, (268-276), Online publication date: 1-Dec-2009.
  226. ACM
    Bell N and Garland M Implementing sparse matrix-vector multiplication on throughput-oriented processors Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, (1-11)
  227. ACM
    Ries F, De Marco T, Zivieri M and Guerrieri R Triangular matrix inversion on Graphics Processing Unit Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, (1-10)
  228. Lessig C and Bientinesi P On parallelizing the MRRR algorithm for data-parallel coprocessors Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I, (396-402)
  229. Byröd M, Josephson K and Åström K (2009). Fast and Stable Polynomial Equation Solving and Its Application to Computer Vision, International Journal of Computer Vision, 84:3, (237-256), Online publication date: 1-Sep-2009.
  230. Pan C (2009). Complexity reduction by using triangular matrix multiplication in computing capacity for an optimal transmission, WSEAS TRANSACTIONS on COMMUNICATIONS, 8:8, (959-969), Online publication date: 1-Aug-2009.
  231. Anderson C (2009). Efficient solution of the Schroedinger-Poisson equations in layered semiconductor devices, Journal of Computational Physics, 228:13, (4745-4756), Online publication date: 30-Jul-2009.
  232. ACM
    Mehrotra A and Somani A A robust and efficient harmonic balance (HB) using direct solution of HB Jacobian Proceedings of the 46th Annual Design Automation Conference, (370-375)
  233. ACM
    Mueller C, Baumgartner B, Ofenbeck G, Schrader B and Sbalzarini I pCMALib Proceedings of the 11th Annual conference on Genetic and evolutionary computation, (1411-1418)
  234. ACM
    Quintana-Ortí G, Quintana-Ortí E, Geijn R, Zee F and Chan E (2009). Programming matrix algorithms-by-blocks for thread-level parallelism, ACM Transactions on Mathematical Software, 36:3, (1-26), Online publication date: 1-Jul-2009.
  235. ACM
    Baker C, Hetmaniuk U, Lehoucq R and Thornquist H (2009). Anasazi software for the numerical solution of large-scale eigenvalue problems, ACM Transactions on Mathematical Software, 36:3, (1-23), Online publication date: 1-Jul-2009.
  236. Hurault A, Daydé M and Pantel M (2009). Advanced service trading for scientific computing over the grid, The Journal of Supercomputing, 49:1, (64-83), Online publication date: 1-Jul-2009.
  237. ACM
    Ansel J, Chan C, Wong Y, Olszewski M, Zhao Q, Edelman A and Amarasinghe S PetaBricks Proceedings of the 30th ACM SIGPLAN Conference on Programming Language Design and Implementation, (38-49)
  238. Lu L, Li K and Guan Y Blind detection of interleaver parameters for non-binary coded data streams Proceedings of the 2009 IEEE international conference on Communications, (3668-3671)
  239. Yaghoobi M, Blumensath T and Davies M (2009). Dictionary learning for sparse approximations with the majorization method, IEEE Transactions on Signal Processing, 57:6, (2178-2191), Online publication date: 1-Jun-2009.
  240. ACM
    Ansel J, Chan C, Wong Y, Olszewski M, Zhao Q, Edelman A and Amarasinghe S (2009). PetaBricks, ACM SIGPLAN Notices, 44:6, (38-49), Online publication date: 28-May-2009.
  241. ACM
    Venetis I and Gao G Mapping the LU decomposition on a many-core architecture Proceedings of the 6th ACM conference on Computing frontiers, (71-80)
  242. Bryson M, Johnson-Roberson M and Sukkarieh S Airborne smoothing and mapping using vision and inertial sensors Proceedings of the 2009 IEEE international conference on Robotics and Automation, (3143-3148)
  243. Wang Y and Lu L (2009). Preconditioned Lanczos method for generalized Toeplitz eigenvalue problems, Journal of Computational and Applied Mathematics, 226:1, (66-76), Online publication date: 1-Apr-2009.
  244. ACM
    Koikari S (2009). Algorithm 894, ACM Transactions on Mathematical Software, 36:2, (1-20), Online publication date: 1-Mar-2009.
  245. ACM
    D'Alberto P and Nicolau A (2009). Adaptive Winograd's matrix multiplications, ACM Transactions on Mathematical Software, 36:1, (1-23), Online publication date: 1-Mar-2009.
  246. ACM
    Lourakis M and Argyros A (2009). SBA, ACM Transactions on Mathematical Software, 36:1, (1-30), Online publication date: 1-Mar-2009.
  247. ACM
    Davis T and Hager W (2009). Dynamic Supernodes in Sparse Cholesky Update/Downdate and Triangular Solves, ACM Transactions on Mathematical Software, 35:4, (1-23), Online publication date: 1-Feb-2009.
  248. ACM
    Taylor A and Higham D (2009). CONTEST, ACM Transactions on Mathematical Software, 35:4, (1-17), Online publication date: 1-Feb-2009.
  249. McBain G, Chubb T and Armfield S (2009). Numerical solution of the Orr-Sommerfeld equation using the viscous Green function and split-Gaussian quadrature, Journal of Computational and Applied Mathematics, 224:1, (397-404), Online publication date: 1-Feb-2009.
  250. Hendriks R, Heusdens R, Jensen J and Kjems U (2009). Low complexity DFT-domain noise PSD tracking using high-resolution periodograms, EURASIP Journal on Advances in Signal Processing, 2009, (15-15), Online publication date: 1-Jan-2009.
  251. Saxena V, Agrawal P, Sabharwal Y, Garg V, Kuruvilla V and Gunnels J Optimization of BLAS on the cell processor Proceedings of the 15th international conference on High performance computing, (18-29)
  252. ACM
    Baranoski G and Krishnaswamy A Light interaction with human skin ACM SIGGRAPH ASIA 2008 courses, (1-80)
  253. Hovsepian K, Anselmo P and Mazumdar S A modeling-based classification algorithm validated with simulated data Proceedings of the 40th Conference on Winter Simulation, (768-776)
  254. Driscoll T, Bornemann F and Trefethen L (2008). The chebop system for automatic solution of differential equations , BIT, 48:4, (701-723), Online publication date: 1-Dec-2008.
  255. Luisier M and Klimeck G A multi-level parallel simulation approach to electron transport in nano-scale transistors Proceedings of the 2008 ACM/IEEE conference on Supercomputing, (1-10)
  256. ACM
    Drake J, Worley P and D’Azevedo E (2008). Algorithm 888, ACM Transactions on Mathematical Software, 35:3, (1-23), Online publication date: 1-Oct-2008.
  257. ACM
    Chen Y, Davis T, Hager W and Rajamanickam S (2008). Algorithm 887, ACM Transactions on Mathematical Software, 35:3, (1-14), Online publication date: 1-Oct-2008.
  258. ACM
    Dumas J, Giorgi P and Pernet C (2008). Dense Linear Algebra over Word-Size Prime Fields, ACM Transactions on Mathematical Software, 35:3, (1-42), Online publication date: 1-Oct-2008.
  259. Kågström B, Kressner D, Quintana-Ortí E and Quintana-Ortí G (2008). Blocked algorithms for the reduction to Hessenberg-triangular form revisited , BIT, 48:3, (563-584), Online publication date: 1-Sep-2008.
  260. Salawdeh I, César E, Morajko A, Margalef T and Luque E Performance Model for Parallel Mathematical Libraries Based on Historical Knowledgebase Proceedings of the 14th international Euro-Par conference on Parallel Processing, (110-119)
  261. Vidal A, Garcia V, Alonso P and Bernabeu M (2008). Parallel computation of the eigenvalues of symmetric Toeplitz matrices through iterative methods, Journal of Parallel and Distributed Computing, 68:8, (1113-1121), Online publication date: 1-Aug-2008.
  262. ACM
    Marques O, Vömel C, Demmel J and Parlett B (2008). Algorithm 880, ACM Transactions on Mathematical Software, 35:1, (1-13), Online publication date: 22-Jul-2008.
  263. ACM
    Bientinesi P, Gunter B and Geijn R (2008). Families of algorithms related to the inversion of a Symmetric Positive Definite matrix, ACM Transactions on Mathematical Software, 35:1, (1-22), Online publication date: 22-Jul-2008.
  264. ACM
    Buttari A, Dongarra J, Kurzak J, Luszczek P and Tomov S (2008). Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy, ACM Transactions on Mathematical Software, 34:4, (1-22), Online publication date: 15-Jul-2008.
  265. ACM
    Lee F and Bailer W Organizing rushes video by visually similar setting Proceedings of the 2008 international conference on Content-based image and video retrieval, (279-288)
  266. Chanrion O and Neubert T (2008). A PIC-MCC code for simulation of streamer propagation in air, Journal of Computational Physics, 227:15, (7222-7245), Online publication date: 1-Jul-2008.
  267. ACM
    Youseff L, Seymour K, You H, Dongarra J and Wolski R The impact of paravirtualized memory hierarchy on linear algebra computational kernels and software Proceedings of the 17th international symposium on High performance distributed computing, (141-152)
  268. ACM
    Catanzaro B, Keutzer K and Su B Parallelizing CAD Proceedings of the 45th annual Design Automation Conference, (12-17)
  269. ACM
    Jiao X and Zha H Consistent computation of first- and second-order differential quantities for surface meshes Proceedings of the 2008 ACM symposium on Solid and physical modeling, (159-170)
  270. Jaroszewicz S Minimum variance associations Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining, (172-183)
  271. ACM
    Howell G, Demmel J, Fulton C, Hammarling S and Marmol K (2008). Cache efficient bidiagonalization using BLAS 2.5 operators, ACM Transactions on Mathematical Software, 34:3, (1-33), Online publication date: 1-May-2008.
  272. ACM
    Goto K and Geijn R (2008). Anatomy of high-performance matrix multiplication, ACM Transactions on Mathematical Software, 34:3, (1-25), Online publication date: 1-May-2008.
  273. Heyouni M and Sadok H (2008). A new implementation of the CMRH method for solving dense linear systems, Journal of Computational and Applied Mathematics, 213:2, (387-399), Online publication date: 20-Mar-2008.
  274. ACM
    Sala M, Stanley K and Heroux M (2008). On the design of interfaces to sparse direct solvers, ACM Transactions on Mathematical Software, 34:2, (1-22), Online publication date: 1-Mar-2008.
  275. ACM
    Avron H, Shklarski G and Toledo S (2008). Parallel unsymmetric-pattern multifrontal sparse LU with column preordering, ACM Transactions on Mathematical Software, 34:2, (1-31), Online publication date: 1-Mar-2008.
  276. ACM
    Chan E, Van Zee F, Bientinesi P, Quintana-Orti E, Quintana-Orti G and van de Geijn R SuperMatrix Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, (123-132)
  277. ACM
    Guo J, Bikshandi G, Fraguela B, Garzaran M and Padua D Programming with tiles Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, (111-122)
  278. ACM
    Diamond J, Robatmili B, Keckler S, van de Geijn R, Goto K and Burger D High performance dense linear algebra on a spatially distributed processor Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, (63-72)
  279. ACM
    Kressner D (2008). Block variants of Hammarling's method for solving Lyapunov equations, ACM Transactions on Mathematical Software, 34:1, (1-15), Online publication date: 1-Jan-2008.
  280. Granat R, Kågström B and Kressner D (2007). Computing periodic deflating subspaces associated with a specified set of eigenvalues , BIT, 47:4, (763-791), Online publication date: 1-Dec-2007.
  281. ACM
    Schnitter S, Hartleb F and Horneffer M Quality-of-service class specific traffic matrices in ip/mpls networks Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, (253-258)
  282. Remón A, Quintana-Ortí E and Quintana-Ortí G Parallel solution of band linear systems in model reduction Proceedings of the 7th international conference on Parallel processing and applied mathematics, (678-687)
  283. Buttari A, Langou J, Kurzak J and Dongarra J Parallel tiled QR factorization for multicore architectures Proceedings of the 7th international conference on Parallel processing and applied mathematics, (639-648)
  284. Waśniewski J and Gustavson F Three versions of a minimal storage Cholesky algorithm using new data structures gives high performance speeds as verified on many computers Proceedings of the 7th international conference on Parallel processing and applied mathematics, (622-627)
  285. Gustavson F The relevance of new data structure approaches for dense linear algebra in the new multi-core/many core environments Proceedings of the 7th international conference on Parallel processing and applied mathematics, (618-621)
  286. Korch M and Rauber T Locality optimized shared-memory implementations of iterated runge-kutta methods Proceedings of the 13th international Euro-Par conference on Parallel Processing, (737-747)
  287. Luszczek P and Dongarra J (2007). High Performance Development for High End Computing With Python Language Wrapper (PLW), International Journal of High Performance Computing Applications, 21:3, (360-369), Online publication date: 1-Aug-2007.
  288. ACM
    Chan E, Quintana-Orti E, Quintana-Orti G and van de Geijn R Supermatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures, (116-125)
  289. ACM
    Yotov K, Roeder T, Pingali K, Gunnels J and Gustavson F An experimental comparison of cache-oblivious and cache-conscious programs Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures, (93-104)
  290. ACM
    Zhang H, Smith B, Sternberg M and Zapol P (2007). SIPs, ACM Transactions on Mathematical Software, 33:2, (9-es), Online publication date: 1-Jun-2007.
  291. de Moura Pinto F and Freitas C Design of multi-dimensional transfer functions using dimensional reduction Proceedings of the 9th Joint Eurographics / IEEE VGTC conference on Visualization, (131-138)
  292. Cawley G and Talbot N (2007). Preventing Over-Fitting during Model Selection via Bayesian Regularisation of the Hyper-Parameters, The Journal of Machine Learning Research, 8, (841-861), Online publication date: 1-May-2007.
  293. ACM
    Morandini M and Mantegazza P (2007). Using dense storage to solve small sparse linear systems, ACM Transactions on Mathematical Software, 33:1, (5-es), Online publication date: 1-Mar-2007.
  294. Qi L, Qian L, Woodruff S and Cartes D (2007). Prony analysis for power system transient harmonics, EURASIP Journal on Advances in Signal Processing, 2007:1, (170-170), Online publication date: 1-Jan-2007.
  295. Bernabeu M and Vidal A Static versus dynamic heterogeneous parallel schemes to solve the symmetric tridiagonal eigenvalue problem Proceedings of the 6th WSEAS international conference on Applied computer science, (301-306)
  296. ACM
    Dhillon I, Parlett B and Vömel C (2006). The design and implementation of the MRRR algorithm, ACM Transactions on Mathematical Software, 32:4, (533-560), Online publication date: 1-Dec-2006.
  297. ACM
    Kressner D (2006). Block algorithms for reordering standard and generalized Schur forms, ACM Transactions on Mathematical Software, 32:4, (521-532), Online publication date: 1-Dec-2006.
  298. ACM
    Langou J, Langou J, Luszczek P, Kurzak J, Buttari A and Dongarra J Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems) Proceedings of the 2006 ACM/IEEE conference on Supercomputing, (113-es)
  299. Sangwine S and Le Bihan N (2006). Quaternion singular value decomposition based on bidiagonalization to a real or complex matrix using quaternion Householder transformations, Applied Mathematics and Computation, 182:1, (727-738), Online publication date: 1-Nov-2006.
  300. ACM
    Akar N A matrix analytical method for the discrete time Lindley equation using the generalized Schur decomposition Proceeding from the 2006 workshop on Tools for solving structured Markov chains, (12-es)
  301. Hetmaniuk U and Lehoucq R (2006). Basis selection in LOBPCG, Journal of Computational Physics, 218:1, (324-332), Online publication date: 10-Oct-2006.
  302. van Hoeve W (2006). Exploiting semidefinite relaxations in constraint programming, Computers and Operations Research, 33:10, (2787-2804), Online publication date: 1-Oct-2006.
  303. Scitovski R, Kralik G, Sabo K and Jelen T (2006). A mathematical model of controlling the growth of tissue in pigs, Applied Mathematics and Computation, 181:2, (1126-1138), Online publication date: 1-Oct-2006.
  304. Leykin A and Verschelde J Interfacing with the numerical homotopy algorithms in PHCpack Proceedings of the Second international conference on Mathematical Software, (354-360)
  305. Gradl T, Spörl A, Huckle T, Glaser S and Schulte-Herbrüggen T Parallelising matrix operations on clusters for an optimal control-based quantum compiler Proceedings of the 12th international conference on Parallel Processing, (751-762)
  306. Badía J, Benner P, Mayo R and Quintana-Ortí E Parallel solution of large-scale and sparse generalized algebraic riccati equations Proceedings of the 12th international conference on Parallel Processing, (710-719)
  307. Karátson J, Kurics T and Lirkov I A parallel algorithm for systems of convection-diffusion equations Proceedings of the 6th international conference on Numerical methods and applications, (65-73)
  308. Zhuo L and Prasanna V Scalable Hybrid Designs for Linear Algebra on Reconfigurable Computing Systems Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1, (87-95)
  309. Sørensen T and Mosegaard J An introduction to GPU accelerated surgical simulation Proceedings of the Third international conference on Biomedical Simulation, (93-104)
  310. Gustavson F and Waśniewski J Rectangular full packed format for LAPACK algorithms timings on several computers Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (570-579)
  311. Gustavson F, Gunnels J and Sexton J Minimal data copy for dense linear algebra factorization Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (540-549)
  312. Hammarling S, Higham N and Lucas C LAPACK-style codes for pivoted Cholesky and QR updating Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (137-146)
  313. Granat R and Kågström B Parallel algorithms and condition estimators for standard and generalized triangular Sylvester-type matrix equations Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (127-136)
  314. Adlerborn B, Kågström B and Kressner D Parallel variants of the multishift QZ algorithm with advanced deflation techniques Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (117-126)
  315. Daydé M, Hurault A and Pantel M Semantic-based service trading Proceedings of the 7th international conference on High performance computing for computational science, (622-633)
  316. Marques O and Vasconcelos P Evaluation of linear solvers for astrophysics transfer problems Proceedings of the 7th international conference on High performance computing for computational science, (466-475)
  317. Drummond L, Galiano V, Marques O, Migallón V and Penadés J PyACTS Proceedings of the 7th international conference on High performance computing for computational science, (417-425)
  318. Flores-Becerra G, Garcia V and Vidal A Efficient parallel algorithm for constructing a unit triangular matrix with prescribed singular values Proceedings of the 7th international conference on High performance computing for computational science, (349-362)
  319. ACM
    Milenkovic V and Sacks E An approximate arrangement algorithm for semi-algebraic curves Proceedings of the twenty-second annual symposium on Computational geometry, (237-246)
  320. ACM
    Jiang L and Su Z Osprey Proceedings of the 28th international conference on Software engineering, (262-271)
  321. ACM
    Hoi S, Jin R and Lyu M Large-scale text categorization by batch mode active learning Proceedings of the 15th international conference on World Wide Web, (633-642)
  322. Liu C (2006). Capitalize on Dimensionality Increasing Techniques for Improving Face Recognition Grand Challenge Performance, IEEE Transactions on Pattern Analysis and Machine Intelligence, 28:5, (725-737), Online publication date: 1-May-2006.
  323. O'Leary D (2006). Computer Memory and Arithmetic, Computing in Science and Engineering, 8:3, (54-59), Online publication date: 1-May-2006.
  324. Keith D, Hoge C, Frank R and Malony A Parallel ICA methods for EEG neuroimaging Proceedings of the 20th international conference on Parallel and distributed processing, (61-61)
  325. ACM
    Lastovetsky A, Reddy R and Higgins R Building the functional performance model of a processor Proceedings of the 2006 ACM symposium on Applied computing, (746-753)
  326. Dongarra J, Bosilca G, Chen Z, Eijkhout V, Fagg G, Fuentes E, Langou J, Luszczek P, Pjesivac-Grbovic J, Seymour K, You H and Vadhiyar S (2006). Self-adapting numerical software (SANS) effort, IBM Journal of Research and Development, 50:2/3, (223-238), Online publication date: 1-Mar-2006.
  327. Korch M and Rauber T (2006). Optimizing locality and scalability of embedded Runge--Kutta solvers using block-based pipelining, Journal of Parallel and Distributed Computing, 66:3, (444-468), Online publication date: 1-Mar-2006.
  328. Polizzi E and Sameh A (2006). A parallel hybrid banded system solver, Parallel Computing, 32:2, (177-194), Online publication date: 1-Feb-2006.
  329. Mangasarian O and Wild E (2006). Multisurface Proximal Support Vector Machine Classification via Generalized Eigenvalues, IEEE Transactions on Pattern Analysis and Machine Intelligence, 28:1, (69-74), Online publication date: 1-Jan-2006.
  330. Xue J Aggressive loop fusion for improving locality and parallelism Proceedings of the Third international conference on Parallel and Distributed Processing and Applications, (224-238)
  331. Recine G, Rosen B and Cui H (2005). Numerical simulation of two-dimensional electron transport in cylindrical nanostructures using Wigner function methods, Journal of Computational Physics, 209:2, (421-447), Online publication date: 1-Nov-2005.
  332. Rünger G and Schwind M Comparison of different parallel modified gram-schmidt algorithms Proceedings of the 11th international Euro-Par conference on Parallel Processing, (826-836)
  333. Lourakis M and Argyros A (2005). Efficient, causal camera tracking in unprepared environments, Computer Vision and Image Understanding, 99:2, (259-290), Online publication date: 1-Aug-2005.
  334. ACM
    Krüger J and Westermann R Linear algebra operators for GPU implementation of numerical algorithms ACM SIGGRAPH 2005 Courses, (234-es)
  335. Steck T and Meyer G A 7-step approach to the design and implementation of parallel algorithms Proceedings of the 7th WSEAS International Conference on Applied Mathematics, (1-6)
  336. Fung G and Mangasarian O (2005). Multicategory Proximal Support Vector Machine Classifiers, Machine Language, 59:1-2, (77-97), Online publication date: 1-May-2005.
  337. ACM
    Kavan L and Žára J Spherical blend skinning Proceedings of the 2005 symposium on Interactive 3D graphics and games, (9-16)
  338. Zobeley J, Lebiedz D, Kammerer J, Ishmurzin A and Kummer U A new time-dependent complexity reduction method for biochemical systems Transactions on Computational Systems Biology I, (90-110)
  339. Becerra G and Maciá A Parallel global and local convergent algorithms for solving the iniverse additive singular value problem Proceedings of the 4th WSEAS International Conference on Systems Theory and Scientific Computation, (1-6)
  340. Zaldívar F, Maciá A and Salvador A A parallel algorithm based on a variant of the Kalman filter for solving the RLS problem Proceedings of the 4th WSEAS International Conference on Signal Processing, Computational Geometry & Artificial Vision, (1-6)
  341. Shen K, Chu L and Yang T Supporting Cluster-Based Network Services on Functionally Symmetric Software Architecture Proceedings of the 2004 ACM/IEEE conference on Supercomputing
  342. Hu X and Xu L (2004). A comparative investigation on subspace dimension determination, Neural Networks, 17:8-9, (1051-1059), Online publication date: 1-Oct-2004.
  343. ACM
    Matthey T, Cickovski T, Hampton S, Ko A, Ma Q, Nyerges M, Raeder T, Slabach T and Izaguirre J (2004). ProtoMol, an object-oriented framework for prototyping novel algorithms for molecular dynamics, ACM Transactions on Mathematical Software, 30:3, (237-265), Online publication date: 1-Sep-2004.
  344. Gansterer W, Bai Y, Day R and Ward R (2004). A Framework for Approximating Eigenpairs in Electronic Structure Computations, Computing in Science and Engineering, 6:5, (50-59), Online publication date: 1-Sep-2004.
  345. ACM
    Chen Y, Bindel D, Song H and Katz R (2004). An algebraic approach to practical and scalable overlay network monitoring, ACM SIGCOMM Computer Communication Review, 34:4, (55-66), Online publication date: 30-Aug-2004.
  346. ACM
    Chen Y, Bindel D, Song H and Katz R An algebraic approach to practical and scalable overlay network monitoring Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications, (55-66)
  347. ACM
    Fatahalian K, Sugerman J and Hanrahan P Understanding the efficiency of GPU algorithms for matrix-matrix multiplication Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware, (133-137)
  348. Rauber T and Rünger G (2004). Improving locality for ODE solvers by program transformations, Scientific Programming, 12:3, (133-154), Online publication date: 1-Aug-2004.
  349. Arias E and Hernández V Numerical integration of the differential riccati equation Proceedings of the 6th international conference on High Performance Computing for Computational Science, (671-684)
  350. Cunha M, Telles J and Coutinho A Parallel boundary elements Proceedings of the 6th international conference on High Performance Computing for Computational Science, (514-526)
  351. Benner P, Quintana-Ortí E and Quintana-Ortí G Parallel model reduction of large linear descriptor systems via balanced truncation Proceedings of the 6th international conference on High Performance Computing for Computational Science, (340-353)
  352. Granat R and Kågström B Evaluating parallel algorithms for solving sylvester-type matrix equations Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (719-729)
  353. Lee K and Bojańczyk A ALPS Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (423-432)
  354. Elmroth E and Skelander R Semi-automatic generation of grid computing interfaces for numerical software libraries Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (404-412)
  355. Kågström B Management of deep memory hierarchies Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (21-32)
  356. Gunnels J and Gustavson F A new array format for symmetric and triangular matrices Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (247-255)
  357. Barnes D and Hopkins T Applying software testing metrics to lapack Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (228-236)
  358. Gustavson F and Waśniewski J High performance linear algebra algorithms Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (225-227)
  359. ACM
    Yi Q, Kennedy K, You H, Seymour K and Dongarra J Automatic blocking of QR and LU factorizations for locality Proceedings of the 2004 workshop on Memory system performance, (12-22)
  360. Yi Q, Kennedy K and Adve V (2004). Transforming Complex Loop Nests for Locality, The Journal of Supercomputing, 27:3, (219-264), Online publication date: 1-Mar-2004.
  361. Korch M, Rauber T and Rünger G Performance optimization of RK methods using block-based pipelining Performance analysis and grid computing, (41-56)
  362. ACM
    Menon V, Pingali K and Mateev N (2003). Fractal symbolic analysis, ACM Transactions on Programming Languages and Systems, 25:6, (776-813), Online publication date: 1-Nov-2003.
  363. Chen Z, Dongarra J, Luszczek P and Roche K (2003). Self-adapting software for numerical linear algebra and LAPACK for clusters, Parallel Computing, 29:11-12, (1723-1743), Online publication date: 1-Nov-2003.
  364. Sim L, Leedham G, Jian L and Schroder H (2003). Fast solution of large N × N matrix equations in an MIMD-SIMD hybrid system, Parallel Computing, 29:11-12, (1669-1684), Online publication date: 1-Nov-2003.
  365. ACM
    Chen Y, Bindel D and Katz R Tomography-based overlay network monitoring Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement, (216-231)
  366. Dellar P (2003). Incompressible limits of lattice Boltzmann equations using multiple relaxation times, Journal of Computational Physics, 190:2, (351-370), Online publication date: 20-Sep-2003.
  367. ACM
    Krüger J and Westermann R Linear algebra operators for GPU implementation of numerical algorithms ACM SIGGRAPH 2003 Papers, (908-916)
  368. ACM
    Krüger J and Westermann R (2003). Linear algebra operators for GPU implementation of numerical algorithms, ACM Transactions on Graphics, 22:3, (908-916), Online publication date: 1-Jul-2003.
  369. ACM
    Zhang Y, Roughan M, Duffield N and Greenberg A Fast accurate computation of large-scale IP traffic matrices from link loads Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (206-217)
  370. ACM
    Zhang Y, Roughan M, Duffield N and Greenberg A (2003). Fast accurate computation of large-scale IP traffic matrices from link loads, ACM SIGMETRICS Performance Evaluation Review, 31:1, (206-217), Online publication date: 10-Jun-2003.
  371. Chen Z, Dongarra J, Luszczek P and Roche K Self-adapting software for numerical linear algebra library routines on clusters Proceedings of the 2003 international conference on Computational science: PartIII, (665-672)
  372. Teranishi K, Raghavan P and Yang C Time-memory trade-offs using sparse matrix methods for large-scale eigenvalue problems Proceedings of the 2003 international conference on Computational science and its applications: PartI, (840-847)
  373. Bastoul C and Feautrier P Improving data locality by chunking Proceedings of the 12th international conference on Compiler construction, (320-334)
  374. Addison_c C, Ren Y and van Waveren M (2003). OpenMP issues arising in the development of parallel BLAS and LAPACK libraries, Scientific Programming, 11:2, (95-104), Online publication date: 1-Apr-2003.
  375. Dongarra J, Foster I, Fox G, Gropp W, Kennedy K, Torczon L and White A References Sourcebook of parallel computing, (729-789)
  376. ACM
    O'Brien J, Shen C and Gatchalian C Synthesizing sounds from rigid-body simulations Proceedings of the 2002 ACM SIGGRAPH/Eurographics symposium on Computer animation, (175-181)
  377. Toledo S and Rabani E (2002). Very Large Electronic Structure Calculations Using an Out-of-Core Filter-Diagonalization Method, Journal of Computational Physics, 180:1, (256-269), Online publication date: 20-Jul-2002.
  378. Fortune S (2002). An Iterated Eigenvalue Algorithm for Approximating Roots of Univariate Polynomials, Journal of Symbolic Computation, 33:5, (627-646), Online publication date: 1-May-2002.
  379. ACM
    Chen J and He L A decoupling method for analysis of coupled RLC interconnects Proceedings of the 12th ACM Great Lakes symposium on VLSI, (41-46)
  380. Becka M, Oksa G and Vajtersic M (2002). Dynamic ordering for a parallel block-Jacobi SVD algorithm, Parallel Computing, 28:2, (243-262), Online publication date: 1-Feb-2002.
  381. Elkins D and Wortman M (2001). On Numerical Solution of the Markov Renewal Equation: Tight Upper and Lower Kernel Bounds, Methodology and Computing in Applied Probability, 3:3, (239-253), Online publication date: 1-Sep-2001.
  382. ACM
    Fung G and Mangasarian O Proximal support vector machine classifiers Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, (77-86)
  383. ACM
    Hiroyuki S Array form representation of idiom recognition system for numerical programs Proceedings of the 2001 conference on APL: an arrays odyssey, (87-98)
  384. ACM
    Rauber T and Rüger G Optimizing locality for ODE solvers Proceedings of the 15th international conference on Supercomputing, (123-132)
  385. ACM
    Mateev N, Menon V and Pingali K Fractal symbolic analysis Proceedings of the 15th international conference on Supercomputing, (38-49)
  386. ACM
    Seymour K and Dongarra J Automatic translation of Fortran to JVM bytecode Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande, (126-133)
  387. Kodukula I and Pingali K (2001). Data-Centric Transformations for Locality Enhancement, International Journal of Parallel Programming, 29:3, (319-364), Online publication date: 1-Jun-2001.
  388. ACM
    Hiroyuki S (2000). Array form representation of idiom recognition system for numerical programs, ACM SIGAPL APL Quote Quad, 31:2, (87-98), Online publication date: 1-Dec-2000.
  389. Ahmed N, Mateev N and Pingali K Tiling imperfectly-nested loop nests Proceedings of the 2000 ACM/IEEE conference on Supercomputing, (31-es)
  390. Gustavson F and Jonsson I (2000). Minimal-storage high-performance Cholesky factorization via blocking and recursion, IBM Journal of Research and Development, 44:6, (823-850), Online publication date: 1-Nov-2000.
  391. ACM
    Luján M, Freeman T and Gurd J (2000). OoLALA, ACM SIGPLAN Notices, 35:10, (229-252), Online publication date: 1-Oct-2000.
  392. ACM
    Luján M, Freeman T and Gurd J OoLALA Proceedings of the 15th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, (229-252)
  393. Kargupta H, Huang W, Sivakumar K, Park B and Wang S Collective Principal Component Analysis from Distributed, Heterogeneous Data Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, (452-457)
  394. Benner P, Castillo M, Quintana-Ortí E and Hernández V (2000). Parallel Partial Stabilizing Algorithms for Large Linear Control Systems, The Journal of Supercomputing, 15:2, (193-206), Online publication date: 1-Feb-2000.
  395. Petitet A and Dongarra J (1999). Algorithmic Redistribution Methods for Block-Cyclic Decompositions, IEEE Transactions on Parallel and Distributed Systems, 10:12, (1201-1216), Online publication date: 1-Dec-1999.
  396. Lê D and Stott Parker D (1999). Using randomization to make recursive matrix algorithms practical, Journal of Functional Programming, 9:6, (605-624), Online publication date: 1-Nov-1999.
  397. Triggs B, McLauchlan P, Hartley R and Fitzgibbon A Bundle Adjustment - A Modern Synthesis Proceedings of the International Workshop on Vision Algorithms: Theory and Practice, (298-372)
  398. ACM
    Chatterjee S, Jain V, Lebeck A, Mundhra S and Thottethodi M Nonlinear array layouts for hierarchical memory systems Proceedings of the 13th international conference on Supercomputing, (444-453)
  399. ACM
    Menon V and Pingali K High-level semantic optimization of numerical codes Proceedings of the 13th international conference on Supercomputing, (434-443)
  400. ACM
    Keyser J, Culver T, Manocha D and Krishnan S MAPC Proceedings of the fifteenth annual symposium on Computational geometry, (360-369)
  401. Whaley R and Dongarra J Automatically tuned linear algebra software Proceedings of the 1998 ACM/IEEE conference on Supercomputing, (1-27)
  402. Li X and Demmel J Making sparse Gaussian elimination scalable by static pivoting Proceedings of the 1998 ACM/IEEE conference on Supercomputing, (1-17)
  403. Casanova H and Dongarra J (1998). Applying NetSolve's Network-Enabled Server, IEEE Computational Science & Engineering, 5:3, (57-67), Online publication date: 1-Jul-1998.
  404. ACM
    Menon V and Trefethen A MultiMATLAB Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-18)
  405. Kapur S and Long D IES3 Proceedings of the 1997 IEEE/ACM international conference on Computer-aided design, (448-455)
  406. Tisseur F (1997). Parallel Implementation of the Yau and Lu Method for Eigenvalue Computation, International Journal of High Performance Computing Applications, 11:3, (197-204), Online publication date: 1-Sep-1997.
  407. ACM
    Kodukula I, Ahmed N and Pingali K (1997). Data-centric multi-level blocking, ACM SIGPLAN Notices, 32:5, (346-357), Online publication date: 1-May-1997.
  408. ACM
    Kodukula I, Ahmed N and Pingali K Data-centric multi-level blocking Proceedings of the ACM SIGPLAN 1997 conference on Programming language design and implementation, (346-357)
  409. ACM
    Manocha D and Krishnan S (1996). Solving algebraic systems using matrix computations, ACM SIGSAM Bulletin, 30:4, (4-21), Online publication date: 1-Dec-1996.
  410. Charny B (1996). Matrix Partitioning on a Virtual Shared Memory Parallel Machine, IEEE Transactions on Parallel and Distributed Systems, 7:4, (343-355), Online publication date: 1-Apr-1996.
  411. Plank J, Kim Y and Dongarra J Algorithm-Based Diskless Checkpointing for Fault-Tolerant Matrix Operations Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
  412. ACM
    Krishnan S and Manocha D Numeric-symbolic algorithms for evaluating one-dimensional algebraic sets Proceedings of the 1995 international symposium on Symbolic and algebraic computation, (59-67)
  413. Saini S and Simon H Applications performance under OSF/1 AD and SUNMOS on Intel Paragon XP/S-15 Proceedings of the 1994 ACM/IEEE conference on Supercomputing, (580-589)
  414. ACM
    Manocha D Computing selected solutions of polynomial equations Proceedings of the international symposium on Symbolic and algebraic computation, (1-8)
  415. ACM
    Manocha D Solving polynomial systems for curve, surface and solid modeling Proceedings on the second ACM symposium on Solid modeling and applications, (169-178)
  416. Dongarra J (1993). Linear Algebra Libraries for High-Performance Computers, IEEE Parallel & Distributed Technology: Systems & Technology, 1:1, (17-24), Online publication date: 1-Feb-1993.
Contributors
  • The University of Tennessee, Knoxville
  • University of California, Davis
  • Technical University of Darmstadt
  • The University of Tennessee System
  • University of California, Berkeley
  • The University of Tennessee, Knoxville
  • Numerical Algorithms Group
  • The University of Manchester
  • University of Washington
  • Rice University
  • Rice University

Recommendations