Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Skip header Section
Solving problems on concurrent processors. Vol. 1: General techniques and regular problemsJuly 1988
Publisher:
  • Prentice-Hall, Inc.
  • Division of Simon and Schuster One Lake Street Upper Saddle River, NJ
  • United States
ISBN:978-0-13-823022-7
Published:01 July 1988
Pages:
592
Skip Bibliometrics Section
Reflects downloads up to 03 Sep 2024Bibliometrics
Abstract

No abstract available.

Cited By

  1. Min M, Lan Y, Fischer P, Merzari E, Kerkemeier S, Phillips M, Rathnayake T, Novak A, Gaston D, Chalmers N and Warburton T Optimization of full-core reactor simulations on summit Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, (1-11)
  2. ACM
    Burtscher M, Devale S, Azimi S, Jaiganesh J and Powers E (2018). A High-Quality and Fast Maximal Independent Set Implementation for GPUs, ACM Transactions on Parallel Computing, 5:2, (1-27), Online publication date: 23-Jan-2019.
  3. Duan J and Yang Y (2018). MCL: A Cost-Efficient Nonblocking Multicast Interconnection Network, IEEE Transactions on Parallel and Distributed Systems, 29:9, (2046-2058), Online publication date: 1-Sep-2018.
  4. Sheraton M and Sloot P Parallel Performance Analysis of Bacterial Biofilm Simulation Models Computational Science – ICCS 2018, (496-505)
  5. Duan J and Yang Y A Cost Efficient Multicast Nonblocking Interconnection Network GLOBECOM 2017 - 2017 IEEE Global Communications Conference, (1-7)
  6. Ray N, Grindeanu I, Zhao X, Mahadevan V and Jiao X (2017). Array-based, parallel hierarchical mesh refinement algorithms for unstructured meshes, Computer-Aided Design, 85:C, (68-82), Online publication date: 1-Apr-2017.
  7. Sikora A, Margalef T and Jorba J (2016). Automated and dynamic abstraction of MPI application performance, Cluster Computing, 19:3, (1105-1137), Online publication date: 1-Sep-2016.
  8. Zaitsev D, Shmeleva T, Retschitzegger W and Pröll B (2016). Security of grid structures under disguised traffic attacks, Cluster Computing, 19:3, (1183-1200), Online publication date: 1-Sep-2016.
  9. ACM
    Peplinski A, Fischer P and Schlatter P Parallel performance of h-type Adaptive Mesh Refinement for Nek5000 Proceedings of the Exascale Applications and Software Conference 2016, (1-9)
  10. Oryspayev D, Aktulga H, Sosonkina M, Maris P and Vary J (2015). Performance analysis of distributed symmetric sparse matrix vector multiplication algorithm for multi-core architectures, Concurrency and Computation: Practice & Experience, 27:17, (5019-5036), Online publication date: 10-Dec-2015.
  11. Ben Youssef B (2015). A parallel cellular automata algorithm for the deterministic simulation of 3-D multicellular tissue growth, Cluster Computing, 18:4, (1561-1579), Online publication date: 1-Dec-2015.
  12. Sikora A, Margalef T and Jorba J (2018). Online root-cause performance analysis of parallel applications, Parallel Computing, 48:C, (81-107), Online publication date: 1-Oct-2015.
  13. Touzene A (2015). All-To-All Broadcast in Hexagonal Torus Networks On-Chip, IEEE Transactions on Parallel and Distributed Systems, 26:9, (2410-2420), Online publication date: 1-Sep-2015.
  14. Träff J mpicroscope Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface, (100-109)
  15. ACM
    Fu J, Latham R, Min M and Carothers C I/O threads to reduce checkpoint blocking for an electromagnetics solver on Blue Gene/P and Cray XK6 Proceedings of the 2nd International Workshop on Runtime and Operating Systems for Supercomputers, (1-8)
  16. Hawick K and Playne D Hard-sphere collision simulations with multiple GPUs, PCIe extension buses and GPU-GPU communications Proceedings of the Tenth Australasian Symposium on Parallel and Distributed Computing - Volume 127, (13-22)
  17. ACM
    Yoo A, Baker A, Pearce R and Van Emden Henson A scalable eigensolver for large scale-free graphs using 2D graph partitioning Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, (1-11)
  18. ACM
    Salmon J, Moraes M, Dror R and Shaw D Parallel random numbers Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  19. ACM
    Carrington L, Tikir M, Olschanowsky C, Laurenzano M, Peraza J, Snavely A and Poole S An idiom-finding tool for increasing productivity of accelerators Proceedings of the international conference on Supercomputing, (202-212)
  20. ACM
    Ben Youssef B Using surface effect measures to model parallel performance Proceedings of the 2010 ACM Symposium on Applied Computing, (2400-2401)
  21. Korkhov V, Krzhizhanovskaya V and Sloot P (2008). A Grid-based Virtual Reactor, Journal of Parallel and Distributed Computing, 68:5, (596-608), Online publication date: 1-May-2008.
  22. ACM
    Guo J, Bikshandi G, Fraguela B, Garzaran M and Padua D Programming with tiles Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, (111-122)
  23. ACM
    Snyder L The design and development of ZPL Proceedings of the third ACM SIGPLAN conference on History of programming languages, (8-1-8-37)
  24. Ben Youssef B, Gang Cheng , Zygourakis K and Markenscoff P (2007). Regular Paper, International Journal of High Performance Computing Applications, 21:2, (196-209), Online publication date: 1-May-2007.
  25. Wang Y Relations between two common types of rectangular tilings Proceedings of the 17th international conference on Algorithms and Computation, (193-202)
  26. Damaj I (2006). Parallel algorithms development for programmable logic devices, Advances in Engineering Software, 37:9, (561-582), Online publication date: 1-Sep-2006.
  27. ACM
    Krishnan M and Nieplocha J Memory efficient parallel matrix multiplication operation for irregular problems Proceedings of the 3rd conference on Computing frontiers, (229-240)
  28. ACM
    Chan E, van de Geijn R, Gropp W and Thakur R Collective communication on architectures that support simultaneous communication over multiple links Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, (2-11)
  29. Izaguirre J, Hampton S and Matthey T (2005). Parallel multigrid summation for the N-body problem, Journal of Parallel and Distributed Computing, 65:8, (949-962), Online publication date: 1-Aug-2005.
  30. ACM
    Wiese K, Hendriks A, Deschênes A and Youssef B The impact of pseudorandom number quality on P-RnaPredict, a parallel genetic algorithm for RNA secondary structure prediction Proceedings of the 7th annual conference on Genetic and evolutionary computation, (479-480)
  31. Koenig G and Kale L Using Message-Driven Objects to Mask Latency in Grid Computing Applications Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
  32. Muthukrishnan S and Suel T (2005). Approximation algorithms for array partitioning problems, Journal of Algorithms, 54:1, (85-104), Online publication date: 1-Jan-2005.
  33. Shu J, Wang B and Zheng W Cluster-Based parallel simulation for large scale molecular dynamics in microscale thermophysics Proceedings of the Second international conference on Parallel and Distributed Processing and Applications, (200-211)
  34. References Grid resource management, (507-566)
  35. ACM
    Yang L, Schopf J and Foster I Conservative Scheduling Proceedings of the 2003 ACM/IEEE conference on Supercomputing
  36. Yamamoto Y, Igai M and Naono K A vector-parallel FFT with a user-specifiable data distribution scheme Proceedings of the 2003 international conference on Parallel and distributed processing and applications, (362-374)
  37. Santos E (2019). Parallel Complexity of Matrix Multiplication, The Journal of Supercomputing, 25:2, (155-175), Online publication date: 1-Jun-2003.
  38. Lu Q and Getov V (2018). Mixed-language high-performance computing for plasma simulations, Scientific Programming, 11:1, (57-66), Online publication date: 1-Jan-2003.
  39. Lui J and Chan M (2002). An Efficient Partitioning Algorithm for Distributed Virtual Environment Systems, IEEE Transactions on Parallel and Distributed Systems, 13:3, (193-211), Online publication date: 1-Mar-2002.
  40. Yang Y and Wang J (2002). Near-Optimal All-to-All Broadcast in Multidimensional All-Port Meshes and Tori, IEEE Transactions on Parallel and Distributed Systems, 13:2, (128-141), Online publication date: 1-Feb-2002.
  41. Chrisochoides N A new approach to parallel mesh generation and partitioning problems Computational science, mathematics and software, (335-359)
  42. Xu C, Wang L and Fong N (2019). Stochastic Prediction of Execution Time for Dynamic Bulk Synchronous Computations, The Journal of Supercomputing, 21:1, (91-103), Online publication date: 1-Jan-2002.
  43. Johnson B, Karthikeyan R and Janaki Ram D (2001). DP, IEEE Transactions on Parallel and Distributed Systems, 12:10, (1052-1065), Online publication date: 1-Oct-2001.
  44. Yang Y and Wang J (2001). Pipelined All-to-All Broadcast in All-Port Meshes and Tori, IEEE Transactions on Computers, 50:10, (1020-1032), Online publication date: 1-Oct-2001.
  45. Kendemir M and Ramanujam J (2001). Data Relation Vectors, IEEE Transactions on Computers, 50:8, (798-810), Online publication date: 1-Aug-2001.
  46. ACM
    Briguglio S, Di Martino B and Vlad G Workload decomposition for particle simulation applications on hierarchical distributed-shared memory parallel systems with integration of HPF and OpenMP Proceedings of the 15th international conference on Supercomputing
  47. Kennedy K and Koelbel C High performance Fortran 2.0 Compiler optimizations for scalable parallel systems, (3-43)
  48. Ding C (2001). An Optimal Index Reshuffle Algorithm for Multidimensional Arrays and Its Applications for Parallel Architectures, IEEE Transactions on Parallel and Distributed Systems, 12:3, (306-315), Online publication date: 1-Mar-2001.
  49. He Y and Ding C (2019). Using Accurate Arithmetics to Improve Numerical Reproducibility and Stability in Parallel Applications, The Journal of Supercomputing, 18:3, (259-277), Online publication date: 1-Mar-2001.
  50. ACM
    He Y and Ding C Using accurate arithmetics to improve numerical reproducibility and stability in parallel applications Proceedings of the 14th international conference on Supercomputing, (225-234)
  51. Liang D and Tripathi S (2000). On Performance Prediction of Parallel Computations with Precedent Constraints, IEEE Transactions on Parallel and Distributed Systems, 11:5, (491-508), Online publication date: 1-May-2000.
  52. Gorlatch S (2000). Toward Formally-Based Design of Message Passing Programs, IEEE Transactions on Software Engineering, 26:3, (276-288), Online publication date: 1-Mar-2000.
  53. Petitet A and Dongarra J (1999). Algorithmic Redistribution Methods for Block-Cyclic Decompositions, IEEE Transactions on Parallel and Distributed Systems, 10:12, (1201-1216), Online publication date: 1-Dec-1999.
  54. Royo D, González A and Valero-García M (1999). Low Communication Overhead Jacobi Algorithms for Eigenvalues Computation on Hypercubes, The Journal of Supercomputing, 14:2, (171-193), Online publication date: 1-Sep-1999.
  55. Bar-Noy A and Ho C (1999). Broadcasting Multiple Messages in the Multiport Model, IEEE Transactions on Parallel and Distributed Systems, 10:5, (500-508), Online publication date: 1-May-1999.
  56. Sun X, Pantano M and Fahringer T (1999). Integrated Range Comparison for Data-Parallel Compilation Systems, IEEE Transactions on Parallel and Distributed Systems, 10:5, (448-458), Online publication date: 1-May-1999.
  57. Liao C and Chung Y (1999). Tree-Based Parallel Load-Balancing Methods for Solution-Adaptive Finite Element Graphs on Distributed Memory Multicomputers, IEEE Transactions on Parallel and Distributed Systems, 10:4, (360-370), Online publication date: 1-Apr-1999.
  58. ACM
    Coddington P and Ko S Techniques for empirical testing of parallel random number generators Proceedings of the 12th international conference on Supercomputing, (282-288)
  59. ACM
    Kennedy K and Kremer U (1998). Automatic data layout for distributed-memory machines, ACM Transactions on Programming Languages and Systems (TOPLAS), 20:4, (869-916), Online publication date: 1-Jul-1998.
  60. ACM
    Entacher K, Uhl A and Wegenkittl S (2019). Linear and inversive pseudorandom numbers for parallel and distributed simulation, ACM SIGSIM Simulation Digest, 28:1, (90-97), Online publication date: 1-Jul-1998.
  61. Entacher K, Uhl A and Wegenkittl S Linear and inversive pseudorandom numbers for parallel and distributed simulation Proceedings of the twelfth workshop on Parallel and distributed simulation, (90-97)
  62. ACM
    Bar-Noy A, Guha S, Naor J and Schieber B Multicasting in heterogeneous networks Proceedings of the thirtieth annual ACM symposium on Theory of computing, (448-453)
  63. John A and Browne J (2018). Compilation of Constraint Programs with Noncyclic and Cyclic Dependencies to Procedural Parallel Programs, International Journal of Parallel Programming, 26:1, (65-119), Online publication date: 1-Feb-1998.
  64. Khanna S, Muthukrishnan S and Paterson M On approximating rectangle tiling and packing Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms, (384-393)
  65. Day W (1997). Expressive Applications of Constraint Logic Programming, Artificial Intelligence Review, 11:6, (427-452), Online publication date: 1-Dec-1997.
  66. Bruck J, Ho C, Upfal E, Kipnis S and Weathersby D (1997). Efficient Algorithms for All-to-All Communications in Multiport Message-Passing Systems, IEEE Transactions on Parallel and Distributed Systems, 8:11, (1143-1156), Online publication date: 1-Nov-1997.
  67. Ou C and Ranka S (1997). Parallel Incremental Graph Partitioning, IEEE Transactions on Parallel and Distributed Systems, 8:8, (884-896), Online publication date: 1-Aug-1997.
  68. Lee P (1997). Efficient Algorithms for Data Distribution on Distributed Memory Parallel Computers, IEEE Transactions on Parallel and Distributed Systems, 8:8, (825-839), Online publication date: 1-Aug-1997.
  69. ACM
    Chrisochoides N, Kodukula I and Pingali K Compiler and run-time support for semi-structured applications Proceedings of the 11th international conference on Supercomputing, (229-236)
  70. ACM
    van Engelen R, Heitlager I, Wolters L and Cats G Incorporating application dependent information in an automatic code generating environment Proceedings of the 11th international conference on Supercomputing, (180-187)
  71. Yun B and Choi D Analysis of heat and fluid flow in a PCB channel using KAICUBE/Hanbit-1 parallel computer Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
  72. Noh M, Kim Y, Han T, Kim S and Yang S Matrix Multiplications on the Memory_based Processor Array Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
  73. ACM
    Burkhart H (1997). Parallel programming using public domain software, ACM SIGCSE Bulletin, 29:1, (224-228), Online publication date: 1-Mar-1997.
  74. ACM
    Burkhart H Parallel programming using public domain software Proceedings of the twenty-eighth SIGCSE technical symposium on Computer science education, (224-228)
  75. Wu M (1997). On Runtime Parallel Scheduling for Processor Load Balancing, IEEE Transactions on Parallel and Distributed Systems, 8:2, (173-186), Online publication date: 1-Feb-1997.
  76. Sarin V and Sameh A Parallel preconditioners for elliptic PDEs Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (30-es)
  77. Plimpton S, Hendrickson B, Attaway S, Swegle J, Vaughan C and Gardner D Transient dynamics simulations Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (28-es)
  78. Blackford L, Choi J, Cleary A, Petitet A, Whaley R, Demmel J, Dhillon I, Stanley K, Dongarra J, Hammarling S, Henry G and Walker D ScaLAPACK Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (5-es)
  79. Fang N Engineering Parallel Algorithms Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
  80. Yeşilyurt S, Ghaddar C, Cruz M and Patera A (1996). Bayesian-Validated Surrogates for Noisy Computer Simulations; Application to Random Media, SIAM Journal on Scientific Computing, 17:4, (973-992), Online publication date: 1-Jul-1996.
  81. Shu W and Wu M (1996). Runtime Incremental Parallel Scheduling (RIPS) on Distributed Memory Computers, IEEE Transactions on Parallel and Distributed Systems, 7:6, (637-649), Online publication date: 1-Jun-1996.
  82. Rothberg E (1996). Performance of Panel and Block Approaches to Sparse Cholesky Factorization on the iPSC/860 and Paragon Multicomputers, SIAM Journal on Scientific Computing, 17:3, (699-713), Online publication date: 1-May-1996.
  83. Xu C and Lau F (1996). Efficient Termination Detection for Loosely Synchronous Applications in Multicomputers, IEEE Transactions on Parallel and Distributed Systems, 7:5, (537-544), Online publication date: 1-May-1996.
  84. Bar-Noy A and Ho C Broadcasting Multiple Messages in the Multiport Model Proceedings of the 10th International Parallel Processing Symposium, (781-788)
  85. Banerjee D and Browne J Complete Parallelization of Computations Proceedings of the 10th International Parallel Processing Symposium, (354-360)
  86. Bruck J, De Coster L, Dewulf N, Ho C and Lauwereins R (1996). On the Design and Implementation of Broadcast and Global Combine Operations Using the Postal Model, IEEE Transactions on Parallel and Distributed Systems, 7:3, (256-265), Online publication date: 1-Mar-1996.
  87. Chung S and Yang J (1996). A Parallel Distributive Join Algorithm for Cube-Connected Multiprocessors, IEEE Transactions on Parallel and Distributed Systems, 7:2, (127-137), Online publication date: 1-Feb-1996.
  88. Clémençon C, Mukherjee B and Schwan K (1996). Distributed Shared Abstractions (DSA) on Multiprocessors, IEEE Transactions on Software Engineering, 22:2, (132-152), Online publication date: 1-Feb-1996.
  89. ACM
    Lumetta S and Culler D The Mantis parallel debugger Proceedings of the SIGMETRICS symposium on Parallel and distributed tools, (118-126)
  90. ACM
    Bordawekar R, Choudhary A and Ramanujam J Automatic optimization of communication in compiling out-of-core stencil codes Proceedings of the 10th international conference on Supercomputing, (366-373)
  91. ACM
    Hyder R and Wood D Synchronization hardware for networks of workstations Proceedings of the 10th international conference on Supercomputing, (245-252)
  92. ACM
    Aluru S Parallel additive lagged Fibonacci random number generators Proceedings of the 10th international conference on Supercomputing, (102-108)
  93. ACM
    Kennedy K and Kremer U Automatic data layout for high performance Fortran Proceedings of the 1995 ACM/IEEE conference on Supercomputing, (76-es)
  94. ACM
    Lou J and Ferraro R A parallel incompressible flow solver package with a parallel multigrid elliptic kernel Proceedings of the 1995 ACM/IEEE conference on Supercomputing, (66-es)
  95. ACM
    Wu M and Shu W High-performance incremental scheduling on massively parallel computers Proceedings of the 1995 ACM/IEEE conference on Supercomputing, (54-es)
  96. ACM
    Hansen P (1995). Efficient parallel recursion, ACM SIGPLAN Notices, 30:12, (9-16), Online publication date: 1-Dec-1995.
  97. Chang H and Oldham W (1995). Dynamic Task Allocation Models for Large Distributed Computing Systems, IEEE Transactions on Parallel and Distributed Systems, 6:12, (1301-1315), Online publication date: 1-Dec-1995.
  98. Nabhan T and Zomaya A (1995). A Parallel Simulated Annealing Algorithm with Low Communication Overhead, IEEE Transactions on Parallel and Distributed Systems, 6:12, (1226-1233), Online publication date: 1-Dec-1995.
  99. ACM
    Brewer E (1995). High-level optimization via automated statistical modeling, ACM SIGPLAN Notices, 30:8, (80-91), Online publication date: 1-Aug-1995.
  100. ACM
    Brewer E High-level optimization via automated statistical modeling Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming, (80-91)
  101. Bar-Noy A, Bruck J, Ho C, Kipnis S and Schieber B (1995). Computing Global Combine Operations in the Multiport Postal Model, IEEE Transactions on Parallel and Distributed Systems, 6:8, (896-900), Online publication date: 1-Aug-1995.
  102. González A, Valero-García M and Díaz de Cerio L (1995). Executing Algorithms with Hypercube Topology on Torus Multicomputers, IEEE Transactions on Parallel and Distributed Systems, 6:8, (803-814), Online publication date: 1-Aug-1995.
  103. ACM
    Brewer E, Chong F, Liu L, Sharma S and Kubiatowicz J Remote queues Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures, (42-53)
  104. ACM
    Ou C, Gunwani M and Ranka S Architecture-independent locality-improving transformations of computational graphs embedded in k-dimensions Proceedings of the 9th international conference on Supercomputing, (289-298)
  105. ACM
    Rauber T and Rünger G (2019). Performance predictions for parallel diagonal-implicitly iterated Runge-Kutta methods, ACM SIGSIM Simulation Digest, 25:1, (21-28), Online publication date: 1-Jul-1995.
  106. Rauber T and Rünger G Performance predictions for parallel diagonal-implicitly iterated Runge-Kutta methods Proceedings of the ninth workshop on Parallel and distributed simulation, (21-28)
  107. Shu W and Wu M (1995). Asynchronous Problems on SIMD Parallel Computers, IEEE Transactions on Parallel and Distributed Systems, 6:7, (704-713), Online publication date: 1-Jul-1995.
  108. Das R, Wu J, Saltz J, Berryman H and Hiranandani S (1995). Distributed Memory Compiler Design For Sparse Problems, IEEE Transactions on Computers, 44:6, (737-753), Online publication date: 1-Jun-1995.
  109. Debbage M, Hill M and Nicole D (2019). An Interface to a Reliable Packet Delivery Service for Parallel Systems, IEEE Transactions on Parallel and Distributed Systems, 6:4, (400-411), Online publication date: 1-Apr-1995.
  110. ACM
    Olszewski L A timing comparison of the conjugate gradient and Gauss-Seidel parallel algorithms in a one-dimensional flow equation using PVM Proceedings of the 33rd annual ACM Southeast Conference, (205-212)
  111. Bala V, Bruck J, Cypher R, Elustando P, Ho A, Ho C, Kipnis S and Snir M (2019). CCL, IEEE Transactions on Parallel and Distributed Systems, 6:2, (154-164), Online publication date: 1-Feb-1995.
  112. Kee K and Hariri S Efficient communication algorithms for pipeline multicomputers Proceedings of the 1994 ACM/IEEE conference on Supercomputing, (468-477)
  113. Ou C and Ranka S Parallel incremental graph partitioning using linear programming Proceedings of the 1994 ACM/IEEE conference on Supercomputing, (458-467)
  114. Koester D, Ranka S and Fox G A parallel Gauss-Seidel algorithm for sparse power system matrices Proceedings of the 1994 ACM/IEEE conference on Supercomputing, (184-193)
  115. Rothberg E and Gupta A (1994). An Efficient Block-Oriented Approach to Parallel Sparse Cholesky Factorization, SIAM Journal on Scientific Computing, 15:6, (1413-1439), Online publication date: 1-Nov-1994.
  116. ACM
    Li H and Sevcik K Parallel sorting by over partitioning Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures, (46-56)
  117. ACM
    Bruck J, Ho C, Kipnis S and Weathersby D Efficient algorithms for all-to-all communications in multi-port message-passing systems Proceedings of the sixth annual ACM symposium on Parallel algorithms and architectures, (298-309)
  118. ACM
    Kaushik S, Huang C, Johnson R and Sadayappan P An approach to communication-efficient data redistribution Proceedings of the 8th international conference on Supercomputing, (364-373)
  119. ACM
    Sarukkai S, Yan J and Gotwals J Normalized performance indices for message passing parallel programs Proceedings of the 8th international conference on Supercomputing, (323-332)
  120. Ponnusamy R, Mansour N, Choudhary A and Fox G (2018). Graph contraction for mapping data on parallel computers, Scientific Programming, 3:1, (73-82), Online publication date: 1-May-1994.
  121. ACM
    Camp W, Plimpton S, Hendrickson B and Leland R (1994). Massively parallel methods for engineering and science problems, Communications of the ACM, 37:4, (30-41), Online publication date: 1-Apr-1994.
  122. Iannello G and Mazzocca N (2019). Communication Workload Analysis for Symmetrical Concurrent Systems, Journal of Parallel and Distributed Computing, 20:2, (224-235), Online publication date: 1-Feb-1994.
  123. ACM
    Choudhary A, Koelbel C and Zosel M High performance Fortran Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (610-613)
  124. ACM
    Lewis J and van de Geijn R Distributed memory matrix-vector multiplication and conjugate gradient algorithms Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (484-492)
  125. ACM
    Bozkus Z, Choudhary A, Fox G, Haupt T and Ranka S Fortran 90D/HPF compiler for distributed memory MIMD computers Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (351-360)
  126. ACM
    Kwan T, Totty B and Reed D Communication and computation performance of the CM-5 Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (192-201)
  127. ACM
    Leutenegger S and Sun X Distributed computing feasibility in a non-dedicated homogeneous distributed system Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (143-152)
  128. ACM
    Wells J, Oberacker V, Umar A, Bottcher C, Strayer M, Drake J and Flanery R The quantum structure of matter grand challenge project Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (44-53)
  129. ACM
    Ding H Monte Carlo simulations of Quantum systems on massively parallel computers Proceedings of the 1993 ACM/IEEE conference on Supercomputing, (34-43)
  130. ACM
    Fahringer T and Zima H A static parameter based performance prediction tool for parallel programs Proceedings of the 7th international conference on Supercomputing, (207-219)
  131. ACM
    Mansour N, Ponnusamy R, Choudhary A and Fox G Graph contraction for physical optimization methods Proceedings of the 7th international conference on Supercomputing, (1-10)
  132. ACM
    Setia S, Squillante M and Tripathi S (1993). Processor scheduling on multiprogrammed, distributed memory parallel computers, ACM SIGMETRICS Performance Evaluation Review, 21:1, (158-170), Online publication date: 1-Jun-1993.
  133. ACM
    Setia S, Squillante M and Tripathi S Processor scheduling on multiprogrammed, distributed memory parallel computers Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems, (158-170)
  134. ACM
    Rothberg E, Singh J and Gupta A Working sets, cache sizes, and node granularity issues for large-scale multiprocessors Proceedings of the 20th annual international symposium on computer architecture, (14-26)
  135. ACM
    Rothberg E, Singh J and Gupta A (1993). Working sets, cache sizes, and node granularity issues for large-scale multiprocessors, ACM SIGARCH Computer Architecture News, 21:2, (14-26), Online publication date: 1-May-1993.
  136. Abali B, özgü F and Bataineh A (2019). Balanced Parallel Sort on Hypercube Multiprocessors, IEEE Transactions on Parallel and Distributed Systems, 4:5, (572-581), Online publication date: 1-May-1993.
  137. ACM
    Vakalis I and de Doncker E Parallel global adaptive integration and dynamic load balancing on loosely coupled systems Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: states of the art and practice, (554-561)
  138. Warren M and Salmon J Astrophysical N-body simulations using hierarchical tree data structures Proceedings of the 1992 ACM/IEEE conference on Supercomputing, (570-576)
  139. Ponnusamy R, Thakur R, Choudhary A and Fox G Scheduling regular and irregular communication patterns on the CM-5 Proceedings of the 1992 ACM/IEEE conference on Supercomputing, (394-402)
  140. von Laszewski G, Parashar M, Mohamed A and Fox G On the parallelization of blocked LU factorization algorithms on distributed memory architectures Proceedings of the 1992 ACM/IEEE conference on Supercomputing, (170-179)
  141. ACM
    Feeley M and Levy H (2019). Distributed shared memory with versioned objects, ACM SIGPLAN Notices, 27:10, (247-262), Online publication date: 31-Oct-1992.
  142. ACM
    Feeley M and Levy H Distributed shared memory with versioned objects Conference proceedings on Object-oriented programming systems, languages, and applications, (247-262)
  143. ACM
    Carey G, Schmidt J, Singh V and Yelton D A scalable, object-oriented finite element solver for partial differential equations on multicomputers Proceedings of the 6th international conference on Supercomputing, (387-396)
  144. ACM
    Gupta M and Banerjee P A methodology for high-level synthesis of communication on multicomputers Proceedings of the 6th international conference on Supercomputing, (357-367)
  145. ACM
    Fahringer T, Blasko R and Zima H Automatic performance prediction to support parallelization of Fortran programs for massively parallel systems Proceedings of the 6th international conference on Supercomputing, (347-356)
  146. ACM
    O'Boyle M and Hedayat G A transformational approach to compiling Sisal for distributed memory architectures Proceedings of the 6th international conference on Supercomputing, (335-346)
  147. ACM
    Helin J Performance analysis of the CM-2, a massively parallel SIMD computer Proceedings of the 6th international conference on Supercomputing, (45-52)
  148. ACM
    Hiranandani S, Kennedy K and Tseng C Evaluation of compiler optimizations for Fortran D on MIMD distributed memory machines Proceedings of the 6th international conference on Supercomputing, (1-14)
  149. ACM
    Hiranandani S, Kennedy K and Tseng C (1992). Compiling Fortran D for MIMD distributed-memory machines, Communications of the ACM, 35:8, (66-80), Online publication date: 1-Aug-1992.
  150. ACM
    Bar-Noy A and Kipnis S Designing broadcasting algorithms in the postal model for message-passing systems Proceedings of the fourth annual ACM symposium on Parallel algorithms and architectures, (13-22)
  151. Sokol L, Weissman J and Mutchler P MTW Proceedings of the 23rd conference on Winter simulation, (557-563)
  152. Suter D (2019). Constraint Networks in Vision, IEEE Transactions on Computers, 40:12, (1359-1367), Online publication date: 1-Dec-1991.
  153. ACM
    Vaswani R and Zahorjan J (1991). The implications of cache affinity on processor scheduling for multiprogrammed, shared memory multiprocessors, ACM SIGOPS Operating Systems Review, 25:5, (26-40), Online publication date: 2-Oct-1991.
  154. Ramanujam J and Sadayappan P (2019). Compile-Time Techniques for Data Distribution in Distributed Memory Machines, IEEE Transactions on Parallel and Distributed Systems, 2:4, (472-482), Online publication date: 1-Oct-1991.
  155. Koelbel C and Mehrotra P (2019). Compiling Global Name-Space Parallel Loops for Distributed Execution, IEEE Transactions on Parallel and Distributed Systems, 2:4, (440-451), Online publication date: 1-Oct-1991.
  156. ACM
    Vaswani R and Zahorjan J The implications of cache affinity on processor scheduling for multiprogrammed, shared memory multiprocessors Proceedings of the thirteenth ACM symposium on Operating systems principles, (26-40)
  157. Fischer D (1991). Short Communication, Parallel Computing, 17:6-7, (695-697), Online publication date: 1-Sep-1991.
  158. ACM
    Venugopal S and Naik V Effects of partitioning and scheduling sparse matrix factorization on communication and load balance Proceedings of the 1991 ACM/IEEE conference on Supercomputing, (866-875)
  159. ACM
    Hiranandani S, Kennedy K and Tseng C Compiler optimizations for Fortran D on MIMD distributed-memory machines Proceedings of the 1991 ACM/IEEE conference on Supercomputing, (86-100)
  160. ACM
    Balasundaram V, Fox G, Kennedy K and Kremer U (1991). A static performance estimator to guide data partitioning decisions, ACM SIGPLAN Notices, 26:7, (213-223), Online publication date: 1-Jul-1991.
  161. Li J and Chen M (1991). Compiling Communication-Efficient Programs for Massively Parallel Machines, IEEE Transactions on Parallel and Distributed Systems, 2:3, (361-376), Online publication date: 1-Jul-1991.
  162. ACM
    Kumar V and Gupta A Analysis of scalability of parallel algorithms and architectures Proceedings of the 5th international conference on Supercomputing, (396-405)
  163. ACM
    Helin J and Berrendorf R Analyzing the performance of message passing MIMD Hypercubes Proceedings of the 5th international conference on Supercomputing, (376-385)
  164. ACM
    Gerndt M Work distribution in parallel programs for distributed memory multiprocessors Proceedings of the 5th international conference on Supercomputing, (96-104)
  165. ACM
    Hege H and Stüben H Vectorization and parallelization of irregular problems via graph coloring Proceedings of the 5th international conference on Supercomputing, (47-56)
  166. Ghosal D, Serazzi G and Tripathi S (2019). The Processor Working Set and its Use in Scheduling Multiprocessor Systems, IEEE Transactions on Software Engineering, 17:5, (443-453), Online publication date: 1-May-1991.
  167. ACM
    Maggio M and Krumme D (1991). A flexible system call interface for interprocessor communication in a distributed memory multicomputer, ACM SIGOPS Operating Systems Review, 25:2, (4-21), Online publication date: 1-Apr-1991.
  168. ACM
    Balasundaram V, Fox G, Kennedy K and Kremer U A static performance estimator to guide data partitioning decisions Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming, (213-223)
  169. Li J and Chen M Generating explicit communication from shared-memory program references Proceedings of the 1990 ACM/IEEE conference on Supercomputing, (865-876)
  170. Hanson F A real introduction to supercomputing Proceedings of the 1990 ACM/IEEE conference on Supercomputing, (376-385)
  171. ACM
    Agha G (1990). Concurrent object-oriented programming, Communications of the ACM, 33:9, (125-141), Online publication date: 1-Sep-1990.
  172. ACM
    Hey A (1990). Supercomputing with transputers—past, present and future, ACM SIGARCH Computer Architecture News, 18:3b, (479-489), Online publication date: 1-Sep-1990.
  173. ACM
    Brochard L and Freau A (1990). Designing algorithms on hierarchical memory multiprocessors, ACM SIGARCH Computer Architecture News, 18:3b, (414-427), Online publication date: 1-Sep-1990.
  174. Banerjee P, Rahmeh J, Stunkel C, Nair V, Roy K, Balasubramanian V and Abraham J (2019). Algorithm-Based Fault Tolerance on a Hypercube Multiprocessor, IEEE Transactions on Computers, 39:9, (1132-1145), Online publication date: 1-Sep-1990.
  175. ACM
    Hey A Supercomputing with transputers—past, present and future Proceedings of the 4th international conference on Supercomputing, (479-489)
  176. ACM
    Brochard L and Freau A Designing algorithms on hierarchical memory multiprocessors Proceedings of the 4th international conference on Supercomputing, (414-427)
  177. ACM
    Hsu J and Banerjee P (2019). Performance measurement and trace driven simulation of parallel CAD and numeric applications on a hypercube multicomputer, ACM SIGARCH Computer Architecture News, 18:2SI, (260-269), Online publication date: 1-Jun-1990.
  178. ACM
    Hsu J and Banerjee P Performance measurement and trace driven simulation of parallel CAD and numeric applications on a hypercube multicomputer Proceedings of the 17th annual international symposium on Computer Architecture, (260-269)
  179. ACM
    Cringean J, England R, Manson G and Willett P Parallel text searching in serial files using a processor farm Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval, (429-453)
  180. ACM
    Steele G Making asynchronous parallelism safe for the world Proceedings of the 17th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, (218-231)
  181. ACM
    Ramanujam J and Sadayappan P A methodology for parallelizing programs for multicomputers and complex memory multiprocessors Proceedings of the 1989 ACM/IEEE conference on Supercomputing, (637-646)
  182. ACM
    Sadayappan P and Rao S Communication reduction for distributed sparse matrix factorization on a processor mesh Proceedings of the 1989 ACM/IEEE conference on Supercomputing, (371-379)
  183. ACM
    Gustafson J, Benner R, Sears M and Sullivan T A radar simulation program for a 1024-processor hypercube Proceedings of the 1989 ACM/IEEE conference on Supercomputing, (96-105)
  184. ACM
    Fox G, Hipes P and Salmon J Practical parallel supercomputing: examples from chemistry and physics Proceedings of the 1989 ACM/IEEE conference on Supercomputing, (58-69)
  185. ACM
    Carmona E Parallelizing a large scientific code - methods, issues, and concerns Proceedings of the 1989 ACM/IEEE conference on Supercomputing, (21-31)
  186. ACM
    Rogers A and Pingali K (1989). Process decomposition through locality of reference, ACM SIGPLAN Notices, 24:7, (69-80), Online publication date: 1-Jul-1989.
  187. ACM
    Rogers A and Pingali K Process decomposition through locality of reference Proceedings of the ACM SIGPLAN 1989 conference on Programming language design and implementation, (69-80)
  188. ACM
    Ono S and Ohta N The NOV-II super parallel computer for signal processing Proceedings of the 3rd international conference on Supercomputing, (381-390)
  189. ACM
    Padmanabhan K The composite binary cube — a family of interconnection networks for multiprocessors Proceedings of the 3rd international conference on Supercomputing, (62-71)
  190. Fox G, Furmanski W, Ho A, Koller J, Simic P and Wong I Neural networks and dynamic complex systems Proceedings of the 22nd annual symposium on Simulation, (129-143)
  191. ACM
    Lyzenga G, Raefsky A and Nour-Omid B Implementing finite element software on hypercube machines Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (1755-1761)
  192. ACM
    Fox G, Furmanski W and Walker D Optimal matrix algorithms on homogeneous hypercubes Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (1656-1673)
  193. ACM
    Hipes P and Kuppermann A Gauss-Jordan inversion with pivoting on the Caltech Mark II hypercube Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (1621-1634)
  194. ACM
    Walker D Portable programming within a message-passing model: the FFT as an example Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (1438-1450)
  195. ACM
    Skjellum A, Morari M and Nattisson S Waveform relaxation for concurrent dynamic simulation of distillation columns Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (1062-1071)
  196. ACM
    Battiti R Collective steropsis on the hypercube Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (1000-1006)
  197. ACM
    Bower J, Nelson M, Wilson M, Fox G and Furmanski W Piriform (Olfactory) cortex model on the hypercube Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (977-999)
  198. ACM
    Warren M and Salmon J An O(NlogN) hypercube N-body integrator Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (971-975)
  199. ACM
    Gorham P, Prince T and Anderson S Hypercube data analysis in astronomy: optical interferometry and millisecond pulsar searches Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (957-962)
  200. ACM
    Fox G What have we learnt from using real parallel machines to solve real problems? Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2, (897-955)
  201. ACM
    Reynolds P A spectrum of options for parallel simulation Proceedings of the 20th conference on Winter simulation, (325-332)
  202. ACM
    Reynolds P Heterogenous distributed simulation Proceedings of the 20th conference on Winter simulation, (206-209)
  203. ACM
    Fox G and Furmanski W Hypercube algorithms for neural network simulation: the Crystal_Accumulator and the Crystal Router Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (714-724)
  204. ACM
    Fox G and Furmanski W Optimal communication algorithms for regular decompositions on the hypercube Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (648-713)
  205. ACM
    Schwan K and Bo W Topologies' - computational messaging for multicomputers Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (580-593)
  206. ACM
    Felten E and Otto S A safe vertex Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (560-564)
  207. ACM
    Zimmermann B and Crichton G A programming model for the Mark III hypercube with multiple processor nodes Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (528-535)
  208. ACM
    Martin A A message-passing model for highly concurrent computation Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (520-527)
  209. ACM
    Felten E and Otto S Coherent parallel C Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (440-450)
  210. ACM
    Salmon J, Callahan S, Flower J and Kolawa A Moose: a multi-tasking operating system of hypercubes Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (391-396)
  211. ACM
    Fox G and Furmanski W A string theory for time dependent complex systems and its application to automatic decomposition Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (285-305)
  212. ACM
    Fox G and Furmanski W Load balancing loosely synchronous problems with a neural network Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (241-278)
  213. ACM
    Salmon J A mathematical analysis of the scattered decomposition Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (239-240)
  214. ACM
    Walker D Performance of a QCD code on the NCUBE hypercube Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (180-187)
  215. ACM
    Walker D, Messina D and Baillie C Performance evaluation of scientific programs on advanced architecture computers Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (173-179)
  216. ACM
    Ho A, Fox G, Walker D, Breaden M, Chen S, Knutson A, Kuwamoto S and Cole T MAC-CUBE, the Macintosh-based hypercube Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (98-103)
  217. ACM
    Ho A, Fox G, Walker D, Snyder S, Chang D, Chen S, Breaden M and Cole T PC-CUBE, a personal computer based hypercube Proceedings of the third conference on Hypercube concurrent computers and applications: Architecture, software, computer systems, and general issues - Volume 1, (92-97)
  218. ACM
    Organick E Algorithms, concurrent processors, and computer science education Proceedings of the sixteenth SIGCSE technical symposium on Computer science education, (1-5)
  219. ACM
    Organick E (1985). Algorithms, concurrent processors, and computer science education, ACM SIGCSE Bulletin, 17:1, (1-5), Online publication date: 1-Mar-1985.
Contributors
  • Indiana University Bloomington
  • California Institute of Technology
  • Jet Propulsion Laboratory
  • Oregon Health & Science University
  • D.E. Shaw Research, LLC
  • Cardiff University

Reviews

David Malcolm Nicol

Solving problems on concurrent processors is a long-awaited outgrowth of the extensive experience at the California Institute of Technology in the field of solving scientific problems on message-passing parallel processors. The book shows how many standard mathematical operations can be distributed, including iterative relaxation methods, finite-element methods, common matrix operations, FFTs, simulated annealing, particle dynamics, and circuit simulations. Simple analyses that describe the overhead cost of parallelism accompany the examples. These analyses serve to demonstrate that the problem decompositions can be expected to scale well onto much larger parallel computers. The book is sprinkled with side discussions about CUBIX, the operating system and user interface developed on the Cal Tech hypercube. The authors are physicists, and operating system purists may sniff at CUBIX. The system interface presented to a programmer is quite practical, though, even if unexciting. A person with a baccalaureate degree in a hard science will have sufficient background to read this book. Anyone interested in practical parallel computing must read it. I intend to use the book in a first-year graduate course on parallel computing. Many of the programs described in the book are sketched in pseudocode; most of these programs (and the operating system) have been placed in the public domain and are available for a nominal fee.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Recommendations