Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1687399.1687477acmconferencesArticle/Chapter ViewAbstractPublication PagesiccadConference Proceedingsconference-collections
research-article

A parallel preconditioning strategy for efficient transistor-level circuit simulation

Published: 02 November 2009 Publication History

Abstract

We describe a parallel computing approach for large-scale SPICE-accurate circuit simulation, which is based on a new strategy for the parallel preconditioned iterative solution of circuit matrices. This strategy consists of several steps, including singleton removal, block triangular form (BTF) reordering, hypergraph partitioning, and a block Jacobi pre-conditioner. Our parallel implementation makes use of a mixed load balance, employing a different parallel partition for the matrix load and solve. Based on message-passing, our circuit simulation code was originally designed for large parallel computers, but for the purposes of this paper we demonstrate that it also gives good parallel speedup in modern multi-core environments. We show that our new parallel solver outperforms a serial direct solver, a parallel direct solver and an alternative iterative solver on a set of circuit test problems.

References

[1]
Cadence UltraSim. http://www.cadence.com/products/custom_ic/ultrasim/.
[2]
Nascentric OmegaSim. http://www.nascentric.com/omegasim_gx.html.
[3]
Synopsys HSIM. http://www.synopsys.com/products/mixedsignal/hsim/hsim.html.
[4]
Synopsys NanoSim. http://www.synopsys.com/products/mixedsignal/nanosim.
[5]
Xyce Parallel Circuit Simulator. http://xyce.sandia.gov.
[6]
P. Amestoy, T. Davis, and I. Duff. An approximate minimum degree ordering algorithm. SIAM J. Matrix Anal. Appl., 17(4):886--905, 1996.
[7]
J. Barby and R. Guindi. Circuitsim93: A circuit simulator benchmarking methodology case study. In Proceedings Sixth Annual IEEE International ASIC Conference and Exhibit, Rochester NY, 1993.
[8]
A. Basermann, U. Jaekel, and K. Hachiy. Preconditioning parallel sparse iterative solvers for circuit simulation. In Proceedings of the 8th SIAM Proceedings on Applied Linear Algebra, Williamsburg VA, 2003.
[9]
A. Basermann, U. Jaekel, and M. Nordhausen. Parallel iterative solvers for sparse linear systems in circuit simulation. Fut. Gen. Comput. Sys., 21(8):1275--1284, 2005.
[10]
C. Bomhof and H. vanderVorst. A parallel linear system solver for circuit simulation problems. Num. Lin. Alg. Appl., 7(7--8):649--665, 2000.
[11]
Ü. Çatalyürek and C. Aykanat. Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication. IEEE Trans. Parallel Dist. Systems, 10(7):673--693, 1999.
[12]
T. A. Davis. Direct Methods for Sparse Linear System. SIAM, 2006.
[13]
D. M. Day, M. K. Bhardwaj, G. M. Reese, and J. S. Peery. Mechanism free domain decomposition. Comput. Meth. Appl. Mech. Engrg., 182(7):763--776, 2005.
[14]
K. Devine, E. Boman, R. Heaphy, R. Bisseling, and U. Catalyurek. Parallel hypergraph partitioning for scientific computing. In Proc. of 20th International Parallel and Distributed Processing Symposium (IPDPS'06). IEEE, 2006.
[15]
L. Fleischer, B. Hendrickson, and A. Pinar. On identifying strongly connected components in parallel. In Proc. of IPDPS'00, LNCS 1800, pages 505--511. Springer, 2000.
[16]
N. Fröhlich, B. Riess, U. Wever, and Q. Zheng. A new approach for parallel simulation of vlsi-circuits on a transistor level. IEEE Transactions on Circuits and Systems Part I, 45(6):601--613, 1998.
[17]
W. Gropp, E. Lusk, N. Doss, and A. Skjellum. A high-performance, portable implementation of the MPI message passing interface standard. Parallel Computing, 22(6):789--828, Sept. 1996.
[18]
K. Gulati, J. F. Croix, S. P. Khatr, and R. Shastry. Fast circuit simulation on graphics processing units. In Proceedings of the 2009 Conference on Asia and South Pacific Design Automation, pages 403--408. IEEE Press, 2009.
[19]
B. Hendrickson and R. Leland. A multilevel algorithm for partitioning graphs. In Proc. Supercomputing '95. ACM, December 1995.
[20]
J. Huang, Z. Liu, M. Jeng, K. Hui, M. Chan, P. KO, and C. Hu. BSIM3 Manual. Technical report, Department of Electrical Engineering and Computer Science, University of California, Berkeley, CA 94720.
[21]
G. Karypis, R. Aggarwal, V. Kumar, and S. Shekhar. Multilevel hypergraph partitioning: Applications in VLSI domain. IEEE Trans. VLSI Systems, 20(1), 1999.
[22]
G. Karypis and V. Kumar. ParMETIS: Parallel graph partitioning and sparse matrix ordering library. Technical Report 97-060, CS Dept., Univ. Minn., 1997.
[23]
B. W. Kernighan and S. Lin. An efficient heuristic procedure for partitioning graphs. The Bell System Technical Journal, 49(2):291--307, 1970.
[24]
K. Kundert. Sparse matrix techniques. In A. Ruehli, editor, Circuit Analysis, Simulation and Design. North - Holland, 1987.
[25]
E. Lelarasmee, A. E. Ruehli, and A. L. Sangiovanni-Vincentelli. The waveform relaxation method for the time-domain analysis of large scale integrated circuits. IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., CAD-1(3):131--145, July 1982.
[26]
X. S. Li and J. W. Demmel. SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems. ACM Trans. Math. Softw., 29(2):110--140, 2003.
[27]
M. Heroux, et. al. An overview of the Trilinos project. ACM TOMS, 31(3):397--423, 2005. http://trilinos.sandia.gov.
[28]
F. Manne and R. H. Bisseling. A parallel approximation algorithm for the weighted maximum matching problem. In Proc. of PMAA'08, LNCS 4967, pages 708--717. Springer, Berlin, 2008.
[29]
W. McLendon, B. Hendrickson, S. Plimpton, and L. Rauchwerger. Finding strongly connected components in distributed graphs. J. of Parallel and Distributed Computing, 65:901--910, 2005.
[30]
L. W. Nagel. Spice 2, a computer program to simulate semiconductor circuits. Technical Report Memorandum ERL-M250, 1975.
[31]
J. N. K. S. R. Nassif and F. N. Najm. A multigrid-like technique for power grid analysis. IEEE Transactions on Computer-aided Design, 21:1148--1160, Oct. 2002.
[32]
A. R. Newton and A. L. Sangiovanni-Vincentelli. Relaxation based electrical simulation. IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., CAD-3(4):308--330, October 1984.
[33]
H. Peng and C.-K. Cheng. Parallel transistor level circuit simulation using domain decomposition methods. In Proceedings of the 2009 Conference on Asia and South Pacific Design Automation, pages 397--402. IEEE Press, 2009.
[34]
Y. Saad. Iterative Methods for Sparse Linear Systems. SIAM, Philadelphia, PA., second edition, 2003.
[35]
O. Schenk and K. Gärtner. Solving unsymmetric sparse systems of linear equations with PARDISO. Fut. Gen. Comput. Sys., 20(3):475--487, 2005.
[36]
N. Selvakkumaran and G. Karypis. Multi-objective hypergraph partitioning algorithms for cut and maximum subdomain degree minimization. IEEE Transactions on Computer-Aided Design, 25(3):504--517, 2006.
[37]
K. Stanley and T. Davis. KLU: a 'Clark Kent' sparse LU factorization algorithm for circuit matrices. In SIAM Conference on Parallel Processing for Scientfic Computing (PP04), 2004.
[38]
P. Su, H. Wan, S. Fung, M. Chan, A. Niknejad, and C. Hu. BSIM3SOI3.1 MOSFET MODEL Users' Manual. Technical report, Department of Electrical Engineering and Computer Science, University of California, Berkeley, CA 94720.
[39]
K. Sun, Q. Zhou, K. Mohanram, and D. C. Sorensen. Parallel domain decomposition for simulation of large-scale power grids. IEEE/ACM International Conference on Computer-Aided Design (ICCAD), pages 54--59, November 2007.
[40]
B. Ucar and C. Aykanat. Partitioning sparse matrices for parallel preconditioned iterative methods. SIAM J. Sci. Comput., 29(4):1683--1709, 2007.
[41]
J. White and A. L. Sangiovanni-Vincentelli. Relax2: A new waveform relaxation approach for the analysis of lsi mos circuits. in Proc. Int. Symp. Circuits Syst., 2:756--759, May 1983.
[42]
J. K. White and A. Sangiovanni-Vincentelli. Relaxation techniques for the simulation of VLSI circuits. Kluwer Academic Publishers, 1987.
[43]
C. Zhuo, J. Hu, M. Zhao, and K. Chen. Power grid analysis and optimization using algebraic multigrid. IEEE Transactions on Computer-aided Design, 27:738--751, April 2008.

Cited By

View all
  • (2024)GraphZeppelin: How to Find Connected Components (Even When Graphs Are Dense, Dynamic, and Massive)ACM Transactions on Database Systems10.1145/364384649:3(1-31)Online publication date: 16-May-2024
  • (2023)Parallel Incomplete LU Factorization Based Iterative Solver for Fixed-Structure Linear Equations in Circuit SimulationProceedings of the 28th Asia and South Pacific Design Automation Conference10.1145/3566097.3567882(339-345)Online publication date: 16-Jan-2023
  • (2022)GraphZeppelin: Storage-Friendly Sketching for Connected Components on Dynamic Graph StreamsProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526146(325-339)Online publication date: 10-Jun-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICCAD '09: Proceedings of the 2009 International Conference on Computer-Aided Design
November 2009
803 pages
ISBN:9781605588001
DOI:10.1145/1687399
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. circuit simulation
  2. hypergraph partitioning
  3. iterative matrix solvers
  4. multi-core
  5. parallel simulation
  6. preconditioners

Qualifiers

  • Research-article

Conference

ICCAD '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 457 of 1,762 submissions, 26%

Upcoming Conference

ICCAD '24
IEEE/ACM International Conference on Computer-Aided Design
October 27 - 31, 2024
New York , NY , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)GraphZeppelin: How to Find Connected Components (Even When Graphs Are Dense, Dynamic, and Massive)ACM Transactions on Database Systems10.1145/364384649:3(1-31)Online publication date: 16-May-2024
  • (2023)Parallel Incomplete LU Factorization Based Iterative Solver for Fixed-Structure Linear Equations in Circuit SimulationProceedings of the 28th Asia and South Pacific Design Automation Conference10.1145/3566097.3567882(339-345)Online publication date: 16-Jan-2023
  • (2022)GraphZeppelin: Storage-Friendly Sketching for Connected Components on Dynamic Graph StreamsProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526146(325-339)Online publication date: 10-Jun-2022
  • (2021)Mathematical Modeling of the Dynamics of Linear Electrical Systems with Parallel CalculationsEnergies10.3390/en1410293014:10(2930)Online publication date: 19-May-2021
  • (2019)LACC: A Linear-Algebraic Algorithm for Finding Connected Components in Distributed Memory2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2019.00012(2-12)Online publication date: May-2019
  • (2018)CommAnalyzerProceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing10.1145/3208040.3208042(80-91)Online publication date: 11-Jun-2018
  • (2017)Convergence-Boosted Graph Partitioning using Maximum Spanning Trees for Iterative Solution of Large Linear CircuitsProceedings of the 54th Annual Design Automation Conference 201710.1145/3061639.3062215(1-6)Online publication date: 18-Jun-2017
  • (2017)Compensating for Parasitic Voltage Drops in Resistive Memory Arrays2017 IEEE International Memory Workshop (IMW)10.1109/IMW.2017.7939075(1-4)Online publication date: May-2017
  • (2017)BaskerParallel Computing10.1016/j.parco.2017.06.00368:C(17-31)Online publication date: 1-Oct-2017
  • (2017)Related WorkParallel Sparse Direct Solver for Integrated Circuit Simulation10.1007/978-3-319-53429-9_2(13-41)Online publication date: 12-Feb-2017
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media