default search action
J. Ramanujam
Person information
- affiliation: Louisiana State University, Baton Rouge, LA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j58]Mengmeng Liu, Gopal Srivastava, J. Ramanujam, Michal Brylinski:
Insights from Augmented Data Integration and Strong Regularization in Drug Synergy Prediction with SynerGNet. Mach. Learn. Knowl. Extr. 6(3): 1782-1797 (2024) - [j57]Elham Ravanbakhsh, Yongqing Liang, J. Ramanujam, Xin Li:
Deep video representation learning: a survey. Multim. Tools Appl. 83(20): 59195-59225 (2024) - [c122]Ashish Srivastava, Shalabh Bhatnagar, M. Narasimha Murty, Jagannathan Ramanujam:
Learning Dynamic Representations in Large Language Models for Evolving Data Streams. ICPR (5) 2024: 239-253 - [i9]Elham Ravanbakhsh, Yongqing Liang, J. Ramanujam, Xin Li:
Deep video representation learning: a survey. CoRR abs/2405.06574 (2024) - [i8]Elham Ravanbakhsh, Cheng Niu, Yongqing Liang, J. Ramanujam, Xin Li:
Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach. CoRR abs/2405.06586 (2024) - 2021
- [j56]Guannan Liu, Manali Singha, Limeng Pu, Prasanga Neupane, Joseph Feinstein, Hsiao-Chun Wu, J. Ramanujam, Michal Brylinski:
GraphDTI: A robust deep learning predictor of drug-target interactions from multiple heterogeneous data. J. Cheminformatics 13(1): 58 (2021) - 2020
- [j55]Wentao Shi, Jeffrey Mitchell Lemoine, Abd-El-Monsif A. Shawky, Manali Singha, Limeng Pu, Shuangyan Yang, J. Ramanujam, Michal Brylinski:
BionoiNet: ligand-binding site classification with off-the-shelf deep neural network. Bioinform. 36(10): 3077-3083 (2020)
2010 – 2019
- 2019
- [j54]Fabio Luporini, Michael Lange, Christian T. Jacobs, Gerard J. Gorman, J. Ramanujam, Paul H. J. Kelly:
Automated Tiling of Unstructured Mesh Computations with Application to Seismological Modeling. ACM Trans. Math. Softw. 45(2): 17:1-17:30 (2019) - 2018
- [j53]Aisha I. Ali-Gombe, Brendan Saltaformaggio, J. Ramanujam, Dongyan Xu, Golden G. Richard III:
Toward a more dependable hybrid analysis of android malware using aspect-oriented programming. Comput. Secur. 73: 235-248 (2018) - 2017
- [j52]Andrew Case, Arghya Kusum Das, Seung-Jong Park, J. Ramanujam, Golden G. Richard III:
Gaslight: A comprehensive fuzzing architecture for memory forensics frameworks. Digit. Investig. 22 Supplement: S86-S93 (2017) - [c121]Zahra Khatami, Hartmut Kaiser, J. Ramanujam:
Improving the Parallel Performance of an NBody Application Using Adaptive Techniques in HPX. HPCC/SmartCity/DSS 2017: 621-622 - [c120]Zahra Khatami, Hartmut Kaiser, J. Ramanujam:
Redesigning OP2 Compiler to Use HPX Runtime Asynchronous Techniques. IPDPS Workshops 2017: 1198-1207 - [c119]Zahra Khatami, Sungpack Hong, Jinsoo Lee, Siegfried Depner, Hassan Chafi, J. Ramanujam, Hartmut Kaiser:
A Load-Balanced Parallel and Distributed Sorting Algorithm Implemented with PGX.D. IPDPS Workshops 2017: 1317-1324 - [c118]Zahra Khatami, Lukas Troska, Hartmut Kaiser, J. Ramanujam, Adrian Serio:
HPX Smart Executors. ESPM2@SC 2017: 3:1-3:8 - [i7]Zahra Khatami, Hartmut Kaiser, J. Ramanujam:
Redesigning OP2 Compiler to Use HPX Runtime Asynchronous Techniques. CoRR abs/1703.09264 (2017) - [i6]Fabio Luporini, Michael Lange, Christian T. Jacobs, Gerard J. Gorman, J. Ramanujam, Paul H. J. Kelly:
Automated Tiling of Unstructured Mesh Computations with Application to Seismological Modelling. CoRR abs/1708.03183 (2017) - [i5]Zahra Khatami, Lukas Troska, Hartmut Kaiser, J. Ramanujam, Adrian Serio:
HPX Smart Executors. CoRR abs/1711.01519 (2017) - 2016
- [j51]Yun Ding, Ye Fang, Juana Moreno, J. Ramanujam, Mark Jarrell, Michal Brylinski:
Assessing the similarity of ligand binding conformations with the Contact Mode Score. Comput. Biol. Chem. 64: 403-413 (2016) - [c117]Zahra Khatami, Hartmut Kaiser, J. Ramanujam:
Using HPX and OP2 for Improving Parallel Scaling Performance of Unstructured Grid Applications. ICPP Workshops 2016: 190-199 - [c116]Changwan Hong, Wenlei Bao, Albert Cohen, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, J. Ramanujam, P. Sadayappan:
Effective padding of multidimensional arrays to avoid cache conflict misses. PLDI 2016: 129-144 - [c115]Zahra Khatami, Hartmut Kaiser, Patricia Grubel, Adrian Serio, J. Ramanujam:
A Massively Parallel Distributed N-body Application Implemented with HPX. ScalA@SC 2016: 57-64 - 2015
- [j50]Yun Ding, Ye Fang, Wei Pan Feinstein, Jagannathan Ramanujam, David M. Koppelman, Juana Moreno, Michal Brylinski, Mark Jarrell:
GeauxDock: A novel approach for mixed-resolution ligand docking using a descriptor-based force field. J. Comput. Chem. 36(27): 2013-2026 (2015) - [j49]Keshav Pingali, J. Ramanujam, P. Sadayappan:
Introduction to the Special Issue on PPoPP'12. ACM Trans. Parallel Comput. 1(2): 9:1-9:2 (2015) - [c114]Tobias Grosser, Jagannathan Ramanujam, Louis-Noël Pouchet, P. Sadayappan, Sebastian Pop:
Optimistic Delinearization of Parametrically Sized Arrays. ICS 2015: 351-360 - [c113]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On Characterizing the Data Access Complexity of Programs. POPL 2015: 567-580 - [c112]Mahesh Ravishankar, Roshan Dathathri, Venmugil Elango, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Distributed memory code generation for mixed Irregular/Regular computations. PPoPP 2015: 65-75 - [c111]Sameer AbuAsal, R. Tohid, J. Ramanujam:
Lost in heterogeneity: architectural selection based on code features. Co-HPC@SC 2015: 6:1-6:6 - [c110]Prashant Singh Rawat, Martin Kong, Thomas Henretty, Justin Holewinski, Kevin Stock, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
SDSLc: a multi-target domain-specific compiler for stencil computations. WOLFHPC@SC 2015: 6:1-6:10 - 2014
- [j48]Zhifeng Yun, Zhou Lei, Gabrielle Allen, Daniel S. Katz, Jagannathan Ramanujam:
DA-TC: a novel application execution model in multicluster systems. Clust. Comput. 17(2): 371-387 (2014) - [j47]Ye Fang, Sheng Feng, Ka-Ming Tam, Zhifeng Yun, Juana Moreno, Jagannathan Ramanujam, Mark Jarrell:
Parallel tempering simulation of the three-dimensional Edwards-Anderson model with compact asynchronous multispin coding on GPU. Comput. Phys. Commun. 185(10): 2467-2478 (2014) - [j46]Sriram Krishnamoorthy, J. Ramanujam, P. Sadayappan:
Introduction to the JPDC Special Issue on Domain-Specific Languages and High-Level Frameworks for High-Performance Computing. J. Parallel Distributed Comput. 74(12): 3175 (2014) - [j45]Fabio Luporini, Ana Lucia Varbanescu, Florian Rathgeber, Gheorghe-Teodor Bercea, J. Ramanujam, David A. Ham, Paul H. J. Kelly:
Cross-Loop Optimization of Arithmetic Intensity for Finite Element Local Assembly. ACM Trans. Archit. Code Optim. 11(4): 57:1-57:25 (2014) - [j44]Venmugil Elango, Naser Sedaghati, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, Radu Teodorescu, P. Sadayappan:
On Using the Roofline Model with Lower Bounds on Data Movement. ACM Trans. Archit. Code Optim. 11(4): 67:1-67:23 (2014) - [j43]Mahesh Ravishankar, John Eisenlohr, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Automatic parallelization of a class of irregular loops for distributed memory systems. ACM Trans. Parallel Comput. 1(1): 7:1-7:37 (2014) - [c109]Michelle Mills Strout, Fabio Luporini, Christopher D. Krieger, Carlo Bertolli, Gheorghe-Teodor Bercea, Catherine Olschanowsky, J. Ramanujam, Paul H. J. Kelly:
Generalizing Run-Time Tiling with the Loop Chain Abstraction. IPDPS 2014: 1136-1145 - [c108]Kevin Stock, Martin Kong, Tobias Grosser, Louis-Noël Pouchet, Fabrice Rastello, J. Ramanujam, P. Sadayappan:
A framework for enhancing data reuse via associative reordering. PLDI 2014: 65-76 - [c107]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On characterizing the data movement complexity of computational DAGs for parallel execution. SPAA 2014: 296-306 - [i4]Naznin Fauzia, Venmugil Elango, Mahesh Ravishankar, J. Ramanujam, Fabrice Rastello, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan:
Beyond Reuse Distance Analysis: Dynamic Analysis for Characterization of Data Locality Potential. CoRR abs/1401.5024 (2014) - [i3]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On Characterizing the Data Movement Complexity of Computational DAGs for Parallel Execution. CoRR abs/1404.4767 (2014) - [i2]Fabio Luporini, Ana Lucia Varbanescu, Florian Rathgeber, Gheorghe-Teodor Bercea, J. Ramanujam, David A. Ham, Paul H. J. Kelly:
COFFEE: an Optimizing Compiler for Finite Element Local Assembly. CoRR abs/1407.0904 (2014) - [i1]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On Characterizing the Data Access Complexity of Programs. CoRR abs/1411.2286 (2014) - 2013
- [j42]Sanket Tavarageri, J. Ramanujam, P. Sadayappan:
Adaptive parallel tiled code generation and accelerated auto-tuning. Int. J. High Perform. Comput. Appl. 27(4): 412-425 (2013) - [j41]Naznin Fauzia, Venmugil Elango, Mahesh Ravishankar, J. Ramanujam, Fabrice Rastello, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan:
Beyond reuse distance analysis: Dynamic analysis for characterization of data locality potential. ACM Trans. Archit. Code Optim. 10(4): 53:1-53:29 (2013) - [c106]Tobias Grosser, Albert Cohen, Paul H. J. Kelly, J. Ramanujam, P. Sadayappan, Sven Verdoolaege:
Split tiling for GPUs: automatic parallelization using trapezoidal tiles. GPGPU@ASPLOS 2013: 24-31 - [c105]Thomas Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
A stencil compiler for short-vector SIMD architectures. ICS 2013: 13-24 - [c104]Athanasios Konstantinidis, Paul H. J. Kelly, J. Ramanujam, P. Sadayappan:
Parametric GPU Code Generation for Affine Loop Programs. LCPC 2013: 136-151 - 2012
- [j40]Hassan A. Salamy, J. Ramanujam:
Code Size Reduction for Array Intensive Applications on Digital Signal Processors. J. Circuits Syst. Comput. 21(3) (2012) - [j39]Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions. J. Parallel Distributed Comput. 72(3): 338-352 (2012) - [j38]Hassan A. Salamy, J. Ramanujam:
An Effective Solution to Task Scheduling and Memory Partitioning for Multiprocessor System-on-Chip. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 31(5): 717-725 (2012) - [j37]Hassan A. Salamy, J. Ramanujam:
Storage Optimization through Offset Assignment with Variable Coalescing. ACM Trans. Embed. Comput. Syst. 11(S1): 16 (2012) - [j36]Hassan A. Salamy, J. Ramanujam:
An ILP solution to address code generation for embedded applications on digital signal processors. ACM Trans. Design Autom. Electr. Syst. 17(3): 28:1-28:23 (2012) - [c103]Jun Shirako, Kamal Sharma, Naznin Fauzia, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan, Vivek Sarkar:
Analytical Bounds for Optimal Tile Size Selection. CC 2012: 101-121 - [c102]Mahesh Ravishankar, John Eisenlohr, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Code generation for parallel execution of a class of irregular loops on distributed memory systems. SC 2012: 72 - [e2]J. Ramanujam, P. Sadayappan:
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, New Orleans, LA, USA, February 25-29, 2012. ACM 2012, ISBN 978-1-4503-1160-1 [contents] - 2011
- [c101]Thomas Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam, P. Sadayappan:
Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures. CC 2011: 225-245 - [c100]Sanket Tavarageri, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Dynamic selection of tile sizes. HiPC 2011: 1-10 - [c99]Louis-Noël Pouchet, Uday Bondhugula, Cédric Bastoul, Albert Cohen, J. Ramanujam, P. Sadayappan, Nicolas Vasilache:
Loop transformations: convexity, pruning and optimization. POPL 2011: 549-562 - 2010
- [c98]Muthu Manikandan Baskaran, J. Ramanujam, P. Sadayappan:
Automatic C-to-CUDA Code Generation for Affine Programs. CC 2010: 244-263 - [c97]Muthu Manikandan Baskaran, Albert Hartono, Sanket Tavarageri, Thomas Henretty, J. Ramanujam, P. Sadayappan:
Parameterized tiling revisited. CGO 2010: 200-209 - [c96]Albert Hartono, Muthu Manikandan Baskaran, J. Ramanujam, Ponnuswamy Sadayappan:
DynTile: Parametric tiled loop generation for parallel execution on multicore processors. IPDPS 2010: 1-12 - [c95]Louis-Noël Pouchet, Uday Bondhugula, Cédric Bastoul, Albert Cohen, J. Ramanujam, P. Sadayappan:
Combined Iterative and Model-driven Optimization in an Automatic Parallelization Framework. SC 2010: 1-11
2000 – 2009
- 2009
- [c94]Qingda Lu, Christophe Alias, Uday Bondhugula, Thomas Henretty, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan, Yongjian Chen, Haibo Lin, Tin-Fook Ngai:
Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors. PACT 2009: 348-357 - [c93]Zhifeng Yun, Zhou Lei, Gabrielle Allen, Daniel S. Katz, Tevfik Kosar, Shantenu Jha, Jagannathan Ramanujam:
An innovative application execution toolkit for multicluster grids. CLUSTER 2009: 1-4 - [c92]Hassan A. Salamy, J. Ramanujam:
A Framework for Task Scheduling and Memory Partitioning for Multi-Processor System-on-Chip. HiPEAC 2009: 263-277 - [c91]Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, P. Sadayappan:
Parametric multi-level tiling of imperfectly nested loops. ICS 2009: 147-157 - [c90]Muthu Manikandan Baskaran, Nagavijayalakshmi Vydyanathan, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors. PPoPP 2009: 219-228 - [c89]Rajesh Sankaran, Brygg Ullmer, Jagannathan Ramanujam, Karun Kallakuri, Srikanth Jandhyala, Cornelius Toole, Christopher Laan:
Decoupling interaction hardware design using libraries of reusable electronics. TEI 2009: 331-337 - 2008
- [c88]Uday Bondhugula, Muthu Manikandan Baskaran, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model. CC 2008: 132-146 - [c87]Hassan A. Salamy, J. Ramanujam:
Optimal address register allocation for arrays in DSP applications. ESTIMedia 2008: 67-72 - [c86]Hassan A. Salamy, J. Ramanujam:
Storage optimization through code size reduction for digital signal processors. ESTIMedia 2008: 107-112 - [c85]Jinpyo Hong, J. Ramanujam:
Address Register Allocation in Digital Signal Processors. ICESS 2008: 331-337 - [c84]Jinpyo Hong, J. Ramanujam:
Scheduling DAGs for Fixed-point DSP Processors by Using Worm Partitions. ICESS 2008: 567-574 - [c83]Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
A compiler framework for optimization of affine loop nests for gpgpus. ICS 2008: 225-234 - [c82]Uday Bondhugula, Muthu Manikandan Baskaran, Albert Hartono, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Towards effective automatic parallelization for multicore systems. IPDPS 2008: 1-5 - [c81]Uday Bondhugula, Albert Hartono, J. Ramanujam, P. Sadayappan:
A practical automatic polyhedral parallelizer and locality optimizer. PLDI 2008: 101-113 - [c80]Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. PPoPP 2008: 1-10 - 2007
- [j35]Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Efficient search-space pruning for integrated fusion and tiling transformations. Concurr. Comput. Pract. Exp. 19(18): 2425-2443 (2007) - [c79]Jinpyo Hong, J. Ramanujam:
Memory Offset Assignment for DSPs. ICESS 2007: 80-87 - [c78]Sriram Krishnamoorthy, Muthu Manikandan Baskaran, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Effective automatic parallelization of stencil computations. PLDI 2007: 235-244 - [c77]Uday Bondhugula, J. Ramanujam, P. Sadayappan:
Automatic mapping of nested loops to FPGAS. PPoPP 2007: 101-111 - [c76]Sai Pinnepalli, Jinpyo Hong, J. Ramanujam, Doris L. Carver:
Code Size Optimization for Embedded Processors using Commutative Transformations. RTCSA 2007: 409-416 - 2006
- [j34]Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella:
Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. J. Parallel Distributed Comput. 66(5): 659-673 (2006) - [j33]Guilin Chen, Mahmut T. Kandemir, Mary Jane Irwin, J. Ramanujam:
Reducing code size through address register assignment. ACM Trans. Embed. Comput. Syst. 5(1): 225-258 (2006) - [j32]Mahmut T. Kandemir, J. Ramanujam, Ugur Sezer:
Improving the energy behavior of block buffering using compiler optimizations. ACM Trans. Design Autom. Electr. Syst. 11(1): 228-250 (2006) - [j31]J. Ramanujam, Jinpyo Hong, Mahmut T. Kandemir, Amit Narayan, Ankush Agarwal:
Estimating and reducing the memory requirements of signal processing codes for embedded systems. IEEE Trans. Signal Process. 54(1): 286-294 (2006) - [c75]Albert Hartono, Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, Venkatesh Choppella, Russell M. Pitzer, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. International Conference on Computational Science (1) 2006: 267-275 - [c74]A. Allam, J. Ramanujam, Gerald Baumgartner, P. Sadayappan:
Memory minimization for tensor contractions using integer linear programming. IPDPS 2006 - [c73]Hassan A. Salamy, J. Ramanujam:
An Effective Heuristic for Simple Offset Assignment with Variable Coalescing. LCPC 2006: 158-172 - [e1]Eduard Ayguadé, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Languages and Compilers for Parallel Computing, 18th International Workshop, LCPC 2005, Hawthorne, NY, USA, October 20-22, 2005, Revised Selected Papers. Lecture Notes in Computer Science 4339, Springer 2006, ISBN 978-3-540-69329-1 [contents] - 2005
- [j30]Gerald Baumgartner, Alexander A. Auer, David E. Bernholdt, Alina Bibireata, Venkatesh Choppella, Daniel Cociorva, Xiaoyang Gao, Robert J. Harrison, So Hirata, Sriram Krishnamoorthy, Sandhya Krishnan, Chi-Chung Lam, Qingda Lu, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan, Alexander Sibiryakov:
Synthesis of High-Performance Parallel Programs for a Class of ab Initio Quantum Chemistry Models. Proc. IEEE 93(2): 276-292 (2005) - [c72]Albert Hartono, Alexander Sibiryakov, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, So Hirata, Chi-Chung Lam, Russell M. Pitzer, J. Ramanujam, P. Sadayappan:
Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations. International Conference on Computational Science (1) 2005: 155-164 - [c71]Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. LCPC 2005: 215-229 - [c70]Xiaoyang Gao, Swarup Kumar Sahoo, Chi-Chung Lam, J. Ramanujam, Qingda Lu, Gerald Baumgartner, P. Sadayappan:
Performance modeling and optimization of parallel out-of-core tensor contractions. PPoPP 2005: 266-276 - 2004
- [j29]Mahmut T. Kandemir, J. Ramanujam, Mary Jane Irwin, Narayanan Vijaykrishnan, Ismail Kadayif, Amisha Parikh:
A compiler-based approach for dynamically managing scratch-pad memories in embedded systems. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 23(2): 243-260 (2004) - [c69]Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella:
Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. IPDPS 2004 - [c68]Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Empirical Performance-Model Driven Data Layout Optimization. LCPC 2004: 72-86 - 2003
- [j28]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
Reducing False Sharing and Improving Spatial Locality in a Unified Compilation Framework. IEEE Trans. Parallel Distributed Syst. 14(4): 337-354 (2003) - [c67]Mahmut T. Kandemir, Mary Jane Irwin, Guilin Chen, J. Ramanujam:
Address Register Assignment for Reducing Code Size. CC 2003: 273-289 - [c66]Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella:
Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. HiPC 2003: 406-417 - [c65]Daniel Cociorva, Xiaoyang Gao, Sandhya Krishnan, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam:
Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. IPDPS 2003: 37 - [c64]Alina Bibireata, Sandhya Krishnan, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella:
Memory-Constrained Data Locality Optimization for Tensor Contractions. LCPC 2003: 93-108 - 2002
- [j27]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam:
An I/O-Conscious Tiling Strategy for Disk-Resident Data Sets. J. Supercomput. 21(3): 257-284 (2002) - [c63]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary:
Exploiting shared scratch pad memory space in embedded multiprocessor systems. DAC 2002: 219-224 - [c62]Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Chi-Chung Lam, J. Ramanujam, Robert J. Harrison, Marcel Nooijen, P. Sadayappan:
A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs. IPDPS 2002 - [c61]Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam:
Memory-Constrained Communication Minimization for a Class of Array Computations. LCPC 2002: 1-15 - [c60]Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison:
Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. PLDI 2002: 177-186 - [c59]Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Robert J. Harrison, So Hirata, Chi-Chung Lam, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan:
A high-level approach to synthesis of high-performance codes for quantum chemistry. SC 2002: 33:1-33:10 - [c58]J. Ramanujam, Sandeep Deshpande, Jinpyo Hong, Mahmut T. Kandemir:
A Heuristic for Clock Selection in High-Level Synthesis. ASP-DAC/VLSI Design 2002: 414-419 - [c57]J. Ramanujam, Satish Krishnamurthy, Jinpyo Hong, Mahmut T. Kandemir:
Address Code and Arithmetic Optimizations for Embedded Systems. ASP-DAC/VLSI Design 2002: 619-624 - [c56]N. E. Crosbie, Mahmut T. Kandemir, Ibrahim Kolcu, J. Ramanujam, Alok N. Choudhary:
Strategies for Improving Data Locality in Embedded Applications. ASP-DAC/VLSI Design 2002: 631- - [p1]J. Ramanujam:
Automatic Data Distribution. The Compiler Design Handbook 2002: 409-460 - 2001
- [j26]Mahmut T. Kandemir, J. Ramanujam:
Data Relation Vectors: A New Abstraction for Data Optimizations. IEEE Trans. Computers 50(8): 798-810 (2001) - [j25]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary, Prithviraj Banerjee:
A Layout-Conscious Iteration Space Transformation Technique. IEEE Trans. Computers 50(12): 1321-1336 (2001) - [j24]Siddharth Rele, Vipin Jain, Santosh Pande, J. Ramanujam:
Compact and efficient code generation through program restructuringon limited memory embedded DSPs. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 20(4): 477-494 (2001) - [j23]M. Narasimhan, J. Ramanujam:
A fast approach to computing exact solutions to the resource-constrained scheduling problem. ACM Trans. Design Autom. Electr. Syst. 6(4): 490-500 (2001) - [j22]Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, Eduard Ayguadé:
Static and Dynamic Locality Optimizations Using Integer Linear Programming. IEEE Trans. Parallel Distributed Syst. 12(9): 922-941 (2001) - [c55]J. Ramanujam:
Integer Lattice Based Methods for Local Address Generation for Block-Cyclic Distributions. Compiler Optimizations for Scalable Parallel Systems Languages 2001: 597-648 - [c54]J. Ramanujam, Jinpyo Hong, Mahmut T. Kandemir, Amit Narayan:
Reducing Memory Requirements of Nested Loops for Embedded Systems. DAC 2001: 359-364 - [c53]Mahmut T. Kandemir, J. Ramanujam, Mary Jane Irwin, Narayanan Vijaykrishnan, Ismail Kadayif, Amisha Parikh:
Dynamic Management of Scratch-Pad Memory Space. DAC 2001: 690-695 - [c52]Daniel Cociorva, J. W. Wilkins, Gerald Baumgartner, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison:
Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. HiPC 2001: 237-248 - [c51]Daniel Cociorva, J. W. Wilkins, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Loop optimization for a class of memory-constrained computations. ICS 2001: 103-113 - [c50]Mahmut T. Kandemir, J. Ramanujam, Ugur Sezer:
Compiler support for block buffering. ISLPED 2001: 76-79 - [c49]Ismail Kadayif, Mahmut T. Kandemir, Narayanan Vijaykrishnan, Mary Jane Irwin, J. Ramanujam:
Morphable Cache Architectures: Potential Benefits. LCTES/OM 2001: 128-137 - [c48]Ismail Kadayif, Mahmut T. Kandemir, Narayanan Vijaykrishnan, Mary Jane Irwin, Jagannathan Ramanujam:
Morphable Cache Architectures: Potential Benefits. OM@PLDI 2001: 128-137 - 2000
- [j21]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary:
Compiler Algorithms for Optimizing Locality and Parallelism on Shared and Distributed-Memory Machines. J. Parallel Distributed Comput. 60(8): 924-965 (2000) - [j20]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Meenakshi A. Kandaswamy:
A Unified Framework for Optimizing Locality, Parallelism, and Communication in Out-of-Core Computations. IEEE Trans. Parallel Distributed Syst. 11(7): 648-668 (2000) - [j19]Mahmut T. Kandemir, Alok N. Choudhary, Prithviraj Banerjee, J. Ramanujam, U. Nagaraj Shenoy:
Minimizing Data and Synchronization Costs in One-Way Communication. IEEE Trans. Parallel Distributed Syst. 11(12): 1232-1251 (2000) - [c47]Mahmut T. Kandemir, J. Ramanujam:
Data Relation Vectors: A New Abstraction for Data Optimizations. IEEE PACT 2000: 227-236 - [c46]M. Narasimhan, J. Ramanujam:
On lower bounds for scheduling problems in high-level synthesis. DAC 2000: 546-551 - [c45]Sunil Atri, J. Ramanujam, Mahmut T. Kandemir:
Improving Offset Assignment on Embedded Processors Using Transformations. HiPC 2000: 367-374 - [c44]Sunil Atri, J. Ramanujam, Mahmut T. Kandemir:
Improving Offset Assignment for Embedded Processors. LCPC 2000: 158-172
1990 – 1999
- 1999
- [j18]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
A Matrix-Based Approach to Global Locality Optimization. J. Parallel Distributed Comput. 58(2): 190-235 (1999) - [j17]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary:
Improving Cache Locality by a Combination of Loop and Data Transformation. IEEE Trans. Computers 48(2): 159-167 (1999) - [j16]Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, U. Nagaraj Shenoy:
A global communication optimization technique based on data-flow analysis and linear algebra. ACM Trans. Program. Lang. Syst. 21(6): 1251-1297 (1999) - [j15]Mahmut T. Kandemir, Alok N. Choudhary, U. Nagaraj Shenoy, Prithviraj Banerjee, J. Ramanujam:
A Linear Algebra Framework for Automatic Determination of Optimal Data Layouts. IEEE Trans. Parallel Distributed Syst. 10(2): 115-135 (1999) - [c43]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
On Reducing False Sharing while Improving Locality on Shared Memory Multiprocessors. IEEE PACT 1999: 203-211 - [c42]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam:
I/O-Conscious Tiling for Disk-Resident Data Sets. Euro-Par 1999: 430-439 - [c41]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam:
Restructuring I/O-Intensive Computations for Locality. HPCN Europe 1999: 1097-1106 - [c40]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
A Framework for Interprocedural Locality Optimization Using Both Loop and Data Layout Transformations. ICPP 1999: 95-102 - [c39]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam:
Compiler Optimizations for I/O-Intensive Computations. ICPP 1999: 164-171 - [c38]Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, Eduard Ayguadé:
An integer linear programming approach for optimizing cache locality. International Conference on Supercomputing 1999: 500-509 - [c37]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
A Graph Based Framework to Detect Optimal Memory Layouts for Improving Data Locality. IPPS/SPDP 1999: 738-743 - [c36]Vipin Jain, Siddharth Rele, Santosh Pande, J. Ramanujam:
Code Restructuring for Improving Real Time Response through Code Speed, Size Trade-offs on Limited Memory Embedded DSPs. LCPC 1999: 459-463 - [c35]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
Improving Locality Using a Graph-Based Technique for Detecting Memory Layouts of Arrays. PP 1999 - 1998
- [j14]P. Sadayappan, Fikret Erçal, J. Ramanujam:
Partitioning Graphs on Message-Passing Machines by Pairwise Mincut. Inf. Sci. 111(1-4): 223-237 (1998) - [j13]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Meenakshi A. Kandaswamy:
Locality Optimization Algorithms for Compilation of Out-of-Core Codes. J. Inf. Sci. Eng. 14(1): 107-138 (1998) - [j12]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Rajesh Bordawekar:
Compilation Techniques for Out-of-Core Parallel Computations. Parallel Comput. 24(3-4): 597-628 (1998) - [c34]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
A Matrix-Based Approach to the Global Locality Optimization Problem. IEEE PACT 1998: 306-313 - [c33]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, U. Nagaraj Shenoy, Prithviraj Banerjee:
Enhancing Spatial Locality via Data Layout Optimizations. Euro-Par 1998: 422-434 - [c32]Jagannathan Ramanujam, Arun Venkatachar, Swaroop Dutta:
Efficient address sequence generation for two-level mappings in High Performance Fortran. HiPC 1998: 132-139 - [c31]M. Narasimhan, J. Ramanujam:
Improving the computational performance of ILP-based problems. ICCAD 1998: 593-596 - [c30]Mahmut T. Kandemir, U. Nagaraj Shenoy, Prithviraj Banerjee, J. Ramanujam, Alok N. Choudhary:
Minimizing Data and Synchronization Costs in One-Way Communication. ICPP 1998: 180-188 - [c29]Mahmut T. Kandemir, Alok N. Choudhary, U. Nagaraj Shenoy, Prithviraj Banerjee, J. Ramanujam:
A Hyperplane Based Approach for Optimizing Spatial Locality in Loop Nests. International Conference on Supercomputing 1998: 69-76 - [c28]Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, U. Nagaraj Shenoy:
A Generalized Framework for Global Communication Optimization. IPPS/SPDP 1998: 69-73 - [c27]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary, Prithviraj Banerjee:
A Loop Transformation Algorithm Based on Explicit Data Layout Representation for Optimizing Locality. LCPC 1998: 34-50 - [c26]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam:
Improving Locality in Out-of-Core Computations Using Data Layout Transformations. LCR 1998: 359-366 - [c25]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee:
Improving Locality Using Loop and Data Transformations in an Integrated Framework. MICRO 1998: 285-297 - 1997
- [j11]Arun Venkatachar, J. Ramanujam, Ashwath Thirumalai:
Communication Generation for Block-Cyclic Distributions. Parallel Process. Lett. 7(2): 195-202 (1997) - [c24]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary:
Compiler Algorithms for Optimizing Locality and Parallelism on Shared and Distributed Memory Machines. IEEE PACT 1997: 236- - [c23]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary:
Optimization of Out-of-Core Computations Using Chain Vectors. Euro-Par 1997: 601-608 - [c22]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary:
Improving the Performance of Out-of-Core Computations. ICPP 1997: 128-136 - [c21]Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary:
A Compiler Algorithm for Optimizing Locality in Loop Nests. International Conference on Supercomputing 1997: 269-276 - [c20]Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Meenakshi A. Kandaswamy:
A Unified Compiler Algorithm for Optimizing Locality, Parallelism and Communication in Out-of-core Computations. IOPADS 1997: 79-92 - [c19]J. Ramanujam, Swaroop Dutta, Arun Venkatachar:
Code Generation for Complex Subscripts in Data-Parallel Programs. LCPC 1997: 49-63 - 1996
- [j10]Ashwath Thirumalai, J. Ramanujam:
Efficient Computation of Address Sequences in Data Parallel Programs Using Closed Forms for Basis Vectors. J. Parallel Distributed Comput. 38(2): 188-203 (1996) - [j9]Rajesh Bordawekar, Alok N. Choudhary, J. Ramanujam:
Compilation and Communication Strategies for Out-of-Core Programs on Distributed Memory Machines. J. Parallel Distributed Comput. 38(2): 277-288 (1996) - [j8]Rajeev Thakur, Alok N. Choudhary, J. Ramanujam:
Efficient Algorithms for Array Redistribution. IEEE Trans. Parallel Distributed Syst. 7(6): 587-594 (1996) - [j7]Ashok K. Goel, J. Ramanujam:
A neural architecture for a class of abduction problems. IEEE Trans. Syst. Man Cybern. Part B 26(6): 854-860 (1996) - [c18]Rajesh Bordawekar, Alok N. Choudhary, J. Ramanujam:
A Framework for Integrated Communication and I/O Placement. Euro-Par, Vol. I 1996: 541-552 - [c17]Rajesh Bordawekar, Alok N. Choudhary, J. Ramanujam:
Automatic Optimization of Communication in Compiling Out-of-Core Stencil Codes. International Conference on Supercomputing 1996: 366-373 - [c16]Arun Venkatachar, J. Ramanujam, Ashwath Thirumalai:
Generalized Overlap Regions for Communication Optimization in Data-Parallel Programs. LCPC 1996: 404-419 - 1995
- [j6]J. Ramanujam, P. Sadayappan:
Mapping combinatorial optimization problems onto neural networks. Inf. Sci. 82(3-4): 239-255 (1995) - [j5]J. Ramanujam:
Beyond unimodular transformations. J. Supercomput. 9(4): 365-389 (1995) - [c15]J. Ramanujam, S. Vasanthakumar:
Statement-level independent partitioning of uniform recurrences. IPPS 1995: 229-233 - [c14]S. D. Kaushik, Chua-Huang Huang, J. Ramanujam, P. Sadayappan:
Multi-phase array redistribution: modeling and evaluation. IPPS 1995: 441-445 - [c13]Ashwath Thirumalai, J. Ramanujam:
Fast Address Sequence Generation for Data-Parallel Programs Using Integer Lattices. LCPC 1995: 191-208 - [c12]Ashwath Thirumalai, J. Ramanujam, Arun Venkatachar:
Communication Generation and Optimization for HPF. LCR 1995: 311-316 - [c11]J. Ramanujam, Amit Narayan:
Integrating Data Distribution and Loop Transformations. PP 1995: 668-673 - 1994
- [c10]J. Ramanujam:
Optimal Software Pipelining of Nested Loops. IPPS 1994: 335-342 - [c9]J. Ramanujam, Ashvin Mathew:
Analysis of Event Synchronization in Parallel Programs. LCPC 1994: 300-315 - 1992
- [j4]J. Ramanujam, P. Sadayappan:
Tiling Multidimensional Itertion Spaces for Multicomputers. J. Parallel Distributed Comput. 16(2): 108-120 (1992) - [c8]J. Ramanujam:
Non-Unimodular Transformations of Nested Loops. SC 1992: 214-223 - 1991
- [j3]J. Ramanujam, P. Sadayappan:
Compile-Time Techniques for Data Distribution in Distributed Memory Machines. IEEE Trans. Parallel Distributed Syst. 2(4): 472-482 (1991) - [c7]J. Ramanujam:
A Linear Algebraic View of Loop Transformations and Their Interaction. PP 1991: 543-548 - [c6]J. Ramanujam, P. Sadayappan:
Tiling multidimensional iteration spaces for nonshared memory machines. SC 1991: 111-120 - 1990
- [j2]Fikret Erçal, J. Ramanujam, P. Sadayappan:
Task Allocation onto a Hypercube by Recursive Mincut Bipartitioning. J. Parallel Distributed Comput. 10(1): 35-44 (1990) - [j1]P. Sadayappan, Fikret Erçal, J. Ramanujam:
Cluster partitioning approaches to mapping parallel programs onto a hypercube. Parallel Comput. 13(1): 1-16 (1990) - [c5]J. Ramanujam, P. Sadayappan:
Tiling of Iteration Spaces for Multicomputers. ICPP (2) 1990: 179-186
1980 – 1989
- 1989
- [c4]J. Ramanujam, P. Sadayappan:
A methodology for parallelizing programs for multicomputers and complex memory multiprocessors. SC 1989: 637-646 - 1988
- [c3]Fikret Erçal, J. Ramanujam, P. Sadayappan:
Task allocation onto a hypercube by recursive mincut bipartitioning. C³P 1988: 210-221 - [c2]J. Ramanujam, P. Sadayappan:
Optimization by neural networks. ICNN 1988: 325-332 - [c1]Ashok K. Goel, J. Ramanujam, P. Sadayappan:
Towards a 'neural' architecture for abductive reasoning. ICNN 1988: 681-688
Coauthor Index
aka: Ponnuswamy Sadayappan
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-26 01:52 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint