


default search action
IPDPS 2014: Phoenix, AZ, USA - Workshops
- 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, Phoenix, AZ, USA, May 19-23, 2014. IEEE Computer Society 2014, ISBN 978-0-7695-5208-8
Workshop 1: HCW - Heterogeneity in Computing Workshop
- Behrooz A. Shirazi, Uwe Schwiegelshohn:
HCW Introduction. 1-2 - Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 3 - Uwe Schwiegelshohn:
Message from the HCW General Chair. 4 - Shoukat Ali:
Message from the HCW Program Chair. 5 - David Abramson
:
HCW 2014 Keynote Talk. 6
HCW Session 1: Heterogeneous Environments for Basic Linear Algebra
- Dimitar Lukarski, Hartwig Anzt
, Stanimire Tomov
, Jack J. Dongarra:
Hybrid Multi-elimination ILU Preconditioners on GPUs. 7-16 - Ashley M. DeFlumere, Alexey L. Lastovetsky
:
Searching for the Optimal Data Partitioning Shape for Parallel Matrix Matrix Multiplication on 3 Heterogeneous Processors. 17-28 - Xavier Lacoste, Mathieu Faverge, George Bosilca, Pierre Ramet
, Samuel Thibault:
Taking Advantage of Hybrid Systems for Sparse Direct Solvers via Task-Based Runtimes. 29-38 - Tania Malik
, Vladimir Rychkov, Alexey L. Lastovetsky
, Jean-Noël Quintin:
Topology-Aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC Platform. 39-47
HCW Session 2: Scheduling and Resource Allocation
- Linchuan Chen, Xin Huo, Gagan Agrawal:
Scheduling Methods for Accelerating Applications on Architectures with Heterogeneous Cores. 48-57 - Bhavesh Khemka, Ryan D. Friese
, Sudeep Pasricha, Anthony A. Maciejewski
, Howard Jay Siegel, Gregory A. Koenig, Sarah Powers, Marcia Hilton, Rajendra Rambharos, Steve Poole:
Utility Driven Dynamic Resource Management in an Oversubscribed Energy-Constrained Heterogeneous System. 58-67 - Adel Essafi, Denis Trystram, Zied Zaidi:
An Efficient Algorithm for Scheduling Jobs in Volunteer Computing Platforms. 68-76
HCW Session 3: Resource-Related Performance Optimization
- Jens Gustedt
, Stéphane Vialle, Patrick P. Mercier:
Resource Centered Computing Delivering High Parallel Performance. 77-88 - Lionel Eyraud-Dubois, Przemyslaw Uznanski
:
Point-to-Point and Congestion Bandwidth Estimation: Experimental Evaluation on PlanetLab Data. 89-96 - Ayman Tarakji, Niels Ole Salscheider:
Runtime Behavior Comparison of Modern Accelerators and Coprocessors. 97-108
Workshop 2: RAW - Reconfigurable Architectures Workshop
- Jürgen Becker, Ramachandran Vaidyanathan, Marco D. Santambrogio
, Jim Tørresen, Ron Sass, Philip Heng Wai Leong
:
RAW Introduction and Committees. 109-110 - Joshua D. Walstrom, Maya B. Gokhale:
RAW 2014 Keynotes. 111
RAW Session 1: Compilers and Binary Translation for Reconfigurable Architectures
- Doug Gallatin, Aaron W. Keen, Chris Lupo
, John Y. Oliver
:
Twill: A Hybrid Microcontroller-FPGA Framework for Parallelizing Single-Threaded C Programs. 112-121 - Ali Mustafa Zaidi, David J. Greaves:
A New Dataflow Compiler IR for Accelerating Control-Intensive Code in Spatial Hardware. 122-131 - Toan X. Mai, Jongeun Lee:
Efficient Software-Based Runtime Binary Translation for Coarse-Grained Reconfigurable Architectures. 132-140
RAW Session 2: New Reconfigurable Architectures
- Georgios Smaragdos
, Danish Anis Khan, Ioannis Sourdis, Christos Strydis
, Alirad Malek, Stavros Tzilis:
A Dependable Coarse-Grain Reconfigurable Multicore Array. 141-150 - Cuong Pham-Quoc
, Zaid Al-Ars, Koen Bertels:
Automated Hybrid Interconnect Design for FPGA Accelerators Using Data Communication Profiling. 151-160 - Anil Kumar Sistla, Xiaozhong Luo, Mukund Malladi, Marc Reisner, Rajasekhar Ganduri, Gayatri Mehta:
SmartBricks: A Visual Environment to Design and Explore Novel Custom Domain-Specific Architectures. 161-169
RAW Session 3: ViPES Papers
- Harry Sidiropoulos, Kostas Siozios
, Dimitrios Soudris:
A Framework for Mapping Dynamic Virtual Kernels onto Heterogeneous Reconfigurable Platforms. 170-175 - Andreas Emeretlis, George Theodoridis, Panayiotis Alefragis
, Nikolaos S. Voros
:
A Hybrid ILP-CP Model for Mapping Directed Acyclic Task Graphs to Multicore Architectures. 176-182 - Kostas Siozios
, Dimitrios Soudris
, Michael Hübner:
A Framework for Customizing Virtual 3-D Reconfigurable Platforms at Run-Time. 183-188
RAW Session 4: Circuit-Level Applications
- Rui Policarpo Duarte, Christos-Savvas Bouganis
:
Over-clocking of Linear Projection Designs through Device Specific Optimisations. 189-198 - Michael Raitza, Markus Vogt, Christian Hochberger, Thilo Pionteck
:
Influence of Magnetic Fields and X-Radiation on Ring Oscillators in FPGAs. 199-204 - Takumi Fujimori, Minoru Watanabe:
Radiation Tolerance of Color Configuration on an Optically Reconfigurable Gate Array. 205-210
RAW Session 5: Numerical Reconfigurable Computing Applications
- Esti Stein, Yosi Ben-Asher:
Adaptive Booth Algorithm for Three-Integers Multiplication for Reconfigurable Mesh. 211-219 - Xinying Wang, Joseph Zambreno:
An FPGA Implementation of the Hestenes-Jacobi Algorithm for Singular Value Decomposition. 220-227
RAW Session 6: Applications of Reconfigurable Computing
- Osama G. Attia, Tyler Johnson, Kevin Townsend, Phillip H. Jones, Joseph Zambreno:
CyGraph: A Reconfigurable Architecture for Parallel Breadth-First Search. 228-235 - Gianluca Durelli, Fabrizio Spada, Riccardo Cattaneo
, Christian Pilato
, Danilo Pau
, Marco D. Santambrogio
:
Adaptive Raytracing Implementation Using Partial Dynamic Reconfiguration. 236-242 - Riccardo Cattaneo
, Riccardo Bellini, Gianluca Durelli, Christian Pilato
, Marco D. Santambrogio
, Donatella Sciuto
:
PaRA-Sched: A Reconfiguration-Aware Scheduler for Reconfigurable Architectures. 243-250
RAW Poster Session 1
- Hiroki Nishiyama, Masato Inagi, Shin'ichi Wakabayashi, Shinobu Nagayama, Keisuke Inoue, Mineo Kaneko:
An ILP-Based Optimal Circuit Mapping Method for PLDs. 251-256 - Cristiano Bacelar de Oliveira, João M. P. Cardoso
, Eduardo Marques
:
High-Level Synthesis from C vs. a DSL-Based Approach. 257-262 - Zhang Zhang, Swamy D. Ponpandi, Akhilesh Tyagi:
An Evaluation of User Satisfaction Driven Scheduling in a Polymorphic Embedded System. 263-268 - Georgios Tzimpragos
, Christoforos Kachris, Dimitrios Soudris, Ioannis Tomkos
:
A Low-Latency Algorithm and FPGA Design for the Min-Search of LDPC Decoders. 269-274 - Jahanzeb Anwer, Marco Platzner
, Sebastian Meisner:
FPGA Redundancy Configurations: An Automated Design Space Exploration. 275-280
RAW Poster Session 2
- Chen Mei, Peng Cao, Yang Zhang, Bo Liu, Leibo Liu
:
Hierarchical Pipeline Optimization of Coarse Grained Reconfigurable Processor for Multimedia Applications. 281-286 - Alexander Wold, Andreas Agne, Jim Tørresen:
Module Placement Using Constraint Programming in Run-Time Reconfigurable Systems. 287-292 - Hasan Erdem Yantir, Arda Yurdakul:
An Efficient Heterogeneous Register File Implementation for FPGAs. 293-298 - Bernhard Schmidt, Daniel Ziener
, Jürgen Teich:
Minimizing Scrubbing Effort through Automatic Netlist Partitioning and Floorplanning. 299-304 - Viet Vu Duy, Timo Sandmann, Steffen Baehr, Oliver Sander, Jürgen Becker
:
Virtualization Support for FPGA-Based Coprocessors Connected via PCI Express to an Intel Multicore Platform. 305-310
Workshop 3: HIPS - Workshop on High-Level Parallel Programming Models and Supportive Environments
- John Cavazos:
HIPS Introduction and Committees. 311
HIPS Session 1: System Support
- Mads Ruben Burgdorff Kristensen
, Simon Andreas Frimann Lund, Troels Blum, Kenneth Skovhede
, Brian Vinter:
Bohrium: A Virtual Machine Approach to Portable Parallelism. 312-321 - Juan Carlos Martínez Santos
, Yunsi Fei
:
HATI: Hardware Assisted Thread Isolation for Concurrent C/C++ Programs. 322-331 - Tatsuya Abe
, Toshiyuki Maeda:
A General Model Checking Framework for Various Memory Consistency Models. 332-341
HIPS Session 2: Optimization
- Lai Wei, John M. Mellor-Crummey
:
Autotuning Tensor Transposition. 342-351 - Weifeng Liu, Isaías A. Comprés Ureña, Michael Gerndt, Bin Gong:
Automatic MPI-IO Tuning with the Periscope Tuning Framework. 352-360 - Jithin Jose, Khaled Hamidouche, Jie Zhang, Akshay Venkatesh, Dhabaleswar K. Panda:
Optimizing Collective Communication in UPC. 361-370
HIPS Session 3: Effective Communication
- Simon Pickartz
, Pablo Reble, Carsten Clauss, Stefan Lankes
:
SWIFT: A Transparent and Flexible Communication Layer for PCIe-Coupled Accelerators and (Co-)Processors. 371-380 - Christopher Boelmann, Lorenz Schwittmann, Torben Weis
:
Deterministic Synchronization of Multi-threaded Programs with Operational Transformation. 381-390 - Sai Charan Koduru, Keval Vora
, Rajiv Gupta
:
ABC2: Adaptively Balancing Computation and Communication in a DSM Cluster of Multicores for Irregular Applications. 391-400
Workshop 4: NIDISC - Workshop on Nature Inspired Distributed Computing
- Pascal Bouvry
, Franciszek Seredynski
, El-Ghazali Talbi:
NIDISC Introduction and Committees. 401
NIDISC Session 1: Applications of Bio-Inspired Algorithms
- Theodore P. Pavlic
:
Using Physical Stigmergy in Decentralized Optimization under Multiple Non-separable Constraints: Formal Methods and an Intelligent Lighting Example. 402-411 - Amir Nakib
, El-Ghazali Talbi, A. Fuser:
Hybrid Metaheuristic for Annual Hydropower Generation Optimization. 412-419 - Fatima Adly, Paul D. Yoo
, Sami Muhaidat
, Yousof Al-Hammadi
:
Machine-Learning-Based Identification of Defect Patterns in Semiconductor Wafer Maps: An Overview and Proposal. 420-429 - Alain Fuser, Florent Fontaine, Jack Copper:
Data Quality, Consistency, and Interpretation Management for Wind Farms by Using Neural Networks. 430-438
NIDISC Session 2: Wireless Networks and Mobility Management
- Antonina Tretyakova, Franciszek Seredynski
, Pascal Bouvry
:
Graph-Based Cellular Automata Approach to Maximum Lifetime Coverage Problem in Wireless Sensor Networks. 439-447 - Sankha Baran Dutta, Robert D. McLeod, Marcia R. Friesen:
GPU Accelerated Nature Inspired Methods for Modelling Large Scale Bi-directional Pedestrian Movement. 448-456 - Marcin Seredynski, Patricia Ruiz
, Krzysztof Szczypiorski
, Djamel Khadraoui:
Improving Bus Ride Comfort Using GLOSA-Based Dynamic Speed Optimisation. 457-463 - Huang Cheng, Xin Fei, Azzedine Boukerche, Mohammed Almulla
:
A Genetic Algorithm-Based Sparse Coverage over Urban VANETs. 464-469
NIDISC Session 3: Multi-objective Optimization
- Jakub Gasior
, Franciszek Seredynski
:
A Game-Theoretic Approach to Multiobjective Job Scheduling in Cloud Computing Systems. 470-479 - Yacine Kessaci, Nouredine Melab, El-Ghazali Talbi:
Multi-level and Multi-objective Survey on Cloud Scheduling. 480-488 - Benoît Bertholon, Sébastien Varrette, Pascal Bouvry
:
Comparison of Multi-objective Optimization Algorithms for the JShadObf JavaScript Obfuscator. 489-496
Workshop 5: HiCOMB - Workshop on High Performance Computational Biology
- Alba Cristina Magalhaes Alves de Melo
, Srinivas Aluru, David A. Bader
:
HiCOMB Introduction and Committees. 497-498 - Stephen Larson, Ümit V. Çatalyürek, Ananth Kalyanaraman:
HiCOMB Keynote and Invited Talks. 499
HiCOMB Session 1: Parallel Algorithms for Biological Sequence Analysis
- Jaroslaw Zola
:
Constructing Similarity Graphs from Large-Scale Biological Sequence Collections. 500-507 - Yi Wang, Gagan Agrawal, Hatice Gulcin Ozer, Kun Huang:
Removing Sequential Bottlenecks in Analysis of Next-Generation Sequencing Data. 508-517
HiCOMB Session 2: Parallel/Distributed Architectures for Biological Applications
- Alexey M. Kozlov, Christian Goll, Alexandros Stamatakis
:
Efficient Computation of the Phylogenetic Likelihood Function on the Intel MIC Architecture. 518-527 - Jie Li, Amin Salighehdar, Narayan Ganesan:
Process Simulation of Complex Biochemical Pathways in Explicit 3D Space Enabled by Heterogeneous Computing Platform. 528-535 - Kary A. C. S. Ocaña
, Silvia Benza, Daniel de Oliveira, Jonas Dias, Marta Mattoso
:
Exploring Large Scale Receptor-Ligand Pairs in Molecular Docking Workflows in HPC Clouds. 536-545 - Natasha Pavlovikj, Kevin Begcy
, Sairam Behera, Malachy Campbell, Harkamal Walia, Jitender S. Deogun:
A Comparison of a Campus Cluster and Open Science Grid Platforms for Protein-Guided Assembly Using Pegasus Workflow Management System. 546-555
HiCOMB Session 3: Metagenomics and Assembly
- Sasha Ames, Jonathan E. Allen
, David A. Hysom, G. Scott Lloyd, Maya B. Gokhale:
Design and Optimization of a Metagenomics Analysis Workflow for NVRAM. 556-565 - Vipin Sachdeva
, Chang Sik Kim, Kirk E. Jordan, Martyn D. Winn:
Parallelization of the Trinity Pipeline for De Novo Transcriptome Assembly. 566-575 - Xiaohui Duan, Kun Zhao, Weiguo Liu:
HiPGA: A High Performance Genome Assembler for Short Read Sequence Data. 576-584
Workshop 6: APDCM - Advances in Parallel and Distributed Computing Models
- Oscar H. Ibarra:
APDCM Introduction and Committees. 585
APDCM Session 1
- Kazuya Tani, Daisuke Takafuji, Koji Nakano
, Yasuaki Ito:
Bulk Execution of Oblivious Algorithms on the Unified Memory Machine, with GPU Implementation. 586-595 - Mario Alberto Chapa Martell, Sato Hiroyuki:
A Linear Performance-Breakdown Model for GPU Programming Optimization Guidance. 596-603 - Guangping Tang, Kenli Li, Keqin Li, Hang Chen, Jiayi Du:
A Hybrid Parallel Tridiagonal Solver on Multi-core Architectures. 604-613 - Atsushi Koike, Kunihiko Sadakane
:
A Novel Computational Model for GPUs with Application to I/O Optimal Sorting Algorithms. 614-623 - Munara Tolubaeva, Yonghong Yan, Barbara M. Chapman:
Predicting Cache Contention for Multithread Applications at Compile Time. 624-631
APDCM Session 2
- Guyue Wang, Shinichi Yamagiwa, Koichi Wada:
Parallelism Extraction Algorithm from Stream-Based Processing Flow Applying Spanning Tree. 632-641 - Quan Chen, Long Zheng, Minyi Guo, Zhiyi Huang:
EEWA: Energy-Efficient Workload-Aware Task Scheduling in Multi-core Architectures. 642-651 - Chunyan Wang, Shoichi Hirasawa, Hiroyuki Takizawa
, Hiroaki Kobayashi:
A Platform-Specific Code Smell Alert System for High Performance Computing Applications. 652-661 - Anne Benoit
, Jean-Marc Nicod, Veronika Rehn-Sonigo:
Optimizing Buffer Sizes for Pipeline Workflow Scheduling with Setup Times. 662-670 - Hatem M. El-Boghdadi:
WECPAR: List Ranking Algorithm and Relative Computational Power. 671-678
APDCM Session 3
- George Bosilca, Aurélien Bouteiller
, Thomas Hérault
, Yves Robert
, Jack J. Dongarra:
Assessing the Impact of ABFT and Checkpoint Composite Strategies. 679-688 - Julien Herrmann, Loris Marchal
, Yves Robert:
Memory-Aware List Scheduling for Hybrid Platforms. 689-698 - Jocelyne Faddoul, Wendy MacCaull:
A Parallel Framework for Handling Non-determinism with Expressive Description Logics. 699-708 - Martti Forsell, Jussi Roivainen, Ville Leppänen
:
Prototyping the MBTAC Processor for the REPLICA CMP. 709-716 - Jens Breitbart, Mareike Schmidtobreick, Vincent Heuveline
:
Evaluation of the Global Address Space Programming Interface (GASPI). 717-726
APDCM Session 4
- Chong Li, Gaétan Hains:
GPS: Towards Simplified Communication on SGL Model. 727-736 - Gokarna Sharma, Hari Krishnan, Costas Busch, Steven R. Brandt:
Near-Optimal Location Tracking Using Sensor Networks. 737-746 - Yihua Ding, James Zijun Wang, Pradip K. Srimani:
Self-Stabilizing Algorithm for Maximal 2-Packing with Safe Convergence in an Arbitrary Graph. 747-754 - Satoshi Fujita:
Minimum Set Cover of Sparsely Distributed Sensor Nodes by a Collection of Unit Disks. 755-761 - Xin Zhou, Yasuaki Ito, Koji Nakano
:
An Efficient Implementation of the Gradient-Based Hough Transform Using DSP Slices and Block RAMs on the FPGA. 762-770
Workshop 7: HPPAC - High-Performance, Power-Aware Computing
- Dong Li, Robert J. Fowler:
HPPAC Introduction and Committees. 771-772
HPPAC Session 1: Power and Energy Analysis and Profiling
- Edgar A. León
, Ian Karlin:
Characterizing the Impact of Program Optimizations on Power and Energy for Explicit Hydrodynamics. 773-781 - Chung-Hsing Hsu, Jacob Combs, Jolie Nazor, Fabian Santiago, Rachelle Thysell, Suzanne Rivoire, Stephen W. Poole:
Application Power Signature Analysis. 782-789 - Ryan E. Grant, Stephen L. Olivier
, James H. Laros III, Ron Brightwell, Allan Porterfield:
Metrics for Evaluating Energy Saving Techniques for Resilient HPC Systems. 790-797
HPPAC Session 2: Power-Efficient Hardware
- Ehsan Atoofian:
Reducing Static and Dynamic Power of L1 Data Caches in GPGPUs. 798-804 - Gilbert Netzer, S. Lennart Johnsson, Daniel Ahlin, Eric Stotzer, Pekka Varis, Erwin Laure:
Exploiting DMA for Performance and Energy Optimized STREAM on a DSP. 805-814 - Nico Reissmann, Jan Christian Meyer, Magnus Jahre
:
A Study of Energy and Locality Effects Using Space-Filling Curves. 815-822
HPPAC Session 3: Large Scale Power Management
- Ashkan Paya, Dan C. Marinescu:
Energy-Aware Load Balancing Policies for the Cloud Ecosystem. 823-832 - George Terzopoulos, Helen D. Karatza
:
Bag-of-Task Scheduling on Power-Aware Clusters Using a DVFS-Based Mechanism. 833-840 - Haibo Zhang, Wenting Han, Feng Li, Songtao He, Yichao Cheng, Hong An, Zhitao Chen:
A Criticality-Aware DVFS Runtime Utility for Optimizing Power Efficiency of Multithreaded Applications. 841-848
Workshop 8: HPGC - High-Performance Grid and Cloud Computing Workshop
- Eric E. Aubanel, Virendrakumar C. Bhavsar, Michael A. Frumkin:
HPGC Introduction and Committees. 849 - Rajkumar Buyya, Derek Murray:
HPGC Keynotes. 850-851
HPGC Session 1
- Andrew J. Younge, John Paul Walters, Stephen P. Crago, Geoffrey Charles Fox:
Evaluating GPU Passthrough in Xen for High Performance Cloud Computing. 852-859 - Teng Long, Il-Chul Yoon, Alan Sussman
, Adam A. Porter, Atif M. Memon:
Scalable System Environment Caching and Sharing for Distributed Virtual Machines. 860-867 - Hangwei Qian, Michael Rabinovich
:
Mega Data Center for Elastic Internet Applications. 868-874
HPGC Session 2
- Ashkan Paya, Dan C. Marinescu:
Cloud-Based Simulation of a Smart Power Grid. 875-884 - Seung-Hwan Lim, Gautam S. Thakur
, James L. Horey:
Analyzing Reliability of Virtual Machine Instances with Dynamic Pricing in the Public Cloud. 885-893 - Mohammad Ahmadian, Ashkan Paya, Dan C. Marinescu:
Security of Applications Involving Multiple Organizations and Order Preserving Encryption in Hybrid Cloud Environments. 894-903
Workshop 9: AsHES - Accelerators and Hybrid Exascale Systems
- Yunquan Zhang:
AsHES Introduction and Committees. 904-906 - Jeffrey S. Vetter:
AsHES Keynote. 907
AsHES Session 1: Programming Model and Performance Optimizations
- Felix Schmitt, Robert Dietrich, Guido Juckeland
:
Scalable Critical Path Analysis for Hybrid MPI-CUDA Applications. 908-915 - Shuai Che, Jiayuan Meng, Kevin Skadron
:
Dymaxion++: A Directive-Based API to Optimize Data Layout and Memory Mapping for Heterogeneous Systems. 916-924 - Chenggang Lai, Zhijun Hao, Miaoqing Huang, Xuan Shi, Haihang You:
Comparison of Parallel Programming Models on Intel MIC Computer Cluster. 925-932 - Marco Maggioni, Tanya Y. Berger-Wolf
:
CoAdELL: Adaptivity and Compression for Improving Sparse Matrix-Vector Multiplication on GPUs. 933-940
AsHES Session 2: Accelerating Applications
- Hartwig Anzt
, William B. Sawyer, Stanimire Tomov
, Piotr Luszczek, Ichitaro Yamazaki, Jack J. Dongarra:
Optimizing Krylov Subspace Solvers on Graphics Processing Units. 941-949 - Lipeng Wang, Yuandong Chan, Xiaohui Duan, Haidong Lan, Xiangxu Meng, Weiguo Liu:
XSW: Accelerating Biological Database Search on Xeon Phi. 950-957 - Simplice Donfack, Stanimire Tomov
, Jack J. Dongarra:
Dynamically Balanced Synchronization-Avoiding LU Factorization with Multicore and GPUs. 958-965 - Qi Hu, Nail A. Gumerov, Rio Yokota
, Lorena A. Barba
, Ramani Duraiswami
:
Scalable Fast Multipole Accelerated Vortex Methods. 966-975
AsHES Session 3: Emerging Hybrid Systems
- Lena Oden, Holger Fröning, Franz-Josef Pfreundt:
Infiniband-Verbs on GPU: A Case Study of Controlling an Infiniband Network Device from the GPU. 976-983 - Anish Varghese, Bob Edwards, Gaurav Mitra, Alistair P. Rendell
:
Programming the Adapteva Epiphany 64-Core Network-on-Chip Coprocessor. 984-992 - Jianting Zhang, Dali Wang:
High-Performance Zonal Histogramming on Large-Scale Geospatial Rasters Using GPUs and GPU-Accelerated Clusters. 993-1000
Workshop 10: PLC - Programming Models, Languages, and Compilers Workshop for Manycore and Heterogeneous Architectures
- Barbara M. Chapman:
PLC Introduction and Committees. 1001
PLC Session 1: Programming and Compilation Techniques for GPUs
- Troels Blum, Mads Ruben Burgdorff Kristensen, Brian Vinter:
Transparent GPU Execution of NumPy Applications. 1002-1010 - Dmitry Mikushin, Nikolay Likhogrud, Eddy Z. Zhang, Christopher Bergstrom:
KernelGen - The Design and Implementation of a Next Generation Compiler Platform for Accelerating Numerical Models on GPUs. 1011-1020 - Wei Ding
, Ligang Lu, Mauricio Araya-Polo, Amik St.-Cyr, Detlef Hohl, Barbara M. Chapman:
Using GPU Shared Memory with a Directive-Based Approach. 1021-1028
PLC Session 2: Libraries and Optimization Frameworks
- Jagan Jayaraj, Pei-Hung Lin
, Paul R. Woodward, Pen-Chung Yew
:
CFD Builder: A Library Builder for Computational Fluid Dynamics. 1029-1038 - Benjamin Ranft, Oliver Denninger, Philip Pfaffe:
A Stream Processing Framework for On-Line Optimization of Performance and Energy Efficiency on Heterogeneous Systems. 1039-1048
PLC Session 3: Tools and Performance Evaluation
- Ahmad Qawasmeh
, Abid Muslim Malik, Barbara M. Chapman:
OpenMP Task Scheduling Analysis via OpenMP Runtime API and Tool Visualization. 1049-1058 - Pavel Zaichenkov, Bert Gijsbers, Clemens Grelck, Olga Tveretina, Alex Shafarenko:
A Case Study in Coordination Programming: Performance Evaluation of S-Net vs Intel's Concurrent Collections. 1059-1067
Workshop 11: EduPar-NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Sushil K. Prasad
:
EduPar Introduction and Committees. 1068-1069 - Randy H. Katz:
EduPar Keynote. 1070
EduPar Session: Introductory Course and Across Curriculum
- Steven Bogaerts:
Limited Time and Experience: Parallelism in CS1. 1071-1078 - Victor P. Gergel, Alexey Liniov
, Iosif B. Meyerov
, Alexander Sysoyev
:
NSF/IEEE-TCPP Curriculum Implementation at the State University of Nizhni Novgorod. 1079-1084 - David J. John, Stan J. Thomas:
Parallel and Distributed Computing across the Computer Science Curriculum. 1085-1090 - Yinong Chen
, Zhizheng Zhou:
Service-Oriented Computing and Software Integration in Computing Curriculum. 1091-1098 - Nasser Giacaman, Oliver Sinnen
:
EA: Research-Infused Teaching of Parallel Programming Concepts for Undergraduate Software Engineering Students. 1099-1105 - Clayton Ferner, Barry Wilkinson, Barbara Heath:
Using Patterns to Teach Parallel Computing. 1106-1113
EduPar Session: Miscellaneous
- Linh Bao Ngo, Edward B. Duffy, Amy W. Apon:
Teaching HDFS/MapReduce Systems Concepts to Undergraduates. 1114-1121 - H. Martin Bücker
, M. Ali Rostami
:
Interactively Exploring the Connection between Nested Dissection Orderings for Parallel Cholesky Factorization and Vertex Separators. 1122-1129 - David Toth:
A Portable Cluster for Each Student. 1130-1134
Workshop 12: GABB - Graph Algorithms Building Blocks
- Tim Mattson, David A. Bader
, Aydin Buluç
, John R. Gilbert, Joseph Gonzalez
, Jeremy Kepner:
GABB Introduction. 1135-1137
Workshop 13: PDSEC - Workshop on Parallel and Distributed Scientific and Engineering Computing
- Peter E. Strazdins, Raphaël Couturier
, Michelle Mills Strout, Keita Teranishi, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
PDSEC Introduction and Committees. 1138-1139
PDSEC Session 1: Best Papers
- William A. Magato, Philip A. Wilsey:
llamaOS: A Solution for Virtualized High-Performance Computing Clusters. 1140-1149 - Azzam Haidar, Piotr Luszczek, Jack J. Dongarra:
New Algorithm for Computing Eigenvectors of the Symmetric Eigenvalue Problem. 1150-1159
PDSEC Session 2: Algorithms (I)
- Davide Barbieri, Valeria Cardellini
, Salvatore Filippone
:
Exhaustive Key Search on Clusters of GPUs. 1160-1168 - Md. Mohsin Ali
, James Southern, Peter E. Strazdins, Brendan Harding
:
Application Level Fault Recovery: Using Fault-Tolerant Open MPI in a PDE Solver. 1169-1178 - Sudip K. Seal, Srikanth B. Yoginath
, Michael K. Miller:
Nanoscale Cluster Detection in Massive Atom Probe Tomography Data. 1179-1188 - Angel Gonzalez Mendez, Graciela Román-Alonso, Fernando Rojas-González, Miguel Alfonso Castro-García, Miguel Aguilar Cornejo, Salomón Cordero-Sánchez
:
Construction of Porous Networks Subjected to Geometric Restrictions by Using OpenMP. 1189-1197
PDSEC Session 3: Systems and Performance Analysis
- Daniel Espling, Per-Olov Östberg, Erik Elmroth:
Integration and Evaluation of Decentralized Fairshare Prioritization (Aequus). 1198-1207 - Jeremiah J. Wilke:
Coordination Languages and MPI Perturbation Theory: The FOX Tuple Space Framework for Resilience. 1208-1217 - Tyson Kendon
, Jörg Denzinger
:
DisSLib: CC: A Library for Distributed Search with a Central Common Search State. 1218-1227 - Hongbo Zou, Yongen Yu, Wei Tang, Hsuanwei Michelle Chen:
Improving I/O Performance with Adaptive Data Compression for Big Data Applications. 1228-1237 - Bertrand Putigny, Benoit Ruelle, Brice Goglin
:
Analysis of MPI Shared-Memory Communication Performance from a Cache Coherence Perspective. 1238-1247
PDSEC Session 4: Algorithms (II)
- Andrew A. Haigh, Eric C. McCreath:
Acceleration of GPU-Based Ultrasound Simulation via Data Compression. 1248-1255 - Klaus Kofler, Dominik Steinhauser, Biagio Cosenza
, Ivan Grasso, Sabine Schindler, Thomas Fahringer
:
Kd-Tree Based N-Body Simulations with Volume-Mass Heuristic on the GPU. 1256-1265 - Norihisa Fujita, Hideo Nuga
, Taisuke Boku, Yasuhiro Idomura
:
Nuclear Fusion Simulation Code Optimization and Performance Evaluation on GPU Cluster. 1266-1274 - Zhe Weng, Peter E. Strazdins:
Acceleration of a Python-Based Tsunami Modelling Application via CUDA and OpenHMPP. 1275-1284 - Roksana Hossain, Sebastian Magierowski, Geoffrey G. Messier:
GPU Enhanced Path Finding for an Unmanned Aerial Vehicle. 1285-1293
Workshop 14: DPDNS - Dependable Parallel, Distributed, and Network-Centric Systems
- Dimiter Avresky, Erik Maehle, Salvatore Distefano
:
DPDNS Introduction and Committees. 1294-1295 - Edgar Nett:
DPDNS Keynote. 1296
DPDNS Session: Applications
- Timo Lindhorst, Burkhard Weseloh, Edgar Nett:
Maintaining Dependable Communication Service for Mobile Stations in Wireless Mesh Networks by Tracking Capacity Demands. 1297-1305 - Ammar Amory, Thomas Tosik, Erik Maehle:
A Load Balancing Behavior for Underwater Robot Swarms to Increase Mission Time and Fault Tolerance. 1306-1313 - Andreas Dittrich, Stefan Wanja, Miroslaw Malek:
ExCovery - A Framework for Distributed System Experiments and a Case Study of Service Discovery. 1314-1323 - Mohamed Mohamedin
, Roberto Palmieri
, Binoy Ravindran
:
Managing Soft-Errors in Transactional Systems. 1324-1329
DPDNS Session: Theoretical Aspects
- Salvatore Distefano
:
Standby System Reliability through DRBD. 1330-1337 - Yingxu Lai, Qiuyue Pan, Zenghui Liu, Yinong Chen
, Zhizheng Zhou:
Trust-Based Security for the Spanning Tree Protocol. 1338-1343 - Emil Vassev, Mike Hinchey
:
Autonomy Requirements Engineering for Self-Adaptive Science Clouds. 1344-1353
Workshop 15: MTAAP - Workshop on Multi-threaded Architectures and Applications
- Luiz DeRose:
MTAAP Introduction and Committees. 1354
MTAAP Session: Algorithms and Position Papers
- Siddharth Gupta, Diana Palsetia, Md. Mostofa Ali Patwary, Ankit Agrawal
, Alok N. Choudhary:
A New Parallel Algorithm for Two-Pass Connected Component Labeling. 1355-1362 - Jaime Arteaga, Stéphane Zuckerman, Elkin Garcia, Guang R. Gao:
Position Paper: Locality-Driven Scheduling of Tasks for Data-Dependent Multithreading. 1363-1367 - Walid J. Ghandour, Nadine J. Ghandour:
Position Paper: Leveraging Strength-Based Dynamic Slicing to Identify Control Reconvergence Instructions. 1368-1373
MTAAP Session: Graph Analytics
- Hao Lu
, Mahantesh Halappanavar, Ananth Kalyanaraman, Sutanay Choudhury:
Parallel Heuristics for Scalable Community Detection. 1374-1385 - Ahmet Erdem Sariyüce, Erik Saule, Kamer Kaya, Ümit V. Çatalyürek:
Hardware/Software Vectorization for Closeness Centrality on Multi-/Many-Core Architectures. 1386-1395 - Adam McLaughlin, David A. Bader
:
Revisiting Edge and Node Parallelism for Dynamic GPU Graph Analytics. 1396-1406
MTAAP Session: Accelerators
- Cheng Wang, Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman, Oscar R. Hernandez:
A Validation Testsuite for OpenACC 1.0. 1407-1416 - Anas Abu-Doleh, Kamer Kaya, Mohamed Abouelhoda, Ümit V. Çatalyürek:
Extracting Maximal Exact Matches on GPU. 1417-1426 - B. Neelima
, G. Ram Mohana Reddy, Prakash S. Raghavendra:
Predicting an Optimal Sparse Matrix Format for SpMV Computation on GPU. 1427-1436
Workshop 16: LSPP - Workshop on Large-Scale Parallel Processing
- Darren J. Kerbyson, Ram Rajamony, Charles C. Weems:
LSPP Introduction and Committees. 1437
LSPP Session 1: Performance Analysis and Optimization
- Arash Shamaei, Bella Bose, Mary Flahive:
Higher Dimensional Gaussian Networks. 1438-1447
LSPP Session 2: Modeling Performance for Scaling
- Bo Li, Hung-Ching Chang, Shuaiwen Song, Chun-Yi Su, Timmy Meyer, John Mooring, Kirk W. Cameron
:
The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications. 1448-1456 - Ying-Chieh Wang, Che-Rung Lee, Yeh-Ching Chung, I-Hsin Chung, Michael Perrone:
Performance Modeling for Hardware Thread-Level Speculation. 1457-1464 - John D. Leidel, Yong Chen
:
HMC-Sim: A Simulation Framework for Hybrid Memory Cube Devices. 1465-1474
LSPP Session 3: Large-Scale Systems
- Roberto Gioiosa, Gokcen Kestor
, Darren J. Kerbyson:
Online Monitoring System for Performance Fault Detection. 1475-1484
LSPP Session 4: Scheduling
- Paul T. Lin, Matthew T. Bettencourt, Stefan Domino, Travis Fisher, Mark Hoemmen, Jonathan J. Hu, Eric T. Phipps, Andrey Prokopenko, Sivasankaran Rajamanickam, Christopher M. Siefert, Eric C. Cyr
, Stephen Kennon:
Towards Extreme-Scale Simulations with Next-Generation Trilinos: A Low Mach Fluid Application Case Study. 1485-1494 - Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Design and Implementation of a Large Scale Tree-Based QR Decomposition Using a 3D Virtual Systolic Array and a Lightweight Runtime. 1495-1504 - Michael Sevilla, Ike Nassi
, Kleoni Ioannidou, Scott A. Brandt, Carlos Maltzahn:
SupMR: Circumventing Disk and Memory Bandwidth Bottlenecks for Scale-up MapReduce. 1505-1514
Workshop 17: PCO - Parallel Computing and Optimization
- Didier El Baz
:
PCO Introduction and Committees. 1515
PCO Session 1: Optimization Techniques for Parallel or Distributed Architectures
- Congfeng Jiang, Jian Wan, Christophe Cérin, Paolo Gianessi
, Yanik Ngoko:
Towards Energy Efficient Allocation for Applications in Volunteer Cloud. 1516-1525 - Karl-Eduard Berger, François Galea, Bertrand Le Cun, Renaud Sirdey
:
Fast Generation of Large Task Network Mappings. 1526-1530
PCO Session 2: Parallel Optimization Algorithms
- Tarek Menouer
, Bertrand Le Cun:
Adaptive N to P Portfolio for Solving Constraint Programming Problems on Top of the Parallel Bobpp Framework. 1531-1540 - Yves Caniou, Philippe Codognet:
Dependent Walks in Parallel Local Search. 1541-1546 - Mhand Hifi, Stéphane Nègre, Toufik Saadi, Sagvan Saleh, Lei Wu:
A Parallel Large Neighborhood Search-Based Heuristic for the Disjunctively Constrained Knapsack Problem. 1547-1551 - Yuji Shinano, Tobias Achterberg, Timo Berthold, Stefan Heinz, Thorsten Koch
, Michael Winkler:
Solving Hard MIPLIB2003 Problems with ParaSCIP on Supercomputers: An Update. 1552-1561
PCO Session 3: Task Scheduling and Miscellaneous
- Shuli Wang, Kenli Li, Jing Mei, Keqin Li, Yan Wang:
A Task Scheduling Algorithm Based on Replication for Maximizing Reliability on Heterogeneous Computing Systems. 1562-1571 - Si Zheng, Yunhuai Liu, Tian He, Shanshan Li, Xiangke Liao:
SkewControl: Gini Out of the Bottle. 1572-1580 - Yuri Alexeev, Sheri A. Mickelson, Sven Leyffer
, Robert L. Jacob, Anthony P. Craig:
The Heuristic Static Load-Balancing Algorithm Applied to the Community Earth System Model. 1581-1590 - Didier El Baz
, Benoît Piranda, Julien Bourgeois:
A Distributed Algorithm for a Reconfigurable Modular Surface. 1591-1598
Workshop 18: ParLearning - Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Abhinav Vishnu, Yinglong Xia:
ParLearning Introduction and Committees. 1599-1600 - Eric P. Xing:
ParLearning Keynote. 1601
ParLearning Session 1
- Hsuan-Yi Chu, Yinglong Xia, Anand V. Panangadan, Viktor K. Prasanna:
Wait-Free Primitives for Initializing Bayesian Network Structure Learning on Multicore Processors. 1602-1611 - Karl Jansson, Håkan Sundell, Henrik Boström:
gpuRF and gpuERT: Efficient and Scalable GPU Algorithms for Decision Tree Ensembles. 1612-1621 - Lei Jin, Zhaokang Wang, Rong Gu, Chunfeng Yuan, Yihua Huang:
Training Large Scale Deep Neural Networks on the Intel Xeon Phi Many-Core Coprocessor. 1622-1630 - Xiujuan Qian, Yongli Wang, Xiaohui Jiang:
Parallel Bayesian Network Modelling for Pervasive Health Monitoring System. 1631-1637
ParLearning Session 2
- Nitin Sukhija, Brandon M. Malone, Srishti Srivastava, Ioana Banicescu, Florina M. Ciorba
:
Portfolio-Based Selection of Robust Dynamic Loop Scheduling Algorithms Using Machine Learning. 1638-1647 - Wei Wang, Guisong Yang, Naixue Xiong, Xingyu He, Wenzhong Guo:
A General P2P Scheme for Constructing Large-Scale Virtual Environments. 1648-1655
ParLearning Session 3
- Peter D. Kirchner, Matthias Böhm, Berthold Reinwald, Daby M. Sow, J. Michael Schmidt, Deepak S. Turaga, Alain Biem:
Large Scale Discriminative Metric Learning. 1656-1663 - Hongjian Qiu, Rong Gu, Chunfeng Yuan, Yihua Huang:
YAFIM: A Parallel Frequent Itemset Mining Algorithm with Spark. 1664-1671 - Yang Bo, Naixue Xiong, Wenzhong Guo:
The Empirical Research of Virtual Enterprise Knowledge Transfer's Effectiveness Faced to the Independent Innovation Ability. 1672-1679 - Naixue Xiong, Guoxiang Tong, Wenzhong Guo, Jian Tan, Guanning Wu:
A Distributed Speech Algorithm for Large Scale Data Communication Systems. 1680-1687
Workshop 19: HPDIC - High Performance Data Intensive Computing
- Christophe Cérin, Congfeng Jiang:
HPDIC Introduction and Committees. 1688
HPDIC Session 1: Memory, I/O, and Performance Enhancement
- Vishwanath Venkatesan, Mohamad Chaarawi, Quincey Koziol, Edgar Gabriel:
Compactor: Optimization Framework at Staging I/O Nodes. 1689-1697 - Keita Iwabuchi, Hitoshi Sato
, Ryo Mizote, Yuichiro Yasui, Katsuki Fujisawa
, Satoshi Matsuoka:
Hybrid BFS Approach Using Semi-external Memory. 1698-1707 - Jialin Liu, Surendra Byna
, Bin Dong, Kesheng Wu
, Yong Chen
:
Model-Driven Data Layout Selection for Improving Read Performance. 1708-1716
HPDIC Session 2: Clustering, Data Management, and Applications
- Stephane Martin, Tomasz Buchert, Pierric Willemet, Olivier Richard, Emmanuel Jeanvoine, Lucas Nussbaum:
Scalable and Reliable Data Broadcast with Kascade. 1717-1726 - Tugdual Sarazin, Hanane Azzag, Mustapha Lebbah:
SOM Clustering Using Spark-MapReduce. 1727-1734 - Liang Li
, Dixin Tang, Taoying Liu, Hong Liu, Wei Li, Chenzhou Cui
:
Optimizing the Join Operation on Hive to Accelerate Cross-Matching in Astronomy. 1735-1745
Workshop 20: JSSPP - Workshop on Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai:
JSSPP Introduction and Committees. 1746
Workshop 21: CHIUW - Chapel Implementers and Users Workshop
- Brad Chamberlain
:
CHIUW Introduction and Committees. 1747-1749

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.