default search action
Mohamed Wahib
Person information
- affiliation: RIKEN Center for Computational Science, Kobe, Japan
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j16]Johann Rudi, Youngjun Lee, Aidan H. Chadha, Mohamed Wahib, Klaus Weide, Jared P. O'Neal, Anshu Dubey:
CG-Kit: Code Generation Toolkit for performant and maintainable variants of source code applied to Flash-X hydrodynamics simulations. Future Gener. Comput. Syst. 163: 107511 (2025) - 2024
- [j15]Yu Xue, Jiajie Zha, Mohamed Wahib, Tinghui Ouyang, Xiao Wang:
Neural architecture search via similarity adaptive guidance. Appl. Soft Comput. 162: 111821 (2024) - [j14]Enzhi Zhang, Mohamed Wahib, Rui Zhong, Masaharu Munetomo:
Learning from the Past Training Trajectories: Regularization by Validation. J. Adv. Comput. Intell. Intell. Informatics 28(1): 67-78 (2024) - [j13]Enzhi Zhang, Bochen Dong, Mohamed Wahib, Rui Zhong, Masaharu Munetomo:
Meta generative image and text data augmentation optimization. J. Supercomput. 80(9): 12644-12662 (2024) - [c55]Chen Zhuang, Peng Chen, Xin Liu, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib:
Communication Optimization for Distributed GCN Training on ABCI Supercomputer. CLUSTER Workshops 2024: 160-161 - [c54]Lingqi Zhang, Ryan Barton, Peng Chen, Xiao Wang, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib:
Investigating Nvidia GPU Architecture Trends via Microbenchmarks. CLUSTER Workshops 2024: 174-175 - [c53]Du Wu, Peng Chen, Yiyu Tan, Yusuke Tanimura, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib:
Asynchronous I/O Optimization for X-Ray Imaging via GPUDirect Storage. CLUSTER Workshops 2024: 196-197 - [c52]Yutong Lu, Wuchun Feng, Mohamed Wahib:
Welcome Message from the IEEE Cluster 2024 Program Chairs. CLUSTER 2024: xiii - [c51]Pengcheng Jiang, Yu Xue, Ferrante Neri, Mohamed Wahib:
Surrogate-Assisted Evolutionary Neural Architecture Search with Isomorphic Training and Prediction. ICIC (2) 2024: 191-203 - [c50]Du Wu, Peng Chen, Xiao Wang, Isaac Lyngaas, Takaaki Miyajima, Toshio Endo, Satoshi Matsuoka, Mohamed Wahib:
Real-time High-resolution X-Ray Computed Tomography. ICS 2024: 110-123 - [c49]Liwen Jiang, Yu Xue, Ferrante Neri, Xiaoping Zhao, Mohamed Wahib:
Progressive Neural Predictor with Score-Based Sampling. IJCNN 2024: 1-8 - [c48]Enzhi Zhang, Rui Zhong, Masaharu Munetomo, Mohamed Wahib:
Validation Loss Landscape Exploration with Deep Q-Learning. IJCNN 2024: 1-9 - [c47]Du Wu, Jintao Meng, Wenxi Zhu, Minwen Deng, Xiao Wang, Tao Luo, Mohamed Wahib, Yanjie Wei:
autoGEMM: Pushing the Limits of Irregular Matrix Multiplication on Arm Architectures. SC 2024: 21 - [c46]Enzhi Zhang, Isaac Lyngaas, Peng Chen, Xiao Wang, Jun Igarashi, Yuankai Huo, Masaharu Munetomo, Mohamed Wahib:
Adaptive Patching for High-resolution Image Segmentation with Transformers. SC 2024: 76 - [i27]Johann Rudi, Youngjun Lee, Aidan H. Chadha, Mohamed Wahib, Klaus Weide, Jared P. O'Neal, Anshu Dubey:
CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations. CoRR abs/2401.03378 (2024) - [i26]Enzhi Zhang, Isaac Lyngaas, Peng Chen, Xiao Wang, Jun Igarashi, Yuankai Huo, Mohamed Wahib, Masaharu Munetomo:
Adaptive Patching for High-resolution Image Segmentation with Transformers. CoRR abs/2404.09707 (2024) - [i25]Aristeidis Tsaris, Chengming Zhang, Xiao Wang, Junqi Yin, Siyan Liu, Moetasim Ashfaq, Ming Fan, Jong-Youl Choi, Mohamed Wahib, Dan Lu, Prasanna Balaprakash, Fei-Yue Wang:
Sequence Length Scaling in Vision Transformers for Scientific Images on Frontier. CoRR abs/2405.15780 (2024) - [i24]Yu Xue, Chenchen Zhu, MengChu Zhou, Mohamed Wahib, Moncef Gabbouj:
A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism. CoRR abs/2407.15600 (2024) - [i23]Emil Vatai, Aleksandr Drozd, Ivan R. Ivanov, Yinghao Ren, Mohamed Wahib:
Tadashi: Enabling AI-Based Automated Code Generation With Guaranteed Correctness. CoRR abs/2410.03210 (2024) - 2023
- [j12]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Torsten Hoefler:
Myths and legends in high-performance computing. Int. J. High Perform. Comput. Appl. 37(3-4): 245-259 (2023) - [j11]Jens Domke, Emil Vatai, Balazs Gerofi, Yuetsu Kodama, Mohamed Wahib, Artur Podobas, Sparsh Mittal, Miquel Pericàs, Lingqi Zhang, Peng Chen, Aleksandr Drozd, Satoshi Matsuoka:
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads. ACM Trans. Archit. Code Optim. 20(4): 57:1-57:26 (2023) - [j10]Huaipeng Zhang, Nhut-Minh Ho, Dogukan Yigit Polat, Peng Chen, Mohamed Wahib, Truong Thao Nguyen, Jintao Meng, Rick Siow Mong Goh, Satoshi Matsuoka, Tao Luo, Weng-Fai Wong:
Simeuro: A Hybrid CPU-GPU Parallel Simulator for Neuromorphic Computing Chips. IEEE Trans. Parallel Distributed Syst. 34(10): 2767-2782 (2023) - [c45]Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka:
Exploiting Scratchpad Memory for Deep Temporal Blocking: A case study for 2D Jacobian 5-point iterative stencil kernel (j2d5pt). GPGPU@PPoPP 2023: 34-35 - [c44]Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka:
PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications. ICS 2023: 167-179 - [c43]Ismayil Ismayilov, Javid Baydamirli, Dogan Sagbili, Mohamed Wahib, Didem Unat:
Multi-GPU Communication Schemes for Iterative Solvers: When CPUs are Not in Charge. ICS 2023: 192-202 - [c42]Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka:
Revisiting Temporal Blocking Stencil Optimizations. ICS 2023: 251-263 - [c41]Truong Thao Nguyen, Balazs Gerofi, Edgar Josafat Martinez-Noriega, François Trahay, Mohamed Wahib:
KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training. NeurIPS 2023 - [c40]Enzhi Zhang, Ruqin Wang, Mohamed Wahib, Rui Zhong, Masaharu Munetomo:
Training Knowledge Inheritance Through Deep Q-Net. SMC 2023: 899-904 - [i22]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Torsten Hoefler:
Myths and Legends in High-Performance Computing. CoRR abs/2301.02432 (2023) - [i21]Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka:
Revisiting Temporal Blocking Stencil Optimizations. CoRR abs/2305.07390 (2023) - [i20]Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Toshio Endo, Satoshi Matsuoka:
Exploiting Scratchpad Memory for Deep Temporal Blocking: A case study for 2D Jacobian 5-point iterative stencil kernel (j2d5pt). CoRR abs/2306.03336 (2023) - [i19]Truong Thao Nguyen, Balazs Gerofi, Edgar Josafat Martinez-Noriega, François Trahay, Mohamed Wahib:
KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training. CoRR abs/2310.10102 (2023) - [i18]Xiao Wang, Isaac Lyngaas, Aristeidis Tsaris, Peng Chen, Sajal Dash, Mayanka Chandra Shekar, Tao Luo, Hong-Jun Yoon, Mohamed Wahib, John Gounley:
Ultra-Long Sequence Distributed Transformer. CoRR abs/2311.02382 (2023) - 2022
- [j9]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Andrew A. Chien, Raymond Bair, Jeffrey S. Vetter, John Shalf:
Preparing for the Future - Rethinking Proxy Applications. Comput. Sci. Eng. 24(2): 85-90 (2022) - [j8]Anshu Dubey, Klaus Weide, Jared O'Neal, Akash Dhruv, Sean M. Couch, James Austin Harris, Tom Klosterman, Rajeev Jain, Johann Rudi, Bronson Messer, Michael Pajkos, Jared Carlson, Ran Chu, Mohamed Wahib, Saurabh Chawdhary, Paul M. Ricker, Dongwook Lee, Katie Antypas, Katherine M. Riley, Christopher S. Daley, Murali K. Ganapathy, Francis X. Timmes, Dean M. Townsley, Marcos Vanella, John Bachan, Paul M. Rich, Shravan Kumar, Eirik Endeve, W. Raphael Hix, Anthony Mezzacappa, Thomas Papatheodore:
Flash-X: A multiphysics simulation software instrument. SoftwareX 19: 101168 (2022) - [j7]Jintao Meng, Chen Zhuang, Peng Chen, Mohamed Wahib, Bertil Schmidt, Xiao Wang, Haidong Lan, Dou Wu, Minwen Deng, Yanjie Wei, Shengzhong Feng:
Automatic Generation of High-Performance Convolution Kernels on ARM CPUs for Deep Learning. IEEE Trans. Parallel Distributed Syst. 33(11): 2885-2899 (2022) - [c39]Truong Thao Nguyen, François Trahay, Jens Domke, Aleksandr Drozd, Emil Vatai, Jianwei Liao, Mohamed Wahib, Balazs Gerofi:
Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning. IPDPS 2022: 1085-1096 - [c38]Xiao Wang, Aristeidis Tsaris, Debangshu Mukherjee, Mohamed Wahib, Peng Chen, Mark Oxley, Olga Ovchinnikova, Jacob D. Hinkle:
Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction. SC 2022: 8:1-8:13 - [c37]Enzhi Zhang, Mohamed Wahib, Masaharu Munetomo:
Learning from the Past: Regularization by Validation. SCIS/ISIS 2022: 1-8 - [i17]Lingqi Zhang, Mohamed Wahib, Peng Chen, Jintao Meng, Xiao Wang, Satoshi Matsuoka:
Persistent Kernels for Iterative Memory-bound GPU Applications. CoRR abs/2204.02064 (2022) - [i16]Jens Domke, Emil Vatai, Balazs Gerofi, Yuetsu Kodama, Mohamed Wahib, Artur Podobas, Sparsh Mittal, Miquel Pericàs, Lingqi Zhang, Peng Chen, Aleksandr Drozd, Satoshi Matsuoka:
At the Locus of Performance: A Case Study in Enhancing CPUs with Copious 3D-Stacked Cache. CoRR abs/2204.02235 (2022) - [i15]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Ray Bair, Andrew A. Chien, Jeffrey S. Vetter, John Shalf:
Preparing for the Future - Rethinking Proxy Apps. CoRR abs/2204.07336 (2022) - [i14]Xiao Wang, Aristeidis Tsaris, Debangshu Mukherjee, Mohamed Wahib, Peng Chen, Mark Oxley, Olga Ovchinnikova, Jacob D. Hinkle:
Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction. CoRR abs/2205.06327 (2022) - [i13]Anshu Dubey, Klaus Weide, Jared O'Neal, Akash Dhruv, Sean M. Couch, James Austin Harris, Tom Klosterman, Rajeev Jain, Johann Rudi, Bronson Messer, Michael Pajkos, Jared Carlson, Ran Chu, Mohamed Wahib, Saurabh Chawdhary, Paul M. Ricker, Dongwook Lee, Katie Antypas, Katherine M. Riley, Christopher S. Daley, Murali K. Ganapathy, Francis X. Timmes, Dean M. Townsley, Marcos Vanella, John Bachan, Paul Rich, Shravan Kumar, Eirik Endeve, W. Raphael Hix, Anthony Mezzacappa, Thomas Papatheodore:
Flash-X, a multiphysics simulation software instrument. CoRR abs/2208.11630 (2022) - 2021
- [j6]Truong Thao Nguyen, Mohamed Wahib, Ryousei Takano:
Efficient MPI-AllReduce for large-scale deep learning on GPU-clusters. Concurr. Comput. Pract. Exp. 33(12) (2021) - [j5]Anshu Dubey, Martin Berzins, Carsten Burstedde, Michael L. Norman, Didem Unat, Mohamed Wahib:
Structured Adaptive Mesh Refinement Adaptations to Retain Performance Portability With Increasing Heterogeneity. Comput. Sci. Eng. 23(5): 62-66 (2021) - [j4]Fareed Qararyah, Mohamed Wahib, Doga Dikbayir, Mehmet Esat Belviranli, Didem Unat:
A computational-graph partitioning method for training memory-constrained DNNs. Parallel Comput. 104-105: 102792 (2021) - [j3]Martin Schlueter, Mehdi Neshat, Mohamed Wahib, Masaharu Munetomo, Markus Wagner:
GTOPX space mission benchmarks. SoftwareX 14: 100666 (2021) - [c36]Truong Thao Nguyen, Mohamed Wahib:
An Allreduce Algorithm and Network Co-design for Large-Scale Training of Distributed Deep Learning. CCGRID 2021: 396-405 - [c35]Jared O'Neal, Mohamed Wahib, Anshu Dubey, Klaus Weide, Tom Klosterman, Johann Rudi:
Domain-Specific Runtime to Orchestrate Computation on Heterogeneous Platforms. Euro-Par Workshops 2021: 154-165 - [c34]Albert Njoroge Kahira, Truong Thao Nguyen, Leonardo Bautista-Gomez, Ryousei Takano, Rosa M. Badia, Mohamed Wahib:
An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural Networks. HPDC 2021: 161-173 - [c33]Jun Li, Minjun Li, Zhigang Cai, François Trahay, Mohamed Wahib, Balazs Gerofi, Zhiming Liu, Min Huang, Jianwei Liao:
Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs. ICPP 2021: 46:1-46:10 - [c32]Peng Chen, Mohamed Wahib, Xiao Wang, Shin'ichiro Takizawa, Takahiro Hirofuchi, Hirotaka Ogawa, Satoshi Matsuoka:
Performance portable back-projection algorithms on CPUs: agnostic data locality and vectorization optimizations. ICS 2021: 316-328 - [c31]Jens Domke, Emil Vatai, Aleksandr Drozd, Peng Chen, Yosuke Oyama, Lingqi Zhang, Shweta Salaria, Daichi Mukunoki, Artur Podobas, Mohamed Wahib, Satoshi Matsuoka:
Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws? IPDPS 2021: 1056-1065 - [c30]Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey C. Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda, Toshiyuki Imamura, Akihiko Kasagi, Kentaro Kawakami, Shuhei Kudo, Akiyoshi Kuroda, Maxime Martinasso, Satoshi Matsuoka, Henrique Mendonça, Kazuki Minami, Prabhat Ram, Takashi Sawada, Mallikarjun Shankar, Tom St. John, Akihiro Tabuchi, Venkatram Vishwanath, Mohamed Wahib, Masafumi Yamazaki, Junqi Yin:
MLPerf™ HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems. MLHPC@SC 2021: 33-45 - [c29]Peng Chen, Mohamed Wahib, Xiao Wang, Takahiro Hirofuchi, Hirotaka Ogawa, Ander Biguri, Richard P. Boardman, Thomas Blumensath, Satoshi Matsuoka:
Scalable FBP decomposition for cone-beam CT reconstruction. SC 2021: 9 - [i12]Albert Njoroge Kahira, Truong Thao Nguyen, Leonardo Bautista-Gomez, Ryousei Takano, Rosa M. Badia, Mohamed Wahib:
An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural Networks. CoRR abs/2104.09075 (2021) - [i11]Peng Chen, Mohamed Wahib, Xiao Wang, Shin'ichiro Takizawa, Takahiro Hirofuchi, Hirotaka Ogawa, Satoshi Matsuoka:
Performance Portable Back-projection Algorithms on CPUs: Agnostic Data Locality and Vectorization Optimizations. CoRR abs/2104.13248 (2021) - [i10]Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey C. Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda, Toshiyuki Imamura, Akihiko Kasagi, Kentaro Kawakami, Shuhei Kudo, Akiyoshi Kuroda, Maxime Martinasso, Satoshi Matsuoka, Henrique Mendonça, Kazuki Minami, Prabhat Ram, Takashi Sawada, Mallikarjun Shankar, Tom St. John, Akihiro Tabuchi, Venkatram Vishwanath, Mohamed Wahib, Masafumi Yamazaki, Junqi Yin:
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems. CoRR abs/2110.11466 (2021) - 2020
- [c28]Kazuaki Matsumura, Hamid Reza Zohouri, Mohamed Wahib, Toshio Endo, Satoshi Matsuoka:
AN5D: automated stencil framework for high-degree temporal blocking on GPUs. CGO 2020: 199-211 - [c27]Lingqi Zhang, Mohamed Wahib, Haoyu Zhang, Satoshi Matsuoka:
A Study of Single and Multi-device Synchronization Methods in Nvidia GPUs. IPDPS 2020: 483-493 - [c26]Mohamed Wahib, Haoyu Zhang, Truong Thao Nguyen, Aleksandr Drozd, Jens Domke, Lingqi Zhang, Ryousei Takano, Satoshi Matsuoka:
Scaling distributed deep learning workloads beyond the memory capacity with KARMA. SC 2020: 19 - [p3]Julian M. Kunkel, Nabeeh Jumah, Anastasiia Novikova, Thomas Ludwig, Hisashi Yashiro, Naoya Maruyama, Mohamed Wahib, John Thuburn:
AIMES: Advanced Computation and I/O Methods for Earth-System Simulations. Software for Exascale Computing 2020: 61-102 - [i9]Kazuaki Matsumura, Hamid Reza Zohouri, Mohamed Wahib, Toshio Endo, Satoshi Matsuoka:
AN5D: Automated Stencil Framework for High-Degree Temporal Blocking on GPUs. CoRR abs/2001.01473 (2020) - [i8]Lingqi Zhang, Mohamed Wahib, Haoyu Zhang, Satoshi Matsuoka:
A Study of Single and Multi-device Synchronization Methods in Nvidia GPUs. CoRR abs/2004.05371 (2020) - [i7]Fareed Qararyah, Mohamed Wahib, Doga Dikbayir, Mehmet Esat Belviranli, Didem Unat:
A Computational-Graph Partitioning Method for Training Memory-Constrained DNNs. CoRR abs/2008.08636 (2020) - [i6]Mohamed Wahib, Haoyu Zhang, Truong Thao Nguyen, Aleksandr Drozd, Jens Domke, Lingqi Zhang, Ryousei Takano, Satoshi Matsuoka:
Scaling Distributed Deep Learning Workloads beyond the Memory Capacity with KARMA. CoRR abs/2008.11421 (2020) - [i5]Martin Schlueter, Mehdi Neshat, Mohamed Wahib, Masaharu Munetomo, Markus Wagner:
GTOPX Space Mission Benchmarks. CoRR abs/2010.07517 (2020) - [i4]Jens Domke, Emil Vatai, Aleksandr Drozd, Peng Chen, Yosuke Oyama, Lingqi Zhang, Shweta Salaria, Daichi Mukunoki, Artur Podobas, Mohamed Wahib, Satoshi Matsuoka:
Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws? CoRR abs/2010.14373 (2020)
2010 – 2019
- 2019
- [c25]Truong Thao Nguyen, Mohamed Wahib, Ryousei Takano:
Topology-aware Sparse Allreduce for Large-scale Deep Learning. IPCCC 2019: 1-8 - [c24]Jens Domke, Kazuaki Matsumura, Mohamed Wahib, Haoyu Zhang, Keita Yashima, Toshiki Tsuchikawa, Yohei Tsuji, Artur Podobas, Satoshi Matsuoka:
Double-Precision FPUs in High-Performance Computing: An Embarrassment of Riches? IPDPS 2019: 78-88 - [c23]Peng Chen, Mohamed Wahib, Shin'ichiro Takizawa, Ryousei Takano, Satoshi Matsuoka:
A versatile software systolic execution model for GPU memory-bound kernels. SC 2019: 53:1-53:81 - [c22]Peng Chen, Mohamed Wahib, Shin'ichiro Takizawa, Ryousei Takano, Satoshi Matsuoka:
iFDK: a scalable framework for instant high-resolution image reconstruction. SC 2019: 84:1-84:24 - [i3]Peng Chen, Mohamed Wahib, Shin'ichiro Takizawa, Ryousei Takano, Satoshi Matsuoka:
A Versatile Software Systolic Execution Model for GPU Memory-Bound Kernels. CoRR abs/1907.06154 (2019) - [i2]Peng Chen, Mohamed Wahib, Shin'ichiro Takizawa, Ryousei Takano, Satoshi Matsuoka:
iFDK: A Scalable Framework for Instant High-resolution Image Reconstruction. CoRR abs/1909.02724 (2019) - 2018
- [c21]Peng Chen, Mohamed Wahib, Shin'ichiro Takizawa, Ryousei Takano, Satoshi Matsuoka:
Efficient Algorithms for the Summed Area Tables Primitive on GPUs. CLUSTER 2018: 482-493 - [c20]Truong Thao Nguyen, Mohamed Wahib, Ryousei Takano:
Hierarchical Distributed-Memory Multi-Leader MPI-Allreduce for Deep Learning Workloads. CANDAR Workshops 2018: 216-222 - [i1]Jens Domke, Kazuaki Matsumura, Mohamed Wahib, Haoyu Zhang, Keita Yashima, Toshiki Tsuchikawa, Yohei Tsuji, Artur Podobas, Satoshi Matsuoka:
Double-precision FPUs in High-Performance Computing: an Embarrassment of Riches? CoRR abs/1810.09330 (2018) - 2017
- [c19]Martin Schlueter, Mohamed Wahib, Masaharu Munetomo:
Numerical Optimization of ESA's Messenger Space Mission Benchmark. EvoApplications (1) 2017: 725-737 - 2016
- [c18]Mohamed Wahib, Naoya Maruyama, Takayuki Aoki:
Daino: a high-level framework for parallel and efficient AMR on GPUs. SC 2016: 621-632 - [e1]Naoya Maruyama, Bronis R. de Supinski, Mohamed Wahib:
OpenMP: Memory, Devices, and Tasks - 12th International Workshop on OpenMP, IWOMP 2016, Nara, Japan, October 5-7, 2016, Proceedings. Lecture Notes in Computer Science 9903, 2016, ISBN 978-3-319-45549-5 [contents] - 2015
- [c17]Mohamed Wahib, Naoya Maruyama:
Automated GPU Kernel Transformations in Large-Scale Production Stencil Applications. HPDC 2015: 259-270 - [c16]Mohamed Wahib, Naoya Maruyama:
Data-centric GPU-based adaptive mesh refinement. IA3@SC 2015: 3:1-3:7 - 2014
- [c15]Mohamed Wahib, Naoya Maruyama:
Scalable Kernel Fusion for Memory-Bound GPU Applications. SC 2014: 191-202 - 2013
- [c14]Mohamed Wahib, Naoya Maruyama:
Highly optimized full GPU-acceleration of non-hydrostatic weather model SCALE-LES. CLUSTER 2013: 1-8 - [p2]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
arGA: Adaptive Resolution Micro-genetic Algorithm with Tabu Search to Solve MINLP Problems Using GPU. Massively Parallel Evolutionary Computation on GPGPUs 2013: 83-104 - 2011
- [c13]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
Advanced genetic algorithm to solve MINLP problems over GPU. IEEE Congress on Evolutionary Computation 2011: 318-325 - [c12]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
Optimization of parallel Genetic Algorithms for nVidia GPUs. IEEE Congress on Evolutionary Computation 2011: 803-811 - [c11]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
Solving Extremely Difficult MINLP Problems Using Adaptive Resolution Micro-GA with Tabu Search. LION 2011: 203-217 - [c10]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
A Framework for Cloud Embedded Web Services Utilized by Cloud Applications. SERVICES 2011: 265-271 - 2010
- [j2]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
The design, usage, and performance of GridUFO: A Grid based Unified Framework for Optimization. Future Gener. Comput. Syst. 26(4): 633-644 (2010) - [c9]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
A Bayesian Optimization Algorithm for De Novo ligand design based docking running over GPU. IEEE Congress on Evolutionary Computation 2010: 1-8 - [c8]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
A Light Framework for the Unified Representation and Execution of Variant Tasks in a Grid Based Environment. PDPTA 2010: 658-664
2000 – 2009
- 2009
- [j1]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
Hybrid of genetic algorithm and local search to solve MAX-SAT problem using nVidia CUDA framework. Genet. Program. Evolvable Mach. 10(4): 391-415 (2009) - [c7]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
Theoretical and Empirical Analysis of a GPU Based Parallel Bayesian Optimization Algorithm. PDCAT 2009: 457-462 - 2008
- [c6]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
A General Service-Oriented Grid Computing Framework for Global Optimization Problem Solving. IEEE SCC (2) 2008: 563-566 - [c5]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
Model for dynamic grain sizing through compound parallelization for an optimization problem solving grid application. GRID 2008: 316-321 - [c4]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
SOAG: Service Oriented Architectured Grids and adoption of application specific QoS attributes. GRID 2008: 346-351 - [c3]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
Solving Large Instances of Capacitated Vehicle Routing Problem over Cell BE. HPCC 2008: 131-138 - [c2]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
A Survey: Genetic Algorithms and the Fast Evolving World of Parallel Computing. HPCC 2008: 897-902 - [p1]Asim Munawar, Mohamed Wahib, Masaharu Munetomo, Kiyoshi Akama:
Parallel GEAs with Linkage Analysis over Grid. Linkage in Evolutionary Computation 2008: 159-187 - 2007
- [c1]Mohamed Wahib, Asim Munawar, Masaharu Munetomo, Kiyoshi Akama:
MHGrid: Towards an Ideal Optimization Environment for Global Optimization Problems Using Grid Computing. PDCAT 2007: 167-168
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-22 19:00 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint