default search action
Jidong Zhai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j53]Jianbin Fang, Jidong Zhai, Zheng Wang:
Editorial for the special issue on programming models and system software for High-Performance Computing (HPC) environments. CCF Trans. High Perform. Comput. 6(3): 241-242 (2024) - [j52]Kezhao Huang, Haitian Jiang, Minjie Wang, Guangxuan Xiao, David Wipf, Xiang Song, Quan Gan, Zengfeng Huang, Jidong Zhai, Zheng Zhang:
FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training. Proc. VLDB Endow. 17(6): 1473-1486 (2024) - [j51]Weitao Wan, Feng Zhang, Chenyang Zhang, Mingde Zhang, Jidong Zhai, Yunpeng Chai, Huanchen Zhang, Wei Lu, Yuxing Chen, Haixiang Li, Anqun Pan, Xiaoyong Du:
Compressed Data Direct Computing for Databases. IEEE Trans. Knowl. Data Eng. 36(5): 1902-1918 (2024) - [j50]Jiesong Liu, Feng Zhang, Lv Lu, Chang Qi, Xiaoguang Guo, Dong Deng, Guoliang Li, Huanchen Zhang, Jidong Zhai, Hechen Zhang, Yuxing Chen, Anqun Pan, Xiaoyong Du:
G-Learned Index: Enabling Efficient Learned Index on GPU. IEEE Trans. Parallel Distributed Syst. 35(6): 795-812 (2024) - [j49]Yuyang Jin, Haojie Wang, Runxin Zhong, Chen Zhang, Xia Liao, Feng Zhang, Jidong Zhai:
Graph-Centric Performance Analysis for Large-Scale Parallel Applications. IEEE Trans. Parallel Distributed Syst. 35(7): 1221-1238 (2024) - [j48]Yuyang Jin, Runxin Zhong, Saiqin Long, Jidong Zhai:
Efficient Inference for Pruned CNN Models on Mobile Devices With Holistic Sparsity Alignment. IEEE Trans. Parallel Distributed Syst. 35(11): 2208-2223 (2024) - [j47]Liang Wang, Jinzhe Yang, Jidong Zhai, Guangwen Yang:
Optimizing I/O Performance Through Effective vCPU Scheduling Interference Management. IEEE Trans. Parallel Distributed Syst. 35(12): 2315-2330 (2024) - [j46]Zhenhua Guo, Yinan Tang, Jidong Zhai, Tongtong Yuan, Jian Jin, Li Wang, Yaqian Zhao, Rengang Li:
A Survey on Performance Modeling and Prediction for Distributed DNN Training. IEEE Trans. Parallel Distributed Syst. 35(12): 2463-2478 (2024) - [c75]Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai, Zhihao Jia:
Optimal Kernel Orchestration for Tensor Programs with Korch. ASPLOS (3) 2024: 755-769 - [c74]Kezhao Huang, Jidong Zhai, Liyan Zheng, Haojie Wang, Yuyang Jin, Qihao Zhang, Runqing Zhang, Zhen Zheng, Youngmin Yi, Xipeng Shen:
WiseGraph: Optimizing GNN with Joint Workload Partition of Graph and Operations. EuroSys 2024: 1-17 - [c73]Yanliang Zhou, Feng Zhang, Tuo Lin, Yuanjie Huang, Saiqin Long, Jidong Zhai, Xiaoyong Du:
F-TADOC: FPGA-Based Text Analytics Directly on Compression with HLS. ICDE 2024: 3739-3752 - [c72]Jiaao He, Shengqi Chen, Jidong Zhai:
POSTER: Pattern-Aware Sparse Communication for Scalable Recommendation Model Training. PPoPP 2024: 466-468 - [c71]Yidong Chen, Chen Zhang, Rongchao Dong, Haoyuan Zhang, Yonghua Zhang, Zhonghua Lu, Jidong Zhai:
MixQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction. SC 2024: 74 - [c70]Kinman Lei, Yuyang Jin, Mingshu Zhai, Kezhao Huang, Haoxing Ye, Jidong Zhai:
PUZZLE: Efficiently Aligning Large Language Models through Light-Weight Context Switch. USENIX ATC 2024: 127-140 - [c69]Chen Zhang, Rongchao Dong, Haojie Wang, Runxin Zhong, Jike Chen, Jidong Zhai:
MAGPY: Compiling Eager Mode DNN Programs by Monitoring Execution States. USENIX ATC 2024: 683-698 - [i15]Jiaao He, Jidong Zhai:
FastDecode: High-Throughput GPU-Efficient LLM Serving using Heterogeneous Pipelines. CoRR abs/2403.11421 (2024) - [i14]Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai:
Optimal Kernel Orchestration for Tensor Programs with Korch. CoRR abs/2406.09465 (2024) - 2023
- [j45]Zixuan Ma, Yuyang Jin, Shizhi Tang, Haojie Wang, Wei-Cheng Xue, Jidong Zhai, Wei-Min Zheng:
Unified Programming Models for Heterogeneous High-Performance Computers. J. Comput. Sci. Technol. 38(1): 211-218 (2023) - [j44]Zheng Chen, Feng Zhang, Jiawei Guan, Jidong Zhai, Xipeng Shen, Huanchen Zhang, Wentong Shu, Xiaoyong Du:
CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression. Proc. ACM Manag. Data 1(1): 4:1-4:31 (2023) - [j43]Zhen Zheng, Zaifeng Pan, Dalin Wang, Kai Zhu, Wenyi Zhao, Tianyou Guo, Xiafei Qiu, Minmin Sun, Junjie Bai, Feng Zhang, Xiaoyong Du, Jidong Zhai, Wei Lin:
BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach. Proc. ACM Manag. Data 1(3): 206:1-206:29 (2023) - [j42]Sunita Chandrasekaran, Min Si, Jidong Zhai, Lena Oden:
Special issue on new trends in high-performance computing: Software systems and applications. Softw. Pract. Exp. 53(1): 3-5 (2023) - [j41]Haojie Wang, Jidong Zhai, Mingyu Gao, Feng Zhang, Tuowei Wang, Zixuan Ma, Shizhi Tang, Liyan Zheng, Wen Wang, Kaiyuan Rong, Yuanyong Chen, Zhihao Jia:
Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections. IEEE Trans. Computers 72(12): 3546-3560 (2023) - [j40]Juncheng Cao, Kaiyuan Rong, Mingshu Zhai, Zeyu Song, Yanyu Ren, Yuxi Zhu, Wentao Han, Jidong Zhai:
Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst. 34(6): 1723-1726 (2023) - [j39]Yihua Hu, Feng Zhang, Yifei Xia, Zhiming Yao, Letian Zeng, Haipeng Ding, Zhewei Wei, Xiao Zhang, Jidong Zhai, Xiaoyong Du, Siqi Ma:
Enabling Efficient Random Access to Hierarchically Compressed Text Data on Diverse GPU Platforms. IEEE Trans. Parallel Distributed Syst. 34(10): 2699-2717 (2023) - [c68]Lunyiu Nie, Jiuding Sun, Yanlin Wang, Lun Du, Shi Han, Dongmei Zhang, Lei Hou, Juanzi Li, Jidong Zhai:
Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing. AAAI 2023: 13400-13408 - [c67]Qianjin Du, Shiji Zhou, Xiaohui Kuang, Gang Zhao, Jidong Zhai:
Joint Geometrical and Statistical Domain Adaptation for Cross-domain Code Vulnerability Detection. EMNLP 2023: 12791-12800 - [c66]Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Zhiyuan Liu, Peng Zhang, Yuxiao Dong, Jie Tang:
GLM-130B: An Open Bilingual Pre-trained Model. ICLR 2023 - [c65]Chen Zhang, Lingxiao Ma, Jilong Xue, Yining Shi, Ziming Miao, Fan Yang, Jidong Zhai, Zhi Yang, Mao Yang:
Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning. OSDI 2023: 681-699 - [c64]Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shuhong Huang, Xupeng Miao, Shizhi Tang, Kezhao Huang, Zhihao Jia:
EINNET: Optimizing Tensor Programs with Derivation-Based Transformations. OSDI 2023: 739-755 - [c63]Tianhui Shi, Jidong Zhai, Haojie Wang, Qiqian Chen, Mingshu Zhai, Zixu Hao, Haoyu Yang, Wenguang Chen:
GraphSet: High Performance Graph Mining through Equivalent Set Transformations. SC 2023: 32:1-32:14 - [c62]Mingshu Zhai, Jiaao He, Zixuan Ma, Zan Zong, Runqing Zhang, Jidong Zhai:
SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization. USENIX ATC 2023: 961-975 - [i13]Kezhao Huang, Haitian Jiang, Minjie Wang, Guangxuan Xiao, David Wipf, Xiang Song, Quan Gan, Zengfeng Huang, Jidong Zhai, Zheng Zhang:
ReFresh: Reducing Memory Access from Exploiting Stable Historical Embeddings for Graph Neural Network Training. CoRR abs/2301.07482 (2023) - [i12]Zixuan Ma, Haojie Wang, Jingze Xing, Liyan Zheng, Chen Zhang, Huanqi Cao, Kezhao Huang, Shizhi Tang, Penghan Wang, Jidong Zhai:
PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR. CoRR abs/2307.04995 (2023) - 2022
- [j38]Wei Liu, Jiangming Jin, Hao Wu, Yifan Gong, Ziyue Jiang, Jidong Zhai:
Zoro: A robotic middleware combining high performance and high reliability. J. Parallel Distributed Comput. 166: 126-138 (2022) - [j37]Feng Zhang, Yani Liu, Ningxuan Feng, Cheng Yang, Jidong Zhai, Shuhao Zhang, Bingsheng He, Jiazao Lin, Xiao Zhang, Xiaoyong Du:
Periodic Weather-Aware LSTM With Event Mechanism for Parking Behavior Prediction. IEEE Trans. Knowl. Data Eng. 34(12): 5896-5909 (2022) - [j36]Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression. IEEE Trans. Parallel Distributed Syst. 33(2): 459-475 (2022) - [j35]Zaifeng Pan, Feng Zhang, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
Exploring Data Analytics Without Decompression on Embedded GPU Systems. IEEE Trans. Parallel Distributed Syst. 33(7): 1553-1568 (2022) - [j34]Runxin Zhong, Jiajie Chen, Chen Zhang, Mingshu Zhai, Zeyu Song, Yutian Wang, Wentao Han, Lin Gan, Jidong Zhai:
Critique of "MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst. 33(9): 2050-2053 (2022) - [j33]Jiesong Liu, Feng Zhang, Hourun Li, Dalin Wang, Weitao Wan, Xiaokun Fang, Jidong Zhai, Xiaoyong Du:
Exploring Query Processing on CPU-GPU Integrated Edge Device. IEEE Trans. Parallel Distributed Syst. 33(10): 4057-4070 (2022) - [j32]Jidong Zhai, Liyan Zheng, Feng Zhang, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen:
Detecting Performance Variance for Parallel Applications Without Source Code. IEEE Trans. Parallel Distributed Syst. 33(10): 4239-4255 (2022) - [j31]Jidong Zhai, Min Si, Antonio J. Peña:
Guest Editorial. IEEE Trans. Parallel Distributed Syst. 33(11): 2644-2647 (2022) - [j30]Jidong Zhai, Liyan Zheng, Jinghan Sun, Feng Zhang, Xiongchao Tang, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen, Weimin Zheng:
Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems. IEEE Trans. Parallel Distributed Syst. 33(12): 3558-3574 (2022) - [j29]Qingyu Xu, Feng Zhang, Mingde Zhang, Jidong Zhai, Bingsheng He, Cheng Yang, Shuhao Zhang, Jiazao Lin, Haidi Liu, Xiaoyong Du:
Payment behavior prediction on shared parking lots with TR-GCN. VLDB J. 31(5): 1035-1058 (2022) - [c61]Zhen Zheng, Xuanda Yang, Pengzhan Zhao, Guoping Long, Kai Zhu, Feiwen Zhu, Wenyi Zhao, Xiaoyong Liu, Jun Yang, Jidong Zhai, Shuaiwen Leon Song, Wei Lin:
AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures. ASPLOS 2022: 359-373 - [c60]Lei Xie, Jidong Zhai, Zhenxing Zhang, Jonathan Allcock, Shengyu Zhang, Yicong Zheng:
Suppressing ZZ crosstalk of Quantum computers through pulse and scheduling co-optimization. ASPLOS 2022: 499-513 - [c59]Lunyiu Nie, Shulin Cao, Jiaxin Shi, Jiuding Sun, Qi Tian, Lei Hou, Juanzi Li, Jidong Zhai:
GraphQ IR: Unifying the Semantic Parsing of Graph Query Languages with One Intermediate Representation. EMNLP 2022: 5848-5865 - [c58]Yunquan Zhang, Jidong Zhai, Rajiv Ranjan:
Message from the High Performance Computing and Communications 2022 Program Chairs. HPCC/DSS/SmartCity/DependSys 2022: lv - [c57]Zixuan Ma, Haojie Wang, Guanyu Feng, Chen Zhang, Lei Xie, Jiaao He, Shengqi Chen, Jidong Zhai:
Efficiently emulating high-bitwidth computation with low-bitwidth hardware. ICS 2022: 5:1-5:12 - [c56]Shizhi Tang, Jidong Zhai, Haojie Wang, Lin Jiang, Liyan Zheng, Zhenhao Yuan, Chen Zhang:
FreeTensor: a free-form DSL with holistic optimizations for irregular tensor programs. PLDI 2022: 872-887 - [c55]Jiaao He, Jidong Zhai, Tiago Antunes, Haojie Wang, Fuwen Luo, Shangfeng Shi, Qin Li:
FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models. PPoPP 2022: 120-134 - [c54]Liyan Zheng, Jidong Zhai, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen:
Vapro: performance variance detection and diagnosis for production-run parallel applications. PPoPP 2022: 150-162 - [c53]Yuyang Jin, Haojie Wang, Runxin Zhong, Chen Zhang, Jidong Zhai:
PerFlow: a domain specific framework for automatic performance analysis of parallel applications. PPoPP 2022: 177-191 - [c52]Zixuan Ma, Jiaao He, Jiezhong Qiu, Huanqi Cao, Yuanwei Wang, Zhenbo Sun, Liyan Zheng, Haojie Wang, Shizhi Tang, Tianyu Zheng, Junyang Lin, Guanyu Feng, Zeqiang Huang, Jie Gao, Aohan Zeng, Jianwei Zhang, Runxin Zhong, Tianhui Shi, Sha Liu, Weimin Zheng, Jie Tang, Hongxia Yang, Xin Liu, Jidong Zhai, Wenguang Chen:
BaGuaLu: targeting brain scale pretrained models with over 37 million cores. PPoPP 2022: 192-204 - [c51]Chen Zhang, Haojie Wang, Zixuan Ma, Lei Xie, Zeyu Song, Jidong Zhai:
UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation. SC 2022: 49:1-49:16 - [c50]Feng Zhang, Weitao Wan, Chenyang Zhang, Jidong Zhai, Yunpeng Chai, Haixiang Li, Xiaoyong Du:
CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases. SIGMOD Conference 2022: 1655-1669 - [i11]Lunyiu Nie, Shulin Cao, Jiaxin Shi, Qi Tian, Lei Hou, Juanzi Li, Jidong Zhai:
GraphQ IR: Unifying Semantic Parsing of Graph Query Language with Intermediate Representation. CoRR abs/2205.12078 (2022) - [i10]Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shizhi Tang, Lei Xie, Kezhao Huang, Zhihao Jia:
OLLIE: Derivation-based Tensor Program Optimizer. CoRR abs/2208.02025 (2022) - [i9]Lunyiu Nie, Jiuding Sun, Yanlin Wang, Lun Du, Shi Han, Dongmei Zhang, Lei Hou, Juanzi Li, Jidong Zhai:
Guiding the PLMs with Semantic Anchors as Intermediate Supervision: Towards Interpretable Semantic Parsing. CoRR abs/2210.01425 (2022) - [i8]Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang:
GLM-130B: An Open Bilingual Pre-trained Model. CoRR abs/2210.02414 (2022) - 2021
- [j28]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. Big Data Min. Anal. 4(3): 208-220 (2021) - [j27]Xian-He Sun, Dong Li, Wen-Guang Chen, Tao Li, Jiwu Shu, Bo Wu, Jin Xiong, Jinging Xue, Feng Zhang, Jidong Zhai, Zhiia Zhao:
Preface. J. Comput. Sci. Technol. 36(1): 1-3 (2021) - [j26]Xiongchao Tang, Chen Zhang, Jidong Zhai, Xuehai Qian, Wenguang Chen, Yong Jiang:
A Fast Lock for Explicit Message Passing Architectures. IEEE Trans. Computers 70(10): 1555-1568 (2021) - [j25]Feng Zhang, Jidong Zhai, Bo Wu, Bingsheng He, Wenguang Chen, Xiaoyong Du:
Automatic Irregularity-Aware Fine-Grained Workload Partitioning on Integrated Architectures. IEEE Trans. Knowl. Data Eng. 33(3): 867-881 (2021) - [j24]Teng Yu, Runxin Zhong, Vladimir Janjic, Pavlos Petoumenos, Jidong Zhai, Hugh Leather, John Thomson:
Collaborative Heterogeneity-Aware OS Scheduler for Asymmetric Multicore Processors. IEEE Trans. Parallel Distributed Syst. 32(5): 1224-1237 (2021) - [j23]Pavan Balaji, Jidong Zhai, Min Si:
Guest Editorial. IEEE Trans. Parallel Distributed Syst. 32(7): 1511-1512 (2021) - [j22]Feng Zhang, Zheng Chen, Chenyang Zhang, Amelie Chi Zhou, Jidong Zhai, Xiaoyong Du:
An Efficient Parallel Secure Machine Learning Framework on GPUs. IEEE Trans. Parallel Distributed Syst. 32(9): 2262-2276 (2021) - [j21]Chen Zhang, Chenggang Zhao, Jiaao He, Shengqi Chen, Liyan Zheng, Kezhao Huang, Wentao Han, Jidong Zhai:
Critique of "Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility" by SCC Team From Tsinghua University. IEEE Trans. Parallel Distributed Syst. 32(11): 2631-2634 (2021) - [j20]Feng Zhang, Jidong Zhai, Xipeng Shen, Dalin Wang, Zheng Chen, Onur Mutlu, Wenguang Chen, Xiaoyong Du:
TADOC: Text analytics directly on compression. VLDB J. 30(2): 163-188 (2021) - [c49]Hao Wu, Jiangming Jin, Jidong Zhai, Yifan Gong, Wei Liu:
Accelerating GPU Message Communication for Autonomous Navigation Systems. CLUSTER 2021: 181-191 - [c48]Lei Xie, Jidong Zhai, Weimin Zheng:
Mitigating Crosstalk in Quantum Computers through Commutativity-Based Instruction Reordering. DAC 2021: 445-450 - [c47]Feng Zhang, Zaifeng Pan, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression. ICDE 2021: 1679-1690 - [c46]Chen Zhang, Zeyu Song, Haojie Wang, Kaiyuan Rong, Jidong Zhai:
HyQuas: hybrid partitioner based quantum circuit simulation system on GPU. ICS 2021: 443-454 - [c45]Haojie Wang, Jidong Zhai, Mingyu Gao, Zixuan Ma, Shizhi Tang, Liyan Zheng, Yuanzhi Li, Kaiyuan Rong, Yuanyong Chen, Zhihao Jia:
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections. OSDI 2021: 37-54 - [c44]Kezhao Huang, Jidong Zhai, Zhen Zheng, Youngmin Yi, Xipeng Shen:
Understanding and bridging the gaps in current GNN performance optimizations. PPoPP 2021: 119-132 - [i7]Jiaao He, Jiezhong Qiu, Aohan Zeng, Zhilin Yang, Jidong Zhai, Jie Tang:
FastMoE: A Fast Mixture-of-Expert Training System. CoRR abs/2103.13262 (2021) - [i6]Feng Zhang, Zaifeng Pan, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression. CoRR abs/2106.06889 (2021) - 2020
- [j19]Ziyue Jiang, Yifan Gong, Jidong Zhai, Yu-Ping Wang, Wei Liu, Hao Wu, Jiangming Jin:
Message Passing Optimization in Robot Operating System. Int. J. Parallel Program. 48(1): 119-136 (2020) - [c43]Chanyoung Oh, Zhen Zheng, Xipeng Shen, Jidong Zhai, Youngmin Yi:
GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU. PACT 2020: 43-54 - [c42]Lei Xie, Jidong Zhai, Baodong Wu, Yuanbo Wang, Xingcheng Zhang, Peng Sun, Shengen Yan:
Elan: Towards Generic and Efficient Elastic Training for Deep Learning. ICDCS 2020: 78-88 - [c41]Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Xiaoyong Du:
Enabling Efficient Random Access to Hierarchically-Compressed Data. ICDE 2020: 1069-1080 - [c40]Zheng Chen, Feng Zhang, Amelie Chi Zhou, Jidong Zhai, Chenyang Zhang, Xiaoyong Du:
ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUs. ICPP 2020: 22:1-22:11 - [c39]Wei Liu, Yifan Gong, Hao Wu, Jidong Zhai, Jiangming Jin:
Memory-Centric Communication Mechanism for Real-time Autonomous Navigation Applications. ICPP 2020: 33:1-33:11 - [c38]Xiaoyang Wang, Zhe Zhou, Ping Han, Tong Meng, Guangyu Sun, Jidong Zhai:
Edge-Stream: a Stream Processing Approach for Distributed Applications on a Hierarchical Edge-computing System. SEC 2020: 14-27 - [c37]Feng Zhang, Ningxuan Feng, Yani Liu, Cheng Yang, Jidong Zhai, Shuhao Zhang, Bingsheng He, Jiazao Lin, Xiaoyong Du:
PewLSTM: Periodic LSTM with Weather-Aware Gating Mechanism for Parking Behavior Prediction. IJCAI 2020: 4424-4430 - [c36]Qingyu Xu, Feng Zhang, Mingde Zhang, Jidong Zhai, Jiazao Lin, Haidi Liu, Xiaoyong Du:
Payment Behavior Prediction and Statistical Analysis for Shared Parking Lots. NPC 2020: 288-293 - [c35]Yuyang Jin, Haojie Wang, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai:
Identifying scalability bottlenecks for large-scale parallel programs with graph analysis. PPoPP 2020: 409-410 - [c34]Yuyang Jin, Haojie Wang, Teng Yu, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai:
ScalAna: automating scaling loss detection with graph analysis. SC 2020: 28 - [c33]Tianhui Shi, Mingshu Zhai, Yi Xu, Jidong Zhai:
GraphPi: high performance graph pattern matching through effective redundancy elimination. SC 2020: 100 - [i5]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. CoRR abs/2008.07141 (2020) - [i4]Yuyang Jin, Haojie Wang, Teng Yu, Xiongchao Tang, Torsten Hoefler, Xu Liu, Jidong Zhai:
ScalAna: Automating Scaling Loss Detection with Graph Analysis. CoRR abs/2009.01692 (2020) - [i3]Feng Zhang, Jidong Zhai, Xipeng Shen, Dalin Wang, Zheng Chen, Onur Mutlu, Wenguang Chen, Xiaoyong Du:
TADOC: Text Analytics Directly on Compression. CoRR abs/2009.09442 (2020) - [i2]Tianhui Shi, Mingshu Zhai, Yi Xu, Jidong Zhai:
GraphPi: High Performance Graph Pattern Matching through Effective Redundancy Elimination. CoRR abs/2009.10955 (2020)
2010 – 2019
- 2019
- [j18]Feng Zhang, Weifeng Liu, Ningxuan Feng, Jidong Zhai, Xiaoyong Du:
Performance evaluation and analysis of sparse matrix and graph kernels on heterogeneous processors. CCF Trans. High Perform. Comput. 1(2): 131-143 (2019) - [j17]Feng Zhang, Jidong Zhai, Marc Snir, Hai Jin, Hironori Kasahara, Mateo Valero:
Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications. Int. J. Parallel Program. 47(3): 343-344 (2019) - [j16]Jiaao He, Chenggang Zhao, Jiping Yu, Xinjian Yu, Liyan Zheng, Chenyao Lou, Shizhi Tang, Wentao Han, Jidong Zhai:
Student Cluster Competition 2018, Team Tsinghua University: Reproducing performance of multi-physics simulations of the Tsunamigenic 2004 Sumatra megathrust earthquake on the Intel Skylake Architecture. Parallel Comput. 90 (2019) - [j15]Amelie Chi Zhou, Yao Xiao, Yifan Gong, Bingsheng He, Jidong Zhai, Rui Mao:
Privacy Regulation Aware Process Mapping in Geo-Distributed Cloud Data Centers. IEEE Trans. Parallel Distributed Syst. 30(8): 1872-1888 (2019) - [c32]Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen:
HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations. ASPLOS 2019: 153-166 - [c31]Xiongchao Tang, Jidong Zhai, Xuehai Qian, Wenguang Chen:
pLock: A Fast Lock for Architectures with Explicit Inter-core Message Passing. ASPLOS 2019: 765-778 - [c30]Xu Ji, Bin Yang, Tianyu Zhang, Xiaosong Ma, Xiupeng Zhu, Xiyang Wang, Nosayba El-Sayed, Jidong Zhai, Weiguo Liu, Wei Xue:
Automatic, Application-Aware I/O Forwarding Resource Allocation. FAST 2019: 265-279 - [c29]Ningxuan Feng, Feng Zhang, Jiazao Lin, Jidong Zhai, Xiaoyong Du:
Statistical Analysis and Prediction of Parking Behavior. NPC 2019: 93-104 - [c28]Bin Yang, Xu Ji, Xiaosong Ma, Xiyang Wang, Tianyu Zhang, Xiupeng Zhu, Nosayba El-Sayed, Haidong Lan, Yibo Yang, Jidong Zhai, Weiguo Liu, Wei Xue:
End-to-end I/O Monitoring on a Leading Supercomputer. NSDI 2019: 379-394 - [c27]Chanyoung Oh, Zhen Zheng, Xipeng Shen, Jidong Zhai, Youngmin Yi:
GOPipe: a granularity-oblivious programming framework for pipelined stencil executions on GPU. PPoPP 2019: 431-432 - [c26]Xiongchao Tang, Haojie Wang, Xiaosong Ma, Nosayba El-Sayed, Jidong Zhai, Wenguang Chen, Ashraf Aboulnaga:
Spread-n-share: improving application performance and cluster throughput with resource-aware job placement. SC 2019: 12:1-12:15 - 2018
- [j14]Jidong Zhai, Wen-Guang Chen:
A vision of post-exascale programming. Frontiers Inf. Technol. Electron. Eng. 19(10): 1261-1266 (2018) - [j13]Ka Cheong Jason Lau, Yuxuan Li, Lei Xie, Qian Xie, Beichen Li, Yu Chen, Guanyu Feng, Jiping Yu, Xinjian Yu, Miao Wang, Wentao Han, Jidong Zhai:
Student cluster competition 2017, team Tsinghua University: Reproducing vectorization of the tersoff multi-body potential on the Intel Skylake and NVIDIA Volta architectures. Parallel Comput. 78: 47-53 (2018) - [j12]Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Wenguang Chen:
Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights. Proc. VLDB Endow. 11(11): 1522-1535 (2018) - [j11]Feng Zhang, Heng Lin, Jidong Zhai, Jie Cheng, Dingyi Xiang, Jizhong Li, Yunpeng Chai, Xiaoyong Du:
An adaptive breadth-first search algorithm on integrated architectures. J. Supercomput. 74(11): 6135-6155 (2018) - [j10]Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng, Keqin Li:
An Efficient In-Memory Checkpoint Method and its Practice on Fault-Tolerant HPL. IEEE Trans. Parallel Distributed Syst. 29(4): 758-771 (2018) - [c25]Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Wenguang Chen:
Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data. ICS 2018: 195-206 - [c24]Yuwei Hu, Jidong Zhai, Dinghua Li, Yifan Gong, Yuhao Zhu, Wei Liu, Lei Su, Jiangming Jin:
BitFlow: Exploiting Vector Parallelism for Binary Neural Networks on CPU. IPDPS 2018: 244-253 - [c23]Youwei Zhuo, Jinglei Cheng, Qinyi Luo, Jidong Zhai, Yanzhi Wang, Zhongzhi Luan, Xuehai Qian:
CSE: Parallel Finite State Machines with Convergence Set Enumeration. MICRO 2018: 29-41 - [c22]Xiongchao Tang, Jidong Zhai, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen:
vSensor: leveraging fixed-workload snippets of programs for performance variance detection. PPoPP 2018: 124-136 - [c21]Haojie Wang, Jidong Zhai, Xiongchao Tang, Bowen Yu, Xiaosong Ma, Wenguang Chen:
Spindle: Informed Memory Access Monitoring. USENIX ATC 2018: 561-574 - [e1]Feng Zhang, Jidong Zhai, Marc Snir, Hai Jin, Hironori Kasahara, Mateo Valero:
Network and Parallel Computing - 15th IFIP WG 10.3 International Conference, NPC 2018, Muroran, Japan, November 29 - December 1, 2018, Proceedings. Lecture Notes in Computer Science 11276, Springer 2018, ISBN 978-3-030-05676-6 [contents] - 2017
- [j9]Feng Zhang, Jidong Zhai, Bingsheng He, Shuhao Zhang, Wenguang Chen:
Understanding Co-Running Behaviors on Integrated CPU/GPU Architectures. IEEE Trans. Parallel Distributed Syst. 28(3): 905-918 (2017) - [c20]Feng Zhang, Bo Wu, Jidong Zhai, Bingsheng He, Wenguang Chen:
FinePar: irregularity-aware fine-grained workload partitioning on integrated architectures. CGO 2017: 27-38 - [c19]Shuo Yang, Kai Wu, Yifan Qiao, Dong Li, Jidong Zhai:
Algorithm-Directed Crash Consistence in Non-volatile Memory for HPC. CLUSTER 2017: 475-486 - [c18]Heng Lin, Xiongchao Tang, Bowen Yu, Youwei Zhuo, Wenguang Chen, Jidong Zhai, Wanwang Yin, Weimin Zheng:
Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores. IPDPS 2017: 635-645 - [c17]Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen:
Versapipe: a versatile programming framework for pipelined computing on GPU. MICRO 2017: 587-599 - [c16]Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng:
Self-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL. PPoPP 2017: 401-413 - [c15]Amelie Chi Zhou, Yifan Gong, Bingsheng He, Jidong Zhai:
Efficient process mapping in geo-distributed cloud data centers. SC 2017: 16 - [i1]Shuo Yang, Kai Wu, Yifan Qiao, Dong Li, Jidong Zhai:
Algorithm-Directed Crash Consistence in Non-Volatile Memory for HPC. CoRR abs/1705.05541 (2017) - 2016
- [j8]Jidong Zhai, Feng Zhang, Qingwen Li, Wenguang Chen, Weimin Zheng:
Characterizing and optimizing TPC-C workloads on large-scale systems using SSD arrays. Sci. China Inf. Sci. 59(9): 92104 (2016) - [j7]Haibao Chen, Song Wu, Hai Jin, Wenguang Chen, Jidong Zhai, Yingwei Luo, Xiaolin Wang:
A survey of cloud resource management for complex engineering applications. Frontiers Comput. Sci. 10(3): 447-461 (2016) - [j6]Jidong Zhai, Wenguang Chen, Weimin Zheng, Keqin Li:
Performance Prediction for Large-Scale Parallel Applications Using Representative Replay. IEEE Trans. Computers 65(7): 2184-2198 (2016) - [j5]Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, Wenguang Chen, Weimin Zheng:
Building Semi-Elastic Virtual Clusters for Cost-Effective HPC Cloud Resource Provisioning. IEEE Trans. Parallel Distributed Syst. 27(7): 1915-1928 (2016) - [c14]Xinliang Wang, Wei Xue, Jidong Zhai, Yangtong Xu, Weimin Zheng, Hai-Xiang Lin:
A Fast Tridiagonal Solver for Intel MIC Architecture. IPDPS 2016: 172-181 - 2015
- [j4]Ikjoon Kim, Jidong Zhai, Yan Li, Wenguang Chen:
Optimizing seam carving on multi-GPU systems for real-time content-aware image resizing. J. Supercomput. 71(9): 3500-3524 (2015) - [j3]Jidong Zhai, Mingliang Liu, Ye Jin, Xiaosong Ma, Wenguang Chen:
Automatic Cloud I/O Configurator for I/O Intensive Parallel Applications. IEEE Trans. Parallel Distributed Syst. 26(12): 3275-3288 (2015) - [c13]Yunyun Jiang, Tian Xiao, Jidong Zhai, Ying Zhao, Wenguang Chen:
A Power-Conserving Online Scheduling Scheme for Video Streaming Services. ICA3PP (1) 2015: 133-154 - [c12]Feng Zhang, Jidong Zhai, Wenguang Chen, Bingsheng He, Shuhao Zhang:
To Co-run, or Not to Co-run: A Performance Study on Integrated Architectures. MASCOTS 2015: 89-92 - 2014
- [c11]Ikjoon Kim, Jidong Zhai, Yan Li, Wenguang Chen:
Optimizing Seam Carving on multi-GPU systems for real-time image resizing. ICPADS 2014: 616-623 - [c10]Jidong Zhai, Jianfei Hu, Xiongchao Tang, Xiaosong Ma, Wenguang Chen:
CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression. SC 2014: 143-153 - 2013
- [c9]Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zhai, Qianqian Shi, Xiaosong Ma, Wenguang Chen:
ACIC: automatic cloud I/O configurator for parallel applications. HPDC 2013: 111-112 - [c8]Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zhai, Qianqian Shi, Xiaosong Ma, Wenguang Chen:
ACIC: automatic cloud I/O configurator for HPC applications. SC 2013: 38:1-38:12 - [c7]Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, Wenguang Chen:
Cost-effective cloud HPC resource provisioning by building semi-elastic virtual clusters. SC 2013: 56:1-56:12 - 2012
- [c6]Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Mingliang Liu, Yan Zhai, Wenguang Chen, Weimin Zheng:
Employing Checkpoint to Improve Job Scheduling in Large-Scale Systems. JSSPP 2012: 36-55 - 2011
- [j2]Jidong Zhai, Tianwei Sheng, Jiangzhou He, Wenguang Chen, Weimin Zheng:
Efficiently Acquiring Communication Traces for Large-Scale Parallel Applications. IEEE Trans. Parallel Distributed Syst. 22(11): 1862-1870 (2011) - [c5]Mingliang Liu, Jidong Zhai, Yan Zhai, Xiaosong Ma, Wenguang Chen:
One optimized I/O configuration per HPC application: leveraging the configurability of cloud. APSys 2011: 15 - [c4]Yan Zhai, Mingliang Liu, Jidong Zhai, Xiaosong Ma, Wenguang Chen:
Cloud versus in-house cluster: evaluating Amazon cluster compute instances for running MPI applications. SC State of the Practice Reports 2011: 11:1-11:10 - 2010
- [c3]Jidong Zhai, Wenguang Chen, Weimin Zheng:
PHANTOM: predicting performance of parallel applications on large-scale parallel machines using a single node. PPoPP 2010: 305-314
2000 – 2009
- 2009
- [j1]Wenguang Chen, Jidong Zhai, Jin Zhang, Weimin Zheng:
LogGPO: An accurate communication model for performance prediction of MPI programs. Sci. China Ser. F Inf. Sci. 52(10): 1785-1791 (2009) - [c2]Jin Zhang, Jidong Zhai, Wenguang Chen, Weimin Zheng:
Process Mapping for MPI Collective Communications. Euro-Par 2009: 81-92 - [c1]Jidong Zhai, Tianwei Sheng, Jiangzhou He, Wenguang Chen, Weimin Zheng:
FACT: fast communication trace collection for parallel applications through program slicing. SC 2009
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-22 19:01 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint