Location via proxy:   
[Report a bug]   [Manage cookies]                

Xiaodong GU


Associate Professor

School of Software, Shanghai Jiao Tong University

Contact:

Room 1208, Software Building, No.800 Dongchuan Road, Shanghai, China
Email:

Research Interest:

My research focuses on deep learning for natural and programming languages. I develop efficient machine learning methodologies for software code. My research topics are:

  • Large Language Models of Code
  • Program Generation
  • Natural Language Processing
Github 

Selected Publications

[Full List] [Google Scholar]

Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers
Yuling Shi, Hongyu Zhang, Chengcheng Wan and Xiaodong Gu*
In Proceedings of the 47th International Conference on Software Engineering (ICSE 2025). Ottawa, Ontario, Canada, April 27 - May 3, 2025. (CCF-A)
[paper] [code] [bibtex]

VarGAN: Adversarial Learning of Variable Semantic Representations
Yalan Lin, Chengcheng Wan, Shuwen Bai, Xiaodong Gu*
IEEE Transactions on Software Engineering (TSE 2024) (CCF-A)
[paper] [code]

On the Evaluation of Neural Code Translation: Taxonomy and Benchmark
Mingsheng Jiao, Tingrui Yu, Xuan Li, Guanjie Qiu, Xiaodong Gu*, Beijun Shen
In Proceedings of the 38th International Conference on Automated Software Engineering (ASE 2023), Kirchberg, Luxembourg, Sept. 11-15, 2023 (CCF-A)
[paper] [slides] [code]

InfeRE: Step-by-Step Regex Generation via Chain of Inference
Shuai Zhang, Xiaodong Gu, Yuting Chen, Beijun Shen
In Proceedings of the 38th International Conference on Automated Software Engineering (ASE 2023), Kirchberg, Luxembourg, Sept. 11-15, 2023 (CCF-A)
[paper] [slides] [code] [bibtex]

Self-Supervised Query Reformulation for Code Search
Yuetian Mao, Chengcheng Wan, Yuze Jiang, Xiaodong Gu*
In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023), San Francisco, California, United States, Dec 3-9, 2023 (CCF-A)
[paper] [slides] [code] [bibtex]

Diet Code Is Healthy: Simplifying Programs for Pre-Trained Models of Code
Zhaowei Zhang, Hongyu Zhang, Beijun Shen, Xiaodong Gu*
In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022), Singapore, Nov 14-18, 2022 (CCF-A)
[paper] [slides] [code] [bibtex]

Zero-Shot Program Representation Learning
Nan Cui, Yuze Jiang, Xiaodong Gu, Beijun Shen,
In Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension (ICPC 2022), Virtual, May 16-17, 2022 (CCF-B)
[paper] [code] [slides] [bibtex]

Self-Supervised Learning of Smart Contract Representations
Shouliang Yang, Xiaodong Gu, Beijun Shen
In Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension (ICPC 2022), Virtual, May 16-17, 2022 (CCF-B)
[code] [slides] [bibtex]

Cross-Domain Deep Code Search with Meta Learning
Yitian Chai, Hongyu Zhang, Beijun Shen and Xiaodong Gu*
In Proceedings of the 44th International Conference on Software Engineering (ICSE 2022) (CCF-A)
[paper] [code] [slides] [bibtex]

DialogBERT: Discource Aware Response Generation via Learning to Recover and Rank Utterances
Xiaodong Gu, Kang Min Yoo, and Jung-Woo Ha
In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2021) (CCF-A)

Deep Code Search
Xiaodong Gu, Hongyu Zhang, Sunghun Kim
In Proceedings of the 40th International Conference on Software Engineering (ICSE 2018), Gothenburg, Sweden, May 27 - June 3, 2018 (CCF-A)
[paper] [code] [slides] [bibtex]

Deep API Learning
Xiaodong Gu, Hongyu Zhang, Dongmei Zhang, Sunghun Kim
In Proceedings of the 24th ACM SIGSOFT International Symposium on the Foundations of Software Engineering (FSE 2016), Seattle, WA, USA, November 13–18, 2016 (CCF-A)
[project website] [paper] [slides] [code] [bibtex]

Teaching

  • SE3332 Machine Learning (Fall 2020, 2021, 2022, 2023)
  • SE2324 Mathematic Foundation for Computer Science (Spring 2021, 2022, 2023, 2024)

Students

I am grateful to the wonderful students I have been collaborating with

  • Yuling SHI (PhD, 2022-)
  • Yalan LIN (Master, 2022-)
  • Chao HU (Master, 2023-)
  • Guangzhan Wang (Master, 2023-)
  • Wenhao Zeng (Master, 2024-)

Alumni

    Zhaowei Zhang (Master 2023, Unity), Yitian CHAI (Master 2024, Tencent), Meng CHEN (Master 2024, Xiaohongshu)

Grants

  • 宁德时代,基于大模型的嵌入式软件可靠性保障技术,2024.5.6-2025.9.30,主持
  • 国家重点研发计划,面向场景计算的低代码开发方法与环境,2023.12-2026.12,参与
  • 中国航空无线电电子研究所,民机软件研制过程辅助系统,2022.12-2026.6,主持
  • 横向课题,基于大模型的恶意代码样本生成,2023.5.1-2024.4.31,主持
  • CCF-腾讯犀牛鸟基金,特定领域程序自动生成,2022.10.1-2023.12.31,主持
  • CCF-百度松果基金,基于预训练模型的程序表征,2021.9.1-2022.8.30,主持
  • 国家自然科学基金青年基金,基于小样本学习的跨语言程序自动生成,2022.1.1-2024.12.31,主持

Services

Program Committee ACL (2023), EMNLP (2021, 2022, 2023), COLING (2020, 2022), IJCAI (2023), EACL (2023)
Reviewer Board Automated Software Engineering (AUSE), Empirical Software Engineering (EMSE)
Journal Reviewer TSE, TOSEM, EMSE, IST, JSS, FCS

 

@2023 Xiaodong Gu's Personal Website. All rights reserved.