research-article

Generating bug-fixes using pretrained transformers

Authors:

Alexey Svyatkovskiy,

Neel SundaresanAuthors Info & Claims

MAPS 2021: Proceedings of the 5th ACM SIGPLAN International Symposium on Machine Programming

Pages 1 - 8

https://doi.org/10.1145/3460945.3464951

Published: 20 June 2021 Publication History

Abstract

Detecting and fixing bugs are two of the most important yet frustrating parts of the software development cycle. Existing bug detection tools are based mainly on static analyzers, which rely on mathematical logic and symbolic reasoning about the program execution to detect common types of bugs. Fixing bugs is typically left out to the developer. In this work we introduce DeepDebug: a data-driven program repair approach which learns to detect and fix bugs in Java methods mined from real-world GitHub repositories. We frame bug-patching as a sequence-to-sequence learning task consisting of two steps: (i) denoising pretraining, and (ii) supervised finetuning on the target translation task. We show that pretraining on source code programs improves the number of patches found by 33% as compared to supervised training from scratch, while domain-adaptive pretraining from natural language to code further improves the accuracy by another 32%. We refine the standard accuracy evaluation metric into non-deletion and deletion-only fixes, and show that our best model generates 75% more non-deletion fixes than the previous state of the art. In contrast to prior work, we attain our best results when generating raw code, as opposed to working with abstracted code that tends to only benefit smaller capacity models. Finally, we observe a subtle improvement from adding syntax embeddings along with the standard positional embeddings, as well as with adding an auxiliary task to predict each token's syntactic class. Despite focusing on Java, our approach is language agnostic, requiring only a general-purpose parser such as tree-sitter.

References

[1]

Claire Le Goues, Michael Dewey-Vogt, Stephanie Forrest, and Westley Weimer. A systematic study of automated program repair: Fixing 55 out of 105 bugs for $8 each. In Proceedings of the 34th International Conference on Software Engineering, ICSE ’12, page 3–13. IEEE Press, 2012.

[2]

Zichao Qi, Fan Long, Sara Achour, and Martin Rinard. An analysis of patch plausibility and correctness for generate-and-validate patch generation systems. In Proceedings of the 2015 International Symposium on Software Testing and Analysis, ISSTA 2015, page 24–36, New York, NY, USA, 2015. Association for Computing Machinery.

Digital Library

[3]

Michele Tufano, Cody Watson, Gabriele Bavota, Massimiliano Di Penta, Martin White, and Denys Poshyvanyk. An empirical study on learning bug-fixing patches in the wild via neural machine translation. ACM Trans. Softw. Eng. Methodol., 28(4), September 2019.

Digital Library

[4]

Sheena Panthaplackel, Miltiadis Allamanis, and Marc Brockschmidt. Copy that! editing sequences by copying spans. arXiv preprint arXiv:2006.04771, 2020.

[5]

Zimin Chen, Steve Kommrusch, Michele Tufano, Louis-Noël Pouchet, Denys Poshyvanyk, and Martin Monperrus. Sequencer: Sequence-to-sequence learning for end-to-end program repair. 2018.

[6]

Thibaud Lutellier, Lawrence Pang, Viet Hung Pham, Moshi Wei, and Lin Tan. Encore: Ensemble learning using convolution neural machine translation for automatic program repair, 2019.

[7]

Jacob Devlin, Jonathan Uesato, Rishabh Singh, and Pushmeet Kohli. Semantic code repair using neuro-symbolic transformation networks, 2017.

[8]

Aditya Kanade, Petros Maniatis, Gogul Balakrishnan, and Kensen Shi. Pre-trained contextual embedding of source code, 2019.

[9]

Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, and Luke Zettlemoyer. BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7871–7880, Online, July 2020. Association for Computational Linguistics.

[10]

Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. Exploring the limits of transfer learning with a unified text-to-text transformer, 2019.

[11]

Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel Bowman. GLUE: A multi-task benchmark and analysis platform for natural language understanding. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 353–355, Brussels, Belgium, November 2018. Association for Computational Linguistics.

[12]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Ł ukasz Kaiser, and Illia Polosukhin. Attention is all you need. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 30, pages 5998–6008. Curran Associates, Inc., 2017.

[13]

Abigail See, Peter J Liu, and Christopher D Manning. Get to the point: Summarization with pointer-generator networks. arXiv preprint arXiv:1704.04368, 2017.

[14]

Wei Zhao, Liang Wang, Kewei Shen, Ruoyu Jia, and Jingming Liu. Improving grammatical error correction via pre-training a copy-augmented architecture with unlabeled data. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 156–165, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics.

[15]

Rico Sennrich, Barry Haddow, and Alexandra Birch. Neural machine translation of rare words with subword units. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1715–1725, Berlin, Germany, August 2016. Association for Computational Linguistics.

[16]

Alec Radford and Jeffrey Wu. Rewon child, david luan, dario amodei, and ilya sutskever. 2019. Language models are unsupervised multitask learners, 2019.

[17]

Alexey Svyatkovskiy, Shao Kun Deng, Shengyu Fu, and Neel Sundaresan. Intellicode compose: Code generation using transformer. Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020.

Digital Library

[18]

Guillaume Klein, Yoon Kim, Yuntian Deng, Jean Senellart, and Alexander Rush. OpenNMT: Open-source toolkit for neural machine translation. In Proceedings of ACL 2017, System Demonstrations, pages 67–72, Vancouver, Canada, July 2017. Association for Computational Linguistics.

[19]

Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, and Michael Auli. fairseq: A fast, extensible toolkit for sequence modeling, 2019.

[20]

Jared Kaplan, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, and Dario Amodei. Scaling laws for neural language models, 2020.

[21]

Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun Shou, Bing Qin, Ting Liu, Daxin Jiang, and Ming Zhou. CodeBERT: A pre-trained model for programming and natural languages. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1536–1547, Online, November 2020. Association for Computational Linguistics.

Cited By

Ma WLiu SZhao MXie XWang WHu QZhang JLiu Y(2024)Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics CapacitiesACM Transactions on Software Engineering and Methodology10.1145/366460633:7(1-29)Online publication date: 26-Aug-2024
https://dl.acm.org/doi/10.1145/3664606
Lajko MCsuvik VGyimothy TVidacs LHuyen PTan SMechtaev SKhurshid S(2024)Automated Program Repair with the GPT Family, including GPT-2, GPT-3 and CodeXProceedings of the 5th ACM/IEEE International Workshop on Automated Program Repair10.1145/3643788.3648021(34-41)Online publication date: 20-Apr-2024
https://dl.acm.org/doi/10.1145/3643788.3648021
Niu CLi CNg VLuo B(2024)Comparing the Pretrained Models of Source Code by Re-pretraining Under a Unified SetupIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.330859535:12(17768-17778)Online publication date: Dec-2024
https://doi.org/10.1109/TNNLS.2023.3308595
Show More Cited By

Index Terms

Generating bug-fixes using pretrained transformers
1. Software and its engineering
  1. Software notations and tools
    1. General programming languages
2. Theory of computation
  1. Semantics and reasoning
    1. Program reasoning
      1. Program analysis

Recommendations

Memories of bug fixes
SIGSOFT '06/FSE-14: Proceedings of the 14th ACM SIGSOFT international symposium on Foundations of software engineering

The change history of a software project contains a rich collection of code changes that record previous development experience. Changes that fix bugs are especially interesting, since they record both the old buggy code and the new fixed code. This ...
PreciseBugCollector: Extensible, Executable and Precise Bug-Fix Collection
ASE '23: Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering

Bug datasets are vital for enabling deep learning techniques to address software maintenance tasks related to bugs. However, existing bug datasets suffer from precise and scale limitations: they are either small-scale but precise with manual validation ...
Supplementary Bug Fixes vs. Re-opened Bugs
SCAM '14: Proceedings of the 2014 IEEE 14th International Working Conference on Source Code Analysis and Manipulation

A typical bug fixing cycle involves the reporting of a bug, the triaging of the report, the production and verification of a fix, and the closing of the bug. However, previous work has studied two phenomena where more than one fix are associated with ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MAPS 2021: Proceedings of the 5th ACM SIGPLAN International Symposium on Machine Programming

June 2021

52 pages

ISBN:9781450384674

DOI:10.1145/3460945

General Chair:
Roopsha Samanta
Purdue University, USA
,
Program Chair:
Isil Dillig
University of Texas at Austin, USA

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGPLAN: ACM Special Interest Group on Programming Languages

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Microsoft

Conference

PLDI '21

Sponsor:

SIGPLAN

PLDI '21: 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation

June 21, 2021

Virtual, Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

21
Total Citations
View Citations
366
Total Downloads

Downloads (Last 12 months)70
Downloads (Last 6 weeks)5

Reflects downloads up to 25 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Ma WLiu SZhao MXie XWang WHu QZhang JLiu Y(2024)Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics CapacitiesACM Transactions on Software Engineering and Methodology10.1145/366460633:7(1-29)Online publication date: 26-Aug-2024
https://dl.acm.org/doi/10.1145/3664606
Lajko MCsuvik VGyimothy TVidacs LHuyen PTan SMechtaev SKhurshid S(2024)Automated Program Repair with the GPT Family, including GPT-2, GPT-3 and CodeXProceedings of the 5th ACM/IEEE International Workshop on Automated Program Repair10.1145/3643788.3648021(34-41)Online publication date: 20-Apr-2024
https://dl.acm.org/doi/10.1145/3643788.3648021
Niu CLi CNg VLuo B(2024)Comparing the Pretrained Models of Source Code by Re-pretraining Under a Unified SetupIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.330859535:12(17768-17778)Online publication date: Dec-2024
https://doi.org/10.1109/TNNLS.2023.3308595
Ahmad BThakur STan BKarri RPearce H(2024)On Hardware Security Bug Code Fixes by Prompting Large Language ModelsIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.337455819(4043-4057)Online publication date: 2024
https://doi.org/10.1109/TIFS.2024.3374558
Wardat MAl-Alaj A(2024)DeepCNN: A Dual Approach to Fault Localization and Repair in Convolutional Neural NetworksIEEE Access10.1109/ACCESS.2024.338498112(50321-50334)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3384981
Saberi IFard FChen F(2024)Utilization of pre-trained language models for adapter-based knowledge transfer in software engineeringEmpirical Software Engineering10.1007/s10664-024-10457-529:4Online publication date: 13-Jun-2024
https://dl.acm.org/doi/10.1007/s10664-024-10457-5
Kumar TGarg VLalar SKumar R(2024)Measuring Impact of Generative AI in Software Development and InnovationIntelligent IT Solutions for Sustainability in Industry 5.0 Paradigm10.1007/978-981-97-1682-1_6(57-67)Online publication date: 5-Jul-2024
https://doi.org/10.1007/978-981-97-1682-1_6
Zhang QFang CMa YSun WChen Z(2023)A Survey of Learning-based Automated Program RepairACM Transactions on Software Engineering and Methodology10.1145/363197433:2(1-69)Online publication date: 23-Dec-2023
https://dl.acm.org/doi/10.1145/3631974
Jin MShahriar STufano MShi XLu SSundaresan NSvyatkovskiy AChandra SBlincoe KTonella P(2023)InferFix: End-to-End Program Repair with LLMsProceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3611643.3613892(1646-1656)Online publication date: 30-Nov-2023
https://dl.acm.org/doi/10.1145/3611643.3613892
Shirafuji ARahman MIbne Amin MWatanobe Y(2023)Program Repair with Minimal Edits Using CodeT52023 12th International Conference on Awareness Science and Technology (iCAST)10.1109/iCAST57874.2023.10359288(178-184)Online publication date: 9-Nov-2023
https://doi.org/10.1109/iCAST57874.2023.10359288
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents