research-article

Open access

Semantic Debugging

Authors:

Andreas ZellerAuthors Info & Claims

ESEC/FSE 2023: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

Pages 438 - 449

https://doi.org/10.1145/3611643.3616296

Published: 30 November 2023 Publication History

PDF eReader

Abstract

Why does my program fail? We present a novel and general technique to automatically determine failure causes and conditions, using logical properties over input elements: “The program fails if and only if int(<length>) > len(<payload>) holds—that is, the given <length> is larger than the <payload> length.” Our AVICENNA prototype uses modern techniques for inferring properties of passing and failing inputs and validating and refining hypotheses by having a constraint solver generate supporting test cases to obtain such diagnoses. As a result, AVICENNA produces crisp and expressive diagnoses even for complex failure conditions, considerably improving over the state of the art with diagnoses close to those of human experts.

Supplementary Material

Video (fse23main-p446-p-video.mp4)

"Automatic query reformulation is a widely used technology to enhance code search results, by formulating as a machine translation problem of rewriting a query into a more comprehensive alternative. While showing promising results, it typically requires a large parallel corpus of query pairs (i.e., the original query and a reformulated query) that are confidential and unpublished by online code search engines. This restricts its practicality in software development. In this paper, we propose SSQR, a self-supervised query reformulation method that does not rely on any parallel query corpus. Inspired by pre-trained models, SSQR treats query reformulation as a masked language modeling task over a large-scale unlabelled corpus of queries. SSQR extends T5 (a sequence-to-sequence model based on Transformer) with a new pre-training objective named corrupted query completion (CQC), which randomly masks words from a complete query and asks T5 to predict the masked content. Then, for a given query to be reformulated, SSQR enumerates candidate positions to be expanded and employs the pre-trained T5 model to generate the content to fill the spans. Finally, SSQR selects expansions that have the most information gain. Our evaluation shows that SSQR significantly outperforms unsupervised baselines and gains competitive performance over supervised methods."

Download
88.27 MB

References

[1]

Remita Amine. 2021. youtube-dl. https://github.com/ytdl-org/youtube-dl

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Using likely invariants for automated software fault localization

Explaining and debugging pathological program behavior

Using likely invariants for automated software fault localization

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Badges

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations