research-article

Open access

PERFUME: Programmatic Extraction and Refinement for Usability of Mathematical Expression

Authors:

Luis GarciaAuthors Info & Claims

Checkmate '21: Proceedings of the 2021 Research on offensive and defensive techniques in the Context of Man At The End (MATE) Attacks

Pages 59 - 69

https://doi.org/10.1145/3465413.3488575

Published: 15 November 2021 Publication History

PDF eReader

Abstract

Algorithmic identification is the crux for several binary analysis applications, including malware analysis, vulnerability discovery, and embedded firmware reverse engineering. However, data-driven and signature-based approaches often break down when encountering outlier realizations of a particular algorithm. Moreover, reverse engineering of domain-specific binaries often requires collaborative analysis between reverse engineers and domain experts. Communicating the behavior of an unidentified binary program to non-reverse engineers necessitates the recovery of algorithmic semantics in a human-digestible form. This paper presents PERFUME, a framework that extracts symbolic math expressions from low-level binary representations of an algorithm. PERFUME works by translating a symbolic output representation of a binary function to a high-level mathematical expression. In particular, we detail how source and target representations are generated for training a machine translation model. We integrate PERFUME as a plug-in for Ghidra--an open-source reverse engineering framework. We present our preliminary findings for domain-specific use cases and formalize open challenges in mathematical expression extraction from algorithmic implementations.

Supplementary Material

MP4 File (check069-weidemanA.mp4)

In this video, we present the paper "PERFUME: Programmatic Extraction and Refinement For Usability of Mathematical Expressions", on extracting mathematical expressions from binary programs. We discuss the pipeline of PERFUME, namely, symbolic execution, data synthesis and machine translation and explain each step with examples. We motivate the effectiveness of PERFUME with our experimental results and discuss remaining challenges and future work.

Download
520.67 MB

References

[1]

angr. 2021. The Angr binary analysis platform. http://angr.io.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Reverse Engineering and Analysis of Genome-Wide Gene Regulatory Networks from Gene Expression Profiles Using High-Performance Computing

Parallel Computing Algorithms for Reverse-Engineering and Analysis of Genome-Wide Gene Regulatory Networks from Gene Expression Profiles

Inference of the Genetic Network Regulating Lateral Root Initiation in Arabidopsis thaliana

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations