Leverage the Explainability of Transformer Models to Improve the DNA 5-Methylcytosine Identification (Student Abstract)

Authors

  • Wenhuan Zeng Tübingen University
  • Daniel H. Huson University of Tuebingen

DOI:

https://doi.org/10.1609/aaai.v38i21.30533

Keywords:

Natural Language Processing, Transfer Learning, DNA Methylation, Transformer-based Language Model

Abstract

DNA methylation is an epigenetic mechanism for regulating gene expression, and it plays an important role in many biological processes. While methylation sites can be identified using laboratory techniques, much work is being done on developing computational approaches using machine learning. Here, we present a deep-learning algorithm for determining the 5-methylcytosine status of a DNA sequence. We propose an ensemble framework that treats the self-attention score as an explicit feature that is added to the encoder layer generated by fine-tuned language models. We evaluate the performance of the model under different data distribution scenarios.

Published

2024-03-24

How to Cite

Zeng, W., & Huson, D. H. (2024). Leverage the Explainability of Transformer Models to Improve the DNA 5-Methylcytosine Identification (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23703-23704. https://doi.org/10.1609/aaai.v38i21.30533