Prior and Prediction Inverse Kernel Transformer for Single Image Defocus Deblurring

Authors

  • Peng Tang MBZUAI Techinical University of Munich
  • Zhiqiang Xu MBZUAI
  • Chunlai Zhou Renmin University of China
  • Pengfei Wei AI Lab, Bytedance
  • Peng Han University of Electronic Science and Technology of China
  • Xin Cao University of New South Wales
  • Tobias Lasser Technical University of Munich

DOI:

https://doi.org/10.1609/aaai.v38i6.28320

Keywords:

CV: Low Level & Physics-based Vision, CV: Other Foundations of Computer Vision

Abstract

Defocus blur, due to spatially-varying sizes and shapes, is hard to remove. Existing methods either are unable to effectively handle irregular defocus blur or fail to generalize well on other datasets. In this work, we propose a divide-and-conquer approach to tackling this issue, which gives rise to a novel end-to-end deep learning method, called prior-and-prediction inverse kernel transformer (P2IKT), for single image defocus deblurring. Since most defocus blur can be approximated as Gaussian blur or its variants, we construct an inverse Gaussian kernel module in our method to enhance its generalization ability. At the same time, an inverse kernel prediction module is introduced in order to flexibly address the irregular blur that cannot be approximated by Gaussian blur. We further design a scale recurrent transformer, which estimates mixing coefficients for adaptively combining the results from the two modules and runs the scale recurrent ``coarse-to-fine" procedure for progressive defocus deblurring. Extensive experimental results demonstrate that our P2IKT outperforms previous methods in terms of PSNR on multiple defocus deblurring datasets.

Published

2024-03-24

How to Cite

Tang, P., Xu, Z., Zhou, C., Wei, P., Han, P., Cao, X., & Lasser, T. (2024). Prior and Prediction Inverse Kernel Transformer for Single Image Defocus Deblurring. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5145-5153. https://doi.org/10.1609/aaai.v38i6.28320

Issue

Section

AAAI Technical Track on Computer Vision V