Deep Learning-Based Dynamic Region of Interest Autofocus Method for Grayscale Image

Yao Wang; Chuan Wu; Yunlong Gao; Huiying Liu

doi:10.3390/s24134336

Deep Learning-Based Dynamic Region of Interest Autofocus Method for Grayscale Image

Sensors (Basel). 2024 Jul 4;24(13):4336. doi: 10.3390/s24134336.

Authors

Yao Wang^{1

2}, Chuan Wu¹, Yunlong Gao¹, Huiying Liu^{1

2}

Affiliations

¹ Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China.
² University of Chinese Academy of Sciences, Beijing 100049, China.

Abstract

In the field of autofocus for optical systems, although passive focusing methods are widely used due to their cost-effectiveness, fixed focusing windows and evaluation functions in certain scenarios can still lead to focusing failures. Additionally, the lack of datasets limits the extensive research of deep learning methods. In this work, we propose a neural network autofocus method with the capability of dynamically selecting the region of interest (ROI). Our main work is as follows: first, we construct a dataset for automatic focusing of grayscale images; second, we transform the autofocus issue into an ordinal regression problem and propose two focusing strategies: full-stack search and single-frame prediction; and third, we construct a MobileViT network with a linear self-attention mechanism to achieve automatic focusing on dynamic regions of interest. The effectiveness of the proposed focusing method is verified through experiments, and the results show that the focusing MAE of the full-stack search can be as low as 0.094, with a focusing time of 27.8 ms, and the focusing MAE of the single-frame prediction can be as low as 0.142, with a focusing time of 27.5 ms.

Keywords: autofocus; dataset; deep learning; lightweight network; ordinal regression.

Grants and funding

This research received no external funding.