Pavement Crack Detection Algorithm Based On Densely Connected and

1. The document proposes a new deep learning method called a densely connected and deeply supervised network (DCDSN) for pixel-level pavement crack detection. 2. DCDSN uses densely connected convolution layers to extract crack features more effectively. It also uses deep supervision modules to constrain hidden layers and extract multi-scale crack features. 3. Feature maps from different scales are fused to obtain the final crack detection results. A class-balanced loss function is also used to address the imbalance between crack and non-crack pixels.

Uploaded by

Tanujaa Shri

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Pavement Crack Detection Algorithm Based On Densely Connected and

Uploaded by

Tanujaa Shri

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Received December 8, 2020, accepted December 16, 2020, date of publication January 11, 2021, date of current version

January 22, 2021.

Digital Object Identifier 10.1109/ACCESS.2021.3050401

Pavement Crack Detection Algorithm Based

on Densely Connected and Deeply
Supervised Network
HAIFENG LI 1, JIANPING ZONG1 , JINGJING NIE2 , ZHILONG WU2 , AND HONGYANG HAN1
1 College
of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, China
2 Chengdu Tianfu International Airport, Chengdu 641419, China

Corresponding author: Haifeng Li (lihf_cauc@126.com)

This work was supported by the National Key Research and Development Project of China under Grant 2019YFB1310601.

ABSTRACT In order to improve the accuracy and robustness of existing automated crack detection methods,
a fully convolutional neural network for pixel-level detection based on densely connected and deeply
supervised network is proposed. First, the densely connected layers are applied for enhancing the propagation
and reuse of crack features. Then, the deeply supervised modules are designed to make network extract
more significant features through multi-scale levels. Finally, the feature maps from different scales are fused
to achieve complementarity at different levels. In addition, a class-balanced cross-entropy loss function is
designed to balance backgrounds and cracks by increasing the weight of crack pixel loss. The proposed
method is tested on three public datasets, and the experiments show that our method is superior to state-of-
the-art methods in accuracy, speed and robustness.

INDEX TERMS Crack detection, deep learning, densely connected network, deeply supervised network.

I. INTRODUCTION detect cracks on airport runway surface. Wei et al. [3] adopt
In recent years, highway and airport constructions are boom- gray difference and Hough transform to realize automatic
ing all over the world, especially in the developing countries. detection of small cracks. Kapela et al. [4] utilize Hough
To keep good condition of infrastructure, prompt and efficient transform feature (HTF) and local binary pattern (LBP) to
maintenance of pavement surface has become an important extract the edge direction and texture features of cracks
issue in the field of transportation industry. Cracks are the respectively. Qu et al. [5]employ structural forest edge detec-
very early forms of most diseases on pavement surfaces. tor to extract crack edge, and seepage model to complete
Prompt and accurate detection of cracks could minimize denoising. Amhaz et al. [6] propose an automatic detec-
maintenance costs and improve efficiency. However, nowa- tion algorithm of two-dimensional pavement cracks based
days manual inspection shows the disadvantages of poor on minimum path location. The crack detection algorithms
accuracy, high subjectivity and inefficiency, which cannot based on traditional digital image processing transform or
satisfy the needs of rapid highway construction. Thus, effi- map the original image to a specific space, and obtain the final
cient and automated crack detection has become a research detection result by learning the structure of shallow crack
hotspot. features. However, due to the complexity of real pavement
Numerous efforts have been applied on traditional digital conditions and the various uncertainties of environmental
image processing techniques to detect cracks, such as thresh- impacts, such as texture diversity, strong noise interference,
old segmentation, feature extraction, edge detection, filter irregular crack direction and so on, these algorithms are easy
and minimum path methods. Oliveira and Correia [1] extract to be interfered by environmental factors, and cannot meet
crack feature with the combination of connected compo- the needs of accuracy and speed at the same time. Therefore,
nent and automatic threshold segmentation. Li et al. [2] use the efficient and robust crack detection algorithms still need
improved OTSU threshold and adaptive iterative threshold to to be studied.
Since the cracks and edges have similar characteristics
The associate editor coordinating the review of this manuscript and in shape, structure and thickness, it is practicable to apply
approving it for publication was Tomasz Trzcinski . edge detection method to detect cracks. Based on structural

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
VOLUME 9, 2021 11835
H. Li et al.: Pavement Crack Detection Algorithm Based on Densely Connected and Deeply Supervised Network

forest [7], Shi et al. [8] propose CrackForest algorithm us an efficient way for feature extraction. However, although
to detect pavement cracks by the combination of comple- the DenseNet based algorithms have achieved superior per-
mentary features of cracks, and the result is more accu- formance for feature extraction, due to the semantic fea-
rate than Free-Form Antioxidant (FFA) [9] and Minimal ture distribution of cracks, and the imbalance of foreground
Path Selection (MPS) [6]. However, the algorithm is still and background ratio in crack detection, it is necessary to
based on the human-selected features of crack, which have supervise and fuse the features from different scales when
weak adaptability and poor robustness in complex back- adopting DenseNet, which induces to our work in this paper.
ground. Richer Convolutional Feature (RCF) [10], as one of Since deeply-supervised nets (DSN) method simultaneously
the most advanced edge detection algorithms, can produce minimizes classification error while making the learning pro-
high-quality edges efficiently by combining multiscale and cess of hidden layers direct and transparent, it provides the
multilevel information of objects. But the backbone of RCF potential to supervise the feature extraction with DenseNet in
is only composed of multiple convolution layers, and the our crack detection applications.
high-level convolution layer only uses the feature map which To overcome the difficulties in crack detection due to its
is transmitted from the previous layer, and it leads to that the very thin shape and semantic feature distribution, we propose
high-level convolution neglects many crack features even if a fully convolutional neural networks for pixel-level detection
the final fusion combines the results of all scales. Thus, RCF based on densely connected and deeply supervised network.
is not fully applicable to crack detection. The main contributions are listed as follows.
Deep learning has been widely used in the field of com- 1) The dense connection module is designed for extracting
puter vision. Some studies have been committed to apply the feature map from the image at various scales. Densely
deep learning to detection and recognition of pavement sur- connected convolution is used to extract the features of cracks
face cracks. Eisenbach et al. [11] propose a road disease more sufficiently.
dataset for training deep learning networks, and evaluate the 2) The deep supervision module is used to constraint mul-
current situation of pavement disease detection technology tiple hidden layers and extract multiscale detail features of
for the first time. Zhang et al. [12] apply a convolution crack.
neural network to the classification of fracture panel and 3) The multiscale information of crack features generated
non-fracture panel, and prove the advantage of deep learning from all the deep supervision modules are fused by the fusion
in fracture detection. Li et al. [13] propose a classification module to obtain the final crack detection results.
model based on convolutional neural network, Deep Bridge 4) To deal with the imbalance of crack and non-crack pix-
Crack Classify (DBCC), and conduct optimized sliding win- els, a class balanced cross entropy loss function is designed to
dow algorithm to detect bridge cracks. The above methods obtain more stable training results by dynamic adjusting the
regard crack detection as a task of image block classifica- weight of crack pixel loss.
tion based on deep neural network. Besides, those methods The proposed method is tested on three public datasets:
neglect the spatial relationship between crack pixels which AEL [16], Crack500 [18] and Cracktree200 [19]. The
causes the lack of global crack features. Inspired by Fully experiment results validate our method.
Convolutional Networks (FCN) [14], some studies have been
devoted to apply semantic segmentation for crack detection. II. OVERVIEW OF METHODS
Schmugge et al. [15] propose a remote video crack detection The main structure of our proposed network is shown
method based on semantic segmentation network. Wei [16] in Fig. 1. The network is composed of convolution modules,
applies semantic segmentation method to automatically learn dense connection modules, conversion modules, deep super-
the linear, direction and edge features of cracks for pixel vision modules, deconvolution layers and fusion module. The
classification. Li et al. [17] develop a lightweight seman- input of the network is a road surface image, while the output
tic segmentation model based on crack characteristics, and is a crack prediction map with the same size as the input,
obtained the average crack width using the axis skeleton algo- and the crack pixels have higher probability than non-crack
rithm. However, since the features generated by deep-level pixels.
layers are abstract semantic features, the general CNN based Given an image into the network, firstly the multiscale
semantic segmentation methods may miss the detail feature feature maps are extracted by the convolution modules and
of cracks and lead to inaccuracy detection results. In addition, dense connection modules, then the dense connection mod-
with growing depth of neural network structures and increas- ules are connected by the conversion modules which mainly
ing number of layers, the extraction of crack feature could compresses the dense features from the previous modules to
be more difficult, and the gradients are going to vanishing. alleviate the feature redundancy. Following each convolution
In 2017, Gao, et al. [20] proposed a classification network, module and dense connection module, a deep supervision
DenseNet, to strengthen feature propagation and alleviate module is connected. Each convolution module and dense
the vanishing-gradient problem. In DenseNet, each layer has connection module extracts a feature map for deep supervi-
direct access to the gradients from the loss function and the sion module, and each deep supervision module generates a
original input signal, leading to an implicit deep supervision. prediction map with loss function. During training, the loss
By densely connecting the feature maps, DenseNet provides function of the feature maps generated by deep supervision