1. Introduction
A complex infrastructure network above and under the ground can determine the economic prosperity of modern cities. Water supply systems (WSS) and sewage are the core of urban underground networks [
1,
2]. Siphons can transport sewage and water run-off under deep obstacles without a continuous gradient at the intersection of pipelines, such as railway lines, high-rise buildings, and bridges. Siphons are often vulnerable to damage due to their long service life, corrosion of aging, structural complexity, and external load impact. There are two types of damage during the service life of siphons, one is the structural defects caused by the quality of siphons or external influences, including deformation, leaking, corrosion, and foreign material [
3]. The other is caused by changes in the physical and chemical environments in siphons, including functional defects resulting from the intrusion of external substances, such as scale, deposits, obstacles, and scum [
4]. Therefore, it is especially necessary to identify siphon conditions to ensure the safety and reliability of buried siphons. The unqualified environment in siphons decreases the efficiency of manual operation and increases the safety risk, and it is of considerable significance to investigate the methods of assessing urban buried siphons in order to minimize the use of manual detection.
Recently, a series of methods have been developed for blockage or leakage detection in pipes, such as piezoelectric ultrasonic sensors technology [
5], eddy current probes [
6], closed-circuit television (CCTV) [
7], sewer scanner evaluation technology (SSET) [
8], and ground-penetrating radar (GPR) [
9,
10]. Piezoelectric ultrasonic sensors can detect minimal defects, and they have the advantages of being portable, non-polluting, and highly accurate for pipeline damage detection. However, piezoelectric ultrasonic testing is relatively difficult, tedious, and expensive, and its use is limited by the operating frequency. Eddy current probe detection has several limitations, and it can only be used to detect the surface or near-surface defects in magnetic or non-magnetic conductive siphons. In CCTV and SSET, the camera is installed on a mobile robot, which passes through the siphon to obtain the video data. The images can be used to manually or automatically evaluate the health or structure of siphons. Nevertheless, the detection is expensive and limited to relatively small portions of the sewer system.
In contrast, acoustic detection is a non-destructive testing method with the advantages of simple implementation, low cost and high detection efficiency. The detection results do not depend on the subjectivity of the inspectors. Acoustic detection is suitable for inspecting pipes with different shapes (straight, curved, and siphon) and fluids of different densities (low, medium, and high). The changes in acoustic wave forms are related to the system characteristics (such as blockage and leakage) [
11,
12]. According to recent study results, the detection methods can detect sewer blockages and leakages based on acoustic signals. Consequently, the US Environmental Protection Agency’s Office of Research and Development developed the portable and rapidly deployable sewer line rapid assessment tools SL-RAT
TM [
13] and Sewer-Batt
TM system [
14] and conducted a series of experiments. The SL-RAT
TM system predicts the blockage degree in pipes by transmitting a multi-tone acoustic signal to the pipes and measuring the attenuation of the transmission signal. The Sewer-Batt
TM system determines the acoustic response according to the intensity of the reflected signal, and works well for detecting pipe blockages with or without side gates. However, a CCTV system is required for further detection due to its limitations with respect to multiple defect recognition.
In this paper, we propose a detection method based on acoustic sensors. Compared with ultrasonic signals, low-frequency acoustic signals with less attenuation are actively transmitted into siphons.
The active acoustic detection method was adopted. A sinusoidal sweep signal was used as the excitation signal, and the sound pressure data were collected at the receiving end to measure the acoustic impulse response of the pipe.
VMD was employed to decompose acoustic signal components and obtain the band-limited intrinsic mode function (BIMF). Mutual information was introduced to quantitatively characterize the BIMF components. The components containing the most information about the siphon conditions were chosen, so that the effective feature information can be kept. According to the mutual information screening coefficient, the effective BIMF components were selected to reconstruct the signal. The root means square error of the reconstructed signal and the original signal was calculated. Moreover, the optimal result of component number k is determined using the minimum root mean square error.
The BIMF component was conducted with sound pressure level (SPL) transformation, and the acoustic energy flow density value of the filtered BIMF component was extracted as the classification feature. Finally, the eigenvalues were input to three classifiers, namely, random forest (RF), k-nearest neighbor (KNN), and support vector machine (SVM). The operation accuracy of the three classifiers for identifying the siphons conditions is higher than 94.4%, which verifies the effectiveness of identifying siphon conditions based on acoustic features.
2. Related Theories
A siphon is composed of two elbows (B, D) and three straight pipes (A, C, and E), as shown in
Figure 1. The material of siphon pipes determines the hydraulic characteristics of the composite elbow. Because of the density difference between solid particles and water in a siphon, the centrifugal force also acts on the siphon, the pressure on the outer wall rises, and the pressure on the inner wall decreases. Due to the lower pressure on the outer wall correspondingly and the higher pressure on the inner wall, an eddy current zone is formed near the inner wall, causing three-dimensional diffusion. The particles enter elbow B and come into contact with the pipe wall outside the elbow. Subsequently, the particles are bounced into water, collide with the pipe wall a second time, and flow with the water in the elbow. However, a particle-free area is formed on the inner side of elbow B, and a large number of particles are accumulated on the outer side of the elbow, thereby reducing the effective area of the mainstream, causing accumulation of solid particle waste. Moreover, blockages are formed in section C of the siphon. The undulations continue and the residual water remains in front of the blockage, which affects the safe operation of the drainage pipe and increases the hidden danger of the siphon.
The joint between the elbow and the straight of the siphon is connected by bonding. When there is water in the siphon, a stress joint will be generated near the bonding seam of the bend. The stress joint is near the bend, and the corrosion in the pipe quickly leads to stress corrosion leaking at the bonding location. During the operation of the siphon, the bonding site is continuously impacted by granular impurities. In the case of low velocity, the particles have less impact on the pipe wall, and the wear on the pipe wall is small. As the increase of the velocity, the radial velocity gradient of the particles in the pipe is higher, and the collision between the particles and the pipe wall is intensified, which enhances the wear degree of the pipe wall. As a result, an unsealed joint is likely to occur.
According to the pipe acoustics theory, as the siphon has a linear-elastic stress–strain relationship and a constant boundary stat, the incident sound wave propagates in the siphon due to vibration and the velocity is the same at all locations [
15]. The changes in acoustic impedance caused by the abrupt change in the cross-sectional area will result in interface reflection, interference, and diffraction, thereby increasing the propagation path of the sound wave. More time will be taken for sound transmission, and the energy dissipation (amplitude reduction) is increased [
16,
17]. Research on siphon damage has shown that the change in stiffness is closely related to the type and degree of damage. Damage degree can be determined by monitoring the change in stiffness. However, it is difficult to directly measure the stiffness, which is generally reflected by the dynamic response characteristics of the structure. Consequently, the acoustic signal transmitted in the pipe can reflect the stiffness change and determine the damage status [
15].
The radius of the pipe is
a, the length is
L, and the plane wave
x is incident along the direction of the plane. The angle between the wave vector
x and the axis is
θi. We assume that both ends of the cylindrical pipe with a finite length meet the boundary conditions, and the influence of sound scattering of the cylindrical pipe is ignored. There is a blockage in the pipe, or the pipe wall is damaged. The sound field of the cylindrical pipe is:
where
is the incident sound field,
,
is the wave number of the sound wave in the water in the pipe.
is the scattering sound field caused by a rigid structure, and
is the scattering sound field caused by a flexible fabric. In the siphon, the pipe wall has a rigid structure, and the plug has an elastic structure.
In a normal condition, the material of the pipe is even and dense. The sound wave is refracted and scattered in the pipe, and it propagates from the transmitting end to the receiving end in the shortest time. In this case, the attenuation of sound wave energy is the least. In the case of a blockage in the siphon, the sound wave is refracted and scattered, and the blockage is diffracted. Since the blockage is generally filled with air or water vapor, the characteristic impedance of the blockage and the pipe wall, air, or water vapor are significantly different. The low-frequency sound wave is diffused and propagated bypassing the blockage. The propagation path is longer, the sound becomes more extensive, and the waveform changes. The sound wave is received by the receiving end after passing through the blockage. When a blockage exists in the pipe, sound pressure and reverberation scattering are generated, thereby increasing the amplitude of the sound wave. The backscattered sound field caused by the blockage is as below [
18]:
The scattered sound field created by the tube wall is [
18]:
During the propagation of an acoustic wave in the pipe, as the propagation distance increases, the energy is attenuated because the sound waves are scattered and absorbed. If there is a leak in the siphon, the acoustic energy dissipation increases. When congestion occurs in the siphon, the energy is absorbed and the amplitude of the wave decreases, resulting in the fluctuation of siphon energy. As blockage or leakage occurs in the siphon, the sound energy changes, which is reflected as the changes in the sound energy flow density per unit time through the single-plane area perpendicular to the direction of energy propagation.
In an ideal medium, the sound energy per unit volume is the sound energy density, and its expression is as follows:
where
is the propagation velocity of the sound wave in the medium,
is the density of the medium,
is the sound pressure, and
is the particle vibration velocity.
In the present study, the acoustic energy flow density is calculated by using the root-mean-square method. The sound wave is a circularly symmetric beam in the pipe. The acoustic energy flow density is defined as follows:
where
is the characteristic acoustic impedance of water.
is the instantaneous sound pressure at the time, and
is the positive time integral limit.
is the time between the first time when the sound pressure of the emitted sound wave exceeds 10% of the maximum value and the first time when it falls to 10% of the maximum value.
is the voltage generated by the hydrophone at time
,
is the root mean square of the voltage in the time period
,
, and
M is the sensitivity of the hydrophone. Equation (6) is obtained after discretization:
3. Experiments
A 450-mm diameter concrete siphon constructed in the Hydraulics Laboratory at the University of Bradford was used to carry out the experiments [
16]. The acoustic signal acquisition system consists of hydrophones, a loudspeaker, a power amplifier, and a computer running WinMLS software. As shown in
Figure 2a, a siphon with the length of 420 mm, height of 200 mm high, and diameter of 450 mm was installed on a layer of fine sand 500-mm thick in an open-top box made of 12 mm plywood. In all experiments, the siphon was fully surrounded by dry sand and was filled with water to simulate real live sewer conditions, the water level of 900 mm below the top rims of the vertical parts (
Figure 2b).
Four hydrophones were used for acoustic detection. As shown in
Figure 1, hydrophone 1 was driven by a 100–6000 Hz sinusoidal sweep, which was generated by a computer running WinMLS software to control the sound card. Unwanted low-frequency machinery noise was removed from the signals received from the hydrophones H2–H4 by using three 8-channel high-pass hydrophone filters. A B&K Type 2610 amplifier and a dual variable Kemo VBF 10M filter were used to process and filter the signal received on the reference hydrophone in the 100–4000 Hz range. In addition, the underwater speaker was driven by a B&K Type 2708 power amplifier. The quality of the signal generated by the underwater speaker was subjectively controlled by a Rotel Type RA-9708 X Stereo amplifier and headphones.
The operator used a computer running WinMLS software to control the sound card to generate 10 s of a sine sweep signal in the frequency range of 50–6000 Hz. The acoustic signals propagating in the siphon were received by hydrophones (type SQ31) installed on the left side of the siphon. A K50WP speaker was installed on the right side of the siphon to test the effectiveness of the acoustic signal generated by Hydrophone 1. The sampling frequency was 22,050 Hz. WinMLS software was used to de-convolve the original acoustic signals received by hydrophones H1–H3 to obtain the acoustic pressure response. The original acoustic signals received on hydrophones H1–H3 were de-convolved with WinMLS software to obtain the acoustic pressure response.
This experiment was carried out in order to investigate the influence of blockage, leakage and joint unsealed in the siphon. Several possible conditions were designed, as shown in
Figure 3a. Ten 5 kg acoustically transparent bags were prepared and filled with fine sand to simulate the blockage in the siphon. The maximum cross-sectional area of a sandbag corresponded to approximately 20% of the siphon’s cross-section. Several bags were tied to a 9-m long rope with a distance of 300 mm. The number of bags deposited in the siphon varied from 1 to 10.
Before additional measurements were made, the acoustic sensors were removed and re-installed when the bags were deposited or removed from the siphon. Subsequently, artificial cuts were made at top of the horizontal section of the siphon to simulate leaks. The siphon remained exposed and was surrounded by water up to the level matching the reference level inside the siphon after the damages were done. Measurements were taken in seven conditions: (1) clean siphon; (2) siphon with a controlled amount of blockage (
Figure 3a); (3) siphon with a 50 mm longitudinal leak (
Figure 3b); (4) siphon with a 100 mm longitudinal leak; (5) siphon with 200 mm longitudinal and 150-mm transversal leaks; (6) siphon with a 200mm longitudinal leak and a 120 mm × 70 mm hole (
Figure 3c); and (7) joint unsealed (
Figure 3d).
4. Proposed Feature Extraction Method
There are several intuitive methods to identify the siphon conditions from the collected sound pressure signals, i.e., (1) waveform distortion, (2) resonance peak in the frequency domain, and (3) waveform fluctuation characteristics. Due to the differences in siphon conditions, the sound pressure signal exhibits different fluctuations in amplitude, phase, intensity, extension width, azimuth, phase, etc. However, during the production, processing, and transmission of acoustic signals, the signal is affected by different types of noise. Therefore, to obtain the characteristic information of blockage or leakage in siphons from the acoustic signal, it is necessary to decompose a complex acoustic signal into several regular simple modes, which can be easily analyzed in the time and frequency domains. The purpose of feature extraction is to further classify the conditions.
Pattern recognition can extract feature information from a limited inspection record, allowing for the prediction of a siphon condition that has not been inspected [
17]. Regardless of the method used for signal processing, it is necessary to accurately identify the frequency band containing siphon conditions in the acoustic signal. To date, feature extraction in signal processing has been conducted in the time domain, frequency domain, and time-frequency domain. Time-frequency domain methods which can locate both frequency and time have been extensively used, such as local mean decomposition (LMD), integrated empirical mode decomposition (EEMD), empirical mode decomposition (EMD), empirical wavelet transform (EWT), wavelet packet transform (WPT), wavelet transform (WT), and short-time Fourier transform (STFT) and [
19,
20]. Dragomiretskiy et al. proposed an adaptive decomposition method called variational mode decomposition (VMD) in 2014 [
21]. This method has a solid foundation of mathematical theory. Compared with adaptive methods such as EMD and LMD, the VMD has fast convergence and good noise robustness. A Lagrange multiplication operator, a quadratic penalty factor, and a constrained variational model expression are used in the decomposition method. By transforming the binding variational problem into a non-binding variational problem, the mathematical significance is obtained, and the modal aliasing of the band-limited intrinsic mode function is minimized. However, the decomposition result of the VMD is affected by the number of modes
k and the quadratic penalty factors
α.
VMD and acoustic energy flow density are applied to assess and classify water-filled underground siphon conditions. The steps are as follows:
4.1. Step 1: Acoustic Noise Reduction
Due to the non-linearity and non-stationarity of acoustic signals, it is difficult to identify useful signals and denoise the data using a traditional Fourier transform. As an improvement of the Fourier transform, the WPT (wavelet packet transform) provides a more precise signal analysis method, which is more suitable for time-frequency analysis. Three-layer wavelet packet decomposition was used to decompose the acoustic signal. The appropriate wavelet packet frequency band was selected to reconstruct the signal by referring to the background noise signals of different siphon conditions, thus removing noise from acoustic signals.
The wavelet packet transform process is shown in Step 1 in
Figure 4:
The layer number of decomposing the original signal is decomposed using an appropriate wavelet basis function;
An appropriate threshold value is selected to process the wavelet coefficients of each layer, and the processed wavelet coefficients are determined to obtain the denoised signal.
In this study, the sampling frequency of the acoustic pressure response is 22050 Hz, and the decomposition scale of the wavelet packet is 3. The wavelet packet is decomposed into eight signals in different frequency bands, including [0–2756], [2756–5512], [5512–8268], [8268–11024], [11024–13780], [13780–16536], [16536–19292], and [19292–22050]. The process of wavelet packet transform is shown in Step 1 of
Figure 4, where
is the original signal, and
is the wavelet steamed stuffed bun space (
is the number of decomposed layers,
is the subspace sequence of the first layer). The high-frequency part can be further decomposed through wavelet packet decomposition, and the original signal is decomposed into
wavelet steamed stuffed bun space (
is the wavelet packet decomposition scale), forming a complete binary tree structure.
4.2. Step 2: Sound Pressure Level
At high amplitude component, the SPL [
22] increases the lower amplitude component of acoustic signals, so that the characteristic signal hidden in the low-amplitude noise can be observed. In the present study, this method was used to amplify the content of the acoustic signal, thereby increasing the differences between different blockage degrees and facilitating the extraction of different siphon conditions in the subsequent decomposition. The calculation equation for the SPL is as follows:
where
is the practical value of the sound pressure of the original sound wave signal, and
is the practical value of the reference sound pressure. In this paper,
. The SPL transform process is shown in Step 2 in
Figure 4.
4.3. Step3: VMD Decomposition
VMD has been used to extract features from non-stationary signals. The number of modes (k) and the quadratic penalty factors () should be selected empirically. One objective of this study is to select k according to the mutual information and the root mean square error. Mutual information, which is defined as the difference between two random variables in uncertainty, is used to measure the statistical correlation of two random variables. The advantage of mutual information is that it describes both the linear correlation and nonlinear correlation between two variables.
The steps of selecting VMD parameters are as follows:
The preset scale K1 is initially determined by observing the original signal of EMD (Empirical Mode Decomposition) decomposition and obtaining the maximum value for the first time. In this paper, the maximum value of K is 15.
The scope and step length of k and are determined. In this study, the number of model k is an integer ranging from 2 to 15, and .
First, the acoustic signal is decomposed using VMD for each group of k values. Second, the signal is decomposed into k BIMF components. Third, the mutual information between every BIMF component and the original signal is calculated. Finally, the mutual information is normalized.
According to the value of mutual information, the BIMF components are sequenced, and the screening coefficient is calculated according to the screening principle of BIMF components. The BIMF component larger than the screening factor is selected for signal reconstruction, and the BIMF component larger than the screening factor is the effective BIMF component. The mutual information screening coefficient is calculated as follows:
where max
mutual information is the maximum mutual information value between all BIMF components. The reconstruction signal is reconstructed by summing the BIMF components with a mutual information value greater than
I.
The root mean square error of the reconstructed signal and the original signal is calculated for different k values. The minimum root mean square error is determined, and k is used as the optimum result. The optimal value of the VMD decomposition parameter k is obtained based on the minimum root mean square error, and the effective BIMF components are obtained according to the mutual information and the mutual information screening coefficient. The SPL is used to transform the decomposed BIMF components.
4.4. Step 4: Acoustic Energy Flow Density
The acoustic energy flow density was extracted as eigenvalue vectors from the signal by the SPL.
4.5. Step 5: Classification and Recognition
The acoustic energy flux density eigenvectors were input to three different classifiers (SVM, KNN, and RF) to classify the siphon conditions.
6. Classification Results and Discussion
In order to verify the suitability of the VMD and acoustic energy flux density for identifying the siphon condition, three classification methods were presented, namely SVM, KNN, and RF. The experimental results listed in
Table 1 were validated by 10-fold cross-validation of the average partition. The average accuracy of the 10-fold cross-validation was calculated. In the unfiltered VMD, all features representing the 8 BIMF components were input into the classifier, and the components BIMF1, BIMF2, BIMF3, BIMF5, and BIMF8 were filtered by the mutual information.
According to the experimental results, the model based on mutual information screening of the VMD components has a recognition accuracy of 96.5%, and the lowest accuracy is 94.4%. The unscreened model has the highest recognition accuracy of 87.8%. For the same classifier, the model results of the screened BIMF components based on mutual information are better than those of the unscreened BIMF components. The classification results show that the model which screens the BIMF component based on mutual information can accurately identify the siphon conditions. Therefore, our classification results indicate that the use of acoustic features and a suitable feature selection method result in better classification performance.
As shown in
Figure 12, in the conditions of clean siphon and joint unsealed, the classification accuracy and the recognition accuracy are 100%. The classification accuracy in the conditions of the 200 mm longitudinal, 150 mm transversal, and 200 mm longitudinal leaks and the 120 mm × 70 mm hole is the lowest. Due to the complexity of the siphon topology, the testing environment, the siphon environment, the blockage form, and the diversity of acoustic propagation characteristics, further research is required if multiple siphon leakages are present. Future research should consider the transmission characteristics of sound waves in different siphons and the attenuation behaviors of the acoustic wave propagation model using a blockage probability prediction model. Moreover, it is necessary to investigate data mining and to explore multiple closings of siphons and data pre-processing, data filtering, data selection, and other methods. In the acoustic data set in this study, important characteristics of the leakage degree and the conditions were extracted effectively and accurately.