Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A mixture of shallow neural networks for virtual sensing: : Could perform better than deep neural networks

Published: 05 December 2024 Publication History

Abstract

Owing to outstanding representation abilities, deep neural networks (DNNs) have recently been extensively studied and attracted increasing attention in virtual sensing of key industrial variables. Although various learning algorithms have been developed to train the DNNs, the complexities of industrial data, such as scarcity of labeled samples, inevitable measurement noise, and multi-modality, may prevent the DNNs from achieving the best performance. This paper aims to deal with the limitations of the DNNs in an alternative way, by fully exploiting the potentials of shallow NNs. Specifically, a Bayesian model structure of a semi-supervised mixture of multiple single-hidden layer NNs (BSsMSLNN), each of which serves as an expert for a local region, is first proposed. Then, a variational inference- and gradient ascent-based training algorithm is developed, which allows parameter learning and mode identification to be completed in a unified framework. The performance of the BSsMSLNN is evaluated using both synthetic and real industrial cases for virtual sensing, demonstrating the effectiveness and superiority of the proposed schemes.

Highlights

A novel model BSsMSLNN is proposed, exploiting full potentials of shallow networks.
A unified training framework based on VI and SGD is developed for the BSsMSLNN.
Thorough performance evaluations show the superiorities of the BSsMSLNN over DNNs.

References

[1]
Bishop C., Pattern recognition and machine learning, 1st ed., Springer-Verlag, New York, USA, 2006.
[2]
Chang P., Li Z., Over-complete deep recurrent neutral network based on wastewater treatment process soft sensor application, Applied Soft Computing 105 (2021) Paper ID: 107227.
[3]
Gao S., Li X., Zhang Y., et al., A soft-sensor model of VCM rectification concentration based on an improved WOA-RBFNN, Measurement Science & Technology 32 (8) (2021) Paper ID: 085104.
[4]
Geng Z., Shi C., Han Y., Intelligent small sample defect detection of water walls in power plants using novel deep learning integrating deep convolutional GAN, IEEE Transactions on Industrial Informatics 19 (6) (2023) 7489–7497.
[5]
Guo R., Liu H., Semisupervised dynamic soft sensor based on complementary ensemble empirical mode decomposition and deep learning, Measurement 183 (2021) Paper ID: 109788.
[6]
Herceg S., Andrijic Z.U., Bolf N., Support vector machine-based soft sensors in the isomerisation process, Chemical and Biochemical Engineering Quarterly 34 (4) (2021) 243–255.
[7]
Hu B., Zhang H., Wang X., et al., DHESN: A deep hierarchical echo state network approach for algal bloom prediction, Expert Systems with Applications 239 (2024) Paper ID: 122329.
[8]
Jiang Q., Wang Z., Yan S., et al., Data-driven soft sensing for batch processes using neural network-based deep quality-relevant representation learning, IEEE Transactions on Artificial Intelligence 4 (4) (2022) 602–611.
[9]
Jiang Y., Yin S., Dong J., et al., A review on soft sensors for monitoring, control, and optimization of industrial processes, IEEE Sensors Journal 21 (11) (2021) 12868–12881.
[10]
Keskar N.S., Mudigere D., Nocedal J., et al., On large-batch training for deep learning: Generalization gap and sharp minima, 2016, arXiv preprint arXiv:1609.04836.
[11]
Lei Y., Karimi H., Chen X., A novel self-supervised deep LSTM network for industrial temperature prediction in aluminum processes application, Neurocomputing 502 (2022) 177–185.
[12]
Lemos T., Campos L.F., Melo A., et al., Echo state network based soft sensor for monitoring and fault detection of industrial processes, Computers & Chemical Engineering 155 (2021) Paper ID: 107512.
[13]
Li, H., Zhang, H., & Wang, Y. (2010). Study on soft sensor for lysine fermentation based on BP neural network. In 8th world congress on intelligent control and automation (pp. 4120–4124). Jinan, China: Jul. 7–Jul.9.
[14]
Liu K., Shao W., Chen G., Autoencoder-based nonlinear Bayesian locally weighted regression for soft sensor development, ISA Transactions 103 (2020) 143–155.
[15]
Ma L., Wang M., Peng K., A missing manufacturing process data imputation framework for nonlinear dynamic soft sensor modeling and its application, Expert Systems with Applications 237 (2024) Paper ID: 121428.
[16]
Ou C., Zhu H., Shardt Y., et al., Quality-driven regularization for deep learning networks and its application to industrial soft sensors, IEEE Transactions on Neural Networks and Learning Systems (2022) 1–11,.
[17]
Panja P., Jia W., McPherson B., Prediction of well performance in SACROC field using stacked Long Short-Term Memory (LSTM) network, Expert Systems with Applications (2022) Paper ID: 117670.
[18]
Pradeep T., Bardhan A., Burman A., et al., Rock strain prediction using deep neural network and hybrid models of ANFIS and meta-heuristic optimization algorithms, Infrastructures 6 (2021) Paper ID: 129.
[19]
Qiu K., Wang J., Zhou X., et al., Soft sensor based on localized semi-supervised relevance vector machine for penicillin fermentation process with asymmetric data, Measurement 202 (2022) Paper ID: 111823.
[20]
Shao W., Ge Z., Song Z., Semisupervised Bayesian Gaussian mixture models for non-Gaussian soft sensor, IEEE Transactions on Cybernetics 51 (7) (2021) 3455–3468.
[21]
Shao W., Ge Z., Song Z., et al., Semisupervised robust modeling of multimode industrial processes for quality variable prediction based on Student’s t mixture model, IEEE Transactions on Industrial Informatics 16 (5) (2020) 2965–2976.
[22]
Shao. L. Yao W., Ge Z., et al., Parallel computing and SGD-based DPMM for soft sensor development with large-scale semisupervised data, IEEE Transactions on Industrial Electronics 66 (8) (2019) 6362–6373.
[23]
Shi X., Kang Q., An J., et al., Novel L1 regularized extreme learning machine for soft-sensing of an industrial process, IEEE Transactions on Industrial Informatics 18 (2) (2022) 1009–1017.
[24]
Sun Q., Ge Z., A survey on deep learning for data-driven soft sensors, IEEE Transactions on Industrial Informatics 17 (9) (2021) 5853–5866.
[25]
Sun K., Sui L., Wang H., et al., Design of an adaptive nonnegative garrote algorithm for multi-layer perceptron-based soft sensor, IEEE Sensors Journal 21 (19) (2021) 21808–21816.
[26]
Wang J., Wang Y., Yao Y., et al., Stacked autoencoder for operation prediction of coke dry quenching process, Control Engineering Practice 88 (2019) 110–118.
[27]
Wang J., Yao L., Xiong W., Novel semi-supervised deep probabilistic slow feature extraction for online chemical process soft sensing application, IEEE Transactions on Instrumentation and Measurement 73 (2024) Paper ID: 2517711.
[28]
Xie R., Hao K., Huang B., et al., Data-driven modeling based on two-stream lambda gated recurrent unit network with soft sensor application, IEEE Transactions on Industrial Electronics 67 (8) (2020) 7034–7043.
[29]
Xie R., Jan N., Hao K., et al., Supervised variational autoencoders for soft sensor modeling with missing data, IEEE Transactions on Industrial Informatics 16 (4) (2020) 2820–2828.
[30]
Yan W., Xu R., Wang K., et al., Soft sensor modeling method based on semisupervised deep learning and its application to wastewater treatment plant, Industrial & Engineering Chemistry Research 59 (10) (2020) 4589–4601.
[31]
Yuan X., Qi S., Soft sensor model for dynamic processes based on multichannel convolutional neural network, Chemometrics and Intelligent Laboratory Systems 203 (2020) Paper ID: 104050.
[32]
Zhou J., Wang X., Yang C., et al., A novel soft sensor modeling approach based on difference-LSTM for complex industrial process, IEEE Transactions on Industrial Informatics 18 (5) (2022) 2955–2964.
[33]
Zhuang, Y., Zhou, Z., Alakent, B., et al. (2023). Semi-supervised Variational Autoencoders for Regression: Application to Soft Sensors. In 2023 IEEE 21st international conference on industrial informatics INDIN, (pp. 1–8). Lemgo, Jermany: Jul. 17–Jul. 20.

Index Terms

  1. A mixture of shallow neural networks for virtual sensing: Could perform better than deep neural networks
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Expert Systems with Applications: An International Journal
        Expert Systems with Applications: An International Journal  Volume 256, Issue C
        Dec 2024
        1582 pages

        Publisher

        Pergamon Press, Inc.

        United States

        Publication History

        Published: 05 December 2024

        Author Tags

        1. Virtual sensing
        2. Deep neural networks
        3. Semi-supervised learning
        4. Mixture model
        5. Error backpropagation

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 12 Feb 2025

        Other Metrics

        Citations

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media