A Review of Machine Learning for Near-Infrared Spectroscopy
Abstract
:1. Introduction
2. Machine Learning-Based NIR Spectroscopy Analysis System
3. Public Datasets
4. Data Preprocessing
4.1. Mean Centering and Standard Normal Variate (SNV)
4.2. Multiplicative Scatter Correction (MSC)
4.3. Extended Multiplicative Scatter Correction (EMSC)
4.4. Inverse Scatter Correction (ISC)
Savitzky–Golay
4.5. Discussion
5. Feature Selection
5.1. Particle Swarm Optimization (PSO) and Binary Particle Swarm Optimization (BPSO)
5.2. Genetic Algorithms (GAs)
5.3. Covariance Selection
5.4. Variable Combination Population Analysis (VCPA)
5.5. Variable Iterative Space Shrinkage Approach (VISSA)
5.6. Bootstrapping Soft Shrinkage (BOSS)
5.7. Iteratively Retaining Informative Variables (IRIV)
5.8. Competitive Adaptive Reweighted Sampling (CARS)
5.9. Successive Projection Algorithm (SPA)
5.10. Uninformative Variable Elimination (UVE)
5.11. Monte Carlo Uninformative Variable Elimination (MCUVE)
5.12. Randomization Test (RT)
5.13. Variable Importance in the Projection (VIP)
5.14. Jackknife Procedure
5.15. Minimal Redundancy Maximal Relevance (mRMR)
5.16. Correlation-Based Feature Selection (CFS)
5.17. LASSO and Elastic Net
5.18. Discussion
6. Traditional Machine Learning Methods for NIR
6.1. Partial Least Square (PLS)
6.2. Extreme Learning Machine (ELM)
Ref. Publish Date | NIR Task | Models | Merits | Limitation |
---|---|---|---|---|
Apr. 2007, [58] | Wine | Principal component analysis (PCA)+partial least squares (PLS) | Can effectively calibrate. | The creation of NIR calibrations for wine compositional parameters was not the aim of this study. |
May 2018, [73] | Olive oils | Partial least squares regression (PLSR) | The major and minor components of olive oils can be simply, quickly, and simultaneously quantified. | The performance of individual sterol form-prediction models was subpar. |
Jun. 2017, [74] | -tocopherol and total tocopherol contents | PLS and discriminant analysis (PLS-DA) | Quick and practical techniques used in the industry for sorting olive oils. | The number of samples were limited. |
Mar. 2021, [75] | Moisture, protein, and fat in meat | Orthogonalization (SPORT)/orthogonalization (PORTO)+PLSR | Reduced the error and bias by up to 52% and 84%, respectively. | A combination of data from various scatter-correction techniques was required. |
Apr. 2019, [39] | Rice-grain moisture | PLS+competitive adaptive reweighted squares (CARS) | Rapid determination of rice-grain moisture. | The results of stability and transitivity verification experiments for models were not provided. |
Dec. 2019, [76] | Rice flour types | PLS-DA+support vector machines (SVM) | High level of accuracy. | The robustness of the model needs to be verified further. |
May 2020, [77] | Multiple adulterations of flaxseed oil. | Orthogonal partial least squares–one-class partial least squares (OPLS-OCPLS) | Can effectively detect single, dual, or multiple adulterants with high accuracy; can rapidly detect multivariate adulteration of known targets. | The types of actual adulterated flaxseed oils were insufficient and the recognition accuracy of 95.8% still needs to be improved. |
Aug. 2020, [78] | Milk powder | Multivariate curve resolution–alternating least squares (MCR-ALS) | Can correctly identify. | Inadequate milk samples were tainted with melamine and sucrose. |
Dec. 2021, [79] | Hemoglobin concentration of blood | Monte Carlo+least absolute shrinkage and selection operator+extreme learning machine (MC-LASSO-ELM) | Better stability and the highest accuracy. | The model operation procedure was complicated and the MC results for a subset of samples had a direct impact on the results of the complete model. |
Sep. 2017, [80] | Osteoarthritis | PCA+SVM+PLS | Demonstrated the capacity of NIR spectroscopy to monitor changes in the articular cartilage matrix. | The ability to assess the capacity of NIR spectroscopy to estimate collagen-related information was not provided. |
Mar. 2020, [81] | Soil organic matter (SOM) | Savitzky–Golay (SG)+standard normal variate (SNV)+first derivative (FD)+PLSR | Rapid test; a simple and nondestructive analytical method. | The preprocessing procedure was complicated; preliminary experimental results. |
Mar. 2017, [82] | Coffee | Genetic algorithm+SVM | A fast and effective method without the production of chemical wastes. | Sample selection was required. |
Apr. 2022, [83] | Sulfur hexafluoride | GA-ELM | Higher prediction accuracy; operating efficiency; better stability; generalization performance. | It was challenging to effectively extract features using the GA algorithm. |
Oct. 2015, [84] | Seed oil | MLR+SVR+ANN | Fast, simple, and lower prediction error. | Better wavelength selection methods for use as input signals should be addressed. |
Jun. 2017, [33] | Acid value in peanut oil | GA-Si-PLS | Simultaneous and rapid measurement of acid value in peanut oil. | All of the algorithms compared were simple PLS-based algorithms. |
Sep. 2017, [85] | The rancidity of perilla oil | ANN multivariate analysis methods | ANN models produced the best prediction results. | Only PCR and PLSR were used to compare the experimental results and the model’s parameters can be further optimized. |
Sep. 2017, [86] | Oil, phenols, glucosinolates, and fatty acid content in the intact seeds of oilseed Brassica species | Modified partial least squares (MLPS) | Higher prediction accuracy. | The NIRS-based equation should be improved further by including samples from various environments with an even greater range of values. |
Nov. 2017, [87] | Copaiba oils | PLSR | Fast; no sample preparation was required; reliability. | More algorithms must be compared to demonstrate the superiority of PLSR. |
Jun. 2019, [36] | Olive Oil | BOSS-PLS | Rapid quantitative analysis. | The experimental samples were not diverse enough. |
Nov. 2019, [6] | Sugar content estimation of citrus | Stepwise multiple linear regression (SMLR) | Higher detection efficiency. | Online citrus experiment detection was required. |
Aug. 2018, [14] | Chicken meat | SVM; Decision trees | Avoided complex configurations or need for expertise in a particular technique. | Accuracy could be further improved. |
Apr. 2016, [88] | Sesame seeds | Multi-elemental discriminant analysis | Classification accuracies of more than 90%. | Further seed sample analysis was necessary in order to increase the discrimination accuracy. |
6.3. Support Vector Machine (SVM) and Support Vector Regression (SVR)
6.4. Single-Layer Feed-Forward Network (SLFN)
6.5. Decision Tree (DT) and Random Forest (RF)
6.6. Discussion
7. Deep Architectures for NIR
7.1. Stacked Autoencoder (SAE) and Variational Autoencoder (VAE)
Ref. Publish Date | NIR Task | Models | Merits | Limitation |
---|---|---|---|---|
Dec. 2022, [125] | Bright-blue pigment in cream | Autoencoder–deep learning (AE-DN) | Lower calculation costs, high accuracy, and faster speed with samples undamaged. | Redundant signal preprocessing steps. |
Sep. 2019, [126] | Aristolochic acids (AAS) | 1D CNN | Without feature extraction, could effectively, nondestructively, and rapidly identify. | The experimental data sample was limited and no comparisons to other deep learning methods were made. |
Jun. 2020, [127] | Drugs | CNN-based transfer learning | Higher classification accuracy with fewer training data. | Validation was performed with small experimental datasets; does not compare with state-of-the-art transfer learning models. |
Aug. 2020, [128] | Salmon, tuna, and beef delicacies | CNN-based machine learning | With a shift-invariant feature, the variation caused by the use of multiple devices in a real-world setting can be minimized. | The types of freshness recognition must be expanded, real-world scene applicability must be improved, and recognition accuracy can still be improved. |
Sep. 2021, [129] | Fresh fruit | Multi-output 1-dimensional convolutional neural network | Lower RMSE; easily adaptable to multi-response modeling by altering the output of the fully connected layers. | The use of transfer learning to process, update, and transfer a single model to integrate multiple responses was not discussed. |
Jun. 2019, [130] | Soil | Convolutional neural network | Multitask learning ability; multidimensional input utilization; higher performance; interpretability of the important wavelength variables used to predict soil properties through sensitivity analysis. | Data hungry; many hyperparameters; requires more advanced computing hardware. |
Nov. 2021, [131] | Tea | Standard normal variate (SNV)+TeaNet; SNV+TeaResNet; SNV+TeaMobilenet | A quick, non-intrusive, and environmentally friendly solution with 100% accuracy. | Various NIR data types necessitated the selection of the best data preprocessing method. |
May 2021, [132] | Dried mango | Chemometric approaches + DL models | Improved the predictive performance of DL models; achieved the lowest RMSEP. | The use of large datasets is required. |
Dec. 2020, [13] | Soil total nitrogen (STN) content | Three different structured CNN models+inception | Good performance and robust generalization. | A sufficient number of the same types of soil samples with similar physical structures were required; the experimental results of the algorithm were heavily influenced by preprocessing such as feature selection. |
Oct. 2020, [133] | Coal | Improved coyote optimization algorithm (I-COA) +local receptive field-based–extreme learning machine (LRF–ELM) | Improved the economy, speed, and accuracy while more effectively extending the spectral properties of coal. | Long training time. |
Dec. 2020, [134] | Soil | A joint convolutional neural network and recurrent neural network architecture (CCNVR) | A significant improvement in prediction accuracy and a better ability to migrate. | With fewer training samples, the model’s robustness and accuracy will significantly decrease. |
Jun. 2021, [135] | Salt content in saline-alkali soil (SAS) | Convolutional neural network–gravitational reservoir-computing–extreme learning machine (CNN-GRC-ELM). | A fast, low-cost, and accurate method. | Does not compare to other state-of-the art deep learning models; the experimental sample data were insufficient. |
Jun. 2022, [136] | Polyethylene | -variational autoencoder (-VAE) | Improved ability to analyze spectroscopic data from complex heterogeneous systems. | More algorithms needed to be compared than with the PCA algorithm. |
Oct. 2020, [137] | Water pollution | An improved convolutional neural network (CNN)+decision tree | Improved NIR prediction accuracy; rapid determination. | Only preliminary experimental results; the decision tree’s parameters had an impact on the model’s performance. |
Jan. 2020, [138] | Cereal | Stacked sparse autoencoder (SSAE)+affine transformation (AT)+extreme learning machine (ELM) | A quick, effective, and economical method for analyzing cereal characteristics with good prediction results. | The test samples were insufficient in terms of quantity and variety. |
May. 2020, [139] | Cells | Mie extinction–extended multiplicative signal correction (ME-EMSC) | In terms of the speed, robustness, and noise levels, the DSAE performed better than Mie extinction–extended multiplicative signal correction (ME-EMSC). | In the experimental preliminary results, a sizable number of additional experimental samples were required. |
Nov. 2020, [140] | Soil | CNN | When the number of calibration samples exceeded 2000, the CNN was more accurate than the machine learning models. | Larger datasets should be explored to test the generalization of the accuracy vs. sample size and explore whether the deep learning CNN model ever reaches a plateau in accuracy. |
Nov. 2020, [141] | Physical distortions | Extended multiplicative signal augmentation (EMSA)+SpectraVGG | The convergence occurred much more quickly and the results were better. | The final model results were strongly influenced by the methods used for data augmentation and signal preprocessing. |
7.2. Convolutional Neural Network (CNN)
7.3. Recurrent Neural Networks (RNNs) and Attention
7.4. Deep Extreme Learning Machine Architectures
7.5. Generative Adversarial Networks (GAN)
7.6. Discussion
8. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Smith, B.C. The benzene fingers, part I: Overtone and combination bands. Spectroscopy 2016, 31, 30–34. [Google Scholar]
- Ozaki, Y. Infrared Spectroscopy—Mid-infrared, Near-infrared, and Far-infrared/Terahertz Spectroscopy. Anal. Sci. 2021, 37, 1193–1212. [Google Scholar] [CrossRef] [PubMed]
- Norris, K.H.; Hart, J.R. 4. Direct Spectrophotometric Determination of Moisture Content of Grain and Seeds. J. Near Infrared Spectrosc. 1996, 4, 23–30. [Google Scholar] [CrossRef]
- Ben-Gera, I.; Norris, K.H. Direct spectrophotometric determination of fat and moisture in meat products. J. Food Sci. 1968, 33, 64–67. [Google Scholar] [CrossRef]
- Skvaril, J.; Kyprianidis, K.G.; Dahlquist, E. Applications of near infrared spectroscopy (NIRS) in biomass energy conversion processes: A review. Appl. Spectrosc. Rev. 2017, 52, 675–728. [Google Scholar] [CrossRef]
- Fu, X.P.; Wang, X.Y.; Rao, X.Q. An LED-based spectrally tuneable light source for visible and near-infrared spectroscopy analysis: A case study for sugar content estimation of citrus. Biosyst. Eng. 2017, 163, 87–93. [Google Scholar] [CrossRef]
- Xie, L.J.; Wang, A.C.; Xu, H.R.; Fu, X.P.; Ying, Y.B. Applications of near-infrared systems for quality evaluation of fruits: A review. Trans. ASABE 2016, 59, 399–419. [Google Scholar]
- Pasquini, C. Near infrared spectroscopy: Fundamentals, practical aspects and analytical applications. J. Braz. Chem. Soc. 2003, 14, 198–219. [Google Scholar] [CrossRef] [Green Version]
- Haghi, R.K.; Yang, J.; Tohidi, B. Fourier Transform Near-Infrared (FTNIR) Spectroscopy and Partial Least-Squares (PLS) Algorithm for Monitoring Compositional Changes in Hydrocarbon Gases under In Situ Pressure. Energy Fuels 2017, 31, 198–219. [Google Scholar] [CrossRef]
- Rinnan, R.; Rinnan, Å. Application of near infrared reflectance (NIR) and fluorescence spectroscopy to analysis of microbiological and chemical properties of arctic soil. Soil Biol. Biochem. 2007, 39, 1664–1673. [Google Scholar] [CrossRef]
- Nikzad-Langerodi, R.; Zellinger, W.; Lughofer, E.; Saminger-Platz, S. Domain-invariant partial-least-squares regression. Anal. Chem. 2018, 90, 6693–6701. [Google Scholar] [CrossRef]
- Sandak, J.; Sandak, A.; Zitek, A.; Hintestoisser, B.; Picchi, G. Development of low-cost portable spectrometers for detection of wood defects. Sensors 2020, 20, 545. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.T.; Li, M.Z.; Ji, R.H.; Wang, M.J.; Zheng, L.H. Comparison of soil total nitrogen content prediction models based on Vis-NIR spectroscopy. Sensors 2020, 20, 7078. [Google Scholar] [CrossRef]
- Barbon, S.; Costa Barbon, A.P.A.d.; Mantovani, R.G.; Barbin, D.F. Machine Learning Applied to Near-Infrared Spectra for Chicken Meat Classification. Asian J. Spectrosc. 2018, 2018, 8949741. [Google Scholar] [CrossRef]
- Nikzad-Langerodi, R.; Zellinger, W.; Saminger-Platz, S.; Moser, B.A. Domain adaptation for regression under Beer–Lambert’s law. Knowl.-Based Syst. 2020, 210, 106447. [Google Scholar] [CrossRef]
- Gogé, F.; Thuriès, L.; Fouad, Y.; Damay, N.; Davrieux, F. Dataset of chemical and near-infrared spectroscopy measurements of fresh and dried poultry and cattle manure. Data Brief 2021, 34, 106447. [Google Scholar] [CrossRef]
- Schoot, M.; Kapper, C. Investigating the need for preprocessing of near-infrared spectroscopic data as a function of sample size. Chemom. Intell. Lab. Syst. 2020, 204, 104105. [Google Scholar] [CrossRef]
- Guo, R.; Wang, J.S.; Jin, H.L.; Luo, L.; Yan, L.H.; Xie, A.G. Measurement of protein content in sesame by near-infrared spectroscopy technique. In Proceedings of the 2011 International Conference on New Technology of Agricultural, Zhengzhou, China, 22–25 October 2011. [Google Scholar]
- Geladi, P.; MacDougall, D.; Martens, H. Linearization and scatter-correction for near-infrared reflectance spectra of meat. Appl. Spectrosc. 1985, 39, 491–500. [Google Scholar] [CrossRef]
- Martens, H.; Stark, E. Extended multiplicative signal correction and spectral interference subtraction: New preprocessing methods for near infrared spectroscopy. J. Pharm. Biomed. Anal. 1991, 9, 625–635. [Google Scholar] [CrossRef]
- Helland, I.S.; Næs, T.; Isaksson, T. Related versions of the multiplicative scatter correction method for preprocessing spectroscopic data. Chemom. Intell. Lab. Syst. 1995, 29, 233–241. [Google Scholar] [CrossRef]
- Savitzky, A.; Golay, M.J.E. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
- Rinnan, Å.; Van Den Berg, F.; Engelsen, S. Review of the most common pre-processing techniques for near-infrared spectra. Trac-Trends Anal. Chem. 2009, 28, 1201–1222. [Google Scholar] [CrossRef]
- Chen, H.Z.; Song, Q.Q.; Tang, G.Q.; Feng, Q.X.; Lin, L.L. The combined optimization of Savitzky-Golay smoothing and multiplicative scatter correction for FT-NIR PLS models. Anal. Chem. 2013, 2013, 642190. [Google Scholar] [CrossRef] [Green Version]
- Jiao, Y.P.; Li, Z.C.; Chen, X.S.; Fei, S.M. Preprocessing methods for near-infrared spectrum calibration. J. Chemom. 2020, 34, e3306. [Google Scholar] [CrossRef]
- Heil, K.; Schmidhalter, U. An Evaluation of Different NIR-Spectral Pre-Treatments to Derive the Soil Parameters C and N of a Humus-Clay-Rich Soil. Sensors 2021, 21, 1423. [Google Scholar] [CrossRef]
- Gu, X.Y.; Guo, J.C.; Xiao, L.J.; Li, C.Y. Conditional mutual information-based feature selection algorithm for maximal relevance minimal redundancy. Appl. Intell. 2022, 52, 1436–1447. [Google Scholar] [CrossRef]
- Bai, X.; Zhang, L.; Kang, C.; Quan, B. Near-infrared spectroscopy and machine learning-based technique to predict quality-related parameters in instant tea. Sci. Rep. 2022, 12, 3833. [Google Scholar] [CrossRef]
- Yang, M.H.; Xu, D.Y.; Chen, S.C.; Li, H.Y.; Shi, Z. Evaluation of machine learning approaches to predict soil organic matter and pH using Vis-NIR spectra. Sensors 2019, 19, 263. [Google Scholar] [CrossRef] [Green Version]
- Bossard, J.A.; Yun, S.; Werner, D.H.; Mayer, T.S. Synthesizing low loss negative index metamaterial stacks for the mid-infrared using genetic algorithms. Opt. Express 2009, 17, 14771–14779. [Google Scholar] [CrossRef]
- Silva, L.G.; Péres, A.F.S.; Freitas, D.L.D. ATR-FTIR spectroscopy in blood plasma combined with multivariate analysis to detect HIV infection in pregnant women. Sci. Rep. 2020, 10, 20156. [Google Scholar] [CrossRef]
- Bonah, E.; Huang, X.; Aheto, J.H.; Yi, S.; Tu, H. Comparison of variable selection algorithms on vis nir hyperspectral imaging spectra for quantitative monitoring and visualization of bacterial foodborne pathogens in fresh pork muscles. Infrared Phys. Technol. 2020, 107, 103327. [Google Scholar] [CrossRef]
- Yang, M.X.; Chen, Q.S.; Kutsanedzie, F.Y.H.; Yang, X.J.; Guo, Z.M.; Qin, O.Y. Portable spectroscopy system determination of acid value in peanut oil based on variables selection algorithms. Measurement 2017, 103, 179–185. [Google Scholar] [CrossRef]
- Mishra, P.; Woltering, E.J. Identifying key wavenumbers that improve prediction of amylose in rice samples utilizing advanced wavenumber selection techniques. Talanta 2021, 224, 121908. [Google Scholar] [CrossRef]
- Tian, Y.; Sun, J.; Zhou, X.; Wu, X.H.; Lu, B.; Dai, C.X. Research on apple origin classification based on variable iterative space shrinkage approach with stepwise regression-support vector machine algorithm and visible-near infrared hyperspectral imaging. J. Food Process Eng. 2020, 43, e13432. [Google Scholar] [CrossRef]
- Jiang, H.; Chen, Q.S. Determination of adulteration content in extra virgin olive oil using FT-NIR spectroscopy combined with the BOSS–PLS algorithm. Molecules 2019, 24, 2134. [Google Scholar] [CrossRef] [Green Version]
- Kong, W.W.; Liu, F.; Zhang, J.F.; Feng, H.L. Non-destructive determination of malondialdehyde (mda) distribution in oilseed rape leaves by laboratory scale nir hyperspectral imaging. Sci. Rep. 2016, 6, 35393. [Google Scholar] [CrossRef]
- Lin, L.; He, Y.; Xiao, Z.T.; Zhao, K.; Dong, T.; Nie, P.C. Rapid-Detection Sensor for Rice Grain Moisture Based on NIR Spectroscopy. Appl. Sci. 2019, 9, 1654. [Google Scholar] [CrossRef] [Green Version]
- Wang, H.Q.; Rehmetulla, A.G.; Guo, S.S.; Kong, X. Machine learning based on structural and FTIR spectroscopic datasets for seed autoclassification. RSC Adv. 2022, 12, 11413–11419. [Google Scholar] [CrossRef]
- Yang, L. Rapid Quality Discrimination of Grape Seed Oil Using an Extreme Machine Learning Approach with Near-Infrared (NIR) Spectroscopy. Spectroscopy 2021, 36, 14–20. [Google Scholar]
- Han, Q.J.; Wu, H.L.; Cai, C.B.; Xu, L.; Yu, R.Q. An ensemble of Monte Carlo uninformative variable elimination for wavelength selection. Anal. Chim. Acta 2008, 612, 121–125. [Google Scholar] [CrossRef]
- Sampaio, P.S.; Soares, A.; Castanho, A.; Almeida, A.S.; Oliverira, J.; Brites, C. Optimization of rice amylose determination by NIR-spectroscopy using PLS chemometrics algorithms. Food Chem. 2018, 242, 196–204. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.; Wang, Y.; Xia, Z.; Wang, Y.; Wu, Y.; Gong, Z. Rapid determination of phytosterols by NIRS and chemometric methods. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019, 211, 336–341. [Google Scholar] [CrossRef] [PubMed]
- Borraz-Martínez, S.; Simó, J.; Gras, A.; Mestre, M.; Boqué, R. Multivariate classification of prunus dulcis varieties using leaves of nursery plants and near-infrared spectroscopy. Sci. Rep. 2019, 9, 19810. [Google Scholar] [CrossRef] [PubMed]
- Shukla, A.; Bhatt, H.; Pani, A.K. Variable selection and modeling from NIR spectra data: A case study of diesel quality prediction using LASSO and Regression Tree. In Proceedings of the 2nd International Conference on Data, Engineering and Applications (IDEA), Bhopal, India, 28–29 February 2020. [Google Scholar]
- Zhao, Z.; Wang, K.; Wang, S.; Xiang, Y.; Bian, X. Sense the Real Change: Proceedings of the 20th International Conference on Near Infrared Spectroscopy; ICNIR 2021; Springer: Singapore, 2022; pp. 291–300. [Google Scholar]
- Craig, A.P.; Wang, K.; Franca, A.S.; Oliveira, L.S.; Irudayaraj, J.; Ileleji, K. Application of elastic net and infrared spectroscopy in the discrimination between defective and non-defective roasted coffees. Talanta 2014, 128, 393–400. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Hu, L.Q.; Yin, C.L.; Ma, S.; Liu, Z.M. Vis-NIR spectroscopy combined with wavelengths selection by PSO optimization algorithm for simultaneous determination of four quality parameters and classification of soy sauce. Food Anal. Meth. 2019, 12, 633–643. [Google Scholar] [CrossRef]
- Nturambirwe, J.F.I.; Nieuwoudt, H.H.; Perold, W.J.; Opara, U.L. Non-destructive measurement of internal quality of apple fruit by a contactless NIR spectrometer with genetic algorithm model optimization. Sci. Afr. 2019, 3, e00051. [Google Scholar] [CrossRef]
- Hadoux, X.; Kant Kumar, D.; Sarossy, M.G.; Roger, J.M.; Gorretta, N. Application-Dedicated Selection of Filters (ADSF) using covariance maximization and orthogonal projection. Anal. Chim. Acta 2016, 921, 1–12. [Google Scholar] [CrossRef]
- Krepper, G.; Romeo, F. Determination of fat content in chicken hamburgers using NIR spectroscopy and the Successive Projections Algorithm for interval selection in PLS regression (iSPA-PLS). Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2018, 189, 300–306. [Google Scholar] [CrossRef]
- Ye, S.F.; Wang, D.; Min, S.G. Successive projections algorithm combined with uninformative variable elimination for spectral variable selection. Chemom. Intell. Lab. Syst. 2008, 91, 194–199. [Google Scholar] [CrossRef]
- Wang, L.S.; Wang, R.J. Determination of soil pH from Vis-NIR spectroscopy by extreme learning machine and variable selection: A case study in lime concretion black soil. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2022, 283, 121707. [Google Scholar] [CrossRef]
- Zhang, J.; Cui, X.Y.; Cai, W.S.; Shao, X.G. A variable importance criterion for variable selection in near-infrared spectral analysis. Sci. China-Chem. 2019, 62, 271–279. [Google Scholar] [CrossRef]
- Xu, H.; Liu, Z.C.; Cai, W.S.; Shao, X.G. A wavelength selection method based on randomization test for near-infrared spectral analysis. Chemom. Intell. Lab. Syst. 2009, 97, 189–193. [Google Scholar] [CrossRef]
- Dias, R.; Garcia, N.L.; Ludwig, G.; Saraiva, M.A. Aggregated functional data model for near-infrared spectroscopy calibration and prediction. J. Appl. Stat. 2015, 42, 127–143. [Google Scholar] [CrossRef]
- Jiang, Y.; Li, C.Y. mRMR-based feature selection for classification of cotton foreign matter using hyperspectral imaging. Comput. Electron. Agric. 2015, 119, 191–200. [Google Scholar] [CrossRef]
- Cozzolino, D.; Liu, L.; Cynkar, W.U.; Dambergs, R.G.; Janik, L.; Colby, C.B.; Gishen, M. Effect of temperature variation on the visible and near infrared spectra of wine and the consequences on the partial least square calibrations developed to measure chemical composition. Anal. Chim. Acta 2007, 588, 224–230. [Google Scholar] [CrossRef]
- Wold, H. Estimation of principal components and related models by iterative least squares. J. Multivar. Anal. 1966, 1, 391–420. [Google Scholar]
- Geladi, P.; Kowalski, B.R. Partial least-squares regression: A tutorial. Anal. Chim. Acta 1986, 185, 1–17. [Google Scholar] [CrossRef]
- Huang, G.B.; Siew, C.K. Extreme learning machine: RBF network case. In Proceedings of the ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, Kunming, China, 6–9 December 2004. [Google Scholar]
- Huang, G.B.; Zhu, Q.Y.; Siew, C.K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
- Huang, G.B.; Zhu, H.M.; Ding, X.J.; Zhang, R. Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2011, 42, 513–529. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Drucke, H.; Burges, C.J.; Kaufman, L.; Smola, A.; Vapnik, V. Support vector regression machines. Adv. Neural Inf. Process. Syst. 1997, 9, 155–161. [Google Scholar]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Huang, G.B.; Chen, L.; Siew, C.K. Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans. Neural Netw. 2006, 17, 879–892. [Google Scholar] [CrossRef] [Green Version]
- Trygg, J.; Wold, S. Orthogonal projections to latent structures (O-PLS). J. Chemom. 2002, 16, 119–128. [Google Scholar] [CrossRef]
- Chun, H.; Keleş, S. Sparse partial least squares regression for simultaneous dimension reduction and variable selection. J. R. Stat. Soc. B 2010, 72, 3–25. [Google Scholar] [CrossRef] [Green Version]
- Rosipal, R.; Keleş, S. Overview and recent advances in partial least squares. In Proceedings of the Subspace, Latent Structure and Feature Selection, Statistical and Optimization, Perspectives Workshop (SLSFS 2005), Bohinj, Slovenia, 23–25 February 2005. [Google Scholar]
- Camacho, W.; Karlsson, S. Quantification of antioxidants in polyethylene by near infrared (NIR) analysis and partial least squares (PLS) regression. Int. J. Polym. Anal. Charact. 2002, 7, 41–51. [Google Scholar] [CrossRef]
- Huang, G.; Huang, G.B.; Song, S.J.; You, K.Y. Trends in extreme learning machines: A review. Neural Netw. 2015, 61, 32–48. [Google Scholar] [CrossRef]
- Li, R.D.; Huang, W.Y.; Shang, G.L.; Zhang, X.B.; Wang, X.; Liu, J.G.; Wang, Y.; Qiao, J.F.; Fan, X.; Wu, K. Rapid Recognizing the Producing Area of a Tobacco Leaf Using Near-Infrared Technology and a Multi-Layer Extreme Learning Machine Algorithm. J. Braz. Chem. Soc. 2022, 33, 251–259. [Google Scholar] [CrossRef]
- Özdemir, İ.S.; Dağ, Ç. Quantification of sterols and fatty acids of extra virgin olive oils by FT-NIR spectroscopy and multivariate statistical analyses. LWT-Food Sci. Technol. 2018, 91, 125–132. [Google Scholar] [CrossRef]
- Cayuela, J.A.; García, J.F. Sorting olive oil based on alpha-tocopherol and total tocopherol content using near-infra-red spectroscopy (NIRS) analysis. J. Food Eng. 2017, 202, 79–88. [Google Scholar] [CrossRef] [Green Version]
- Minshra, P.; Verkleij, T.; Klont, R. Improved prediction of minced pork meat chemical properties with near-infrared spectroscopy by a fusion of scatter-correction techniquesy. Infrared Phys. Technol. 2021, 113, 103643. [Google Scholar] [CrossRef]
- Sampaio, P.S.; Castanho, A.; Almeida, A.S.; Oliveira, J.; Brites, C. Identification of rice flour types with near-infrared spectroscopy associated with PLS-DA and SVM methods. Eur. Food Res. Technol. 2020, 246, 527–537. [Google Scholar] [CrossRef]
- Yuan, Z.; Zhang, L.X.; Wang, D.; Jiang, J.; de B. Harrington, P.; Mao, J.; Zhang, Q.; Li, P.W. Detection of flaxseed oil multiple adulteration by near-infrared spectroscopy and nonlinear one class partial least squares discriminant analysis. LWT-Food Sci. Technol. 2020, 125, 109247. [Google Scholar] [CrossRef]
- Mazivila, S.J.; Páscoa, R.N.M.J.; Castro, R.C.; Ribeiro, D.S.M.; Santos, J.L.M. Detection of melamine and sucrose as adulterants in milk powder using near-infrared spectroscopy with DD-SIMCA as one-class classifier and MCR-ALS as a means to provide pure profiles of milk and of both adulterants with forensic evidence: A short communication. Talanta 2020, 216, 120937. [Google Scholar] [PubMed]
- Wang, K.J.; Bian, X.H.; Zheng, M.; Liu, P.; Lin, L.G.; Tan, X.Y. Rapid determination of hemoglobin concentration by a novel ensemble extreme learning machine method combined with near-infrared spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2021, 263, 120138. [Google Scholar] [CrossRef] [PubMed]
- Afara, I.O.; Prasadam, I.; Arabshahi, Z.; Xiao, Y.; Oloyede, A. Monitoring osteoarthritis progression using near infrared (NIR) spectroscopy. Sci. Rep. 2017, 7, 11463. [Google Scholar] [CrossRef]
- Ba, Y.L.; Liu, J.B.; Han, J.C.; Zhang, X.C. Application of Vis-NIR spectroscopy for determination the content of organic matter in saline-alkali soils. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2020, 229, 117863. [Google Scholar] [CrossRef]
- Bona, E.; Marquetti, I.; Link, J.V.; Makimori, G.Y.F.; da Costa Arca, V.; Guimarães Lemes, A.L.; Ferreira, J.M.G.; dos Santos Scholz, M.B.; Valderrama, P.; Poppi, R.J. Support vector machines in tandem with infrared spectroscopy for geographical classification of green arabica coffee. LWT-Food Sci. Technol. 2017, 76, 330–336. [Google Scholar] [CrossRef]
- Liu, H.; Zhu, J.; Yin, H.; Yan, Q.Q.; Liu, H.; Guan, S.X.; Cai, Q.S.; Sun, J.W.; Yao, S.; Wei, R.Y. Extreme learning machine and genetic algorithm in quantitative analysis of sulfur hexafluoride by infrared spectroscopy. Appl. Opt. 2020, 61, 2834–2841. [Google Scholar] [CrossRef]
- Olivos-Trujillo, M.; Gajardo, H.A.; Salvo, S.; González, A.; Muñoz, C. Assessing the stability of parameters estimation and prediction accuracy in regression methods for estimating seed oil content in Brassica napus L. using NIR spectroscopy. In Proceedings of the 2015 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON), Santiago, Chile, 28–30 October 2015. [Google Scholar]
- Hong, S.J.; Rho, S.J.; Lee, A.Y.; Park, H.; Cui, J.S.; Park, J.M.; Hong, S.J.; Kim, Y.R.; Kim, G. Rancidity estimation of perilla seed oil by using near-infrared spectroscopy and multivariate analysis techniques. J. Spectrosc. 2017, 2017, 1082612. [Google Scholar] [CrossRef] [Green Version]
- Rahul, S.; Sanjula, S.; Gurpreet, K.; Surinder, S.B. Near-infrared reflectance spectroscopy calibrations for assessment of oil, phenols, glucosinolates and fatty acid content in the intact seeds of oilseed Brassica species. J. Sci. Food Agric. 2018, 98, 4050–4057. [Google Scholar]
- de Oliveira Moreira, A.C.; de Lira Machado, A.H.; de Almeida, F.V.; Braga, J.W.B. Rapid purity determination of copaiba oils by a portable NIR spectrometer and PLSR. Food Anal. Meth. 2018, 11, 1867–1877. [Google Scholar] [CrossRef]
- Choi, Y.H.; Hong, C.K.; Park, G.Y.; Kim, C.K.; Kim, J.H.; Jung, K.; Kwon, J.H. A nondestructive approach for discrimination of the origin of sesame seeds using ED-XRF and NIR spectrometry with chemometrics. Food Sci. Biotechnol. 2016, 25, 433–438. [Google Scholar] [CrossRef]
- Kumar, B.; Vyas, O.P.; Vyas, R.J. A comprehensive review on the variants of support vector machines. Mod. Phys. Lett. B 2019, 33, 1950303. [Google Scholar] [CrossRef]
- Pierna, J.A.F.; Baeten, V.; Renier, A.M.; Cogdill, R.P.; Dardenne, P. Combination of support vector machines (SVM) and near-infrared (NIR) imaging spectroscopy for the detection of meat and bone meal (MBM) in compound feeds. J. Chemom. 2004, 18, 341–349. [Google Scholar] [CrossRef]
- Revilla, I.; Vivar-Quintana, A.M.; María Inmaculada, G.; Miriam, H.; Iván, M.; Pedro, H. NIR Spectroscopy for Discriminating and Predicting the Sensory Profile of Dry-Cured Beef “Cecina”. Sensors 2020, 20, 6892. [Google Scholar] [CrossRef]
- Hyafil, L.; Ronald, L.R. Constructing optimal binary decision trees is NP-complete. Inform. Process. Lett. 1976, 5, 15–17. [Google Scholar] [CrossRef]
- Kucheryavskiy, S. Analysis of NIR spectroscopic data using decision trees and their ensembles. J. Anal. Test. 2018, 2, 274–289. [Google Scholar] [CrossRef]
- Ho, T.K. Random decision forests. In Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, Canada, 14–16 August 1995; Volume 1, pp. 278–282. [Google Scholar]
- Zhang, S.P.; Tan, Z.L.; Liu, J.; Xu, Z.H.; Du, Z. Determination of the food dye indigotine in cream by near-infrared spectroscopy technology combined with random forest model. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2020, 227, 117551. [Google Scholar] [CrossRef]
- Kahrıman, F.; Onaç, İ.; Mert Türk, F.; Öner, F.; Egesel, C.Ö. Determination of carotenoid and tocopherol content inmaize flour and oil samples using nearinfrared spectroscopy. Spectr. Lett. 2019, 52, 473–481. [Google Scholar] [CrossRef]
- Cadet, X.F.; Lo-Thong, O.; Bureau, S.; Dehak, R.; Bessafi, M. Use of machine learning and infrared spectra for rheological characterization and application to the apricot. Sci. Rep. 2019, 9, 19197. [Google Scholar] [CrossRef] [Green Version]
- Liu, P.; Wang, J.; Li, Q.; Gao, J.; Tan, X.Y.; Bian, X.H. Rapid identification and quantification of Panax notoginseng with its adulterants by near infrared spectroscopy combined with chemometrics. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019, 206, 23–30. [Google Scholar] [CrossRef] [PubMed]
- Sun, X.F.; Li, H.L.; Yi, Y.; Hua, H.M.; Guan, Y.; Chen, C. Rapid detection and quantification of adulteration in Chinese hawthorn fruits powder by near-infrared spectroscopy combined with chemometrics. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2021, 250, 119346. [Google Scholar] [CrossRef]
- Martinez-Castillo, C.; Astray, G.; Mejuto, J.C.; Simal-Gandara, J. Random forest, artificial neural network, and support vector machine models for honey classification. eFood 2020, 1, 69–76. [Google Scholar] [CrossRef] [Green Version]
- Lei, M.; Zhang, L.; Li, M.; Chen, H.Y.; Zhang, X. Near-Infrared Spectrum of Coal Origin Identification Based on SVM Algorithm. In Proceedings of the 2018 37th Chinese Control Conference (CCC), Wuhan, China, 25–27 July 2018. [Google Scholar]
- Kirkbright, G.F.; Menon, K.R. The determination of combined vinyl acetate in polyvinyl chloride/polyvinyl acetate copolymer by near-infrared photo-acoustic spectrometry and diffuse reflectance spectrometry. Anal. Chim. Acta 1982, 136, 373–377. [Google Scholar] [CrossRef]
- Chen, H.Z.; Xu, L.L.; Ai, W.; Lin, B.; Feng, Q.X.; Cai, K. Kernel functions embedded in support vector machine learning models for rapid water pollution assessment via near-infrared spectroscopy. Sci. Total Environ. 2020, 714, 136765. [Google Scholar] [CrossRef]
- Câmara, A.B.F.; de Carvalho, L.S.; de Morais, C.L.M.; de Lima, L.A.S.; de Araújo, H.O.M.; de Oliveira, F.M.; de Lima, K.M.G. MCR-ALS and PLS coupled to NIR/MIR spectroscopies for quantification and identification of adulterant in biodiesel-diesel blends. Fuel 2017, 210, 497–506. [Google Scholar] [CrossRef]
- Gupa, O.; Das, A.J.; Hellerstein, J.; Raskar, R. Machine learning approaches for large scale classification of produce. Sci. Rep. 2018, 8, 5226. [Google Scholar] [CrossRef] [Green Version]
- Guinda, A.; Rada, M.; Benaissa, M.; Ourrach, I.; Cayuela, J.A. Controlling argan seed quality by NIR. J. Am. Oil Chem. Soc. 2015, 92, 1143–1151. [Google Scholar] [CrossRef] [Green Version]
- Ahmadi, P.; Muharam, F.M.; Ahmad, K.; Mansor, S.; Abu Seman, I. Early detection of Ganoderma basal stem rot of oil palms using artificial neural network spectral analysis. Plant Dis. 2017, 101, 1009–1016. [Google Scholar] [CrossRef] [Green Version]
- Chen, H.; Tan, C.; Lin, Z. Ensemble of extreme learning machines for multivariate calibration of near-infrared spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2020, 229, 117982. [Google Scholar] [CrossRef]
- Bian, X.H.; Zhang, C.X.; Tan, X.Y.; Dymek, M.; Guo, Y.G.; Lin, L.G.; Cheng, B.; Hu, X.Y. A boosting extreme learning machine for near-infrared spectral quantitative analysis of diesel fuel and edible blend oil samples. Anal. Methods 2017, 9, 2983–2989. [Google Scholar] [CrossRef]
- Devos, O.; Ruckebusch, C.; Durand, A.; Duponchel, L.; Huvenne, J.P. Support vector machines (SVM) in near infrared (NIR) spectroscopy: Focus on parameters optimization and model interpretation. Chemom. Intell. Lab. Syst. 2009, 96, 27–33. [Google Scholar] [CrossRef]
- Huang, F.R.; Song, H.; Guo, L.; Guang, P.W.; Yang, X.H.; Li, L.Q.; Zhao, H.X.; Yang, M.X. Detection of adulteration in Chinese honey using NIR and ATR-FTIR spectral data fusion. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2020, 235, 118297. [Google Scholar] [CrossRef]
- Xu, Z.P.; Cheng, W.M.; Fan, S.; Liu, J.; Wang, H.P.; Li, X.H.; Liu, B.M.; Wu, Y.J.; Zhang, P.F.; Wang, Q. Data fusion of near-infrared diffuse reflectance spectra and transmittance spectra for the accurate determination of rice flour constituents. Anal. Chim. Acta 2022, 1193, 339384. [Google Scholar] [CrossRef]
- Malegori, C.; Buratti, S.; Oliveri, P.; Ratti, S.; Cappa, C.; Lucisano, M. A modified mid-level data fusion approach on electronic nose and FT-NIR data for evaluating the effect of different storage conditions on rice germ shelf life. Talanta 2020, 206, 120208. [Google Scholar] [CrossRef]
- Wang, Y.J.; Li, L.Q.; Liu, Y.; Cui, Q.Q.; Ning, J.M.; Zhang, Z.Z. Enhanced quality monitoring during black tea processing by the fusion of nirs and computer vision. J. Food Eng. 2021, 304, 110599. [Google Scholar] [CrossRef]
- Medeiros, A.D.D.; Silva, L.J.D.; Ribeiro, J.P.O.; Ferreira, K.C.; Rosas, J.T.F.; Santos, A.A.; Silva, C.B.D. Machine learning for seed quality classification: An advanced approach using merger data from FT-NIR spectroscopy and X-ray imaging. Sensors 2020, 20, 4319. [Google Scholar] [CrossRef]
- Mishra, P.; Marini, F.; Brouwer, B.; Roger, J.M.; Biancolillo, A.; Woltering, E.; Hogeveen-van Echtelt, E. Sequential fusion of information from two portable spectrometers for improved prediction of moisture and soluble solids content in pear fruit. Sensors 2021, 223, 121733. [Google Scholar] [CrossRef]
- Hinton, G.E.; Osindero, S.; Teh, Y.W. A Fast Learning Algorithm for Deep Belief Nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef]
- King, D.P.; Welling, M. Auto-encoding variational bayes. In Proceedings of the 2nd International Conference on Learning Representations (ICLR), Banff, AB, Canada, 14–16 April 2014. [Google Scholar]
- Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
- Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
- Deng, B.H.; Zhang, X.M.; Gong, W.Y.; Shang, D.P. An Overview of Extreme Learning Machine. In Proceedings of the 2019 4th International Conference on Control, Robotics and Cybernetics (CRC), Tokyo, Japan, 27–30 September 2019. [Google Scholar]
- Huang, G.B.; Zuo, B.; Gong, W.Y.; Kasun, L.L.C.; Vong, C.M. Local receptive fields based extreme learning machine. IEEE Comput. Intell. Mag. 2015, 10, 18–29. [Google Scholar] [CrossRef]
- Gruszczyńsk, S.; Gruszczyński, W. Supporting soil and land assessment with machine learning models using the Vis-NIR spectral response. Geoderma 2022, 25, 115451. [Google Scholar] [CrossRef]
- Fu, L.H.; Sun, J.; Wang, S.M.; Xu, M.; Yao, K.S.; Gao, Y.; Tang, N.Q. Identification of maize seed varieties based on stacked sparse autoencoder and near-infrared hyperspectral imaging technology. J. Food Process Eng. 2022, 45, e14120. [Google Scholar] [CrossRef]
- Liu, J.; Zhang, J.X.; Tan, Z.L.; Hou, Q.; Liu, R.R. Detecting the content of the bright blue pigment in cream based on deep learning and near-infrared spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2022, 270, 120757. [Google Scholar] [CrossRef] [PubMed]
- Chen, X.Y.; Chai, Q.Q.; Lin, N.; Li, X.H.; Wang, W. 1D convolutional neural network for the discrimination of aristolochic acids and their analogues based on near-infrared spectroscopy. Anal. Methods 2019, 11, 5118–5125. [Google Scholar] [CrossRef]
- Li, L.Q.; Pan, X.P.; Chen, W.L.; Wei, M.M.; Feng, Y.C.; Yin, L.H.; Hu, C.Q.; Yang, H.H. Multi-manufacturer drug identification based on near infrared spectroscopy and deep transfer learning. J. Innov. Opt. Health Sci. 2020, 13, 2050016. [Google Scholar] [CrossRef]
- Moon, E.J.; Kim, Y.; Xu, Y.; Na, Y.; Giacci, A.J.; Lee, J.H. Evaluation of salmon, tuna, and beef freshness using a portable spectrometer. Sensors 2020, 20, 4299. [Google Scholar] [CrossRef]
- Mishra, P.; Passos, D. Multi-output 1-dimensional convolutional neural networks for simultaneous prediction of different traits of fruit based on near-infrared spectroscopy. Postharvest Biol. Technol. 2022, 183, 111741. [Google Scholar] [CrossRef]
- Ng, W.; Minasny, B.; Montazerolghaem, M.; Padarian, J.; Ferguson, R.; Bailey, S.; McBratney, A.B. Convolutional neural network for simultaneous prediction of several soil properties using visible/near-infrared, mid-infrared, and their combined spectra. Geoderma 2019, 352, 251–267. [Google Scholar] [CrossRef]
- Yang, J.R.; Wang, J.; Lu, G.D.; Fei, S.M.; Yan, T.; Zhang, C.; Lu, X.H.; Yu, Z.Y.; Li, W.C.; Tang, X.L. TeaNet: Deep learning on Near-Infrared Spectroscopy (NIR) data for the assurance of tea quality. Comput. Electron. Agric. 2021, 190, 106431. [Google Scholar] [CrossRef]
- Mishra, P.; Passos, D. A synergistic use of chemometrics and deep learning improved the predictive performance of near-infrared spectroscopy models for dry matter prediction in mango fruit. Chemom. Intell. Lab. Syst. 2021, 212, 104287. [Google Scholar] [CrossRef]
- Xiao, D.; Li, H.Z.; Sun, X.Y. Coal Classification Method Based on Improved Local Receptive Field-Based Extreme Learning Machine Algorithm and Visible–Infrared Spectroscopy. ACS Omega 2020, 5, 25772–25783. [Google Scholar] [CrossRef]
- Yang, J.C.; Wang, X.L.; Wang, R.H.; Wang, H.J. Combination of Convolutional Neural Networks and Recurrent Neural Networks for predicting soil properties using Vis–NIR spectroscopy. Geoderma 2020, 380, 114616. [Google Scholar] [CrossRef]
- Xiao, D.; Vu, Q.H.; Le, B.T. Salt content in saline-alkali soil detection using visible-near infrared spectroscopy and a 2D deep learning. Microchem J. 2021, 165, 106182. [Google Scholar] [CrossRef]
- Grossutti, M.; D’Amico, J.; Quintal, J.; MacFarlane, H.; Quirk, A.; Dutcher, J.R. Deep Learning and Infrared Spectroscopy: Representation Learning with a β-Variational Autoencoder. J. Phys. Chem. Lett. 2022, 13, 5787–5793. [Google Scholar] [CrossRef]
- Chen, H.Z.; Chen, A.; Xu, L.L.; Xie, H.; Qiao, H.L.; Lin, Q.Y.; Cai, K. A deep learning CNN architecture applied in smart near-infrared analysis of water pollution for agricultural irrigation resources. Agric. Water Manag. 2020, 240, 106303. [Google Scholar] [CrossRef]
- Le, B.T. Application of deep learning and near infrared spectroscopy in cereal analysis. Vib. Spectrosc. 2020, 106, 103009. [Google Scholar] [CrossRef]
- Magnussen, E.A.; Solheim, J.H.; Blazhko, U.; Tafintseva, V.; Tøndel, K.; Liland, K.H.; Dzurendova, S.; Shapaval, V.; Sandt, C.; Borondics, F.; et al. Deep convolutional neural network recovers pure absorbance spectra from highly scatter-distorted spectra of cells. Chemom. Intell. Lab. Syst. 2020, 13, e202000204. [Google Scholar] [CrossRef]
- Ng, W.; Minasny, B.; Mendes, W.D.S.; Demattê, J.A.M. The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with nearinfrared spectroscopy data. Soil 2020, 6, 565–578. [Google Scholar] [CrossRef]
- Blazhko, U.; Shapaval, V.; Kovalev, V.; Kohler, A. Comparison of augmentation and pre-processing for deep learning and chemometric classification of infrared spectra. Chemom. Intell. Lab. Syst. 2021, 215, 104367. [Google Scholar] [CrossRef]
- Rifai, S.; Vincent, P.; Muller, X.; Glorot, X.; Bengio, Y. Contractive auto-encoders: Explicit invariance during feature extraction. In Proceedings of the 28th International Conference on Machine Learning, Washington, DC, USA, 28 June–2 July 2011. [Google Scholar]
- Vincent, P.; Larochelle, H.; Lajoie, I.; Bengio, Y.; Manzagol, P.-A.; Bottou, L. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 2010, 11, 3371–3408. [Google Scholar]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef] [Green Version]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- He, K.M.; Zhang, X.Y.; Ren, S.Q.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016. [Google Scholar]
- Zagoruyko, S.; Komodakis, N. Wide Residual Networks. In Proceedings of the British Machine Vision Conference (BMVC), York, UK, 19–22 September 2016. [Google Scholar]
- Wang, D.; Wang, X.D.; Lv, S.H. An overview of end-to-end automatic speech recognition. Symmetry 2019, 11, 1018. [Google Scholar] [CrossRef] [Green Version]
- Zhang, X.L.; Lin, T.; Xu, J.F.; Luo, X.; Ying, Y.B. DeepSpectra: An end-to-end deep learning approach for quantitative spectral analysis. Anal. Chim. Acta 2019, 1058, 48–57. [Google Scholar] [CrossRef]
- Zhang, W.W.; Xiang, H.T.; Wang, Y.X.; Bi, X.; Zhang, Y.Z.; Zhang, P.J.; Chen, J.; Wang, L.; Zheng, Y.J. A Signal Response Visualization Gas Recognition Algorithm Based on a Wavelet Transform Coefficient Map-Capsule Network for Artificial Olfaction. IEEE Sens. J. 2022, 22, 14717–14726. [Google Scholar] [CrossRef]
- Gers, F.A.; Schraudolph, N.N.; Nicol, N.; Schmidhuber, J. Learning precise timing with LSTM recurrent networks. J. Mach. Learn. Res. 2002, 3, 115–143. [Google Scholar]
- Heck, J.C.; Salem, F.M. Simplified minimal gated unit variations for recurrent neural networks. In Proceedings of the 2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS), Boston, MA, USA, 6–9 August 2017. [Google Scholar]
- Zuo, E.G.; Sun, L.; Yan, J.Y.; Chen, C.; Chen, C.; Lv, X.Y. Rapidly detecting fennel origin of the near-infrared spectroscopy based on extreme learning machine. J. Sci. Rep. 2022, 12, 13593. [Google Scholar] [CrossRef]
- He, K.X.; Liu, J.J.; Li, Z. Application of Generative Adversarial Network for the Prediction of Gasoline Properties. Chem. Eng. Trans. 2020, 81, 907–912. [Google Scholar]
- Zheng, A.B.; Yang, H.H.; Pan, X.P.; Yin, L.H.; Feng, Y.C. Identification of Multi-Class Drugs Based on Near Infrared Spectroscopy and Bidirectional Generative Adversarial Networks. Sensors 2021, 21, 1088. [Google Scholar] [CrossRef]
- Yang, B.; Chen, C.; Chen, F.F.; Chen, C.; Tang, J.; Gao, R.; Lv, X.Y. Identification of cumin and fennel from different regions based on generative adversarial networks and near infrared spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2021, 21, 119956. [Google Scholar] [CrossRef]
- Zhang, W.D.; Li, L.Q.; Hu, J.Q.; Feng, Y.C.; Yin, L.H.; Hu, C.Q.; Yang, H.H. Drug discrimination by near infrared spectroscopy based on stacked sparse auto-encoders combined with kernel extreme learning machine. Chin. J. Anal. Chem. 2018, 46, 1146–1454. [Google Scholar]
- Mao, Y.C.; Le, B.T.; Xiao, D.; He, D.K.; Liu, C.M.; Jiang, L.Q.; Yu, Z.C.; Yang, F.H.; Liu, X.X. Coal classification method based on visible-infrared spectroscopy and an improved multilayer extreme learning machine. Opt. Laser Technol. 2019, 114, 10–15. [Google Scholar] [CrossRef]
- Xia, J.J.; Du, X.Y.; Xu, W.X.; Wei, Y.; Xiong, Y.M.; Min, S.G. Non-destructive analysis the dating of paper based on convolutional neural network. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2021, 248, 119290. [Google Scholar] [CrossRef]
- Sarin, J.K.; Te Moller, N.; Mohammadi, A.; Prakash, M.; Torniainen, J.; Brommer, H.; Nippolainen, E.; Shaikh, R.; Mäkelä, J.; Korhonen, R.K.; et al. A deep learning CNN architecture applied in smart near-infrared analysis of water pollution for agricultural irrigation resources. Osteoarthr. Cartil. 2021, 29, 423–432. [Google Scholar] [CrossRef]
Instrument | Cost | Speed | Signal to Ratio |
---|---|---|---|
LED | very low | moderate | moderate |
AOTF | moderate | very fast | low |
Dispersive | low | slow | high |
Fourier | high | fast | very low |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, W.; Kasun, L.C.; Wang, Q.J.; Zheng, Y.; Lin, Z. A Review of Machine Learning for Near-Infrared Spectroscopy. Sensors 2022, 22, 9764. https://doi.org/10.3390/s22249764
Zhang W, Kasun LC, Wang QJ, Zheng Y, Lin Z. A Review of Machine Learning for Near-Infrared Spectroscopy. Sensors. 2022; 22(24):9764. https://doi.org/10.3390/s22249764
Chicago/Turabian StyleZhang, Wenwen, Liyanaarachchi Chamara Kasun, Qi Jie Wang, Yuanjin Zheng, and Zhiping Lin. 2022. "A Review of Machine Learning for Near-Infrared Spectroscopy" Sensors 22, no. 24: 9764. https://doi.org/10.3390/s22249764