Detecting Weak Signals of the Future: A System Implementation Based on Text Mining and Natural Language Processing
Abstract
:1. Introduction
1.1. Weak Signals
1.2. Background and Related Work
2. Description of the Proposed System
2.1. Stage 1: Definition of the Input Data Sources
2.2. Stage 2: Creation of an Input Dataset
2.3. Stage 3: Extract, Transform and Load (ETL)
2.4. Stage 4: Category Assignation
2.5. Stage 5: Text Mining
2.6. Stage 6: Natural Language Processing (NLP): Multi-Word Expressions
2.7. Stage 7: Interpretation, Evaluation and Decision-Making
- A list of potential weak signals represented in the Keyword Issue Map, depending on their Degree of Diffusion and Degree of Transmission.
- A list of potential weak signals represented in the Keyword Emergence Map, depending on their Degree of Visibility and Degree of Transmission.
- A ranking of all the keywords present in both graphs, which are more likely to be connected to weak signals.
- The results of the multi-word analysis, providing more accurate results to discard false signs.
3. Experimental Setup
3.1. Definition of the Experiment for Remote Sensing Sector
3.2. Definition of the Evaluation Methods
4. Results
4.1. Keyword Issue Map (KIM) for Remote Sensing
4.2. Keyword Emergence Map (KEM) for Remote Sensing
4.3. Detected Terms as Potential Weak Signals
- Keywords related to environmental, sustainability and geographical factors: Africa, alluvial, asteroids, attenuation, bedrock, Canadian, curvature, depression, desertification, disaster, diurnal, ENSO, extinction, foliar, forestry, Italy, Miocene, multitemporal, observatory, oceanography, pollen, rainforest, rangeland, southeast, sprawl, threat, topsoil, waste, weed and Wuhan.
- Keywords related to business needs: adjacent, archival, breaking, care, check, consumption, diagnosis, forward, guidance, indirect, interior, intervention, invariant, kernel, maximization, mega, native, NOAA, physiological, plantation, preference, probabilistic, rational, residential, stakeholder, super, supervised, triggering, uptake, vibration and wild.
- Keywords related to product/technological components: actuator, adaptative, array, bathymetry, cassini, clay, color, converter, endmember, excitation, gamma, hitran, inorganic, InSAR, oblique, passage, photometry, pigments, Rosetta, sounder, SRTM, stepwise, unmanned, UVSQ, volatile and voxel.
4.4. Results of the Multi-Word Analysis
4.5. Evaluation of the Results
- The growth of remote sensing services is attributed to the effective and flexible data-gathering, thanks to highest resolutions of the metrics, cloud computing software and machine learning techniques. Several terms, such as “adaptative encoding” or “voxel”, were detected as related to weak signals.
- Among the outstanding applications, agriculture and especially desertification, are areas in which remote sensors will be more relevant. Desertification and other terms related to agriculture are keywords that the algorithm identified is related to weak signals.
- Interferometric synthetic aperture radar, abbreviated “InSAR”, which is a radar technique used in geodesy and remote sensing, is becoming more and more important. InSAR is a keyword that the algorithm identified as related to weak signals.
5. Discussion
5.1. Main Findings
5.2. Limitations
6. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
Appendix A
- CPU: Intel core i7 7500U (Dual Core)
- GPU: NVidia Geforce GT 650M 1024 MB GDDR5—900MHz—384 CUDA Cores
- RAM: 16 GB DD4
Stage | CPU | GPU |
---|---|---|
Data Warehouse creation | 6257 min | 688 min |
Category assignation | 68 min | 9 min |
Text mining | 124 min | 11 min |
Multi-word expressions | 43 min | 6 min |
Operation | Oracle | MySQL | MsSql | MongoDB | Redis | GraphQL | Cassandra |
---|---|---|---|---|---|---|---|
INSERT | 0.076 | 0.093 | 0.093 | 0.005 | 0.009 | 0.008 | 0.011 |
SELECT | 0.025 | 0.093 | 0.062 | 0.009 | 0.016 | 0.010 | 0.014 |
Appendix B
References
- Eisenhardt, K.M.; Brown, S.L. Patching: Restitching business portfolios in dynamic markets. Harv. Bus. Rev. 1999, 77, 72–82. [Google Scholar] [PubMed]
- Zahra, S.A.; Gedajlovic, E.; Neubaum, D.O.; Shulman, J.M. A typology of social entrepreneurs: Motives, search processes and ethical challenges. J. Bus. Ventur. 2009, 24, 519–532. [Google Scholar] [CrossRef]
- Choo, C.W.; Auster, E. Environmental scanning: Acquisition and use of information by managers. Ann. Rev. Inf. Sci. Technol. 1993, 28, 279–314. [Google Scholar]
- Ansoff, H.I.; McDonnell, E.J. Implanting Strategic Management; Prentice Hall: Cambridge, MA, USA, 1990; pp. 27–34. [Google Scholar]
- Ansoff, H.I. Managing Strategic Surprise by Response to Weak Signals. Calif. Manag. Rev. 1975, 18, 21–33. [Google Scholar] [CrossRef]
- Cooper, A.; Voigt, C.; Unterfrauner, E.; Kravcik, M.; Pawlowski, J.; Pirkkalainen, H. Report on Weak Signals Collection. TELMAP, European Commission Seventh Framework Project (IST-257822). 2011. Deliverable D4.1. pp. 6–7. Available online: https://cordis.europa.eu/docs/projects/cnect/2/257822/080/deliverables/001-D41Weaksignalscollectionfinal.doc (accessed on 25 April 2020).
- Coffman, B. Part I. Introduction. In Weak Signal Research; MG Taylor Corporation: Louisville, KY, USA, 1997. [Google Scholar]
- Godet, M. From Anticipation to Action, a Handbook of Strategic Prospective; UNESCO Publishing: Paris, France, 1994; p. 59. [Google Scholar]
- Molitor, G.T. Molitor Forecasting Model: Key Dimensions for Plotting the Patterns of Change. J. Future Stud. 2003, 8, 61–72. [Google Scholar]
- Dator, J. Futures Studies as Applied Knowledge. In New Thinking for a New Millennium; Routledge: London, UK, 1996; pp. 66–74. [Google Scholar]
- Dator, J. Universities without quality and quality without universities. Horizon 2005, 13, 199–215. [Google Scholar] [CrossRef]
- Nikander, I.O. Early Warnings, a Phenomenon in Project Management. Ph.D. Thesis, Helsinki University of Technology, Helsinki, Finland, 2002. [Google Scholar]
- Mannermaa, M. Tulevaisuuden Hallinta Skenaariot Strategiatyoskentelyssa. (Managing the Future, Scenarios in Strategy Work); WSOY: Porvoo, Finland, 1999; p. 227. [Google Scholar]
- Hiltunen, E. The future sign and its three dimensions. Futures 2007, 40, 247–260. [Google Scholar] [CrossRef]
- Peirce, C.S. Some Consequences of Four Incapacities. J. Specul. Philos. 1868, 2, 140–157. [Google Scholar]
- Han, J.; Kamber, M. Data Mining: Concepts and Techniques; Morgan Kaufmann Publishers: Burlington, MA, USA, 2001. [Google Scholar]
- Fischler, M.A.; Firschein, O. Intelligence: The Eye, the Brain and the Computer; Addison-Wesley: Boston, MA, USA, 1987; p. 221. [Google Scholar]
- Hong, S.W.; Kim, Y.E.; Bae, K.J.; Park, Y.W.; Park, J.K. Development of analysis model for R&D environment change in search of the weak signal. J. Korea Technol. Innov. Soc. 2009, 12, 189–211. [Google Scholar]
- Thorleuchter, D.; Scheja, T.; Van den Poel, D. Semantic weak signal tracing. Expert Syst. Appl. 2014, 41, 5009–5016. [Google Scholar] [CrossRef]
- Julien, P.A.; Andriambeloson, E.; Ramangalahy, C. Networks, weak signals and technological innovations among SMEs in the land-based transportation equipment sector. Entrep. Reg. Dev. 2004, 16, 251–269. [Google Scholar] [CrossRef]
- Wu, X.; Zhu, X.; Wu, G.Q.; Ding, W. Data mining with big data. IEEE Trans. Knowl. Data Eng. 2013, 26, 97–107. [Google Scholar] [CrossRef]
- Koivisto, R.; Kulmala, I.; Gotcheva, N. Weak signals and damage scenarios—Systematics to identify weak signals and their sources related to mass transport attacks. Technol. Forecast. Soc. Chang. 2016, 104, 180–190. [Google Scholar] [CrossRef]
- Davis, J.; Groves, C. City/future in the making: Masterplanning London’s Olympic legacy as anticipatory assemblage. Futures 2019, 109, 13–23. [Google Scholar] [CrossRef] [Green Version]
- Irvine, N.; Nugent, C.; Zhang, S.; Wang, H.; Ng, W.W.Y. Neural Network Ensembles for Sensor-Based Human Activity Recognition Within Smart Environments. Sensors 2020, 20, 216. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Huang, M.; Liu, Z. Research on Mechanical Fault Prediction Method Based on Multifeature Fusion of Vibration Sensing Data. Sensors 2020, 20, 6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Awan, F.M.; Saleem, Y.; Minerva, R.; Crespi, N. A Comparative Analysis of Machine/Deep Learning Models for Parking Space Availability Prediction. Sensors 2020, 20, 322. [Google Scholar] [CrossRef] [Green Version]
- Baghmolaei, R.M.; Mozafari, N.; Hamzeh, A. Continuous states latency aware influence maximization in social networks. AI Commun. 2017, 30, 99–116. [Google Scholar]
- McGrath, J.; Fischetti, J. What if compulsory schooling was a 21st century invention? Weak signals from a systematic review of the literature. Int. J. Educ. Res. 2019, 95, 212–226. [Google Scholar] [CrossRef]
- Chao, W.; Jiang, X.; Luo, Z.; Hu, Y.; Ma, W. Interpretable Charge Prediction for Criminal Cases with Dynamic Rationale Attention. J. Artif. Intell. Res. 2019, 66, 743–764. [Google Scholar] [CrossRef]
- Van Veen, B.L.; Ortt, R.; Badke-Schaub, P. Compensating for perceptual filters in weak signal assessments. Futures 2019, 108, 1–11. [Google Scholar] [CrossRef]
- Thorleuchter, D.; Van den Poel, D. Idea mining for webbased weak signal detection. Futures 2015, 66, 25–34. [Google Scholar] [CrossRef]
- Rowea, E.; Wrightb, G.; Derbyshirec, J. Enhancing horizon scanning by utilizing pre-developed scenarios: Analysis of current practice and specification of a process improvement to aid the identification of important ‘weak signals’. Technol. Forecast. Soc. Chang. 2017, 125, 224–235. [Google Scholar] [CrossRef]
- Yoon, J. Detecting weak signals for long-term business opportunities using text mining of Web news. Expert Syst. Appl. 2012, 39, 12543–12550. [Google Scholar] [CrossRef]
- Yoo, S.H.; Won, D. Simulation of Weak Signals of Nanotechnology Innovation in Complex System. Sustainability 2018, 10, 486. [Google Scholar]
- Suh, J.H. Generating Future-Oriented Energy Policies and Technologies from the Multidisciplinary Group Discussions by Text-Mining-Based Identification of Topics and Experts. Sustainability 2018, 10, 3709. [Google Scholar] [CrossRef] [Green Version]
- Kwon, L.-N.; Park, J.-H.; Moon, Y.-H.; Lee, B.; Shin, Y.; Kim, Y.-K. Weak signal detecting of industry convergence using information of products and services of global listed companies—Focusing on growth engine industry in South Korea. J. Open Innov. Technol. Mark. Complex. 2018, 4, 10. [Google Scholar] [CrossRef] [Green Version]
- Ben-Porat, O.; Hirsch, S.; Kuchy, L.; Elad, G.; Reichart, R.; Tennenholtz, M. Predicting Strategic Behavior from Free Text. J. Artif. Intell. Res. 2020, 68, 413–445. [Google Scholar] [CrossRef]
- Fink, L.; Yogev, N.; Even, A. Business intelligence and organizational learning: An empirical investigation of value creation processes. Inf. Manag. 2017, 54, 38–56. [Google Scholar] [CrossRef]
- Ilmola, L.; Kuusi, O. Filters of weak signals hinder foresight: Monitoring weak signals efficiently in corporate decision-making. Futures 2006, 38, 908–924. [Google Scholar] [CrossRef]
- Weng, J.; Bu-Sung, L. Event detection in twitter. In Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, Barcelona, Spain, 17–21 July 2011. [Google Scholar]
- Doulamis, N. Event detection in twitter microblogging. IEEE Trans. Cybern. 2015, 46, 2810–2824. [Google Scholar] [CrossRef] [PubMed]
- Atefeh, F.; Wael, K. A survey of techniques for event detection in twitter. Comput. Intell. 2013, 31, 132–164. [Google Scholar] [CrossRef]
- Mehmood, N.; Culmone, R.; Mostarda, L. Modeling temporal aspects of sensor data for MongoDB NoSql database. J. Big Data 2017, 4, 15. [Google Scholar] [CrossRef]
- Bjeladinovic, S. A fresh approach for hybrid Sql/NoSql database design based on data structuredness. Enterp. Inf. Syst. 2018, 12, 1202–1220. [Google Scholar] [CrossRef]
- Čerešňák, R.; Kvet, M. Comparison of query performance in relational a non-relation databases. Transp. Res. Procedia 2019, 40, 170–177. [Google Scholar] [CrossRef]
- Yangui, R.; Nabli, A.; Gargouri, F. Automatic Transformation of Data Warehouse Schema to NoSQL Data Base: Comparative Study. Procedia Comput. Sci. 2016, 96, 255–264. [Google Scholar] [CrossRef] [Green Version]
- Inmon, W.H. Building the Data Warehouse, 4th ed.; John Wiley and Sons: Hoboken, NJ, USA, 2005; p. 156. [Google Scholar]
- Willett, P. The Porter stemming algorithm: Then and now. Program 2006, 40, 219–223. [Google Scholar] [CrossRef]
- Griol-Barres, I.; Milla, S.; Millet, J. Implementación de un sistema de detección de señales débiles de futuro mediante técnicas de minería de textos. (Implementation of a weak signal detection system by text mining techniques). Rev. Esp. Doc. Cient. 2019, 42, 234. [Google Scholar] [CrossRef] [Green Version]
- Kim, J.; Han, M.; Lee, Y.; Park, Y. Futuristic datadriven scenario building: Incorporating text mining and fuzzy association rule mining into fuzzy cognitive map. Expert Syst. Appl. 2016, 57, 311–323. [Google Scholar] [CrossRef]
- Mendonca, S.; Cunha, M.P.; Kaivo-Oja, J.; Ruff, F. Wild Cards, Weak Signals and Organizational Improvisation. Futures 2004, 36, 201–218. [Google Scholar] [CrossRef]
- Ishikiriyama, C.S.; Miro, D.; Gomesa, C.F.S. Text Mining Business Intelligence: A small sample of what words can say. Procedia Comput. Sci. 2015, 55, 261–267. [Google Scholar] [CrossRef] [Green Version]
- Yuen, J. Comparison of Impact Factor, Eigenfactor Metrics, and SCImago Journal Rank Indicator and h-index for Neurosurgical and Spinal Surgical Journals. World Neurosurg. 2018, 119, e328–e337. [Google Scholar] [CrossRef] [PubMed]
- Thomason, J.; Padmakumar, A.; Sinapov, J.; Walker, N.; Jiang, Y.; Yedidsion, H.; Hart, J.; Stone, P.; Mooney, R.J. Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog. J. Artif. Intell. Res. 2020, 67, 327–374. [Google Scholar] [CrossRef] [Green Version]
- Guralnik, V.; Srivastava, J. Event detection from time series data. In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, 15–18 August 1999; pp. 33–42. [Google Scholar]
- Tseng, Y.-H.; Lin, C.-J.; Lin, Y.-I. Text mining techniques for patent analysis. Inf. Process. Manag. 2007, 43, 1216–1247. [Google Scholar] [CrossRef]
- Wood, L. Satellite Remote Sensing—Market Analysis, Trends, and Forecasts; Global Industry Analysts Inc.: San Jose, CA, USA, 2019; p. 12. [Google Scholar]
- Bindzarova-Gergelova, M.; Labant, S.; Kuzevic, S.; Kuzevicova, Z.; Pavolova, H. Identification of Roof Surfaces from LiDAR Cloud Points by GIS Tools: A Case Study of Lučenec, Slovakia. Sustainability 2020, 12, 6847. [Google Scholar] [CrossRef]
- Sugla, S.; Dhum, N. Remote Sensing Services Market by Platform (Satellites, UAVs, Manned Aircraft, and Ground), End User (Defense and Commercial), Resolution (Spatial, Spectral, Radiometric, and Temporal), and Region—Global Forecast to 2022; Markets and Markets: Dublin, Ireland, 2017; pp. 5–25. [Google Scholar]
- Badmos, O.S.; Rienow, A.; Callo-Concha, D.; Greve, K.; Jürgens, C. Urban Development in West Africa—Monitoring and Intensity Analysis of Slum Growth in Lagos: Linking Pattern and Process. Remote Sens. 2018, 10, 1044. [Google Scholar] [CrossRef] [Green Version]
- Thomson, E.R.; Malhi, Y.; Bartholomeus, H.; Oliveras, L.; Gvozdevaite, A. Mapping the Leaf Economic Spectrum across West African Tropical Forests Using UAV-Acquired Hyperspectral Imagery. Remote Sens. 2018, 10, 1532. [Google Scholar] [CrossRef] [Green Version]
- Samasse, K.; Hanan, N.P.; Tappan, G.; Diallo, Y. Assessing Cropland Area in West Africa for Agricultural Yield Analysis. Remote Sens. 2018, 10, 1785. [Google Scholar] [CrossRef] [Green Version]
- Anchang, J.Y.; Prihodko, L.; Kaptué, A.T.; Ross, C.W.; Ji, W.; Kumar, S.S. Trends in Woody and Herbaceous Vegetation in the Savannas of West Africa. Remote Sens. 2019, 11, 576. [Google Scholar] [CrossRef] [Green Version]
- Jung, H.C.; Getirana, A.; Arsenault, K.R.; Holmes, T.R.H.; McNally, A. Uncertainties in Evapotranspiration Estimates over West Africa. Remote Sens. 2019, 11, 892. [Google Scholar] [CrossRef] [Green Version]
- Mondal, P.; Liu, X.; Fatoyinbo, T.E.; Lagomasino, D. Evaluating Combinations of Sentinel-2 Data and Machine-Learning Algorithms for Mangrove Mapping in West Africa. Remote Sens. 2019, 11, 2928. [Google Scholar] [CrossRef] [Green Version]
- Meftah, M.; Damé, L.; Keckhut, P.; Bekki, S.; Sarkissian, A.; Hauchecorne, A.; Bertran, E.; Carta, J.P. UVSQ-SAT, a Pathfinder CubeSat Mission for Observing Essential Climate Variables. Remote Sens. 2020, 12, 92. [Google Scholar] [CrossRef] [Green Version]
- Zhang, W.; Yoshida, T.; Tang, X. Text classification based on multi-word with support vector machine. Knowl. Based Syst. 2008, 21, 879–886. [Google Scholar] [CrossRef]
- Griol, I.; Milla, S.; Millet, J. Improving strategic decision making by the detection of weak signals in heterogeneous documents by text mining techniques. AI Commun. 2019, 32, 347–360. [Google Scholar] [CrossRef]
- Griol, I.; Milla, S.; Millet, J. System Implementation for the Detection of Weak Signals of the Future in Heterogeneous Documents by Text Mining and Natural Language Processing Techniques. In Proceedings of the 11th International Conference on Agents and Artificial Intelligence, Prague, Czech Republic, 19–21 February 2019; pp. 631–638. [Google Scholar]
- Dzedzickis, A.; Kaklauskas, A.; Bucinskas, V. Human Emotion Recognition: Review of Sensors and Methods. Sensors 2020, 20, 592. [Google Scholar] [CrossRef] [Green Version]
- Haegeman, K.; Marinelli, E.; Scapolo, F.; Ricci, A.; Sokolov, A. Quantitative and qualitative approaches in future oriented technology analysis (FTA): From combination to integration. Technol. Forecast. Soc. Chang. 2013, 80, 386–397. [Google Scholar] [CrossRef]
- Silva, V.O.; Martins, C.; Ekel, P. An Efficient Parallel Implementation of an Optimized Simplex Method in GPU-CUDA. IEEE Lat. Am. Trans. 2018, 16, 564–573. [Google Scholar] [CrossRef]
Available Systems | Implemented System |
---|---|
Mainly qualitative analysis | Quantified analysis |
Specific model for a specific topic | Model only dependent on the input dataset |
Pre-determined keywords | All words and multi-words expressions are keywords |
One single data source and/or expert opinion | Three different types of data sources |
Mainly structured data sources | Unstructured data sources (documents and NLP 1) |
Keyword | DoD | Incr Rate | DoV | Incr Rate | DoT | Automatic Category |
---|---|---|---|---|---|---|
Business Needs | ||||||
consumption | 96.73 | 0.0975 | 579.36 | 0.0479 | 6.39 | Agricultural and Forest Meteorology |
diagnosis | 91.18 | 0.1079 | 566.45 | 0.0472 | 6.39 | Space Research, Water Research |
kernel | 96.64 | 0.07 | 540.09 | 0.0384 | 6.16 | Space Research, Water Resources |
noaa | 84.73 | 0.0839 | 531.27 | 0.0473 | 2.38 | Climate Change, Space Research, Wind power |
physiological | 93.91 | 0.0718 | 576.73 | 0.0363 | 6.39 | Radiology, Climate Change |
residential | 92.91 | 0.0813 | 536.64 | 0.0463 | 6.39 | Climate Change, Applied Geography, Water Research |
Environmental/Sustainability Factors | ||||||
asteroids | 88.64 | 0.0796 | 582.64 | 0.0479 | 6.84 | Space Research |
bedrock | 78.64 | 0.1007 | 568.55 | 0.0647 | 13.21 | Space Research, Particle Physics |
Africa | 93.64 | 0.0699 | 528.45 | 0.0678 | 7.03 | Climate Change, Water Research |
canadian | 85.36 | 0.0781 | 545.09 | 0.0427 | 6.76 | Space Research, Agriculture |
desertification | 93.91 | 0.0671 | 604.45 | 0.0357 | 6.39 | Climate Change, Space Research |
disaster | 102.91 | 0.0712 | 559 | 0.045 | 6.39 | Astronautics, Climate Change, Ecosystems |
enso | 86.91 | 0.0667 | 572.91 | 0.04 | 0.32 | Agricultural and Forest Meteorology |
extinction | 83 | 0.0863 | 593.45 | 0.052 | 132.52 | Space Research, Chemistry |
Product/Technological Components | ||||||
gamma | 88.27 | 0.1081 | 528 | 0.0533 | 5.6 | Sea Research, Space Research |
hitran | 95.73 | 0.0683 | 577.36 | 0.0684 | 6.39 | Chemistry, Molecular Spectroscopy Research |
insar | 87 | 0.1393 | 87 | 0.1393 | 0.95 | Space Research, Water Research |
UVSQ | 94.18 | 0.0831 | 555.64 | 0.0491 | 127.36 | Aerospace Science, Aeronautics |
srtm | 85.36 | 0.0617 | 527.73 | 0.0425 | 3.41 | Wind power, Applied Geography, Biology |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Griol-Barres, I.; Milla, S.; Cebrián, A.; Fan, H.; Millet, J. Detecting Weak Signals of the Future: A System Implementation Based on Text Mining and Natural Language Processing. Sustainability 2020, 12, 7848. https://doi.org/10.3390/su12197848
Griol-Barres I, Milla S, Cebrián A, Fan H, Millet J. Detecting Weak Signals of the Future: A System Implementation Based on Text Mining and Natural Language Processing. Sustainability. 2020; 12(19):7848. https://doi.org/10.3390/su12197848
Chicago/Turabian StyleGriol-Barres, Israel, Sergio Milla, Antonio Cebrián, Huaan Fan, and Jose Millet. 2020. "Detecting Weak Signals of the Future: A System Implementation Based on Text Mining and Natural Language Processing" Sustainability 12, no. 19: 7848. https://doi.org/10.3390/su12197848