Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (81)

Search Parameters:
Keywords = long-term coherent processing

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
14 pages, 936 KiB  
Review
Application of Artificial Intelligence Models to Predict the Onset or Recurrence of Neovascular Age-Related Macular Degeneration
by Francesco Saverio Sorrentino, Marco Zeppieri, Carola Culiersi, Antonio Florido, Katia De Nadai, Ginevra Giovanna Adamo, Marco Pellegrini, Francesco Nasini, Chiara Vivarelli, Marco Mura and Francesco Parmeggiani
Pharmaceuticals 2024, 17(11), 1440; https://doi.org/10.3390/ph17111440 - 28 Oct 2024
Viewed by 482
Abstract
Neovascular age-related macular degeneration (nAMD) is one of the major causes of vision impairment that affect millions of people worldwide. Early detection of nAMD is crucial because, if untreated, it can lead to blindness. Software and algorithms that utilize artificial intelligence (AI) have [...] Read more.
Neovascular age-related macular degeneration (nAMD) is one of the major causes of vision impairment that affect millions of people worldwide. Early detection of nAMD is crucial because, if untreated, it can lead to blindness. Software and algorithms that utilize artificial intelligence (AI) have become valuable tools for early detection, assisting doctors in diagnosing and facilitating differential diagnosis. AI is particularly important for remote or isolated communities, as it allows patients to endure tests and receive rapid initial diagnoses without the necessity of extensive travel and long wait times for medical consultations. Similarly, AI is notable also in big hubs because cutting-edge technologies and networking help and speed processes such as detection, diagnosis, and follow-up times. The automatic detection of retinal changes might be optimized by AI, allowing one to choose the most effective treatment for nAMD. The complex retinal tissue is well-suited for scanning and easily accessible by modern AI-assisted multi-imaging techniques. AI enables us to enhance patient management by effectively evaluating extensive data, facilitating timely diagnosis and long-term prognosis. Novel applications of AI to nAMD have focused on image analysis, specifically for the automated segmentation, extraction, and quantification of imaging-based features included within optical coherence tomography (OCT) pictures. To date, we cannot state that AI could accurately forecast the therapy that would be necessary for a single patient to achieve the best visual outcome. A small number of large datasets with high-quality OCT, lack of data about alternative treatment strategies, and absence of OCT standards are the challenges for the development of AI models for nAMD. Full article
Show Figures

Figure 1

23 pages, 12829 KiB  
Article
Analysis of the Response of Shallow Groundwater Levels to Precipitation Based on Different Wavelet Scales—A Case Study of the Datong Basin, Shanxi
by Hongyue Zhang, Xiaoping Rui, Ye Zhou, Wen Sun, Weiyi Xie, Chaojie Gao and Yingchao Ren
Water 2024, 16(20), 2920; https://doi.org/10.3390/w16202920 - 14 Oct 2024
Viewed by 502
Abstract
The rise in shallow groundwater levels is typically triggered by precipitation recharge, exhibiting a certain lag relative to precipitation changes. Therefore, identifying the response mechanism of shallow groundwater levels to precipitation is crucial for clarifying the interaction between precipitation and groundwater. However, the [...] Read more.
The rise in shallow groundwater levels is typically triggered by precipitation recharge, exhibiting a certain lag relative to precipitation changes. Therefore, identifying the response mechanism of shallow groundwater levels to precipitation is crucial for clarifying the interaction between precipitation and groundwater. However, the response mechanism of groundwater levels to precipitation is complex and variable, influenced by various hydrogeological and geographical conditions, and often exhibits significant nonlinear characteristics. To address this issue, this study employs methods such as continuous wavelet transform, cross wavelet transform, and wavelet coherence to analyze the response patterns of groundwater levels to precipitation at different wavelet scales in the Datong Basin from 2013 to 2022: (i) At short wavelet scales (10.33~61.96 d), the groundwater level dynamics respond almost instantaneously to extreme rainfall; (ii) At medium wavelet scales(61.96~247.83 d), the precipitation-groundwater recharge process shows characteristics of either rapid recovery or significant delay; (iii) At long wavelet scales (247.83~495.67 d), three potential groundwater processes were identified in the Datong Basin, exhibiting long-term lag responses throughout this study period, with lag times of 11.18 days, 148.75 days, and 151.49 days, respectively. Furthermore, the results indicate that the lag response time of shallow groundwater levels to precipitation is not only related to the wavelet scale but also to the identified depth conditions of different groundwater regions, groundwater extraction intensity, precipitation intensity, and aquifer lithology. This study distinguishes the temporal and spatial response mechanisms of shallow groundwater to precipitation at different wavelet scales, and this information may further aid in understanding the interaction between precipitation and groundwater levels. Full article
Show Figures

Figure 1

24 pages, 350 KiB  
Article
Evidence Preservation in Digital Forensics: An Approach Using Blockchain and LSTM-Based Steganography
by Mohammad AlKhanafseh and Ola Surakhi
Electronics 2024, 13(18), 3729; https://doi.org/10.3390/electronics13183729 - 20 Sep 2024
Viewed by 2269
Abstract
As digital crime continues to rise, the preservation of digital evidence has become a critical phase in digital forensic investigations. This phase focuses on securing and maintaining the integrity of evidence for legal proceedings. Existing solutions for evidence preservation, such as centralized storage [...] Read more.
As digital crime continues to rise, the preservation of digital evidence has become a critical phase in digital forensic investigations. This phase focuses on securing and maintaining the integrity of evidence for legal proceedings. Existing solutions for evidence preservation, such as centralized storage systems and cloud frameworks, present challenges related to security and collaboration. In this paper, we propose a novel framework that addresses these challenges in the preservation phase of forensics. Our framework employs a combination of advanced technologies, including the following: (1) Segmenting evidence into smaller components for improved security and manageability, (2) Utilizing steganography for covert evidence preservation, and (3) Implementing blockchain to ensure the integrity and immutability of evidence. Additionally, we incorporate Long Short-Term Memory (LSTM) networks to enhance steganography in the evidence preservation process. This approach aims to provide a secure, scalable, and reliable solution for preserving digital evidence, contributing to the effectiveness of digital forensic investigations. An experiment using linguistic steganography showed that the LSTM autoencoder effectively generates coherent text from bit streams, with low perplexity and high accuracy. Our solution outperforms existing methods across multiple datasets, providing a secure and scalable approach for digital evidence preservation. Full article
(This article belongs to the Special Issue Network and Mobile Systems Security, Privacy and Forensics)
Show Figures

Figure 1

24 pages, 60637 KiB  
Article
SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing
by Xiaomeng Guo and Baoyi Xu
Remote Sens. 2024, 16(18), 3420; https://doi.org/10.3390/rs16183420 - 14 Sep 2024
Viewed by 822
Abstract
Monitoring aircraft using synthetic aperture radar (SAR) images is a very important task. Given its coherent imaging characteristics, there is a large amount of speckle interference in the image. This phenomenon leads to the scattering information of aircraft targets being masked in SAR [...] Read more.
Monitoring aircraft using synthetic aperture radar (SAR) images is a very important task. Given its coherent imaging characteristics, there is a large amount of speckle interference in the image. This phenomenon leads to the scattering information of aircraft targets being masked in SAR images, which is easily confused with background scattering points. Therefore, automatic detection of aircraft targets in SAR images remains a challenging task. For this task, this paper proposes a framework for speckle reduction preprocessing of SAR images, followed by the use of an improved deep learning method to detect aircraft in SAR images. Firstly, to improve the problem of introducing artifacts or excessive smoothing in speckle reduction using total variation (TV) methods, this paper proposes a new nonconvex total variation (NTV) method. This method aims to ensure the effectiveness of speckle reduction while preserving the original scattering information as much as possible. Next, we present a framework for aircraft detection based on You Only Look Once v8 (YOLOv8) for SAR images. Therefore, the complete framework is called SAR-NTV-YOLOv8. Meanwhile, a high-resolution small target feature head is proposed to mitigate the impact of scale changes and loss of depth feature details on detection accuracy. Then, an efficient multi-scale attention module was proposed, aimed at effectively establishing short-term and long-term dependencies between feature grouping and multi-scale structures. In addition, the progressive feature pyramid network was chosen to avoid information loss or degradation in multi-level transmission during the bottom-up feature extraction process in Backbone. Sufficient comparative experiments, speckle reduction experiments, and ablation experiments are conducted on the SAR-Aircraft-1.0 and SADD datasets. The results have demonstrated the effectiveness of SAR-NTV-YOLOv8, which has the most advanced performance compared to other mainstream algorithms. Full article
Show Figures

Figure 1

21 pages, 7018 KiB  
Article
Digital Horizons in Construction: A Comprehensive System for Excellence in Project Management
by Salazar Santos Fonseca, Patricia Aguilera Benito and Carolina Piña Ramírez
Buildings 2024, 14(7), 2228; https://doi.org/10.3390/buildings14072228 - 19 Jul 2024
Cited by 1 | Viewed by 1320
Abstract
In today’s competitive construction industry, companies are under increasing pressure to enhance efficiency and productivity. This research examines how digitalization can address issues such as market instability, low productivity, lack of investment in innovation, workforce issues, and management deficiencies. It explores the potential [...] Read more.
In today’s competitive construction industry, companies are under increasing pressure to enhance efficiency and productivity. This research examines how digitalization can address issues such as market instability, low productivity, lack of investment in innovation, workforce issues, and management deficiencies. It explores the potential of technologies like Building Information Modeling (BIM) and Lean Construction (LC) to improve project management. The “House of COANFI” framework, integrating Lean principles with strategy, process, projects, and people, is proposed as a solution for enhancing project management, promoting organizational coherence, continuous improvement, and technological adoption. The methodology includes a literature survey, stakeholder workshops, developing an information system, and validation through case studies. Key findings highlight the benefits of COANFI implementation, including better data management, improved productivity, collaborative integration, and organizational learning. However, challenges such as resistance to change, data quality issues, and integration complexity must be addressed. The study concludes that digitalization, supported by frameworks like COANFI, can significantly enhance efficiency and competitiveness. Future research should validate these methodologies in real-world applications, explore strategies for managing organizational change, and investigate the impact of digital technologies on sustainability, helping the construction sector achieve long-term growth and sustainability. Full article
Show Figures

Figure 1

14 pages, 3096 KiB  
Article
Tracking Metabolite Variations during the Degradation of Vegetables in Rice Bran Bed with Intact-State Nuclear Magnetic Resonance Spectroscopy
by Kengo Ito, Ryusei Yamamoto and Yasuyo Sekiyama
Metabolites 2024, 14(7), 391; https://doi.org/10.3390/metabo14070391 - 19 Jul 2024
Viewed by 878
Abstract
Fermentation—a process of compound degradation by microorganisms—is a traditional food processing method utilized worldwide for the long-term preservation of fresh foods. In recent years, fermented foods have gained attention as health foods. Fermentation increases the nutritional value of ingredients, producing complex flavors and [...] Read more.
Fermentation—a process of compound degradation by microorganisms—is a traditional food processing method utilized worldwide for the long-term preservation of fresh foods. In recent years, fermented foods have gained attention as health foods. Fermentation increases the nutritional value of ingredients, producing complex flavors and aromas. To identify unknown components in fermented foods, it is necessary to analyze compounds and conditions nondestructively and comprehensively. We performed intact-state nuclear magnetic resonance (NMR) spectroscopy using intermolecular single quantum coherence (iSQC) to detect the degradation of vegetables directly and nondestructively. We used two types of vegetables and a rice bran bed (nukazuke), which is used for traditional vegetable fermentation in Japan. Major metabolites such as saccharides, organic acids, and amino acids were identified in iSQC-sliced spectra. Comparing NMR signal intensities during degradation revealed the transition of metabolites characteristic of lactic acid fermentation. A pathway-based network analysis showed pathways involved in amino acid metabolism and lactic acid fermentation. Our analytical approach with intact-state NMR spectroscopy using iSQC demonstrated that it may be effective in other experimental systems, allowing for the evaluation of phenomena that have been conventionally overlooked in their true state. Full article
(This article belongs to the Special Issue Emerging Applications of Metabolomics in Fermented Food)
Show Figures

Graphical abstract

13 pages, 3551 KiB  
Article
Effects of Acute Mental Stress on Choroidal Thickness
by Jiechun Lin, Yingxiang Han, Meng Liu and Xiaofei Wang
Bioengineering 2024, 11(7), 684; https://doi.org/10.3390/bioengineering11070684 - 5 Jul 2024
Viewed by 882
Abstract
Purpose: Previous studies have indicated an association between education and myopia, suggesting that numerous stress events during the educational process may influence eye health. This study aimed to investigate the impact of mental stress induced by mental arithmetic (MA) on choroidal thickness (CT). [...] Read more.
Purpose: Previous studies have indicated an association between education and myopia, suggesting that numerous stress events during the educational process may influence eye health. This study aimed to investigate the impact of mental stress induced by mental arithmetic (MA) on choroidal thickness (CT). Methods: This study included 33 participants aged between 19 and 29 years. Swept-source optical coherence tomography (SS-OCT) was used to capture images of the posterior segment of the left eye during baseline and MA to assess changes in the CT. After denoising and compensation, the baseline images and MA images that had been rigidly registered and resampled to the baseline images were segmented using a deep learning-based method. Based on the segmentation results, the CT within the regions of 1 mm and 3 mm diameter centered at the lowest point of the fovea was calculated. Results: Significant increases were observed in both CT1mm and CT3mm during MA, with mean changes of 2.742 ± 7.098 μm (p = 0.034) and 3.326 ± 6.143 μm (p < 0.001), respectively. Conclusions: Thickening of the choroid has been observed during acute mental stress. We speculate that long-term or chronic mental stress could have a potential adverse impact on myopia progression. Full article
(This article belongs to the Special Issue Ophthalmic Engineering (2nd Edition))
Show Figures

Graphical abstract

17 pages, 10624 KiB  
Article
Application of the Data-Driven Method and Hydrochemistry Analysis to Predict Groundwater Level Change Induced by the Mining Activities: A Case Study of Yili Coalfield in Xinjiang, Norwest China
by Ankun Luo, Shuning Dong, Hao Wang, Haidong Cao, Tiantian Wang, Xiaoyu Hu, Chenyu Wang, Shouchuan Zhang and Shen Qu
Water 2024, 16(11), 1611; https://doi.org/10.3390/w16111611 - 5 Jun 2024
Viewed by 857
Abstract
As the medium of geological information, groundwater provides an indirect method to solve the secondary disasters of mining activities. Identifying the groundwater regime of overburden aquifers induced by the mining disturbance is significant in mining safety and geological environment protection. This study proposes [...] Read more.
As the medium of geological information, groundwater provides an indirect method to solve the secondary disasters of mining activities. Identifying the groundwater regime of overburden aquifers induced by the mining disturbance is significant in mining safety and geological environment protection. This study proposes the novel data-driven algorithm based on the combination of machine learning methods and hydrochemical analyses to predict anomalous changes in groundwater levels within the mine and its neighboring areas induced after mining activities accurately. The hydrochemistry analysis reveals that the dissolution of carbonate and evaporite and the cation exchange function are the main hydrochemical process for controlling the groundwater environment. The anomalous change in the hydrochemistry characteristic in different aquifers reveals that the hydraulic connection between different aquifers is enhanced by mining activities. The continuous wavelet coherence is used to reveal the nonlinear relationship between the groundwater level change and external influencing factors. Based on the above analysis, the groundwater level, precipitation, mine water inflow, and unit goal area could be considered as the input variables of the hydrological model. Two different data-driven algorithms, the Decision Tree and the Long Short-Term Memory (LSTM) neural network, are introduced to construct the hydrological prediction model. Four error metrics (MAPE, RMSE, NSE and R2) are applied for evaluating the performance of hydrological model. For the NSE value, the predictive accuracy of the hydrological model constructed using LSTM is 8% higher than that of Decision Tree algorithm. Accurately predicting the anomalous change in groundwater level caused by the mining activities could ensure the safety of coal mining and prevent the secondary disaster of mining activities. Full article
Show Figures

Figure 1

16 pages, 4149 KiB  
Article
AK-MADDPG-Based Antijamming Strategy Design Method for Frequency Agile Radar
by Zhidong Zhu, Xiaoying Deng, Jian Dong, Cheng Feng and Xiongjun Fu
Sensors 2024, 24(11), 3445; https://doi.org/10.3390/s24113445 - 27 May 2024
Viewed by 675
Abstract
Frequency agility refers to the rapid variation of the carrier frequency of adjacent pulses, which is an effective radar active antijamming method against frequency spot jamming. Variation patterns of traditional pseudo-random frequency hopping methods are susceptible to analysis and decryption, rendering them ineffective [...] Read more.
Frequency agility refers to the rapid variation of the carrier frequency of adjacent pulses, which is an effective radar active antijamming method against frequency spot jamming. Variation patterns of traditional pseudo-random frequency hopping methods are susceptible to analysis and decryption, rendering them ineffective against increasingly sophisticated jamming strategies. Although existing reinforcement learning-based methods can adaptively optimize frequency hopping strategies, they are limited in adapting to the diversity and dynamics of jamming strategies, resulting in poor performance in the face of complex unknown jamming strategies. This paper proposes an AK-MADDPG (Adaptive K-th order history-based Multi-Agent Deep Deterministic Policy Gradient) method for designing frequency hopping strategies in frequency agile radar. Signal pulses within a coherent processing interval are treated as agents, learning to optimize their hopping strategies in the case of unknown jamming strategies. Agents dynamically adjust their carrier frequencies to evade jamming and collaborate with others to enhance antijamming efficacy. This approach exploits cooperative relationships among the pulses, providing additional information for optimized frequency hopping strategies. In addition, an adaptive K-th order history method has been introduced into the algorithm to capture long-term dependencies in sequential data. Simulation results demonstrate the superior performance of the proposed method. Full article
Show Figures

Figure 1

19 pages, 27087 KiB  
Article
Bridge Monitoring Strategies for Sustainable Development with Microwave Radar Interferometry
by Lilong Zou, Weike Feng, Olimpia Masci, Giovanni Nico, Amir M. Alani and Motoyuki Sato
Sustainability 2024, 16(7), 2607; https://doi.org/10.3390/su16072607 - 22 Mar 2024
Cited by 2 | Viewed by 1246
Abstract
The potential of a coherent microwave radar for infrastructure health monitoring has been investigated over the past decade. Microwave radar measuring based on interferometry processing is a non-invasive technique that can measure the line-of-sight (LOS) displacements of large infrastructure with sub-millimeter precision and [...] Read more.
The potential of a coherent microwave radar for infrastructure health monitoring has been investigated over the past decade. Microwave radar measuring based on interferometry processing is a non-invasive technique that can measure the line-of-sight (LOS) displacements of large infrastructure with sub-millimeter precision and provide the corresponding frequency spectrum. It has the capability to estimate infrastructure vibration simultaneously and remotely with high accuracy and repeatability, which serves the long-term serviceability of bridge structures within the context of the long-term sustainability of civil engineering infrastructure management. In this paper, we present three types of microwave radar systems employed to monitor the displacement of bridges in Japan and Italy. A technique that fuses polarimetric analysis and the interferometry technique for bridge monitoring is proposed. Monitoring results achieved with full polarimetric real aperture radar (RAR), step-frequency continuous-wave (SFCW)-based linear synthetic aperture, and multi-input multi-output (MIMO) array sensors are also presented. The results reveal bridge dynamic responses under different loading conditions, including wind, vehicular traffic, and passing trains, and show that microwave sensor interferometry can be utilized to monitor the dynamics of bridge structures with unprecedented spatial and temporal resolution. This paper demonstrates that microwave sensor interferometry with efficient, cost-effective, and non-destructive properties is a serious contender to employment as a sustainable infrastructure monitoring technology serving the sustainable development agenda. Full article
(This article belongs to the Section Sustainable Engineering and Science)
Show Figures

Figure 1

17 pages, 5407 KiB  
Article
Determination of Jupiter’s Pole Orientation from Juno Radio Science Data
by Giacomo Lari, Marco Zannoni, Daniele Durante, Ryan S. Park and Giacomo Tommei
Aerospace 2024, 11(2), 124; https://doi.org/10.3390/aerospace11020124 - 31 Jan 2024
Cited by 1 | Viewed by 1251
Abstract
The extreme accuracy of Juno radio science data allows us to perform very precise orbit determination experiments. While previous works focused on the estimation of the gravitational field of Jupiter, in this article, we aim to accurately determine the planet’s orientation in space. [...] Read more.
The extreme accuracy of Juno radio science data allows us to perform very precise orbit determination experiments. While previous works focused on the estimation of the gravitational field of Jupiter, in this article, we aim to accurately determine the planet’s orientation in space. For this purpose, we implement a rotational model of Jupiter, taking into account also its main deformations, as they affect the planet’s inertia components. Rotation parameters are estimated simultaneously with all other parameters (especially gravity and tides), in order to obtain a global and coherent solution. In our experiments, we find that Juno data manage to constrain Jupiter’s pole direction with an accuracy of around 107 radians for the whole duration of the mission, allowing us to improve its long-term ephemerides. Moreover, Juno data provide an upper bound on the maximum displacement between Jupiter’s pole and spin axis of less than 10 m, which allows us to investigate possible short-period nutation effects due to, for example, atmospheric and interior processes of the planet. Full article
Show Figures

Figure 1

20 pages, 2767 KiB  
Article
A Robust Chinese Named Entity Recognition Method Based on Integrating Dual-Layer Features and CSBERT
by Yingjie Xu, Xiaobo Tan, Xin Tong and Wenbo Zhang
Appl. Sci. 2024, 14(3), 1060; https://doi.org/10.3390/app14031060 - 26 Jan 2024
Cited by 3 | Viewed by 1308
Abstract
In the rapidly evolving field of cybersecurity, the integration of multi-source, heterogeneous, and fragmented data into a coherent knowledge graph has garnered considerable attention. Such a graph elucidates semantic interconnections, thereby facilitating sophisticated analytical decision support. Central to the construction of a cybersecurity [...] Read more.
In the rapidly evolving field of cybersecurity, the integration of multi-source, heterogeneous, and fragmented data into a coherent knowledge graph has garnered considerable attention. Such a graph elucidates semantic interconnections, thereby facilitating sophisticated analytical decision support. Central to the construction of a cybersecurity knowledge graph is Named Entity Recognition (NER), a critical technology that converts unstructured text into structured data. The efficacy of NER is pivotal, as it directly influences the integrity of the knowledge graph. The task of NER in cybersecurity, particularly within the Chinese linguistic context, presents distinct challenges. Chinese text lacks explicit space delimiters and features complex contextual dependencies, exacerbating the difficulty in discerning and categorizing named entities. These linguistic characteristics contribute to errors in word segmentation and semantic ambiguities, impeding NER accuracy. This paper introduces a novel NER methodology tailored for the Chinese cybersecurity corpus, termed CSBERT-IDCNN-BiLSTM-CRF. This approach harnesses Iterative Dilated Convolutional Neural Networks (IDCNN) for extracting local features, and Bi-directional Long Short-Term Memory networks (BiLSTM) for contextual understanding. It incorporates CSBERT, a pre-trained model adept at processing few-shot data, to derive input feature representations. The process culminates with Conditional Random Fields (CRF) for precise sequence labeling. To compensate for the scarcity of publicly accessible Chinese cybersecurity datasets, this paper synthesizes a bespoke dataset, authenticated by data from the China National Vulnerability Database, processed via the YEDDA annotation tool. Empirical analysis affirms that the proposed CSBERT-IDCNN-BiLSTM-CRF model surpasses existing Chinese NER frameworks, with an F1-score of 87.30% and a precision rate of 85.89%. This marks a significant advancement in the accurate identification of cybersecurity entities in Chinese text, reflecting the model’s robust capability to address the unique challenges presented by the language’s structural intricacies. Full article
(This article belongs to the Special Issue Natural Language Processing (NLP) and Applications—2nd Edition)
Show Figures

Figure 1

30 pages, 4582 KiB  
Article
Lip2Speech: Lightweight Multi-Speaker Speech Reconstruction with Gabor Features
by Zhongping Dong, Yan Xu, Andrew Abel and Dong Wang
Appl. Sci. 2024, 14(2), 798; https://doi.org/10.3390/app14020798 - 17 Jan 2024
Viewed by 1635
Abstract
In environments characterised by noise or the absence of audio signals, visual cues, notably facial and lip movements, serve as valuable substitutes for missing or corrupted speech signals. In these scenarios, speech reconstruction can potentially generate speech from visual data. Recent advancements in [...] Read more.
In environments characterised by noise or the absence of audio signals, visual cues, notably facial and lip movements, serve as valuable substitutes for missing or corrupted speech signals. In these scenarios, speech reconstruction can potentially generate speech from visual data. Recent advancements in this domain have predominantly relied on end-to-end deep learning models, like Convolutional Neural Networks (CNN) or Generative Adversarial Networks (GAN). However, these models are encumbered by their intricate and opaque architectures, coupled with their lack of speaker independence. Consequently, achieving multi-speaker speech reconstruction without supplementary information is challenging. This research introduces an innovative Gabor-based speech reconstruction system tailored for lightweight and efficient multi-speaker speech restoration. Using our Gabor feature extraction technique, we propose two novel models: GaborCNN2Speech and GaborFea2Speech. These models employ a rapid Gabor feature extraction method to derive lowdimensional mouth region features, encompassing filtered Gabor mouth images and low-dimensional Gabor features as visual inputs. An encoded spectrogram serves as the audio target, and a Long Short-Term Memory (LSTM)-based model is harnessed to generate coherent speech output. Through comprehensive experiments conducted on the GRID corpus, our proposed Gabor-based models have showcased superior performance in sentence and vocabulary reconstruction when compared to traditional end-to-end CNN models. These models stand out for their lightweight design and rapid processing capabilities. Notably, the GaborFea2Speech model presented in this study achieves robust multi-speaker speech reconstruction without necessitating supplementary information, thereby marking a significant milestone in the field of speech reconstruction. Full article
(This article belongs to the Special Issue Advanced Technology in Speech and Acoustic Signal Processing)
Show Figures

Figure 1

22 pages, 1139 KiB  
Article
Indicators for Takt Production Performance Assessment—A Conceptual Study
by Kimmo Keskiniva, Arto Saari and Juha-Matti Junnonen
Buildings 2024, 14(1), 50; https://doi.org/10.3390/buildings14010050 - 23 Dec 2023
Viewed by 1088
Abstract
This conceptual study aims to produce rough analysis methods and visualizations for production data (formatted in time, location, and work) that can be collected from construction sites that utilize takt production. The scope is on creating methods for evaluating the soundness of the [...] Read more.
This conceptual study aims to produce rough analysis methods and visualizations for production data (formatted in time, location, and work) that can be collected from construction sites that utilize takt production. The scope is on creating methods for evaluating the soundness of the takt plan and its execution. Relevant production literature regarding takt production management and data collection are utilized in the production of the methods and visualizations. However, only imaginary production data are utilized in this study to keep the indicators as simplified and clear as possible. A total of seven indicators with varying levels of novelty are provided in the study. The proposed indicators emphasize punctual adherence to the takt schedule, homogenous production pace, avoiding trade overlapping in locations, steady work in process, and coherent short and long-term production targets. Both as-planned and as-built perspectives are considered. The proposed indicators are argued to be valuable for production management and research and development processes since they provide status information and document the progression of the production for later indicators purposes. This study also acts as a foundation for further empirical studies regarding takt production data utilization. Full article
(This article belongs to the Section Construction Management, and Computers & Digitization)
Show Figures

Figure 1

22 pages, 2347 KiB  
Article
Dementia Detection from Speech: What If Language Models Are Not the Answer?
by Mondher Bouazizi, Chuheng Zheng, Siyuan Yang and Tomoaki Ohtsuki
Information 2024, 15(1), 2; https://doi.org/10.3390/info15010002 - 19 Dec 2023
Cited by 2 | Viewed by 2266
Abstract
A growing focus among scientists has been on researching the techniques of automatic detection of dementia that can be applied to the speech samples of individuals with dementia. Leveraging the rapid advancements in Deep Learning (DL) and Natural Language Processing (NLP), these techniques [...] Read more.
A growing focus among scientists has been on researching the techniques of automatic detection of dementia that can be applied to the speech samples of individuals with dementia. Leveraging the rapid advancements in Deep Learning (DL) and Natural Language Processing (NLP), these techniques have shown great potential in dementia detection. In this context, this paper proposes a method for dementia detection from the transcribed speech of subjects. Unlike conventional methods that rely on advanced language models to address the ability of the subject to make coherent and meaningful sentences, our approach relies on the center of focus of the subjects and how it changes over time as the subject describes the content of the cookie theft image, a commonly used image for evaluating one’s cognitive abilities. To do so, we divide the cookie theft image into regions of interest, and identify, in each sentence spoken by the subject, which regions are being talked about. We employed a Long Short-Term Memory (LSTM) neural network to learn different patterns of dementia subjects and control ones and used it to perform a 10-fold cross validation-based classification. Our experimental results on the Pitt corpus from the DementiaBank resulted in a 82.9% accuracy at the subject level and 81.0% at the sample level. By employing data-augmentation techniques, the accuracy at both levels was increased to 83.6% and 82.1%, respectively. The performance of our proposed method outperforms most of the conventional methods, which reach, at best, an accuracy equal to 81.5% at the subject level. Full article
(This article belongs to the Section Information Applications)
Show Figures

Figure 1

Back to TopTop