Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 

Topic Editors

Dr. Jiahui Yu
Binjiang Institute, Zhejiang University, Hangzhou 310053, China
Department of Computer Science, University of Liverpool, Liverpool L69 3BX, UK
Institute of Cyberspace Security, Zhejiang University of Technology, Hangzhou 310014, China
School of Computing, University of Portsmouth, Portsmouth, UK
School of Computing, University of Portsmouth, Portsmouth PO1 3HE, UK
Binjiang Institute, Zhejiang University, Hangzhou 310053, China

Theoretical and Applied Problems in Human-Computer Intelligent Systems

Abstract submission deadline
31 March 2025
Manuscript submission deadline
30 June 2025
Viewed by
11998

Topic Information

Dear Colleagues,

The advancement in human–computer interaction technology has been a driving force in developing robotics, intelligent systems, and medical applications. The ability to understand human–computer interaction through visual and text analyses, as well as ensuring security, is critical for the success of intelligent systems. To achieve this, it is necessary to extract valuable insights from visual and text information and to implement robust security measures to protect against potential attacks. Despite these advancements, many challenges remain to be addressed in this field. Recently, deep learning, AI defense and attack, real-time learning, robot control, and other cutting-edge technologies have been explored and applied to intelligent systems. The increasing demand and complex real-world problems have encouraged the growth of academic research in human–computer interaction. This Topic brings together experts, engineers, and researchers worldwide to present their latest findings and advancements in human–computer interaction. The focus is on innovative theories related to intelligent systems and machine learning, emphasizing applications involving human–machine interaction through visual and text information.

Dr. Jiahui Yu
Prof. Dr. Charlie Yang
Prof. Dr. Zhenyu Wen
Dr. Dalin Zhou
Dr. Dongxu Gao
Dr. Changting Lin
Topic Editors

Keywords

  • visual-based interaction
  • data analysis
  • robustness and security
  • machine learning
  • computer-aided medical analysis
  • robotic control and design
  • machine learning in signal transmission

Participating Journals

Journal Name Impact Factor CiteScore Launched Year First Decision (median) APC
Applied Sciences
applsci
2.5 5.3 2011 17.8 Days CHF 2400 Submit
Electronics
electronics
2.6 5.3 2012 16.8 Days CHF 2400 Submit
Future Internet
futureinternet
2.8 7.1 2009 13.1 Days CHF 1600 Submit
Machines
machines
2.1 3.0 2013 15.6 Days CHF 2400 Submit
Systems
systems
2.3 2.8 2013 17.3 Days CHF 2400 Submit
Technologies
technologies
4.2 6.7 2013 24.6 Days CHF 1600 Submit
Biomimetics
biomimetics
3.4 3.5 2016 20.3 Days CHF 2200 Submit

Preprints.org is a multidiscipline platform providing preprint service that is dedicated to sharing your research from the start and empowering your research journey.

MDPI Topics is cooperating with Preprints.org and has built a direct connection between MDPI journals and Preprints.org. Authors are encouraged to enjoy the benefits by posting a preprint at Preprints.org prior to publication:

  1. Immediately share your ideas ahead of publication and establish your research priority;
  2. Protect your idea from being stolen with this time-stamped preprint article;
  3. Enhance the exposure and impact of your research;
  4. Receive feedback from your peers in advance;
  5. Have it indexed in Web of Science (Preprint Citation Index), Google Scholar, Crossref, SHARE, PrePubMed, Scilit and Europe PMC.

Published Papers (11 papers)

Order results
Result details
Journals
Select all
Export citation of selected articles as:
23 pages, 3648 KiB  
Article
Probabilistic Confusion Matrix: A Novel Method for Machine Learning Algorithm Generalized Performance Analysis
by Ioannis Markoulidakis and Georgios Markoulidakis
Technologies 2024, 12(7), 113; https://doi.org/10.3390/technologies12070113 - 13 Jul 2024
Viewed by 402
Abstract
The paper addresses the issue of classification machine learning algorithm performance based on a novel probabilistic confusion matrix concept. The paper develops a theoretical framework which associates the proposed confusion matrix and the resulting performance metrics with the regular confusion matrix. The theoretical [...] Read more.
The paper addresses the issue of classification machine learning algorithm performance based on a novel probabilistic confusion matrix concept. The paper develops a theoretical framework which associates the proposed confusion matrix and the resulting performance metrics with the regular confusion matrix. The theoretical results are verified based on a wide variety of real-world classification problems and state-of-the-art machine learning algorithms. Based on the properties of the probabilistic confusion matrix, the paper then highlights the benefits of using the proposed concept both during the training phase and the application phase of a classification machine learning algorithm. Full article
Show Figures

Figure 1

17 pages, 9779 KiB  
Article
Optimizing Speech Emotion Recognition with Machine Learning Based Advanced Audio Cue Analysis
by Nuwan Pallewela, Damminda Alahakoon, Achini Adikari, John E. Pierce and Miranda L. Rose
Technologies 2024, 12(7), 111; https://doi.org/10.3390/technologies12070111 - 11 Jul 2024
Viewed by 324
Abstract
In today’s fast-paced and interconnected world, where human–computer interaction is an integral component of daily life, the ability to recognize and understand human emotions has emerged as a crucial facet of technological advancement. However, human emotion, a complex interplay of physiological, psychological, and [...] Read more.
In today’s fast-paced and interconnected world, where human–computer interaction is an integral component of daily life, the ability to recognize and understand human emotions has emerged as a crucial facet of technological advancement. However, human emotion, a complex interplay of physiological, psychological, and social factors, poses a formidable challenge even for other humans to comprehend accurately. With the emergence of voice assistants and other speech-based applications, it has become essential to improve audio-based emotion expression. However, there is a lack of specificity and agreement in current emotion annotation practice, as evidenced by conflicting labels in many human-annotated emotional datasets for the same speech segments. Previous studies have had to filter out these conflicts and, therefore, a large portion of the collected data has been considered unusable. In this study, we aimed to improve the accuracy of computational prediction of uncertain emotion labels by utilizing high-confidence emotion labelled speech segments from the IEMOCAP emotion dataset. We implemented an audio-based emotion recognition model using bag of audio word encoding (BoAW) to obtain a representation of audio aspects of emotion in speech with state-of-the-art recurrent neural network models. Our approach improved the state-of-the-art audio-based emotion recognition with a 61.09% accuracy rate, an improvement of 1.02% over the BiDialogueRNN model and 1.72% over the EmoCaps multi-modal emotion recognition models. In comparison to human annotation, our approach achieved similar results in identifying positive and negative emotions. Furthermore, it has proven effective in accurately recognizing the sentiment of uncertain emotion segments that were previously considered unusable in other studies. Improvements in audio emotion recognition could have implications in voice-based assistants, healthcare, and other industrial applications that benefit from automated communication. Full article
Show Figures

Figure 1

32 pages, 7283 KiB  
Technical Note
Research on the Training and Application Methods of a Lightweight Agricultural Domain-Specific Large Language Model Supporting Mandarin Chinese and Uyghur
by Kun Pan, Xiaogang Zhang and Liping Chen
Appl. Sci. 2024, 14(13), 5764; https://doi.org/10.3390/app14135764 - 1 Jul 2024
Viewed by 490
Abstract
In the field of Natural Language Processing (NLP), the lack of support for minority languages, especially Uyghur, the scarcity of Uyghur language corpora in the agricultural domain, and the lightweight nature of large language models remain prominent issues. This study proposes a method [...] Read more.
In the field of Natural Language Processing (NLP), the lack of support for minority languages, especially Uyghur, the scarcity of Uyghur language corpora in the agricultural domain, and the lightweight nature of large language models remain prominent issues. This study proposes a method for constructing a bilingual (Uyghur and Chinese) lightweight specialized large language model for the agricultural domain. By utilizing a mixed training approach of Uyghur and Chinese, we extracted Chinese corpus text from agricultural-themed books in PDF format using OCR (Optical Character Recognition) technology, converted the Chinese text corpus into a Uyghur corpus using a rapid translation API, and constructed a bilingual mixed vocabulary. We applied the parameterized Transformer model algorithm to train the model for the agricultural domain in both Chinese and Uyghur. Furthermore, we introduced a context detection and fail-safe mechanism for the generated text. The constructed model possesses the ability to support bilingual reasoning in Uyghur and Chinese in the agricultural domain, with higher accuracy and a smaller size that requires less hardware. It (our work) addresses issues such as the scarcity of Uyghur corpora in the agricultural domain, mixed word segmentation and word vector modeling in Uyghur for widespread agricultural languages, model lightweighting and deployment, and the fragmentation of non-relevant texts during knowledge extraction from small-scale corpora. The lightweight design of the model reduces hardware requirements, facilitating deployment in resource-constrained environments. This advancement promotes agricultural intelligence, aids in the development of specific applications and minority languages (such as agriculture and Uyghur), and contributes to rural revitalization. Full article
Show Figures

Figure 1

25 pages, 5092 KiB  
Article
A Computational Framework for Enhancing Industrial Operations and Electric Network Management: A Case Study
by André F. V. Pedroso, Francisco J. G. Silva, Raul D. S. G. Campilho, Rita C. M. Sales-Contini, Arnaldo G. Pinto and Renato R. Moreira
Technologies 2024, 12(6), 91; https://doi.org/10.3390/technologies12060091 - 19 Jun 2024
Viewed by 486
Abstract
Automotive industries require constant technological development and the capacity to adapt to market needs. Hence, component suppliers must be able to adapt to persistent trend changes and technical improvements, acting in response to customers’ expectations and developing their manufacturing methods to be as [...] Read more.
Automotive industries require constant technological development and the capacity to adapt to market needs. Hence, component suppliers must be able to adapt to persistent trend changes and technical improvements, acting in response to customers’ expectations and developing their manufacturing methods to be as flexible as possible. Concepts such as layout flexibility, management of industrial facilities, and building information modeling (BIM) are becoming ever more addressed within the automotive industry in order to envision and select the necessary information exchanges. Given this question and based on the gap in the literature regarding this subject, this work proposes a solution, developing a novel tool that allows the monitoring and assignment of newer/relocated equipment to the switchboards within a given industrial plant. The solution intends to increase the flexibility of production lines through the assessment, analysis, improvement, and reorganization of the electrical load distribution to develop projects accurately implying layout changes. The tool is validated with an automotive manufacturer. With the implementation of this open-source tool, a detailed electrical flow management system is accomplished, and it has proven successful and essential in raising levels of organizational flexibility. This has guaranteed the company’s competitiveness with effective integrated administration methods and tools, such as a much easier study upon inserting new/relocated equipment without production line breaks. Full article
Show Figures

Figure 1

14 pages, 7066 KiB  
Article
Improved Particle Filter in Machine Learning-Based BLE Fingerprinting Method to Reduce Indoor Location Estimation Errors
by Jingshi Qian, Jiahe Li, Nobuyoshi Komuro, Won-Suk Kim and Younghwan Yoo
Future Internet 2024, 16(6), 211; https://doi.org/10.3390/fi16060211 - 15 Jun 2024
Viewed by 528
Abstract
Indoor position fingerprint-based location estimation methods have been widely used by applications on smartphones. In these localization estimation methods, it is very popular to use the RSSI (Received Signal Strength Indication) of signals to represent the position fingerprint. This paper proposes the design [...] Read more.
Indoor position fingerprint-based location estimation methods have been widely used by applications on smartphones. In these localization estimation methods, it is very popular to use the RSSI (Received Signal Strength Indication) of signals to represent the position fingerprint. This paper proposes the design of a particle filter for reducing the estimation error of the machine learning-based indoor BLE location fingerprinting method. Unlike the general particle filter, taking into account the distance, the proposed system designs improved likelihood functions, considering the coordinates based on fingerprint points using mean and variance of RSSI values, combining the particle filter with the k-NN (k-Nearest Neighbor) algorithm to realize the reduction in indoor positioning error. The initial position is estimated by the position fingerprinting method based on the machine learning method. By comparing the fingerprint method based on k-NN with general particle filter processing, and the fingerprint estimation method based on only k-NN or SVM (Support Vector Machine), experiment results showed that the proposed method has a smaller minimum error and a better average error than the conventional method. Full article
Show Figures

Figure 1

19 pages, 16794 KiB  
Article
A Benchmark for UAV-View Natural Language-Guided Tracking
by Hengyou Li, Xinyan Liu and Guorong Li
Electronics 2024, 13(9), 1706; https://doi.org/10.3390/electronics13091706 - 28 Apr 2024
Viewed by 717
Abstract
We propose a new benchmark, UAVNLT (Unmanned Aerial Vehicle Natural Language Tracking), for the UAV-view natural language-guided tracking task. UAVNLT consists of videos taken from UAV cameras from four cities for vehicles on city roads. For each video, vehicles’ bounding boxes, trajectories, and [...] Read more.
We propose a new benchmark, UAVNLT (Unmanned Aerial Vehicle Natural Language Tracking), for the UAV-view natural language-guided tracking task. UAVNLT consists of videos taken from UAV cameras from four cities for vehicles on city roads. For each video, vehicles’ bounding boxes, trajectories, and natural language are carefully annotated. Compared to the existing data sets, which are only annotated with bounding boxes, the natural language sentences in our data set can be more suitable for many application fields where humans take part in the system for that language, being not only more friendly for human–computer interaction but also capable of overcoming the appearance features’ low uniqueness for tracking. We tested several existing methods on our new benchmarks and found that the performance of the existing methods was not satisfactory. To pave the way for future work, we propose a baseline method suitable for this task, achieving state-of-the-art performance. We believe our new data set and proposed baseline method will be helpful in many fields, such as smart city, smart transportation, vehicle management, etc. Full article
Show Figures

Figure 1

20 pages, 1541 KiB  
Article
Developing a Performance Evaluation Framework Structural Model for Educational Metaverse
by Elena Tsappi, Ioannis Deliyannis and George Nathaniel Papageorgiou
Technologies 2024, 12(4), 53; https://doi.org/10.3390/technologies12040053 - 16 Apr 2024
Viewed by 1434
Abstract
In response to the transformative impact of digital technology on education, this study introduces a novel performance management framework for virtual learning environments suitable for the metaverse era. Based on the Structural Equation Modeling (SEM) approach, this paper proposes a comprehensive evaluative model, [...] Read more.
In response to the transformative impact of digital technology on education, this study introduces a novel performance management framework for virtual learning environments suitable for the metaverse era. Based on the Structural Equation Modeling (SEM) approach, this paper proposes a comprehensive evaluative model, anchored on the integration of the Theory of Planned Behavior (TPB), the Unified Theory of Acceptance and Use of Technology (UTAUT), and the Community of Inquiry Framework (CoI). The model synthesizes five Key Performance Indicators (KPIs)—content delivery, student engagement, metaverse tool utilization, student performance, and adaptability—to intricately assess academic avatar performances in virtual educational settings. This theoretical approach marks a significant stride in understanding and enhancing avatar efficacy in the metaverse environment. It enriches the discourse on performance management in digital education and sets a foundation for future empirical studies. As virtual online environments gain prominence in education and training, this research study establishes the basic principles and highlights the key points for further empirical research in the new era of the metaverse educational environment. Full article
Show Figures

Figure 1

32 pages, 24494 KiB  
Article
Interdisciplinary Dynamics in COVID-19 Research: Examining the Role of Computer Science and Collaboration Patterns
by Yunfan Li, Shiyong Liu, An Zeng, Jun Wu, Jiayu Zhang, Weiwei Zhang and Sheng Li
Systems 2024, 12(4), 113; https://doi.org/10.3390/systems12040113 - 28 Mar 2024
Viewed by 1732
Abstract
In academia, it is rare for an event or issue to foster the extensive participation of multiple disciplines. Research related to COVID-19 has undeniably yielded a wealth of valuable insights and impetus for the progress of interdisciplinary research, encompassing concepts, methodologies, intellectual approaches, [...] Read more.
In academia, it is rare for an event or issue to foster the extensive participation of multiple disciplines. Research related to COVID-19 has undeniably yielded a wealth of valuable insights and impetus for the progress of interdisciplinary research, encompassing concepts, methodologies, intellectual approaches, theories, frameworks, data integration and analysis, and pertinent considerations. In the academic community, there is a widespread expectation that as science and technology continue to progress, the convergence of medicine with various other fields will gain momentum. Fields like computer science are anticipated to see expanded applications in domains such as medicine, vaccine research, disease diagnosis, and more. This study aims to examine interdisciplinary approaches in health-related research, particularly in the context of COVID-19. The goal is to analyze and comprehend the involvement and collaboration patterns of various disciplines in pandemic research, with a specific emphasis on the role and integration level of computer science. This study analyzed 240,509 COVID-19 related articles published from December 2019 to September 2022 using methods such as chord diagrams, modularity analysis, and eigenvector centrality analysis in Social Networking Analysis (SNA). The findings revealed an emerging trend of integration trend between Humanities & Social Sciences and Natural Sciences. Expectations that computer science would prominently feature in pandemic research during this technology-driven era haven’t materialized. While it maintains links with engineering, it hasn’t formed strong connections with medicine. This indicates a gap between computer science and core medical research in large-scale health crises, where COVID-19 research remains centered on medicine with varying interdisciplinary collaboration, and high-tech disciplines like computer science have not achieved their expected influence in these studies. Full article
Show Figures

Figure 1

20 pages, 4632 KiB  
Article
Predicting Maps Using In-Vehicle Cameras for Data-Driven Intelligent Transport
by Zhiguo Ma, Yutong Zhang and Meng Han
Electronics 2023, 12(24), 5017; https://doi.org/10.3390/electronics12245017 - 15 Dec 2023
Viewed by 925
Abstract
Bird’s eye view (BEV) semantic maps have evolved into a crucial element of urban intelligent traffic management and monitoring, offering invaluable visual and significant data representations for informed intelligent city decision making. Nevertheless, current methodologies continue underutilizing the temporal information embedded within dynamic [...] Read more.
Bird’s eye view (BEV) semantic maps have evolved into a crucial element of urban intelligent traffic management and monitoring, offering invaluable visual and significant data representations for informed intelligent city decision making. Nevertheless, current methodologies continue underutilizing the temporal information embedded within dynamic frames throughout the BEV feature transformation process. This limitation results in decreased accuracy when mapping high-speed moving objects, particularly in capturing their shape and dynamic trajectory. A framework is proposed for cross-view semantic segmentation to address this challenge, leveraging simulated environments as a starting point before applying it to real-life urban imaginative transportation scenarios. The view converter module is thoughtfully designed to collate information from multiple initial view observations captured from various angles and modes. This module outputs a top-down view semantic graph characterized by its object space layout to preserve beneficial temporal information in BEV transformation. The NuScenes dataset is used to evaluate model effectiveness. A novel application is also devised that harnesses transformer networks to map images and video sequences into top-down or comprehensive bird’s-eye views. By combining physics-based and constraint-based formulations and conducting ablation studies, the approach has been substantiated, highlighting the significance of context above and below a given point in generating these maps. This innovative method has been thoroughly validated on the NuScenes dataset. Notably, it has yielded state-of-the-art instantaneous mapping results, with particular benefits observed for smaller dynamic category displays. The experimental findings include comparing axial attention with the state-of-the-art (SOTA) model, demonstrating the performance enhancement associated with temporal awareness. Full article
Show Figures

Figure 1

30 pages, 17684 KiB  
Article
Designing for Intergenerational Communication among Older Adults: A Systematic Inquiry in Old Residential Communities of China’s Yangtze River Delta
by Cun Li and Ming Cao
Systems 2023, 11(11), 528; https://doi.org/10.3390/systems11110528 - 29 Oct 2023
Viewed by 2459
Abstract
Presently, a substantial majority of older individuals in urban regions of China prefer to inhabit older residential communities over newer counterparts. Within these aging communities, the intricate matter of intergenerational communication among older adults presents a complex and multifaceted issue that warrants comprehensive [...] Read more.
Presently, a substantial majority of older individuals in urban regions of China prefer to inhabit older residential communities over newer counterparts. Within these aging communities, the intricate matter of intergenerational communication among older adults presents a complex and multifaceted issue that warrants comprehensive investigation from a systematic perspective. This paper first employs the observational method to study multiple old residential communities in a city in the Yangtze River Delta region of China. The POEMS framework and the AEIOU framework are applied, focusing on the analysis of individuals and the interaction between individuals and objects, respectively. Semistructured interviews are then conducted with three groups of people, emphasizing community participation by older adults, intergenerational interaction from the perspective of older adults, and intergenerational interaction from the perspective of young people. Finally, the paper categorizes the types and characteristics of individuals in the old communities, identifying the intersections between these groups. The current social situation of older adults and young people is summarized, including behavioral and psychological characteristics and social interaction challenges. Based on these findings, ten system design directions to enhance intergenerational interaction in old communities are proposed, and three of these system design directions are further developed. Full article
Show Figures

Figure 1

20 pages, 1017 KiB  
Article
RAdam-DA-NLSTM: A Nested LSTM-Based Time Series Prediction Method for Human–Computer Intelligent Systems
by Banteng Liu, Wei Chen, Zhangquan Wang, Seyedamin Pouriyeh and Meng Han
Electronics 2023, 12(14), 3084; https://doi.org/10.3390/electronics12143084 - 16 Jul 2023
Cited by 2 | Viewed by 1114
Abstract
At present, time series prediction methods are widely applied for Human–Computer Intelligent Systems in various fields such as Finance, Meteorology, and Medicine. To enhance the accuracy and stability of the prediction model, this paper proposes a time series prediction method called RAdam-Dual stage [...] Read more.
At present, time series prediction methods are widely applied for Human–Computer Intelligent Systems in various fields such as Finance, Meteorology, and Medicine. To enhance the accuracy and stability of the prediction model, this paper proposes a time series prediction method called RAdam-Dual stage Attention mechanism-Nested Long Short-Term Memory (RAdam-DA-NLSTM). First, we design a Nested LSTM (NLSTM), which adopts a new internal LSTM unit structure as the memory cell of LSTM to guide memory forgetting and memory selection. Then, we design a self-encoder network based on the Dual stage Attention mechanism (DA-NLSTM), which uses the NLSTM encoder based on the input attention mechanism, and uses the NLSTM decoder based on the time attention mechanism. Additionally, we adopt the RAdam optimizer to solve the objective function, which dynamically selects Adam and SGD optimizers according to the variance dispersion and constructs the rectifier term to fully express the adaptive momentum. Finally, we use multiple datasets, such as PM2.5 data set, stock data set, traffic data set, and biological signals, to analyze and test this method, and the experimental results show that RAdam-DA-NLSTM has higher prediction accuracy and stability compared with other traditional methods. Full article
Show Figures

Figure 1

Back to TopTop