Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (2,677)

Search Parameters:
Keywords = K-means clustering

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
14 pages, 3633 KiB  
Article
Delineating Regional BES–ELM Neural Networks for Studying Indoor Visible Light Positioning
by Jiaming Zhang and Xizheng Ke
Photonics 2024, 11(10), 910; https://doi.org/10.3390/photonics11100910 (registering DOI) - 27 Sep 2024
Abstract
This paper introduces a single LED and four photodetectors (PDs) as a visible light system structure and collects the received signal strength values and corresponding physical coordinates at the PD receiving end, establishing a comprehensive dataset. The K-means clustering algorithm is employed to [...] Read more.
This paper introduces a single LED and four photodetectors (PDs) as a visible light system structure and collects the received signal strength values and corresponding physical coordinates at the PD receiving end, establishing a comprehensive dataset. The K-means clustering algorithm is employed to separate the room into center and boundary areas through the fingerprint database. The bald eagle search (BES) algorithm is employed to optimize the initial parameters, specifically the weights and thresholds, in the extreme learning machine (ELM) neural network, and the BES–ELM indoor positioning model is established by region to improve positioning accuracy. Due to the impact exerted by the ambient environment, there are fluctuations in the positioning accuracy of the center and edge regions, and the positioning of the edge region needs to be further improved. To address this, it is proposed to use the enhanced weighted K-nearest neighbor (EWKNN) algorithm based on the BES–ELM neural network to correct the prediction points with higher-than-average positioning errors, achieving precise edge positioning. The simulation demonstrates that within an indoor space measuring 5 m × 5 m × 3 m, the algorithm achieves an average positioning error of 2.93 cm, and the positioning accuracy is improved by 86.07% relative to conventional BP neural networks. Full article
Show Figures

Figure 1

16 pages, 2152 KiB  
Article
A Study of GGDP Transition Impact on the Sustainable Development by Mathematical Modelling Investigation
by Nuoya Yue and Junjun Hou
Mathematics 2024, 12(19), 3005; https://doi.org/10.3390/math12193005 - 26 Sep 2024
Abstract
GDP is a common and essential indicator for evaluating a country’s overall economy. However, environmental issues may be overlooked in the pursuit of GDP growth for some countries. It may be beneficial to adopt more sustainable criteria for assessing economic health. In this [...] Read more.
GDP is a common and essential indicator for evaluating a country’s overall economy. However, environmental issues may be overlooked in the pursuit of GDP growth for some countries. It may be beneficial to adopt more sustainable criteria for assessing economic health. In this study, green GDP (GGDP) is discussed using mathematical approaches. Multiple dataset indicators were selected for the evaluation of GGDP and its impact on climate mitigation. The k-means clustering algorithm was utilized to classify 16 countries into three distinct categories for specific analysis. The potential impact of transitioning to GGDP was investigated through changes in a quantitative parameter, the climate impact factor. Ridge regression was applied to predict the impact of switching to GGDP for the three country categories. The consequences of transitioning to GGDP on the quantified improvement of climate indicators were graphically demonstrated over time on a global scale. The entropy weight method (EWM) and TOPSIS were used to obtain the value. Countries in category 2, as divided by k-means clustering, were predicted to show a greater improvement in scores as one of the world’s largest carbon emitters, China, which belongs to category 2 countries, plays a significant role in global climate governance. A specific analysis of China was performed after obtaining the EWM-TOPSIS results. Gray relational analysis and Pearson correlation were carried out to analyze the relationships between specific indicators, followed by a prediction of CO2 emissions based on the analyzed critical indicators. Full article
(This article belongs to the Special Issue Financial Mathematics and Sustainability)
Show Figures

Figure 1

17 pages, 6508 KiB  
Article
RNA-Seq Analysis and Candidate Gene Mining of Gossypium hirsutum Stressed by Verticillium dahliae Cultured at Different Temperatures
by Ni Yang, Zhaolong Gong, Yajun Liang, Shiwei Geng, Fenglei Sun, Xueyuan Li, Shuaishuai Qian, Chengxia Lai, Mayila Yusuyin, Junduo Wang and Juyun Zheng
Plants 2024, 13(19), 2688; https://doi.org/10.3390/plants13192688 - 25 Sep 2024
Abstract
The occurrence and spread of Verticillium dahliae (V. dahliae) in cotton depends on the combined effects of pathogens, host plants, and the environment, among which temperature is one of the most important environmental factors. Studying how temperature impacts the occurrence of [...] Read more.
The occurrence and spread of Verticillium dahliae (V. dahliae) in cotton depends on the combined effects of pathogens, host plants, and the environment, among which temperature is one of the most important environmental factors. Studying how temperature impacts the occurrence of V. dahliae in cotton and the mechanisms governing host defense responses is crucial for disease prevention and control. Understanding the dual effects of temperature on both pathogens and hosts can provide valuable insights for developing effective strategies to manage this destructive fungal infection in cotton. This study was based on the deciduous V. dahliae Vd-3. Through cultivation at different temperatures, Vd-3 formed the most microsclerotia and had the largest colony diameter at 25 °C. Endospore toxins were extracted, and 48 h was determined to be the best pathogenic time point for endotoxins to infect cotton leaves through a chlorophyll fluorescence imaging system and phenotypic evaluation. Transcriptome sequencing was performed on cotton leaves infected with Vd-3 endotoxins for 48 h at different culture temperatures. A total of 34,955 differentially expressed genes (DEGs) were identified between each temperature and CK (no pathogen inoculation), including 17,422 common DEGs. The results of the enrichment analysis revealed that all the DEGs were involved mainly in photosynthesis and sugar metabolism. Among the 34,955 DEGs, genes in the biosynthesis and signal transduction pathways of jasmonic acid (JA), salicylic acid (SA), and ethylene (ET) were identified, and their expression patterns were determined. A total of 5652 unique DEGs were clustered into six clusters using the k-means clustering algorithm, and the functions and main transcription factors (TFs) of each cluster were subsequently annotated. In addition, we constructed a gene regulatory network via weighted correlation network analysis (WGCNA) and identified twelve key genes related to cotton defense against V. dahliae at different temperatures, including four genes encoding transcription factors. These findings provide a theoretical foundation for investigating temperature regulation in V. dahliae infecting cotton and introduce novel genetic resources for enhancing resistance to this disease in cotton plants. Full article
(This article belongs to the Section Plant Genetics, Genomics and Biotechnology)
Show Figures

Figure 1

20 pages, 10618 KiB  
Article
Combining UAV Multi-Source Remote Sensing Data with CPO-SVR to Estimate Seedling Emergence in Breeding Sunflowers
by Shuailing Zhang, Hailin Yu, Bingquan Tian, Xiaoli Wang, Wenhao Cui, Lei Yang, Jingqian Li, Huihui Gong, Junsheng Zhao, Liqun Lu, Jing Zhao and Yubin Lan
Agronomy 2024, 14(10), 2205; https://doi.org/10.3390/agronomy14102205 - 25 Sep 2024
Abstract
In order to accurately obtain the seedling emergence rate of breeding sunflower and to assess the quality of sowing as well as the merit of sunflower varieties, a method of extracting the sunflower seedling emergence rate using multi-source remote sensing information from unmanned [...] Read more.
In order to accurately obtain the seedling emergence rate of breeding sunflower and to assess the quality of sowing as well as the merit of sunflower varieties, a method of extracting the sunflower seedling emergence rate using multi-source remote sensing information from unmanned aerial vehicles is proposed. Visible and multispectral images of sunflower seedlings were acquired using a UAV. The thresholding method was used to segment the excess green image of the visible image into vegetation and non-vegetation, to obtain the center point of the vegetation to generate a buffer, and to mask the visible image to achieve weed removal. The components of color models such as the hue–saturation value (HSV), green-relative color space (YCbCr), cyan-magenta-yellow-black (CMYK), and CIELAB color space (L*A*B) models were compared and analyzed. The A component of the L*A*B model was preferred for the optimization of K-means clustering to segment sunflower seedlings and mulch using the genetic algorithm, and the segmentation accuracy was improved by 4.6% compared with the K-means clustering algorithm. All told, 10 geometric features of sunflower seedlings were extracted using segmented images, and 10 vegetation indices and 48 texture features of sunflower seedlings were calculated based on multispectral images. The Pearson’s correlation coefficient method was used to filter the three types of features, and the geometric feature set, the vegetation index set, the texture feature set, and the preferred feature set were constructed. The construction of a sunflower plant number estimation model using the crested porcupine optimizer–support vector machine is proposed and compared with the sunflower plant number estimation models constructed based on decision tree regression, BP neural network, and support vector machine regression. The results show that the accuracy of the model based on the preferred feature set is higher than that of the other three feature sets, indicating that feature screening can improve the accuracy and stability of models; assessed using the CPO-SVR model, the accuracy of the preferred feature set was the highest, with an R² of 0.94, an RMSE of 5.16, and an MAE of 3.03. Compared to the SVR model, the value of the R2 is improved by 3.3%, the RMSE decreased by 18.3%, and the MAE decreased by 18.1%. The results of the study can be cost-effective, accurate, and reliable in terms of obtaining the seedling emergence rate of sunflower field breeding. Full article
(This article belongs to the Special Issue AI, Sensors and Robotics for Smart Agriculture—2nd Edition)
Show Figures

Figure 1

21 pages, 10847 KiB  
Article
Construction and Comparison of Machine-Learning Forecast Models of Albacore Thunnus alalunga Fishing Grounds in the South Pacific Ocean
by Jianxiong Li, Feng Chen, Qian Dai, Wenbin Zhu, Dewei Li, Wei Yu and Weifeng Zhou
Fishes 2024, 9(10), 375; https://doi.org/10.3390/fishes9100375 - 25 Sep 2024
Abstract
The traditional methods for predicting the distribution of albacore (Thunnus alalunga) fishing grounds have low performance and accuracy. Uneven sampling can result in unreasonable evaluation indicators. To address these issues, three methods, equi-frequency, K-means clustering algorithm, and 1-R split, were applied [...] Read more.
The traditional methods for predicting the distribution of albacore (Thunnus alalunga) fishing grounds have low performance and accuracy. Uneven sampling can result in unreasonable evaluation indicators. To address these issues, three methods, equi-frequency, K-means clustering algorithm, and 1-R split, were applied to discretize the catch per unit effort (CPUE) of albacore in the South Pacific from 2016 to 2021 and partition the fishing grounds into abundance levels. Eight machine learning models were used to predict the fishing grounds. In addition to the traditional evaluation index based on confusion matrix, top-k index was also used to evaluate the accuracy of fishery abundance predictions. The results showed that (1) When sampling is unbalanced, the reported accuracy does not fully represent the actual performance of the model in predicting the abundance of albacore in the fishing ground. F1 value can be used as the index of the model effect and stability. (2) In binary classification, the quartile stacking algorithm has the best stacking performance, with F1 0.89. (3) The top-1 prediction accuracy of three-category fishery forecasting is the highest at 0.74, and the top-1 prediction accuracy of five-category fishery forecasting is the highest at 0.54. (4) The top-k accuracy of classification of fisheries with multiple abundance using K-means is significantly better than that of equal frequency discretization (p < 0.001). The top-k evaluation index was used to predict the fishing grounds of albacore across multiple abundance levels for the first time in this study, which is significant for pioneering a new method for this application and which provides a demonstration of the development of artificial intelligence techniques for fisheries in the future. Full article
(This article belongs to the Special Issue Biodiversity and Spatial Distribution of Fishes)
Show Figures

Figure 1

18 pages, 2062 KiB  
Article
Double-Layer Distributed and Integrated Fault Detection Strategy for Non-Gaussian Dynamic Industrial Systems
by Shengli Dong, Xinghan Xu, Yuhang Chen, Yifang Zhang and Shengzheng Wang
Entropy 2024, 26(10), 815; https://doi.org/10.3390/e26100815 - 25 Sep 2024
Abstract
Currently, with the increasing scale of industrial systems, multisensor monitoring data exhibit large-scale dynamic Gaussian and non-Gaussian concurrent complex characteristics. However, the traditional principal component analysis method is based on Gaussian distribution and uncorrelated assumptions, which are greatly limited in practice. Therefore, developing [...] Read more.
Currently, with the increasing scale of industrial systems, multisensor monitoring data exhibit large-scale dynamic Gaussian and non-Gaussian concurrent complex characteristics. However, the traditional principal component analysis method is based on Gaussian distribution and uncorrelated assumptions, which are greatly limited in practice. Therefore, developing a new fault detection method for large-scale Gaussian and non-Gaussian concurrent dynamic systems is one of the urgent challenges to be addressed. To this end, a double-layer distributed and integrated data-driven strategy based on Laplacian score weighting and integrated Bayesian inference is proposed. Specifically, in the first layer of the distributed strategy, we design a Jarque–Bera test module to divide all multisensor monitoring variables into Gaussian and non-Gaussian blocks, successfully solving the problem of different data distributions. In the second layer of the distributed strategy, we design a dynamic augmentation module to solve dynamic problems, a K-means clustering module to mine local similarity information of variables, and a Laplace scoring module to quantitatively evaluate the structural retention ability of variables. Therefore, this double-layer distributed strategy can simultaneously combine the different distribution characteristics, dynamism, local similarity, and importance of variables, comprehensively mining the local information of the multisensor data. In addition, we develop an integrated Bayesian inference strategy based on detection performance weighting, which can emphasize the differential contribution of local models. Finally, the fault detection results for the Tennessee Eastman production system and a diesel engine working system validate the superiority of the proposed method. Full article
(This article belongs to the Section Multidisciplinary Applications)
Show Figures

Figure 1

29 pages, 1354 KiB  
Article
An Approach for Multi-Item Product Sales Forecasting Based on Advancing the BCG Matrix with Matrix-Clustering and Time Modeling Techniques
by Che-Yu Hung and Chien-Chih Wang
Systems 2024, 12(10), 388; https://doi.org/10.3390/systems12100388 - 25 Sep 2024
Abstract
Customized production has greatly diversified product categories, which has altered product life cycles and added complexity to business management. This paper introduces a matrix-clustering technique that integrates k-means clustering with the BCG Matrix, enhanced by time modeling, to offer a comprehensive framework for [...] Read more.
Customized production has greatly diversified product categories, which has altered product life cycles and added complexity to business management. This paper introduces a matrix-clustering technique that integrates k-means clustering with the BCG Matrix, enhanced by time modeling, to offer a comprehensive framework for multi-item product sales forecasting. The approach builds upon existing BCG Matrix outcomes, re-clustering high-selling products more precisely and redefining their relationship with other product lines more objectively. This method addresses the challenge of forecasting situations with limited historical data, providing more accurate sales predictions. Using Taiwan’s sales data, an empirical study on integrated circuit tray products demonstrated the effectiveness of the matrix clustering technique. The results showed improved data utilization, increasing from 35.93% with the original BCG analysis to 52.43% with the combined matrix-clustering and time modeling methods. This study contributes to academic research by presenting a portfolio analysis approach rooted in matrix clustering, which systematically enhances traditional BCG Matrix methods. The proposed framework is adaptable to the unique traits of different portfolios, offering businesses workflows that are efficient, reliable, sustainable, and scalable. Full article
Show Figures

Figure 1

12 pages, 3245 KiB  
Proceeding Paper
A Recommendation System for E-Commerce Products Using Collaborative Filtering Approaches
by Neelamadhab Padhy, Sridev Suman, T Sanam Priyadarshini and Subhalaxmi Mallick
Eng. Proc. 2024, 67(1), 50; https://doi.org/10.3390/engproc2024067050 - 24 Sep 2024
Abstract
The objective of this article is to recommend products using association rule mining from an E-commerce site. This helps us to recommend products through utilizing the filtering concept. In this article, we use the Apriori and FP-Growth algorithms. Our model not only suggests [...] Read more.
The objective of this article is to recommend products using association rule mining from an E-commerce site. This helps us to recommend products through utilizing the filtering concept. In this article, we use the Apriori and FP-Growth algorithms. Our model not only suggests products but also gives tips on how to make strong suggestion systems that can deal with a lot of data and give quick responses. Our objective is to predict ratings so that the users could be recommended and buy products. There are 1,048,100 records in the dataset. This dataset consists of four features, and these are are follows: {user-id, productid, Ratings, and timing}. Here, we consider the rating as our dependent attribute, and others factors are independent features. In this article, we use collaborative filtering algorithms (SVD, SVD+, and ALS) and also item-based filtering techniques (KNNBasic) to recommend products. Apart from these, sssociation rule mining, hybridization of Apriori, and FP-Growth are used. K-means clustering is used to identify anomalies as well as to create a dashboard, using Power BI for data visualization. Apart from these, we have also developed a hybridization algorithm using Apriori and FP-Growth. Among all the recommendation algorithms, SVD outperforms in recommending the product, and the average RMSE and MAE values are 1.31, and 1.04, respectively. Full article
(This article belongs to the Proceedings of The 3rd International Electronic Conference on Processes)
Show Figures

Figure 1

16 pages, 5276 KiB  
Article
Comparative Study of Lightweight Target Detection Methods for Unmanned Aerial Vehicle-Based Road Distress Survey
by Feifei Xu, Yan Wan, Zhipeng Ning and Hui Wang
Sensors 2024, 24(18), 6159; https://doi.org/10.3390/s24186159 - 23 Sep 2024
Abstract
Unmanned aerial vehicles (UAVs) are effective tools for identifying road anomalies with limited detection coverage due to the discrete spatial distribution of roads. Despite computational, storage, and transmission challenges, existing detection algorithms can be improved to support this task with robustness and efficiency. [...] Read more.
Unmanned aerial vehicles (UAVs) are effective tools for identifying road anomalies with limited detection coverage due to the discrete spatial distribution of roads. Despite computational, storage, and transmission challenges, existing detection algorithms can be improved to support this task with robustness and efficiency. In this study, the K-means clustering algorithm was used to calculate the best prior anchor boxes; Faster R-CNN (region-based convolutional neural network), YOLOX-s (You Only Look Once version X-small), YOLOv5-s, YOLOv7-tiny, YOLO-MobileNet, and YOLO-RDD models were built based on image data collected by UAVs. YOLO-MobileNet has the most lightweight model but performed worst in accuracy, but greatly reduces detection accuracy. YOLO-RDD (road distress detection) performed best with a mean average precision (mAP) of 0.701 above the Intersection over Union (IoU) value of 0.5 and achieved relatively high accuracy in detecting all four types of distress. The YOLO-RDD model most successfully detected potholes with an AP of 0.790. Significant or severe distresses were better identified, and minor cracks were relatively poorly identified. The YOLO-RDD model achieved an 85% computational reduction compared to YOLOv7-tiny while maintaining high detection accuracy. Full article
Show Figures

Figure 1

17 pages, 1602 KiB  
Article
Achieving the Best Symmetry by Finding the Optimal Clustering Filters for Specific Lighting Conditions
by Volodymyr Hrytsyk, Anton Borkivskyi and Taras Oliinyk
Symmetry 2024, 16(9), 1247; https://doi.org/10.3390/sym16091247 - 23 Sep 2024
Abstract
This article explores the efficiency of various clustering methods for image segmentation under different luminosity conditions. Image segmentation plays a crucial role in computer vision applications, and clustering algorithms are commonly used for this purpose. The search for an adaptive clustering mechanism aims [...] Read more.
This article explores the efficiency of various clustering methods for image segmentation under different luminosity conditions. Image segmentation plays a crucial role in computer vision applications, and clustering algorithms are commonly used for this purpose. The search for an adaptive clustering mechanism aims to ensure the maximum symmetry of real objects with objects/segments in their digital representations. However, clustering method performances can fluctuate with varying lighting conditions during image capture. Therefore, we assess the performance of several clustering algorithms—including K-Means, K-Medoids, Fuzzy C-Means, Possibilistic C-Means, Gustafson–Kessel, Entropy-based Fuzzy, Ridler–Calvard, Kohonen Self-Organizing Maps and MeanShift—across images captured under different illumination conditions. Additionally, we develop an adaptive image segmentation system utilizing empirical data. Conducted experiments highlight varied performances among clustering methods under different luminosity conditions. This research enhances a better understanding of luminosity’s impact on image segmentation and aids the method selection for diverse lighting scenarios. Full article
(This article belongs to the Special Issue Image Processing and Symmetry: Topics and Applications)
Show Figures

Figure 1

18 pages, 3036 KiB  
Article
Factors Affecting the Happiness of Learners in Higher Education: Attitude, Grade Point Average, and Time Management
by Nattaporn Thongsri, Jariya Seksan and Pattaraporn Warintarawej
Sustainability 2024, 16(18), 8214; https://doi.org/10.3390/su16188214 - 21 Sep 2024
Abstract
Student well-being is essential for academic achievement and personal growth. Fostering happiness among university students is crucial for individual development, strong family bonds, a harmonious society, and national progress. This study aimed to identify key determinants of student happiness in higher education. Eight [...] Read more.
Student well-being is essential for academic achievement and personal growth. Fostering happiness among university students is crucial for individual development, strong family bonds, a harmonious society, and national progress. This study aimed to identify key determinants of student happiness in higher education. Eight factors, including GPA, workload, family support, university environment, attitude, motivation, time management, and social relationships, were examined among 388 Thai students using an online survey. Students were categorized into distinct groups based on these factors using k-means clustering. ANOVA was employed to assess whether these factors significantly differentiated the groups, and significant factors were further analyzed using regression analysis to confirm their impact on student happiness. A neural network analysis was also utilized to evaluate the relative importance of each factor. The results revealed that attitude, GPA, and time management significantly affected student happiness. A positive attitude fosters a sense of opportunity and achievement, a high GPA reflects academic success and enhances self-confidence, and effective time management reduces stress while allowing more time for enjoyable activities. Full article
Show Figures

Figure 1

20 pages, 7057 KiB  
Article
Weather Condition Clustering for Improvement of Photovoltaic Power Plant Generation Forecasting Accuracy
by Kristina I. Haljasmaa, Andrey M. Bramm, Pavel V. Matrenin and Stanislav A. Eroshenko
Algorithms 2024, 17(9), 419; https://doi.org/10.3390/a17090419 - 20 Sep 2024
Abstract
Together with the growing interest towards renewable energy sources within the framework of different strategies of various countries, the number of solar power plants keeps growing. However, managing optimal power generation for solar power plants has its own challenges. First comes the problem [...] Read more.
Together with the growing interest towards renewable energy sources within the framework of different strategies of various countries, the number of solar power plants keeps growing. However, managing optimal power generation for solar power plants has its own challenges. First comes the problem of work interruption and reduction in power generation. As the system must be tolerant to the faults, the relevance and significance of short-term forecasting of solar power generation becomes crucial. Within the framework of this research, the applicability of different forecasting methods for short-time forecasting is explained. The main goal of the research is to show an approach regarding how to make the forecast more accurate and overcome the above-mentioned challenges using opensource data as features. The data clustering algorithm based on KMeans is proposed to train unique models for specific groups of data samples to improve the generation forecast accuracy. Based on practical calculations, machine learning models based on Random Forest algorithm are selected which have been proven to have higher efficiency in predicting the generation of solar power plants. The proposed algorithm was successfully tested in practice, with an achieved accuracy near to 90%. Full article
(This article belongs to the Special Issue Algorithms for Time Series Forecasting and Classification)
Show Figures

Graphical abstract

37 pages, 2367 KiB  
Article
Waste Management and Innovation: Insights from Europe
by Lucio Laureti, Alberto Costantiello, Fabio Anobile, Angelo Leogrande and Cosimo Magazzino
Recycling 2024, 9(5), 82; https://doi.org/10.3390/recycling9050082 - 19 Sep 2024
Abstract
This paper analyzes the relationship between urban waste recycling and innovation systems in Europe. Data from the Global Innovation Index for 34 European countries in the period 2013–2022 were used. To analyze the characteristics of European countries in terms of waste recycling capacity, [...] Read more.
This paper analyzes the relationship between urban waste recycling and innovation systems in Europe. Data from the Global Innovation Index for 34 European countries in the period 2013–2022 were used. To analyze the characteristics of European countries in terms of waste recycling capacity, the k-Means algorithm optimized with the Elbow method and the Silhouette Coefficient was used. The results show that the optimal number of clusters is three. Panel data results show that waste recycling increases with domestic market scale, gross capital formation, and the diffusion of Information and Communication Technologies (ICTs), while it decreases with the infrastructure index, business sophistication index, and the average expenditure on research and development of large companies. Full article
Show Figures

Figure 1

14 pages, 914 KiB  
Article
Identifying Key Factors for Securing a Champions League Position in French Ligue 1 Using Explainable Machine Learning Techniques
by Spyridon Plakias, Christos Kokkotis, Michalis Mitrotasios, Vasileios Armatas, Themistoklis Tsatalas and Giannis Giakas
Appl. Sci. 2024, 14(18), 8375; https://doi.org/10.3390/app14188375 - 18 Sep 2024
Abstract
Introduction: Performance analysis is essential for coaches and a topic of extensive research. The advancement of technology and Artificial Intelligence (AI) techniques has revolutionized sports analytics. Aim: The primary aim of this article is to present a robust, explainable machine learning (ML) model [...] Read more.
Introduction: Performance analysis is essential for coaches and a topic of extensive research. The advancement of technology and Artificial Intelligence (AI) techniques has revolutionized sports analytics. Aim: The primary aim of this article is to present a robust, explainable machine learning (ML) model that identifies the key factors that contribute to securing one of the top three positions in the standings of the French Ligue 1, ensuring participation in the UEFA Champions League for the following season. Materials and Methods: This retrospective observational study analyzed data from all 380 matches of the 2022–23 French Ligue 1 season. The data were obtained from the publicly-accessed website “whoscored” and included 34 performance indicators. This study employed Sequential Forward Feature Selection (SFFS) and various ML algorithms, including XGBoost, Support Vector Machine (SVM), and Logistic Regression (LR), to create a robust, explainable model. The SHAP (SHapley Additive Explanations) model was used to enhance model interpretability. Results: The K-means Cluster Analysis categorized teams into groups (TOP TEAMS, 3 teams/REST TEAMS, 17 teams), and the ML models provided significant insights into the factors influencing league standings. The LR classifier was the best-performing classifier, achieving an accuracy of 75.13%, a recall of 76.32%, an F1-score of 48.03%, and a precision of 35.17%. “SHORT PASSES” and “THROUGH BALLS” were features found to positively influence the model’s predictions, while “TACKLES ATTEMPTED” and “LONG BALLS” had a negative impact. Conclusions: Our model provided satisfactory predictive accuracy and clear interpretability of results, which gave useful information to stakeholders. Specifically, our model suggests adopting a strategy during the ball possession phase that relies on short passes (avoiding long ones) and aiming to enter the attacking third and the opponent’s penalty area with through balls. Full article
Show Figures

Figure 1

20 pages, 3793 KiB  
Article
Travel Time Variability in Urban Mobility: Exploring Transportation System Reliability Performance Using Ridesharing Data
by Yuxin Sun and Ying Chen
Sustainability 2024, 16(18), 8103; https://doi.org/10.3390/su16188103 - 17 Sep 2024
Abstract
Travel time variability (TTV) is a crucial indicator of transportation network performance, assessing travel time reliability and delays. This study investigates TTV metrics within the context of shared mobility using probe data from transportation network companies (TNCs) in Chicago, Los Angeles, and Dallas–Fort [...] Read more.
Travel time variability (TTV) is a crucial indicator of transportation network performance, assessing travel time reliability and delays. This study investigates TTV metrics within the context of shared mobility using probe data from transportation network companies (TNCs) in Chicago, Los Angeles, and Dallas–Fort Worth. Eight reliability metrics are analyzed and compared for each origin–destination (OD) pair in the network, including standard deviation (SD), the Planning Time Index (PTI), the Travel Time Index (TTI), the Buffer Index (BI), On-time Measures PR (alpha), and the Misery Index (MI), to evaluate their effectiveness in clustering OD pairs using K-means clustering. The findings confirm that SD, PTI, and MI are particularly effective in measuring travel time reliability and clustering within urban systems. This study identifies the most unbalanced supply–demand OD pairs/regions in each city, noting that low/medium-SD clusters around metropolitan airports indicate stable travel times even in high-demand zones, while high-SD clusters in downtown areas reveal significant traffic demands and unreliability. These patterns become more pronounced in study areas with multiple city centers. This study highlights the need for targeted strategies to enhance travel time reliability, particularly in regions like Dallas–Fort Worth, where public transportation alternatives are limited. Full article
(This article belongs to the Section Sustainable Transportation)
Show Figures

Figure 1

Back to TopTop