Version 1
: Received: 17 July 2024 / Approved: 18 July 2024 / Online: 18 July 2024 (08:48:13 CEST)
How to cite:
Liu, C.; Xu, S.; Wang, Z.; Chen, Y.; Chao, L.; Lin, Y.; Yan, H. Research on Students’ Utilization of Artificial Intelligence Based on Random Forest Model and PCA-K-means Algorithm. Preprints2024, 2024071477. https://doi.org/10.20944/preprints202407.1477.v1
Liu, C.; Xu, S.; Wang, Z.; Chen, Y.; Chao, L.; Lin, Y.; Yan, H. Research on Students’ Utilization of Artificial Intelligence Based on Random Forest Model and PCA-K-means Algorithm. Preprints 2024, 2024071477. https://doi.org/10.20944/preprints202407.1477.v1
Liu, C.; Xu, S.; Wang, Z.; Chen, Y.; Chao, L.; Lin, Y.; Yan, H. Research on Students’ Utilization of Artificial Intelligence Based on Random Forest Model and PCA-K-means Algorithm. Preprints2024, 2024071477. https://doi.org/10.20944/preprints202407.1477.v1
APA Style
Liu, C., Xu, S., Wang, Z., Chen, Y., Chao, L., Lin, Y., & Yan, H. (2024). Research on Students’ Utilization of Artificial Intelligence Based on Random Forest Model and PCA-K-means Algorithm. Preprints. https://doi.org/10.20944/preprints202407.1477.v1
Chicago/Turabian Style
Liu, C., Yang Lin and Hao Yan. 2024 "Research on Students’ Utilization of Artificial Intelligence Based on Random Forest Model and PCA-K-means Algorithm" Preprints. https://doi.org/10.20944/preprints202407.1477.v1
Abstract
This study delves into the impact of artificial intelligence on students’ learning.The paper pioneers the use of kmeans classification to reduce the dimensionality of features before the classification process, which minimizes redundant information and computational overhead. This step is not commonly found in prior research.Firstly, a random forest model is established with “whether AI assistance was used to complete the paper” as the output, and various features as inputs. The analysis of the predictive accuracy when changing input features reveals the degree of influence of each feature on using AI to complete the paper. After feature selection, the remaining 6 features can still effectively predict the use of AI to complete the paper, indicating the representativeness of the extracted indicators. Subsequently, the PCA-K-means algorithm is employed to classify the use of artificial intelligence. PCA algorithm is used for dimensionality reduction of the extracted features to better characterize students’ use of AI. Three representative variables are selected, and the contribution of each feature is computed based on the factor loading matrix. Finally, the K-means algorithm is used to classify students based on the three dimensional features, showing that students can be effectively classified into 2 categories. The results indicate that the majority of students support the use of AI technology to enhance their abilities, although some students may only intend to use AI to cope with exams and similar tasks.
Keywords
Artificial intelligence; Random forest model; PCA algorithm; K-means algorithm
Subject
Computer Science and Mathematics, Computer Science
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.