Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleApril 2024
Novel parameter-free and parametric same degree distribution-based dimensionality reduction algorithms for trustworthy data structure preserving
Information Sciences: an International Journal (ISCI), Volume 661, Issue Chttps://doi.org/10.1016/j.ins.2023.120030AbstractAs an effective dimensionality reduction method, Same Degree Distribution (SDD) has been demonstrated to be able to maintain better data structure than other dimensionality reduction methods, including Principal Component Analysis (PCA), ...
- research-articleMay 2024
Application of non parametric Bayesian methods in high dimensional data
Journal of Computational Methods in Sciences and Engineering (JOCMSE), Volume 24, Issue 2Pages 731–743https://doi.org/10.3233/JCM-237104With the development of technology and the widespread collection of data, high-dimensional data analysis has become a research hotspot in many fields. Traditional parameter methods often face problems such as dimensional disasters in high-dimensional ...
- research-articleDecember 2023
High dimensional controlled variable selection with model-X knockoffs in the AFT model
Computational Statistics (CSTAT), Volume 39, Issue 4Pages 1993–2009https://doi.org/10.1007/s00180-023-01426-5AbstractInterpretability and stability are two important characteristics required for the application of high dimensional data in statistics. Although the former has been favored by many existing forecasting methods to some extent, the latter in the sense ...
- research-articleDecember 2023
High Dimensional Data Classification Approach with Deep Learning and Tucker Decomposition
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 363–370https://doi.org/10.1145/3628797.3628824Along with the success of deep learning are extensive models and huge amounts of data. Therefore, efficiency in deep learning is one of the most concerning problems. Many methods have been proposed to reduce the complexity of models and have achieved ...
- ArticleSeptember 2023
Relative Intrinsic Dimensionality Is Intrinsic to Learning
Artificial Neural Networks and Machine Learning – ICANN 2023Pages 516–529https://doi.org/10.1007/978-3-031-44207-0_43AbstractHigh dimensional data can have a surprising property: pairs of data points may be easily separated from each other, or even from arbitrary subsets, with high probability using just simple linear classifiers. However, this is more of a rule of ...
-
- ArticleAugust 2023
A Geometric View on the Role of Nonlinear Feature Maps in Few-Shot Learning
AbstractWe investigate the problem of successfully learning from just a few examples of data points in a binary classification problem, and present a brief overview of some recent results on the role of nonlinear feature maps in this challenging task. Our ...
- research-articleApril 2023
Robust and sparse multinomial regression in high dimensions
Data Mining and Knowledge Discovery (DMKD), Volume 37, Issue 4Pages 1609–1629https://doi.org/10.1007/s10618-023-00936-6AbstractA robust and sparse estimator for multinomial regression is proposed for high dimensional data. Robustness of the estimator is achieved by trimming the observations, and sparsity of the estimator is obtained by the elastic net penalty. In contrast ...
- research-articleApril 2023
Learning from high dimensional data based on weighted feature importance in decision tree ensembles
Computational Statistics (CSTAT), Volume 39, Issue 1Pages 313–342https://doi.org/10.1007/s00180-023-01347-3AbstractLearning from high dimensional data has been utilized in various applications such as computational biology, image classification, and finance. Most classical machine learning algorithms fail to give accurate predictions in high dimensional ...
- research-articleMarch 2023
Deep neural networks with L1 and L2 regularization for high dimensional corporate credit risk prediction
Expert Systems with Applications: An International Journal (EXWA), Volume 213, Issue PAhttps://doi.org/10.1016/j.eswa.2022.118873Highlights- This study adds more external information to predict corporate credit risk (CCR).
Accurate credit risk prediction can help companies avoid bankruptcies and make adjustments ahead of time. There is a tendency in corporate credit risk prediction that more and more features are considered in the prediction system. ...
- research-articleFebruary 2023
OLP++: An online local classifier for high dimensional data
Information Fusion (INFU), Volume 90, Issue CPages 120–137https://doi.org/10.1016/j.inffus.2022.09.010AbstractEnsemble diversity is an important characteristic of Multiple Classifier Systems (MCS), which aim at improving the overall performance of a classification system by combining the response of several models. While diversity may be ...
Highlights- Local-based ensemble methods produce and/or exploit models specialized in each area.
- research-articleJanuary 2023
A batch process for high dimensional imputation
Computational Statistics (CSTAT), Volume 39, Issue 2Pages 781–802https://doi.org/10.1007/s00180-023-01325-9AbstractThis paper describes a correlation-based batch process for addressing high dimensional imputation problems. There are relatively few algorithms designed to efficiently handle imputation of missing data in high dimensional contexts. Fewer still are ...
- research-articleJanuary 2023
Robust inference for change points in high dimension
Journal of Multivariate Analysis (JMUL), Volume 193, Issue Chttps://doi.org/10.1016/j.jmva.2022.105114AbstractThis paper proposes a new test for a change point in the mean of high-dimensional data based on the spatial sign and self-normalization. The test is easy to implement with no tuning parameters, robust to heavy-tailedness and ...
- ArticleNovember 2022
Binary Gravitational Subspace Search for Outlier Detection in High Dimensional Data Streams
AbstractIn recent years, technology has continued to rapidly evolve, resulting in the generation of high-dimensional data streams. Combining the streaming scenario and high dimensionality is a particularly complex task specifically for outlier detection. ...
- research-articleNovember 2022
Subspace-based outlier detection using linear programming and heuristic techniques
Expert Systems with Applications: An International Journal (EXWA), Volume 207, Issue Chttps://doi.org/10.1016/j.eswa.2022.117955AbstractA useful strategy to perform outlier detection (OD) in highdimensional data, especially in the presence of multiple classes of outliers, is to decompose the outlier detection problem into a set of relevant subspace selection (RSS) ...
- research-articleNovember 2022
Design of fuzzy rule-based models with fuzzy relational factorization
Expert Systems with Applications: An International Journal (EXWA), Volume 206, Issue Chttps://doi.org/10.1016/j.eswa.2022.117904Highlights- Granulation of data using different encoding mechanisms.
- Design of fuzzy rule-...
This study opens an original avenue of designing interpretable fuzzy rule-based models realized in the presence of data characterized by a large number of attributes (input variables). In the presence of such high-dimensional data, the ...
- research-articleAugust 2022
Mixture of von Mises-Fisher distribution with sparse prototypes
AbstractMixtures of von Mises-Fisher distributions can be used to cluster data on the unit hypersphere. This is particularly adapted for high-dimensional directional data such as texts. We propose in this article to estimate a von Mises ...
- research-articleJuly 2022
Binary Golden Eagle Optimizer with Time-Varying Flight Length for feature selection
AbstractThe concept of any method to resolve feature selection issues is to identify a subset of the original features. However, determining a minimal feature subset is considered an NP-hard problem. Many existing feature selection methods, ...
- research-articleJune 2022
Robust logistic zero-sum regression for microbiome compositional data
Advances in Data Analysis and Classification (SPADAC), Volume 16, Issue 2Pages 301–324https://doi.org/10.1007/s11634-021-00465-4AbstractWe introduce the Robust Logistic Zero-Sum Regression (RobLZS) estimator, which can be used for a two-class problem with high-dimensional compositional covariates. Since the log-contrast model is employed, the estimator is able to do feature ...
- research-articleJune 2022
Online active classification via margin-based and feature-based label queries
Machine Language (MALE), Volume 111, Issue 6Pages 2323–2348https://doi.org/10.1007/s10994-022-06133-8AbstractIn the paradigm of online active classification, the learner not only has to predict the label of each incoming instance, but also must decide whether the true label of that instance should be supplied, or not. The overall goal is to minimize the ...
- ArticleMay 2022
Benchmarking Penalized Regression Methods in Machine Learning for Single Cell RNA Sequencing Data
AbstractSingle Cell RNA Sequencing (scRNA-seq) technology has enabled the biological research community to explore gene expression at a single-cell resolution. By studying differences in gene expression, it is possible to differentiate cell clusters and ...