The concept and data drift problems have received much attention in recent years. This aspect is ... more The concept and data drift problems have received much attention in recent years. This aspect is crucial in many domains exhibiting non-stationary and cyclical patterns affecting their generative processes. Drift detection can be treated as a supervised task, with labeled data constantly used to validate the learned model. From a practical point of view, this is an impractical task because labeling is complex, costly, and time-consuming. On the other hand, unsupervised change detection techniques are cumbersome in applications because they generate many false alarms. The paper presents a new concept drift detection method based on feature analysis. Stream of data carries information about the distribution patterns that reflect different concepts that may be hidden in the data. The essential features are searched and ranked by LASSO. The rank of features and statistics are employed to feature drift detection. The proposed approach was experimentally checked based on synthetic and nat...
Keystroke Dynamics - database contains free-typing keystroke dynamics, correlated with user IDs. ... more Keystroke Dynamics - database contains free-typing keystroke dynamics, correlated with user IDs. 15000 records for keystroke dynamics. 150 users, 100 records have been obtained per user. Data of user "X" is in file "user_X". Each row of the file represents a user keystroke dynamics vector, each column represents one feature attribute. The last column contains class labels.
Advances in intelligent systems and computing, Nov 3, 2015
Liver fibrosis is a common disease of the European population (but not only them). It may have ma... more Liver fibrosis is a common disease of the European population (but not only them). It may have many backgrounds and may develop with a different rapidity—it may stay hidden for many years or rapidly develop into terminal stage called cirrhosis, where liver can no longer fulfill its function. Unfortunately, current methods of diagnosis are either connected with a potential risk for a patient and require a hospitalization or are expensive and not very accurate. This paper presents a comparative study of various feature selection algorithms combined with selected machine learning algorithms which may be used to build an advanced liver fibrosis diagnosis support system based on a nonexpensive and safe routine blood tests. Experiments carried out on a dataset collected by authors, proved usability and satisfactory accuracy of the presented algorithms.
The concept and data drift problems have received much attention in recent years. This aspect is ... more The concept and data drift problems have received much attention in recent years. This aspect is crucial in many domains exhibiting non-stationary and cyclical patterns affecting their generative processes. Drift detection can be treated as a supervised task, with labeled data constantly used to validate the learned model. From a practical point of view, this is an impractical task because labeling is complex, costly, and time-consuming. On the other hand, unsupervised change detection techniques are cumbersome in applications because they generate many false alarms. The paper presents a new concept drift detection method based on feature analysis. Stream of data carries information about the distribution patterns that reflect different concepts that may be hidden in the data. The essential features are searched and ranked by LASSO. The rank of features and statistics are employed to feature drift detection. The proposed approach was experimentally checked based on synthetic and nat...
Keystroke Dynamics - database contains free-typing keystroke dynamics, correlated with user IDs. ... more Keystroke Dynamics - database contains free-typing keystroke dynamics, correlated with user IDs. 15000 records for keystroke dynamics. 150 users, 100 records have been obtained per user. Data of user "X" is in file "user_X". Each row of the file represents a user keystroke dynamics vector, each column represents one feature attribute. The last column contains class labels.
Advances in intelligent systems and computing, Nov 3, 2015
Liver fibrosis is a common disease of the European population (but not only them). It may have ma... more Liver fibrosis is a common disease of the European population (but not only them). It may have many backgrounds and may develop with a different rapidity—it may stay hidden for many years or rapidly develop into terminal stage called cirrhosis, where liver can no longer fulfill its function. Unfortunately, current methods of diagnosis are either connected with a potential risk for a patient and require a hospitalization or are expensive and not very accurate. This paper presents a comparative study of various feature selection algorithms combined with selected machine learning algorithms which may be used to build an advanced liver fibrosis diagnosis support system based on a nonexpensive and safe routine blood tests. Experiments carried out on a dataset collected by authors, proved usability and satisfactory accuracy of the presented algorithms.
Uploads
Papers by Piotr Porwik