Communications in Computer and Information Science, 2015
Security informatics and computational intelligence are gaining more importance in detecting terr... more Security informatics and computational intelligence are gaining more importance in detecting terrorist activities as the extremist groups are misusing many of the available Internet services to incite violence and hatred. However, inadequate performance of statistical based computational intelligence methods reduces intelligent techniques efficiency in supporting counterterrorism efforts, and limits the early detection opportunities of potential terrorist activities. In this paper, we propose a feature set hybridization method, based on feature selection and extraction methods, for accurate content classification in Arabic dark web pages. The proposed method hybridizes the feature sets so that the generated feature set contains less number of features that capable of achieving higher classification performance. A selected dataset from Dark Web Forum Portal (DWFP) is used to test the performance of the proposed method that based on Term Frequency - Inverse Document Frequency (TFIDF) as feature selection method on one hand, while Random Projection (RP) and Principal Component Analysis (PCA) feature selection methods on the other hand. Classification results using the Support Vector Machine (SVM) classifier show that a high classification performance has been achieved base on the hybridization of TFIDF and PCA, where 99 % of F1 and accuracy performance has been achieved.
2020 International Conference on Promising Electronic Technologies (ICPET), 2020
Genetic algorithm (GA) is a well-known adaptive nature inspired technique that utilized in variou... more Genetic algorithm (GA) is a well-known adaptive nature inspired technique that utilized in various applications and research domains. This research proposes an Enhanced Genetic Algorithm (EGA) inspired by Genetic Engineering. In EGA, the process of chromosome generation is interfered based on the intercorrelation between genes such that the highly correlated genes are treated. The proposed EGA were employed to optimize the classification results using Support Vector Machine (SVM) on the “Spambase” popular dataset. Experimental results show that classification results were more optimized using EGA as compared to basic GA optimization. The classification results were enhanced using the proposed Enhanced Genetic Algorithm (EGA).
The role of intelligence and security informatics based on statistical computations is becoming m... more The role of intelligence and security informatics based on statistical computations is becoming more significant in detecting terrorism activities proactively as the extremist groups are misusing many of the obtainable facilities on the Internet to incite violence and hatred. However, the performance of statistical methods is reported to be limited due to the inadequate accuracy produced by the inability of these methods to comprehend the meaning of texts created by humans. Miss classification of the actual terrorism web content as non-terrorism or vice versa reduces the usefulness of intelligent techniques to support the efforts against potential threats, and limits the opportunities for the effective use of intelligence and security informatics in the early detection of terrorist activities. In this paper, we propose a hybridized method based on the basic term-weighting techniques for accurate terrorism activities detection in textual contexts. The proposed method combines the fea...
The hospital website should be easy to use and understandable by patients and general public. A h... more The hospital website should be easy to use and understandable by patients and general public. A hospital website equipped with usability standards can reduce gap between patients themselves and improve user satisfaction level significantly. As more people are turning towards Internet for seeking health information, therefore, health-care websites containing relevant and useful health contents is need of the day. The current study is a part of research which aims to understand and highlight usability issues and propose design rules for developers to produce hospital websites of Pakistan in accordance with usability standards. Four public sector hospitals operational in Pakistan were evaluated with the objective to identify usability issues. These issues have been used to form design rules. Twenty-six participants took part in the study to perform user tasks. Data collected was through questionnaire and direct observation techniques. Average scores calculated against each website and ...
Thesaurus is used in many Information Retrieval (IR) applications such as data integration, data ... more Thesaurus is used in many Information Retrieval (IR) applications such as data integration, data warehousing, semantic query processing and classifiers. It was also utilized to solve the problem of schema matching. Considering the fact of existence of many thesauri for a certain area of knowledge, the quality of schema matching results when using different thesauri in the same field is not predictable. In this paper, we propose a methodology to study the performance of the thesaurus in solving schema matching. The paper also presents results of experiments using different thesauri. Precision, recall, Fmeasure, and similarity average were calculated to show that the quality of matching changed according to the used thesaurus.
Automatic text classification is an effective solution used to sort out the increasing amount of ... more Automatic text classification is an effective solution used to sort out the increasing amount of online textual content. However, high dimensionality is a considerable impediment observed in the text classification field in spite of the fact that there have been many statistical methods available to address this issue. Still, none of these has proved to be effective enough in solving this problem. This paper proposes a machine learning based feature ranking and selection method named Support Vector Machine based Feature Ranking Method (SVM-FRM). The proposed method utilizes Support Vector Machine (SVM) learning algorithm for weighting and selecting the significant features in order to obtain better classification performance. Later on, hybridization techniques are applied to enhance the performance of SVM-FRM method in some experimental situations. The proposed SVM-FRM method and its enhancement are tested using three text classification public datasets. The achieved results are com...
International Journal of Distributed Sensor Networks, 2016
Sensor nodes in wireless sensor networks are deployed to observe the surroundings for some phenom... more Sensor nodes in wireless sensor networks are deployed to observe the surroundings for some phenomenon of interest. The fundamental issue in observing such environments is the area coverage which reflects how well the region is monitored. The nonuniform sensor nodes distribution in a certain region caused by random deployment might lead to coverage holes/gaps in the network. One of the solutions to improve area coverage after initial deployment is by sensor nodes mobility. However, the main challenge in this approach is how to increase area coverage with the least energy consumption. This research work aims to improve area coverage with minimal energy consumption and faster convergence rate. The Edge Based Centroid (EBC) algorithm is presented to improve the area coverage with faster convergence rate in a distributed network. The simulation based performance evaluations of the proposed algorithms are carried out in terms of area coverage, convergence rate, and energy efficiency. Comp...
Communications in Computer and Information Science, 2015
Security informatics and computational intelligence are gaining more importance in detecting terr... more Security informatics and computational intelligence are gaining more importance in detecting terrorist activities as the extremist groups are misusing many of the available Internet services to incite violence and hatred. However, inadequate performance of statistical based computational intelligence methods reduces intelligent techniques efficiency in supporting counterterrorism efforts, and limits the early detection opportunities of potential terrorist activities. In this paper, we propose a feature set hybridization method, based on feature selection and extraction methods, for accurate content classification in Arabic dark web pages. The proposed method hybridizes the feature sets so that the generated feature set contains less number of features that capable of achieving higher classification performance. A selected dataset from Dark Web Forum Portal (DWFP) is used to test the performance of the proposed method that based on Term Frequency - Inverse Document Frequency (TFIDF) as feature selection method on one hand, while Random Projection (RP) and Principal Component Analysis (PCA) feature selection methods on the other hand. Classification results using the Support Vector Machine (SVM) classifier show that a high classification performance has been achieved base on the hybridization of TFIDF and PCA, where 99 % of F1 and accuracy performance has been achieved.
2020 International Conference on Promising Electronic Technologies (ICPET), 2020
Genetic algorithm (GA) is a well-known adaptive nature inspired technique that utilized in variou... more Genetic algorithm (GA) is a well-known adaptive nature inspired technique that utilized in various applications and research domains. This research proposes an Enhanced Genetic Algorithm (EGA) inspired by Genetic Engineering. In EGA, the process of chromosome generation is interfered based on the intercorrelation between genes such that the highly correlated genes are treated. The proposed EGA were employed to optimize the classification results using Support Vector Machine (SVM) on the “Spambase” popular dataset. Experimental results show that classification results were more optimized using EGA as compared to basic GA optimization. The classification results were enhanced using the proposed Enhanced Genetic Algorithm (EGA).
The role of intelligence and security informatics based on statistical computations is becoming m... more The role of intelligence and security informatics based on statistical computations is becoming more significant in detecting terrorism activities proactively as the extremist groups are misusing many of the obtainable facilities on the Internet to incite violence and hatred. However, the performance of statistical methods is reported to be limited due to the inadequate accuracy produced by the inability of these methods to comprehend the meaning of texts created by humans. Miss classification of the actual terrorism web content as non-terrorism or vice versa reduces the usefulness of intelligent techniques to support the efforts against potential threats, and limits the opportunities for the effective use of intelligence and security informatics in the early detection of terrorist activities. In this paper, we propose a hybridized method based on the basic term-weighting techniques for accurate terrorism activities detection in textual contexts. The proposed method combines the fea...
The hospital website should be easy to use and understandable by patients and general public. A h... more The hospital website should be easy to use and understandable by patients and general public. A hospital website equipped with usability standards can reduce gap between patients themselves and improve user satisfaction level significantly. As more people are turning towards Internet for seeking health information, therefore, health-care websites containing relevant and useful health contents is need of the day. The current study is a part of research which aims to understand and highlight usability issues and propose design rules for developers to produce hospital websites of Pakistan in accordance with usability standards. Four public sector hospitals operational in Pakistan were evaluated with the objective to identify usability issues. These issues have been used to form design rules. Twenty-six participants took part in the study to perform user tasks. Data collected was through questionnaire and direct observation techniques. Average scores calculated against each website and ...
Thesaurus is used in many Information Retrieval (IR) applications such as data integration, data ... more Thesaurus is used in many Information Retrieval (IR) applications such as data integration, data warehousing, semantic query processing and classifiers. It was also utilized to solve the problem of schema matching. Considering the fact of existence of many thesauri for a certain area of knowledge, the quality of schema matching results when using different thesauri in the same field is not predictable. In this paper, we propose a methodology to study the performance of the thesaurus in solving schema matching. The paper also presents results of experiments using different thesauri. Precision, recall, Fmeasure, and similarity average were calculated to show that the quality of matching changed according to the used thesaurus.
Automatic text classification is an effective solution used to sort out the increasing amount of ... more Automatic text classification is an effective solution used to sort out the increasing amount of online textual content. However, high dimensionality is a considerable impediment observed in the text classification field in spite of the fact that there have been many statistical methods available to address this issue. Still, none of these has proved to be effective enough in solving this problem. This paper proposes a machine learning based feature ranking and selection method named Support Vector Machine based Feature Ranking Method (SVM-FRM). The proposed method utilizes Support Vector Machine (SVM) learning algorithm for weighting and selecting the significant features in order to obtain better classification performance. Later on, hybridization techniques are applied to enhance the performance of SVM-FRM method in some experimental situations. The proposed SVM-FRM method and its enhancement are tested using three text classification public datasets. The achieved results are com...
International Journal of Distributed Sensor Networks, 2016
Sensor nodes in wireless sensor networks are deployed to observe the surroundings for some phenom... more Sensor nodes in wireless sensor networks are deployed to observe the surroundings for some phenomenon of interest. The fundamental issue in observing such environments is the area coverage which reflects how well the region is monitored. The nonuniform sensor nodes distribution in a certain region caused by random deployment might lead to coverage holes/gaps in the network. One of the solutions to improve area coverage after initial deployment is by sensor nodes mobility. However, the main challenge in this approach is how to increase area coverage with the least energy consumption. This research work aims to improve area coverage with minimal energy consumption and faster convergence rate. The Edge Based Centroid (EBC) algorithm is presented to improve the area coverage with faster convergence rate in a distributed network. The simulation based performance evaluations of the proposed algorithms are carried out in terms of area coverage, convergence rate, and energy efficiency. Comp...
Uploads