Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Hoda Mohamed

The advances in processing and communication techniques resulted in a multitude of emerging applications that interact with streams of data. Traditional data mining systems store arriving data, collect them for later mining, and make... more
The advances in processing and communication techniques resulted in a multitude of emerging applications that interact with streams of data. Traditional data mining systems store arriving data, collect them for later mining, and make multiple passes over the collected data. Unfortunately, these systems are prohibitively slow when they deal with data streams with massive amounts of data arriving at high rates. This paper introduces a new model for mining sequential patterns on distributed data streams environments. It focuses on evolving data streams that originate from multiple distributed sources. Moreover, the mining process is achieved without compromising the privacy of the individual data streams of the participant nodes. Simulation results show that the proposed model scales linearly with the number of distributed nodes. In addition, it reduces the overhead in the distributed mining process.
ABSTRACT This paper proposes a methodology to prepare corpora in Arabic language from online social network (OSN) and review site for Sentiment Analysis (SA) task. The paper also proposes a methodology for generating a stopword list from... more
ABSTRACT This paper proposes a methodology to prepare corpora in Arabic language from online social network (OSN) and review site for Sentiment Analysis (SA) task. The paper also proposes a methodology for generating a stopword list from the prepared corpora. The aim of the paper is to investigate the effect of removing stopwords on the SA task. The problem is that the stopwords lists generated before were on Modern Standard Arabic (MSA) which is not the common language used in OSN. We have generated a stopword list of Egyptian dialect and a corpus-based list to be used with the OSN corpora. We compare the efficiency of text classification when using the generated lists along with previously generated lists of MSA and combining the Egyptian dialect list with the MSA list. The text classification was performed using Na\"ive Bayes and Decision Tree classifiers and two feature selection approaches, unigrams and bigram. The experiments show that the general lists containing the Egyptian dialects words give better performance than using lists of MSA stopwords only.
Page 1. General architecture of multi 1 General architecture of multi agent intelligent active learning system Dr. Hoda K. Mohamed, Dr. Hassan S. Bedor Department of Computers & Systems, Cairo, Egypt Eng. Howida A. Shedeed ...
ABSTRACT Data mining in databases is an important issue in the development of data and knowledge-base systems. It facilitates querying database knowledge and semantic query optimization. The aim of the work is to use data mining tools for... more
ABSTRACT Data mining in databases is an important issue in the development of data and knowledge-base systems. It facilitates querying database knowledge and semantic query optimization. The aim of the work is to use data mining tools for intelligent query answering in database systems, which include generalization, data summarization and rule discovery. We used a model for a knowledge-rich database, which consists not only of the components from a deductive database but also the components relevant to knowledge discovery tools. The discovered rule set constitutes a graph whose edges are the rules. This condition dependency graph provides a map of possible query reformulation operations so that semantic query optimization could be performed. Semantic knowledge is needed to optimize queries. Semantic knowledge may be in the form of generalized rules or association rules. Both will be applied to semantic query optimization system.
Page 1. Collective Sequential Pattern Mining in Distributed Evolving Data Streams Amany F. Soliman1,a+ , Gamal A. Ebrahim1,b and Hoda K. Mohammed1,c 1 Computer and Systems Engineering Department, Faculty of Engineering, Ain Shams... more
Page 1. Collective Sequential Pattern Mining in Distributed Evolving Data Streams Amany F. Soliman1,a+ , Gamal A. Ebrahim1,b and Hoda K. Mohammed1,c 1 Computer and Systems Engineering Department, Faculty of Engineering, Ain Shams University, Cairo, Egypt ...
ABSTRACT A new image inpainting technique is developed to fit perfectly for special categories of images that contain mainly buildings. This technique handles the need to obtain an image of building free from parking cars along sides of... more
ABSTRACT A new image inpainting technique is developed to fit perfectly for special categories of images that contain mainly buildings. This technique handles the need to obtain an image of building free from parking cars along sides of the roads. To do that, one needs to carefully inpaint the roads and the missing parts of images. This can be done by combining vanishing points detections and image segmentation. After detecting vanishing points, it is possible to draw the line dividing the missing regions into two parts: the road part and the building part. Each part should be then inpainted independently using a different source region for each one. A segmentation technique, which is based on color and texture features, is employed to extract a source region for road part inpainting. Simple geometric calculations are employed to detect the source region for building part inpainting.
ABSTRACT Cloud computing is offering utility oriented IT services to users worldwide. Based on a pay per use model, it provides a variety of computing resources, enterprise applications while enabling their hosting from consumer,... more
ABSTRACT Cloud computing is offering utility oriented IT services to users worldwide. Based on a pay per use model, it provides a variety of computing resources, enterprise applications while enabling their hosting from consumer, scientific and business domains through a three layered architecture and different cloud types. The proliferation of cloud computing has resulted in the establishment of large-scale data centers around the world containing thousands of computing nodes which consume huge amounts of energy, contributing to high operational costs and carbon footprints to the environment. Energy consumption is not only determined by hardware efficiency, but it also depends on the resource management system deployed on the infrastructure and the efficiency of applications running in the system. The challenge is addressed in finding cloud computing solutions that not only save energy for the environment but also reduce operational costs. Our Fuzzy based contribution improves power efficiency with around 40 % than other policies.
Page 1. A GENERAL ARCHITECTURE OF STUDENT MODELTO ASSESS THE LEARNING PERFORMANCE IN INTELLIGENT TUTORING SYSTEMS Dr. Hassan S. Bedor, Dr. Hoda K. Mohamed Ain Shams University, Faculty ...
Page 1. Automatic Documents Classification Hoda K. Mohamed E-mail: hodakm2002@yahoo.com Ain Shams U., Faculty of Eng., Computer & System Eng. Dept., Cairo, Egypt Abstract-Automatic document classification is of ...
ABSTRACT Intrusion detection systems (IDSs) have become a widely used measure for security systems. The main problem for those systems results is the irrelevant alerts on those results. We will propose a data mining based method for... more
ABSTRACT Intrusion detection systems (IDSs) have become a widely used measure for security systems. The main problem for those systems results is the irrelevant alerts on those results. We will propose a data mining based method for classification to distinguish serious alerts and irrelevant one with a performance of 99.9% which is better in comparison with the other recent data mining methods that have reached the performance of 97%. A ranked alerts list also created according to alerts importance to minimize human interventions.
The paper presents the design of a model for forecasting long-term electricity load. The model uses data mining techniques. The paper defines the load forecast and the summary of the most important factors affecting the load forecast in... more
The paper presents the design of a model for forecasting long-term electricity load. The model uses data mining techniques. The paper defines the load forecast and the summary of the most important factors affecting the load forecast in Egyptian electricity network. The steps needed for the knowledge discovery process is implemented to the time series data. Preprocessing the data in