Data Streams are unbounded, sequential data instances that are generated very rapidly. The storag... more Data Streams are unbounded, sequential data instances that are generated very rapidly. The storage, querying and mining of such rapid flows of data is computationally very challenging. Data Stream Mining (DSM) is concerned with the mining of such data streams in real-time using techniques that require only one pass through the data. DSM techniques need to be adaptive to reflect changes of the pattern encoded in the stream (concept drift). The relevance of features for a DSM classification task may change due to concept drifts and this paper describes the first step towards a concept drift detection method with online feature tracking capabilities.
Abstract - Nowadays, many users use web search engines to find and gather information. User faces... more Abstract - Nowadays, many users use web search engines to find and gather information. User faces an increasing amount of various semi-structured information sources. The issue of correlating, integrating and presenting related information to users becomes important. ...
Eastern-European Journal of Enterprise Technologies, 2014
Abstract. This article aims to automate the extraction of information from semi-structured web do... more Abstract. This article aims to automate the extraction of information from semi-structured web documents by minimizing the amount of hand coding. Ex-traction of information from the WWW can be used to structure the huge amount of data buried in web documents, so that data ...
Abstract. This article aims to automate the extraction of information from semi-structured web do... more Abstract. This article aims to automate the extraction of information from semi-structured web documents by minimizing the amount of hand coding. Ex-traction of information from the WWW can be used to structure the huge amount of data buried in web documents, so that data ...
Purpose – The aim of this paper is to propose a strategy for extracting information from web tabl... more Purpose – The aim of this paper is to propose a strategy for extracting information from web tables.
Design/methodology/approach – The paper presents a strategy for extracting information from web tables of semi-structured web pages (WPs) by handling the issue of synonym which emerges as these WPs have been designed and created without referring to any standards or guidelines.
Findings – The paper finds that this strategy extracts information with high precision, and extracts the attributes besides the sub-attributes that describe the extracted attributes and values of the sub-attributes.
Practical implications – Experiment conducted on the Nokia products domain demonstrated that the proposed strategy extracts information from web tables with high precision which is 98.98 percent.
Originality/value – This paper contributes to the research on extracting information.
Data Streams are unbounded, sequential data instances that are generated very rapidly. The storag... more Data Streams are unbounded, sequential data instances that are generated very rapidly. The storage, querying and mining of such rapid flows of data is computationally very challenging. Data Stream Mining (DSM) is concerned with the mining of such data streams in real-time using techniques that require only one pass through the data. DSM techniques need to be adaptive to reflect changes of the pattern encoded in the stream (concept drift). The relevance of features for a DSM classification task may change due to concept drifts and this paper describes the first step towards a concept drift detection method with online feature tracking capabilities.
Abstract - Nowadays, many users use web search engines to find and gather information. User faces... more Abstract - Nowadays, many users use web search engines to find and gather information. User faces an increasing amount of various semi-structured information sources. The issue of correlating, integrating and presenting related information to users becomes important. ...
Eastern-European Journal of Enterprise Technologies, 2014
Abstract. This article aims to automate the extraction of information from semi-structured web do... more Abstract. This article aims to automate the extraction of information from semi-structured web documents by minimizing the amount of hand coding. Ex-traction of information from the WWW can be used to structure the huge amount of data buried in web documents, so that data ...
Abstract. This article aims to automate the extraction of information from semi-structured web do... more Abstract. This article aims to automate the extraction of information from semi-structured web documents by minimizing the amount of hand coding. Ex-traction of information from the WWW can be used to structure the huge amount of data buried in web documents, so that data ...
Purpose – The aim of this paper is to propose a strategy for extracting information from web tabl... more Purpose – The aim of this paper is to propose a strategy for extracting information from web tables.
Design/methodology/approach – The paper presents a strategy for extracting information from web tables of semi-structured web pages (WPs) by handling the issue of synonym which emerges as these WPs have been designed and created without referring to any standards or guidelines.
Findings – The paper finds that this strategy extracts information with high precision, and extracts the attributes besides the sub-attributes that describe the extracted attributes and values of the sub-attributes.
Practical implications – Experiment conducted on the Nokia products domain demonstrated that the proposed strategy extracts information from web tables with high precision which is 98.98 percent.
Originality/value – This paper contributes to the research on extracting information.
Uploads
Papers by Mahmood S Hammoodi
Design/methodology/approach – The paper presents a strategy for extracting information from web tables of semi-structured web pages (WPs) by handling the issue of synonym which emerges as these WPs have been designed and created without referring to any standards or guidelines.
Findings – The paper finds that this strategy extracts information with high precision, and extracts the attributes besides the sub-attributes that describe the extracted attributes and values of the sub-attributes.
Practical implications – Experiment conducted on the Nokia products domain demonstrated that the proposed strategy extracts information from web tables with high precision which is 98.98 percent.
Originality/value – This paper contributes to the research on extracting information.
Design/methodology/approach – The paper presents a strategy for extracting information from web tables of semi-structured web pages (WPs) by handling the issue of synonym which emerges as these WPs have been designed and created without referring to any standards or guidelines.
Findings – The paper finds that this strategy extracts information with high precision, and extracts the attributes besides the sub-attributes that describe the extracted attributes and values of the sub-attributes.
Practical implications – Experiment conducted on the Nokia products domain demonstrated that the proposed strategy extracts information from web tables with high precision which is 98.98 percent.
Originality/value – This paper contributes to the research on extracting information.