Computer Science and Information Systems 2011 Volume 8, Issue 3, Pages: 673-692
https://doi.org/10.2298/CSIS101011023K
Full text ( 524 KB)
Cited by
Data extraction and annotation based on domain-specific ontology evolution for deep web
Kerui Chen (College of Computer Science and Technology, Jilin University, Changchun, China + School of Computer Science and Technology, Changchun University of Science and Technology, Changchun, China)
Zuo Wanli (College of Computer Science and Technology, Jilin University, Changchun, China)
He Fengling (College of Computer Science and Technology, Jilin University, Changchun, China)
Chen Yongheng (College of Computer Science and Technology, Jilin University, Changchun, China)
Wang Ying (College of Computer Science and Technology, Jilin University, Changchun, China)
Deep web respond to a user query result records encoded in HTML files. Data
extraction and data annotation, which are important for many applications,
extracts and annotates the record from the HTML pages. We proposed an
domain-specific ontology based data extraction and annotation technique; we
first construct mini-ontology for specific domain according to information of
query interface and query result pages; then, use constructed mini-ontology
for identifying data areas and mapping data annotations in data extraction;
in order to adapt to new sample set, mini-ontology will evolve dynamically
based on data extraction and data annotation. Experimental results
demonstrate that this method has higher precision and recall in data
extraction and data annotation.
Keywords: Deep Web, Data Extraction, Data Annotation, Domain Ontology, Ontology Evolution