OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation

Cui, Zhan; Damiani, Ernesto; Leida, Marcello; Viviani, Marco

doi:10.1007/11552413_17

Zhan Cui²²,
Ernesto Damiani²¹,
Marcello Leida²¹ &
…
Marco Viviani²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3681))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

1526 Accesses
4 Citations

Abstract

This paper describes a theoretical approach on data mining, information classifying and a global overview of our OntoExtractor application, concerning the analysis of incoming data flow and generate metadata structures.

In order to help the user to classify a big and varied group of data, our proposal is to use fuzzy-based techniques to compare and classify the data.

Before comparing the elements, the incoming flow of information has to be converted into a common structured format like XML.

With those structured documents now we can compare and cluster the various data and generate a metadata structure about this data repository.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Metadata Discovery Using Data Sampling and Exploratory Data Analysis

SOIM: Similarity Measures on Ontology Instances Based on Mixed Features

Structure Inference for Linked Data Sources Using Clustering

References

Bouchon-Meunier, B., Rifqi, M., Bothorel, S.: Towards general measures of comparison of objects. Fuzzy Sets and Systems 84, 143–153 (1996)
Article MATH MathSciNet Google Scholar
Ceravolo, P.: Extracting Role Hierarchies from Authentication Data Flows. Computer Systems Science & Engineering Journal (IJCSSE) 19(3), 121–127 (2004)
Google Scholar
Ceravolo, P., Nocerino, M.C., Viviani, M.: Knowledge Extraction from Semistructured Data Based on Fuzzy Techniques. In: Knowledge-Based Intelligent Information and Engineering Systems, Proceedings of the 8th International Conference, KES 2004, Part III, pp. 328–334 (2004)
Google Scholar
Damiani, E., Nocerino, M.C., Viviani, M.: Knowledge Extraction from an XML Data Flow: Building a Taxonomy based on Clustering Technique, Current Issues in Data and Knowledge Engineering. In: Proceedings of EUROFUSE 2004: 8th Meeting of the EURO Working Group on Fuzzy Sets, pp. 133–142 (2004)
Google Scholar
Leida, M.: Structural information extraction techniques from semi-structured data flows, coming from differents data sources, Università degli Studi di Milano, DTI – Note del Polo – Research, No. 70 (2005)
Google Scholar
RDF W3C Recommendation http://www.w3.org/TR/rdf-primer/

Download references

Author information

Authors and Affiliations

Dipartimento di Tecnologie dell’Informazione, Università degli Studi di Milano, Via Bramante, 65, 26013, Crema, CR, Italy
Ernesto Damiani, Marcello Leida & Marco Viviani
Intelligent Systems Research Centre, BT Group, Orion Building – Adastral Park – Martlesham Heath, IP5 3RE, Ipswich, Suffolk, UK
Zhan Cui

Authors

Zhan Cui
View author publications
You can also search for this author in PubMed Google Scholar
Ernesto Damiani
View author publications
You can also search for this author in PubMed Google Scholar
Marcello Leida
View author publications
You can also search for this author in PubMed Google Scholar
Marco Viviani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Business, La Trobe University, 3086, Melbourne, Victoria, Australia
Rajiv Khosla
Centre for SMART systems Engineering Research Centre, University of Brighton, Moulsecoomb, BN2 4GJ, Brighton, UK
Robert J. Howlett
School of Electrical and Information Engineering, Knowledge Based Intelligent Engineering Systems Centre, University of South Australia, 5095, Mawson Lakes, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cui, Z., Damiani, E., Leida, M., Viviani, M. (2005). OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2005. Lecture Notes in Computer Science(), vol 3681. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11552413_17

Download citation

DOI: https://doi.org/10.1007/11552413_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28894-7
Online ISBN: 978-3-540-31983-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Metadata Discovery Using Data Sampling and Exploratory Data Analysis

SOIM: Similarity Measures on Ontology Instances Based on Mixed Features

Structure Inference for Linked Data Sources Using Clustering

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Metadata Discovery Using Data Sampling and Exploratory Data Analysis

SOIM: Similarity Measures on Ontology Instances Based on Mixed Features

Structure Inference for Linked Data Sources Using Clustering

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation