Authors:
María G. Buey
1
;
Angel Luis Garrido
2
;
Carlos Bobed
2
and
Sergio Ilarri
2
Affiliations:
1
InSynergy Consulting S.A., Spain
;
2
University of Zaragoza, Spain
Keyword(s):
Information Extraction, Ontologies, Legal Documents.
Related
Ontology
Subjects/Areas/Topics:
Applications
;
Artificial Intelligence
;
e-Business
;
Enterprise Engineering
;
Enterprise Information Systems
;
Enterprise Ontologies
;
Formal Methods
;
Hybrid Intelligent Systems
;
Industrial Applications of AI
;
Knowledge Engineering and Ontology Development
;
Knowledge-Based Systems
;
Natural Language Processing
;
Ontologies
;
Pattern Recognition
;
Simulation and Modeling
;
Soft Computing
;
Symbolic Systems
Abstract:
In the legal field, it is a fact that a large number of documents are processed every day by management
companies with the purpose of extracting data that they consider most relevant in order to be stored in their
own databases. Despite technological advances, in many organizations, the task of examining these usually-extensive
documents for extracting just a few essential data is still performed manually by people, which is
expensive, time-consuming, and subject to human errors. Moreover, legal documents usually follow several
conventions in both structure and use of language, which, while not completely formal, can be exploited to
boost information extraction. In this work, we present an approach to obtain relevant information out from
these legal documents based on the use of ontologies to capture and take advantage of such structure and
language conventions. We have implemented our approach in a framework that allows to address different
types of documents with minimal e
ffort. Within this framework, we have also regarded one frequent problem
that is found in this kind of documentation: the presence of overlapping elements, such as stamps or signatures,
which greatly hinders the extraction work over scanned documents. Experimental results show promising
results, showing the feasibility of our approach.
(More)