Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1123098.1123108acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesdg-oConference Proceedingsconference-collections
Article

Building automatically a business registration ontology

Published: 19 May 2002 Publication History

Abstract

We discuss a domain-independent, corpus based method for dictionary-less automatic extraction of ontological knowledge from domain-specific unannotated documents. We present the architecture, algorithms, and results for ONTOSTRUCT---a new system that uses machine learning and statistical techniques to analyze text sources, discover terms, link equivalent terms into concepts, learn both hierarchical and non-hierarchical conceptual relations, and build an extensive, semantically sound hierarchy of concepts. We report on ONTOSTRUCT's results in constructing a domain-specific ontology for the business registration domain, and evaluate the performance of two of its modules.

References

[1]
N. Adams, F. Artigas, V. Atluri, S. A. Chun, S. Colbert, M. Degeratu, A. Ebeid, V. Hatzivassiloglou, R. Holowczak, O. Marcopolus, P. Mazzoleni, W. Rayner, and Y. Yesha. E-Government: Human-Centered Systems for Business Services. In Proceedings of the First National Conference on Digital Government Research, 2000.
[2]
E. Agichtein and L. Gravano. Snowball: Extracting Relations from Large Plain-Text Collections. In Proceedings of the Fifth ACM International Conference on Digital Libraries, 2000.
[3]
G. Bisson, C. Nédellec, and D. Cañamero. Designing Clustering Methods for Ontology Building: the Mo'K Workbench. In Proceedings of the First Workshop on Ontology Learning (OL-2000) in conjunction with the Fourteenth European Conference on Artificial Intelligence (ECAI-2000), Berlin, 2000.
[4]
S. Finch and A. Mikheev. A Workbench for Finding Structure in Texts. In Proceedings of the Fifth Conference on Applied Natural Language Processing, April 1997.
[5]
J. S. Justeson and S. M. Katz. Technical terminology: Some Linguistic Properties and an Algorithm for Identification in Text. Natural Language Engineering, 1(1):9--27, 1995.
[6]
K. Knight and S. Luk. Building a Large Knowledge Base for Machine Translation. In Proceedings of the American Association of Artificial Intelligence Conference (AAAI-94), 1994.
[7]
G. N. Lance and W. T. Williams. A General Theory of Classification Sorting Strategies. Computer Journal, 9:373--380, 1967.
[8]
D. B. Lenat. CYC: A Large-Scale Investment in Knowledge Infrastrature. Communications of the ACM, 38(11):32--38, 1995.
[9]
G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller. Introduction to WordNet: An On-Line Lexical Database. International Journal of Lexicography, 3(4):235--312, 1990.
[10]
M. F. Porter. An Algorithm for Suffix Stripping. Program, 14(3):130--137, 1980.
[11]
J. C. Reynar and A. Ratnaparkhi. A Maximum Entropy Approach to Identifying Sentence Boundaries. Proceedings of the Fifth Conference on Applied Natural Language Processing, April 1997.
[12]
M. Sanderson and B. W. Croft. Deriving Concept Hierarchies from Text. In Proceedings of the 22nd ACM SIGIR, pages 206--213, 1999.
[13]
S. Siegel and N. J. Castellan. Nonparametric Statistics for the Behavioural Sciences. McGraw-Hill, 2nd edition, 1988.

Cited By

View all
  • (2018)Ontology construction from Thailand labor protection actProceedings of the 10th International Conference on Management of Digital EcoSystems10.1145/3281375.3281410(47-54)Online publication date: 25-Sep-2018
  • (2008)Ontology generation for large email collectionsProceedings of the 2008 international conference on Digital government research10.5555/1367832.1367875(254-261)Online publication date: 18-May-2008
  • (2008)Learning the distance metric in a personal ontologyProceedings of the 2nd international workshop on Ontologies and information systems for the semantic web10.1145/1458484.1458488(17-24)Online publication date: 30-Oct-2008
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
dg.o '02: Proceedings of the 2002 annual national conference on Digital government research
May 2002
1234 pages

Publisher

Digital Government Society of North America

Publication History

Published: 19 May 2002

Check for updates

Qualifiers

  • Article

Conference

dg.o '02
dg.o '02: Digital government research
May 19 - 22, 2002
California, Los Angeles, USA

Acceptance Rates

Overall Acceptance Rate 150 of 271 submissions, 55%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Ontology construction from Thailand labor protection actProceedings of the 10th International Conference on Management of Digital EcoSystems10.1145/3281375.3281410(47-54)Online publication date: 25-Sep-2018
  • (2008)Ontology generation for large email collectionsProceedings of the 2008 international conference on Digital government research10.5555/1367832.1367875(254-261)Online publication date: 18-May-2008
  • (2008)Learning the distance metric in a personal ontologyProceedings of the 2nd international workshop on Ontologies and information systems for the semantic web10.1145/1458484.1458488(17-24)Online publication date: 30-Oct-2008
  • (2006)Integrated scoring for spelling error correction, abbreviation expansion and case restoration in dirty textProceedings of the fifth Australasian conference on Data mining and analystics - Volume 6110.5555/1273808.1273820(83-89)Online publication date: 1-Nov-2006

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media