Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1687878.1687889dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
research-article
Free access

Topological field parsing of German

Published: 02 August 2009 Publication History

Abstract

Freer-word-order languages such as German exhibit linguistic phenomena that present unique challenges to traditional CFG parsing. Such phenomena produce discontinuous constituents, which are not naturally modelled by projective phrase structure trees. In this paper, we examine topological field parsing, a shallow form of parsing which identifies the major sections of a sentence in relation to the clausal main verb and the subordinating heads. We report the results of topological field parsing of German using the unlexicalized, latent variable-based Berkeley parser (Petrov et al., 2006) Without any language- or model-dependent adaptation, we achieve state-of-the-art results on the TüBa-D/Z corpus, and a modified NE-GRA corpus that has been automatically annotated with topological fields (Becker and Frank, 2002). We also perform a qualitative error analysis of the parser output, and discuss strategies to further improve the parsing results.

References

[1]
M. Becker and A. Frank. 2002. A stochastic topological parser for German. In Proceedings of the 19th International Conference on Computational Linguistics, pages 71--77.
[2]
S. Brants, S. Dipper, S. Hansen, W. Lezius, and G. Smith. 2002. The TIGER Treebank. In Proceedings of the Workshop on Treebanks and Linguistic Theories, pages 24--41.
[3]
U. Callmeier. 2000. PET--a platform for experimentation with efficient HPSG processing techniques. Natural Language Engineering, 6(01):99--107.
[4]
A. Dubey and F. Keller. 2003. Probabilistic parsing for German using sister-head dependencies. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 96--103.
[5]
K. A. Foth, M. Daum, and W. Menzel. 2004. A broad-coverage parser for German based on defeasible constraints. Constraint Solving and Language Processing.
[6]
A. Frank, M. Becker, B. Crysmann, B. Kiefer, and U. Schaefer. 2003. Integrated shallow and deep parsing: TopP meets HPSG. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 104--111.
[7]
W. Frey. 2004. Notes on the syntax and the pragmatics of German Left Dislocation. In H. Lohnstein and S. Trissler, editors, The Syntax and Semantics of the Left Periphery, pages 203--233. Mouton de Gruyter, Berlin.
[8]
J. Hockenmaier. 2006. Creating a CCGbank and a Wide-Coverage CCG Lexicon for German. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 505--512.
[9]
T. N. Höhle. 1983. Topologische Felder. Ph.D. thesis, Köln.
[10]
S. Kübler, E. W. Hinrichs, and W. Maier. 2006. Is it really that difficult to parse German? In Proceedings of EMNLP.
[11]
M. Liepert. 2003. Topological Fields Chunking for German with SVM's: Optimizing SVM-parameters with GA's. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP), Bulgaria.
[12]
G. Neumann, C. Braun, and J. Piskorski. 2000. A Divide-and-Conquer Strategy for Shallow Parsing of German Free Texts. In Proceedings of the sixth conference on Applied natural language processing, pages 239--246. Morgan Kaufmann Publishers Inc. San Francisco, CA, USA.
[13]
S. Petrov and D. Klein. 2008. Parsing German with Latent Variable Grammars. In Proceedings of the ACL-08: HLT Workshop on Parsing German (PaGe-08), pages 33--39.
[14]
S. Petrov, L. Barrett, R. Thibaux, and D. Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 433--440, Sydney, Australia, July. Association for Computational Linguistics.
[15]
C. Rohrer and M. Forst. 2006. Improving coverage and parsing quality of a large-scale LFG for German. In Proceedings of the Language Resources and Evaluation Conference (LREC-2006), Genoa, Italy.
[16]
W. Skut, T. Brants, B. Krenn, and H. Uszkoreit. 1998. A Linguistically Interpreted Corpus of German Newssaper Text. Proceedings of the ESSLLI Workshop on Recent Advances in Corpus Annotation.
[17]
H. Telljohann, E. Hinrichs, and S. Kubler. 2004. The TüBa-D/Z treebank: Annotating German with a context-free backbone. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pages 2229--2235.
[18]
H. Telljohann, E. W. Hinrichs, S. Kubler, and H. Zinsmeister. 2006. Stylebook for the Tubingen Treebank of Written German (TüBa-D/Z). Seminar fur Sprachwissenschaft, Universitat Tubingen, Tubingen, Germany.
[19]
T. Ule. 2003. Directed Treebank Refinement for PCFG Parsing. In Proceedings of Workshop on Treebanks and Linguistic Theories (TLT) 2003, pages 177--188.
[20]
J. Veenstra, F. H. Müller, and T. Ule. 2002. Topological field chunking for German. In Proceedings of the Sixth Conference on Natural Language Learning, pages 56--62.

Cited By

View all
  • (2010)Entity-based local coherence modelling using topological fieldsProceedings of the 48th Annual Meeting of the Association for Computational Linguistics10.5555/1858681.1858701(186-195)Online publication date: 11-Jul-2010

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
August 2009
572 pages
ISBN:9781932432459
  • General Chair:
  • Keh-Yih Su

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 02 August 2009

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)58
  • Downloads (Last 6 weeks)5
Reflects downloads up to 15 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2010)Entity-based local coherence modelling using topological fieldsProceedings of the 48th Annual Meeting of the Association for Computational Linguistics10.5555/1858681.1858701(186-195)Online publication date: 11-Jul-2010

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media