Comparative Error Analysis of Parser Outputs on Telugu Dependency Treebank

Kanneganti, Silpa; Chaudhry, Himani; Misra Sharma, Dipti

doi:10.1007/978-3-319-75477-2_28

Silpa Kanneganti¹⁴,
Himani Chaudhry¹⁴ &
Dipti Misra Sharma¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9623))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1385 Accesses

Abstract

We present a comparative error analysis of two parsers - MALT and MST on Telugu Dependency Treebank data. MALT and MST are currently two of the most dominant data-driven dependency parsers. We discuss the performances of both the parsers in relation to Telugu language. We also talk in detail about both the algorithmic issues of the parsers as well as the language specific constraints of Telugu. The purpose is, to better understand how to help the parsers deal with complex structures, make sense of implicit language specific cues and build a more informed Treebank.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Exploring Different Approaches for Parsing Telugu

Combining Dependency Parsers Using Error Rates

Coarse-Grained vs. Fine-Grained Lithuanian Dependency Parsing

Notes

1.
MALT version 1.8.1.
2.
MST version 0.5.0.
3.
http://homepages.inf.ed.ac.uk/lzhang10/maxent.html.
4.
LAS – Labeled Attachment Score.
5.
UAS – Unlabeled Attachment Score.
6.
LS - Labeled Score.

References

Nivre, J., Hall, J., Nilsson, J., Chanev, A., Eryigit, G., Kübler, S., Marinov, S., Marsi, E.: Maltparser: a language-independent system for data-driven dependency parsing. Nat. Lang. Eng. 13, 95–135 (2007)
Google Scholar
McDonald, R., Pereira, F., Ribarov, K., Hajič, J.: Non-projective dependency parsing using spanning tree algorithms. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing. HLT 2005, Stroudsburg, PA, USA, Association for Computational Linguistics,pp. 523–530 (2005)
Google Scholar
McDonald, R.T., Nivre, J.: Characterizing the errors of data-driven dependency parsing models. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning EMNLP-CoNLL, pp. 122–131 (2007)
Google Scholar
Husain, S., Agrawal, B.: Analyzing parser errors to improve parsing accuracy and to inform tree banking decisions. Linguistic Issues in Language Technology, 7 (2012)
Google Scholar
Vempaty, C., Naidu, V., Husain, S., Kiran, R., Bai, L., Sharma, D.M., Sangal, R.: Issues in analyzing Telugu sentences towards building a Telugu treebank. In: Gelbukh, A. (ed.) CICLing 2010. LNCS, vol. 6008, pp. 50–59. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12116-6_5
Chapter Google Scholar
Nivre, J.: Inductive dependency parsing. Text, Speech and Language Technology, vol. 34. Springer, Netherlands (2006)
Google Scholar
Black, E., Jelinek, F., Lafferty, J., Magerman, D.M., Mercer, R., Roukos, S.: Towards history-based grammars: using richer models for probabilistic parsing. In: Proceedings of the Workshop on Speech and Natural Language. HLT 1991, Stroudsburg, PA, USA, Association for Computational Linguistics, pp. 134–139 (1992)
Google Scholar
Kudo, T., Matsumoto, Y.: Japanese dependency analysis using cascaded chunking. In: Proceedings of the 6th Conference on Natural Language Learning, vol. 20. COLING 2002, Stroudsburg, PA, USA. Association for Computational Linguistics, pp. 1–7 (2002)
Google Scholar
Chu, Y.J., Liu, T.H.: On shortest arborescence of a directed graph. Sci. Sinica 14, 1396 (1965)
MathSciNet MATH Google Scholar
Edmonds, J.: Optimum branchings. J. Res. Natil Bur. Stan. B 71, 233–240 (1967)
Article MathSciNet MATH Google Scholar
Eisner, J.M.: Three new probabilistic models for dependency parsing: an exploration. In: Proceedings of the 16th Conference on Computational Linguistics, vol. 1. Association for Computational Linguistics, pp. 340–345 (1996)
Google Scholar
McDonald, R., Crammer, K., Pereira, F.: Online large-margin training of dependency parsers. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, pp. 91–98 (2005)
Google Scholar
Garapati, U.R., Koppaka, R., Addanki, S.: Dative case in Telugu: a parsing perspective. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL 2012), COLING 2012, pp. 123-132, Mumbai (2012)
Google Scholar
Husain, S., Mannem, P., Ambati, B.R., Gadde, P.: The ICON-2010 tools contest on Indian language dependency parsing. In: Proceedings of ICON-2010 Tools Contest on Indian Language Dependency Parsing, ICON, vol. 10, pp. 1-8. Citeseer (2010)
Google Scholar
Bharati, A., Chaitanya, V., Sangal, R., Ramakrishnamacharyulu, K.: Natural Language Processing: A Paninian Perspective. Prentice-Hall of India, New Delhi (1995)
Google Scholar
Chaudhry, H., Sharma, H., Sharma, D.M.: Divergences in English-Hindi parallel dependency treebanks. DepLing 2013, 33 (2013)
Google Scholar
Ambati, B.R., Husain, S., Nivre, J., Sangal, R.: On the role of morphosyntactic features in Hindi dependency parsing. In: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages. Association for Computational Linguistics, pp. 94–102 (2010)
Google Scholar
Ambati, B.R., Gadde, P., Jindal, K.: Experiments in Indian language dependency parsing. In: Proceedings of the ICON-2009 NLP Tools Contest: Indian Language Dependency Parsing, pp. 32–37 (2009)
Google Scholar
Bhat, R.A., Sharma, D.M.: Non-projective structures in Indian language treebanks. In: The 11th International Workshop on Treebanks and Linguistic Theories, Edições Colibri, pp. 25–30 (2012)
Google Scholar

Download references

Acknowledgment

We thank Riyaz Ahmad Bhat, Vigneshwaran Muralidharan and Irshad Ahmad Bhat for their assistance and comments that greatly improved the manuscript.

Author information

Authors and Affiliations

Kohli Center on Intelligent Systems (KCIS), International Institute of Information Technology, Hyderabad (IIIT Hyderabad) Gachibowli, Hyderabad, 500032, Telangana, India
Silpa Kanneganti, Himani Chaudhry & Dipti Misra Sharma

Authors

Silpa Kanneganti
View author publications
You can also search for this author in PubMed Google Scholar
Himani Chaudhry
View author publications
You can also search for this author in PubMed Google Scholar
Dipti Misra Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Silpa Kanneganti .

Editor information

Editors and Affiliations

CIC, Instituto Politécnico Nacional, Mexico City, Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kanneganti, S., Chaudhry, H., Misra Sharma, D. (2018). Comparative Error Analysis of Parser Outputs on Telugu Dependency Treebank. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2016. Lecture Notes in Computer Science(), vol 9623. Springer, Cham. https://doi.org/10.1007/978-3-319-75477-2_28

Download citation

DOI: https://doi.org/10.1007/978-3-319-75477-2_28
Published: 21 March 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-75476-5
Online ISBN: 978-3-319-75477-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Comparative Error Analysis of Parser Outputs on Telugu Dependency Treebank

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Exploring Different Approaches for Parsing Telugu

Combining Dependency Parsers Using Error Rates

Coarse-Grained vs. Fine-Grained Lithuanian Dependency Parsing

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Comparative Error Analysis of Parser Outputs on Telugu Dependency Treebank

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Exploring Different Approaches for Parsing Telugu

Combining Dependency Parsers Using Error Rates

Coarse-Grained vs. Fine-Grained Lithuanian Dependency Parsing

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation