Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2390665.2390679dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
research-article
Free access

Pattern learning for relation extraction with a hierarchical topic model

Published: 08 July 2012 Publication History

Abstract

We describe the use of a hierarchical topic model for automatically identifying syntactic and lexical patterns that explicitly state ontological relations. We leverage distant supervision using relations from the knowledge base FreeBase, but do not require any manual heuristic nor manual seed list selections. Results show that the learned patterns can be used to extract new relations with good precision.

References

[1]
ACE. 2004. The automatic content extraction projects. http://projects.ldc.upenn.edu/ace.
[2]
Michele Banko, Michael J. Cafarella, Stephen Soderland, Matt Broadhead, and Oren Etzioni. 2007. Open information extraction from the web. In IJCAI'07.
[3]
D. T. Bollegala, Y. Matsuo, and M. Ishizuka. 2010. Relational duality: Unsupervised extraction of semantic relations between entities on the web. In Proceedings of the 19th international conference on World wide web, pages 151--160. ACM.
[4]
R. Bunescu and M. Pasca. 2006. Using encyclopedic knowledge for named entity disambiguation. In Proceedings of EACL, volume 6, pages 9--16.
[5]
S. Cucerzan. 2007. Large-scale named entity disambiguation based on wikipedia data. In Proceedings of EMNLP-CoNLL, volume 2007, pages 708--716.
[6]
G. Doddington, A. Mitchell, M. Przybocki, L. Ramshaw, S. Strassel, and R. Weischedel. 2004. The automatic content extraction (ace) program--tasks, data, and evaluation. In Proceedings of LREC, volume 4, pages 837--840. Citeseer.
[7]
A. Fader, S. Soderland, and O. Etzioni. 2011. Identifying relations for open information extraction. In Proceedings of Empirical Methods in Natural Language Processing.
[8]
A. Haghighi and L. Vanderwende. 2009. Exploring content models for multi-document summarization. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 362--370. Association for Computational Linguistics.
[9]
R. Hoffmann, C. Zhang, and D. S. Weld. 2010. Learning 5000 relational extractors. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 286--295. Association for Computational Linguistics.
[10]
R. Hoffmann, C. Zhang, X. Ling, L. Zettlemoyer, and D. S. Weld. 2011. Knowledge-based weak supervision for information extraction of overlapping relations. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 541--550. Association for Computational Linguistics.
[11]
G. Kasneci, M. Ramanath, F. Suchanek, and G. Weikum. 2009. The yago-naga approach to knowledge discovery. ACM SIGMOD Record, 37(4): 41--47.
[12]
Z. Kozareva and E. Hovy. 2010. Learning arguments and supertypes of semantic relations using recursive patterns. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 1482--1491. Association for Computational Linguistics.
[13]
C. Kwok, O. Etzioni, and D. S. Weld. 2001. Scaling question answering to the web. ACM Transactions on Information Systems (TOIS), 19(3): 242--262.
[14]
D. Li, S. Somasundaran, and A. Chakraborty. 2011. A combination of topic models with max-margin learning for relation detection.
[15]
P. McNamee and H. T. Dang. 2009. Overview of the tac 2009 knowledge base population track. In Text Analysis Conference (TAC).
[16]
D. Milne and I. H. Witten. 2008. Learning to link with wikipedia. In Proceeding of the 17th ACM conference on Information and knowledge management, pages 509--518. ACM.
[17]
M. Mintz, S. Bills, R. Snow, and D. Jurafsky. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2, pages 1003--1011. Association for Computational Linguistics.
[18]
J. Nivre. 2006. Inductive dependency parsing. In Text, Speech and Language Technology, volume 34. Springer Verlag.
[19]
M. Pasca, D. Lin, J. Bigham, A. Lifchits, and A. Jain. 2006. Organizing and searching the world wide web of facts-step one: the one-million fact extraction challenge. In Proceedings of the National Conference on Artificial Intelligence, page 1400. Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999.
[20]
F. Pereira, A. Rajaraman, S. Sarawagi, W. Tunstall-Pedoe, G. Weikum, and A. Halevy. 2009. Answering web questions using structured data: dream or reality? Proceedings of the VLDB Endowment, 2(2): 1646--1646.
[21]
H. Poon and P. Domingos. 2009. Unsupervised semantic parsing. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-Volume 1, pages 1--10. Association for Computational Linguistics.
[22]
D. Ravichandran and E. Hovy. 2002. Learning surface text patterns for a question answering system. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pages 41--47. Association for Computational Linguistics.
[23]
S. Sekine. 2006. On-demand information extraction. In Proceedings of the COLING/ACL on Main conference poster sessions, pages 731--738. Association for Computational Linguistics.
[24]
Beth M. Sundheim and Nancy A. Chinchor. 1993. Survey of the message understanding conferences. In HLT'93.
[25]
I. Titov and A. Klementiev. 2011. A bayesian model for unsupervised semantic parsing. In The 49th Annual Meeting of the Association for Computational Linguistics.
[26]
C. Wang, J. Fan, A. Kalyanpur, and D. Gondek. 2011. Relation extraction with relation topics. In Proceedings of Empirical Methods in Natural Language Processing.
[27]
Daniel S. Weld, Fei Wu, Eytan Adar, Saleema Amershi, James Fogarty, Raphael Hoffmann, Kayur Patel, and Michael Skinner. 2008. Intelligence in wikipedia. In Proceedings of the 23rd national conference on Artificial intelligence, pages 1609--1614. AAAI Press.
[28]
F. Wu and D. S. Weld. 2010. Open information extraction using wikipedia. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 118--127. Association for Computational Linguistics.
[29]
L. Yao, A. Haghighi, S. Riedel, and A. McCallum. 2011. Structured relation discovery using generative models. In Empirical Methods in Natural Language Processing (EMNLP).
[30]
A. Yates, M. Cafarella, M. Banko, O. Etzioni, M. Broadhead, and S. Soderland. 2007. Textrunner: Open information extraction on the web. In Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pages 25--26. Association for Computational Linguistics.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '12: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
July 2012
420 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 08 July 2012

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)40
  • Downloads (Last 6 weeks)10
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Scalpel-CD: Leveraging Crowdsourcing and Deep Probabilistic Modeling for Debugging Noisy Training DataThe World Wide Web Conference10.1145/3308558.3313599(2158-2168)Online publication date: 13-May-2019
  • (2018)SnubaProceedings of the VLDB Endowment10.14778/3291264.329126812:3(223-236)Online publication date: 1-Nov-2018
  • (2018)Relation Extraction Using Distant SupervisionACM Computing Surveys10.1145/324174151:5(1-35)Online publication date: 19-Nov-2018
  • (2018)Weakly-supervised Contextualization of Knowledge Graph FactsThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval10.1145/3209978.3210031(765-774)Online publication date: 27-Jun-2018
  • (2017)Learning the structure of generative models without labeled dataProceedings of the 34th International Conference on Machine Learning - Volume 7010.5555/3305381.3305410(273-282)Online publication date: 6-Aug-2017
  • (2017)Inferring generative model structure with static analysisProceedings of the 31st International Conference on Neural Information Processing Systems10.5555/3294771.3294794(239-249)Online publication date: 4-Dec-2017
  • (2017)SnorkelProceedings of the VLDB Endowment10.14778/3157794.315779711:3(269-282)Online publication date: 1-Nov-2017
  • (2016)Data programmingProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157382.3157497(3574-3582)Online publication date: 5-Dec-2016
  • (2015)Towards a Relation Extraction Framework for Cyber-Security ConceptsProceedings of the 10th Annual Cyber and Information Security Research Conference10.1145/2746266.2746277(1-4)Online publication date: 7-Apr-2015
  • (2014)Structured Information Extraction from Natural Disaster Events on TwitterProceedings of the 5th International Workshop on Web-scale Knowledge Representation Retrieval & Reasoning10.1145/2663792.2663794(1-8)Online publication date: 3-Nov-2014
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media