Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1944566.1944638dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
research-article
Free access

Enhancing multi-lingual information extraction via cross-media inference and fusion

Published: 23 August 2010 Publication History

Abstract

We describe a new information fusion approach to integrate facts extracted from cross-media objects (videos and texts) into a coherent common representation including multi-level knowledge (concepts, relations and events). Beyond standard information fusion, we exploited video extraction results and significantly improved text Information Extraction. We further extended our methods to multi-lingual environment (English, Arabic and Chinese) by presenting a case study on cross-lingual comparable corpora acquisition based on video comparison.

References

[1]
Amato, F., Mazzeo, A., Moscato, V. and Picariello, A. 2010. Information Extraction from Multimedia Documents for e-Government Applications. Information Systems: People, Organizations, Institutions, and Technologies. pp. 101--108.
[2]
Appriou A., A. Ayoun, Benferhat, S., Besnard, P., Cholvy, L., Cooke, R., Cuppens, F., Dubois, D., Fargier, H., Grabisch, M., Kruse, R., Lang, J. Moral, S., Prade, H., Saffiotti, A., Smets, P., Sossai, C. 2001. Fusion: General concepts and characteristics. International Journal of Intelligent Systems 16(10).
[3]
Baluja, S. and Rowley, H. 2006. Boosting Sex Identification Performance. International Journal of Computer Vision.
[4]
Bergsma, S. 2005. Automatic Acquisition of Gender Information for Anaphora Resolution. Proc. Canadian AI 2005.
[5]
Cheung, P. and Fung P. 2004. Sentence Alignment in Parallel, Comparable, and Quasi-comparable Corpora. Proc. LREC 2004.
[6]
Cheung, S.-C. and Zakhor, A. 2000. Efficient video similarity measurement and search. Proc. IEEE International Conference on Image Processing.
[7]
Deschacht K. and Moens M. 2007. Text Analysis for Automatic Image Annotation. Proc. ACL 2007.
[8]
Feng, Y. and Lapata, M. 2008. Automatic Image Annotation Using Auxiliary Text Information. Proc. ACL 2008.
[9]
Gregoire, E. 2006. An unbiased approach to iterated fusion by weakening. Information Fusion. 7(1).
[10]
Gu, Z., Mei, T., Hua, X., Tang, J., Wu, X. 2007. Multi-Layer Multi-Instance Kernel for Video Concept Detection. Proc. ACM Multimedia 2007.
[11]
Hakkani-Tur, D., Ji, H. and Grishman, R. 2007. Using Information Extraction to Improve Cross-lingual Document Retrieval. Proc. RANLP 2007 Workshop on Multi-Source Multi-lingual Information Extraction and Summarization.
[12]
Iria, J. and Magalhaes, J. 2009. Exploiting Cross-Media Correlations in the Categorization of Multimedia Web Documents. Proc. CIAM 2009.
[13]
Ji, H. and Grishman, R. 2008. Refining Event Extraction Through Cross-document Inference. Proc. ACL 2008.
[14]
Ji, H. 2009. Mining Name Translations from Comparable Corpora by Creating Bilingual Information Networks. Proc. ACL-IJCNLP 2009 workshop on Building and Using Comparable Corpora (BUCC 2009): from parallel to non-parallel corpora.
[15]
Ji, H., Grishman, R., Freitag, D., Blume, M., Wang, J., Khadivi, S., Zens, R., and Ney, H. 2009. Name Translation for Distillation. Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation. Springer.
[16]
Ji, H. and Lin, D. 2009. Gender and Animacy Knowledge Discovery from Web-Scale N-Grams for Unsupervised Person Mention Detection. Proc. PACLIC 2009.
[17]
Oviatt, S. L., DeAngeli, A., & Kuhn, K. 1997. Integration and synchronization of input modes during multimodal human-computer interaction. Proceedings of Conference on Human Factors in Computing Systems (CHI'97), 415--422. New York: ACM Press.
[18]
Labsky, M., Praks, P., Svátek1, V., and Svab, O. 2005. Multimedia Information Extraction from HTML Product Catalogues. Proc. 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 401--404.
[19]
Lin, D., Church, K., Ji, H., Sekine, S., Yarowsky, D., Bergsma, S., Patil, K., Pitler, E., Lathbury, R., Rao, V., Dalwani, K. and Narsale, S. 2010. New Data, Tags and Tools for Web-Scale N-grams. Proc. LREC 2010.
[20]
Magalhaes, J., Ciravegna, F. and Ruger, S. 2008. Exploring Multimedia in a Keyword Space. Proc. ACM Multimedia 2008.
[21]
Munteanu, D. S. and Marcu D. 2005. Improving Machine Translation Performance by Exploiting Non-Parallel Corpora. Computational Linguistics. Volume 31, Issue 4. pp. 477--504.
[22]
Naphade, M. R., Kennedy, L., Kender, J. R., Chang, S.-F., Smith, J. R., Over, P., and Hauptmann, A. A light scale concept ontology for multimedia understanding for TRECVID 2005. Technical report, IBM, 2005.
[23]
Pazouki, E. and Rahmati, M. 2009. A novel multimedia data mining framework for information extraction of a soccer video stream. Intelligent Data Analysis, pp. 833--857.
[24]
Qi, G.-J., Hua, X.-S., Rui, Y., Tang, J., Mei, T., and Zhang, H.-J. 2007. Correlative Multi-label Video Annotation. Proc. ACM Multimedia 2007.
[25]
Saggion, H., Cunningham, H., Bontcheva, K., Maynard, D., Hamza, O., and Wilks, Y. 2004. Multimedia indexing through multi-source and multi-language information extraction: the MUMIS project. Data Knowlege Engineering, 48, 2, pp. 247--264.
[26]
Wang, F. and Zhang, C. 2006. Label propagation through linear neighborhoods. Proc. ICML 2006.

Index Terms

  1. Enhancing multi-lingual information extraction via cross-media inference and fusion

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics: Posters
      August 2010
      1588 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 23 August 2010

      Qualifiers

      • Research-article

      Acceptance Rates

      Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 145
        Total Downloads
      • Downloads (Last 12 months)40
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 25 Jan 2025

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media