Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1982185.1982440acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
poster

Representing a bilingual lexicon with suffix trees

Published: 21 March 2011 Publication History

Abstract

This paper presents a system based on generalized suffix trees that efficiently implements a set of operations over a bilingual lexicon. Besides the basic operations of adding and removing translations from the lexicon, the system provides two unique query functions that we refer to as monolingual and bilingual coverage. These two functions lay the foundation for higher-level mining operations, such as identification of translation patterns, that are the subject of ongoing research. Nevertheless, the system presented here is interesting in and by itself, for the novelty of the coverage functions and the potential of the whole data structure. We compare the performance of two implementations, one based on suffix trees and the other on suffix arrays.

References

[1]
Aires, G. P. Lopes, and L. Gomes. Phrase translation extraction from aligned parallel corpora using suffix arrays and related structures. In Progress in Artificial Intelligence, LNCS, pages 588--597, 2009.
[2]
L. Gomes. Parallel texts alignment. 2009.
[3]
L. Gomes, Aires, and G. P. Lopes. Parallel texts alignment. In EPIA 2009, Aveiro, pages 513--524. Universidade de Aveiro.
[4]
D. Gusfield. Algorithms on strings, trees, and sequences: computer science and computational biology. Cambridge Univ Pr, 1997.
[5]
L. Russo, G. Navarro, and A. Oliveira. Fully-compressed suffix trees. LATIN 2008: Theoretical Informatics, pages 362--373, 2008.
[6]
E. Ukkonen. On-line construction of suffix trees. Algorithmica, 14(3): 249--260, 1995.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '11: Proceedings of the 2011 ACM Symposium on Applied Computing
March 2011
1868 pages
ISBN:9781450301138
DOI:10.1145/1982185

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 March 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. correspondence
  2. coverage
  3. lexicon
  4. translation pairs

Qualifiers

  • Poster

Conference

SAC'11
Sponsor:
SAC'11: The 2011 ACM Symposium on Applied Computing
March 21 - 24, 2011
TaiChung, Taiwan

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 77
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media