Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- articleDecember 2015
TweetNorm: a benchmark for lexical normalization of Spanish tweets
- Iñaki Alegria,
- Nora Aranberri,
- Pere R. Comas,
- Víctor Fresno,
- Pablo Gamallo,
- Lluis Padró,
- Iñaki San Vicente,
- Jordi Turmo,
- Arkaitz Zubiaga
Language Resources and Evaluation (SPLRE), Volume 49, Issue 4Pages 883–905https://doi.org/10.1007/s10579-015-9315-6The language used in social media is often characterized by the abundance of informal and non-standard writing. The normalization of this non-standard language can be crucial to facilitate the subsequent textual processing and to consequently help boost ...
- ArticleNovember 2015
TextServer: Cloud-Based Multilingual Natural Language Processing
ICDMW '15: Proceedings of the 2015 IEEE International Conference on Data Mining Workshop (ICDMW)Pages 1636–1639TextServer is an efficient language analysis platform which offers a variety of robust NLP services for a wide range of languages. Services can be easily accessed via a web interface, where document collections can either be uploaded and sent to batch ...
- articleJanuary 2015
Unsupervised ensemble minority clustering
Cluster analysis lies at the core of most unsupervised learning tasks. However, the majority of clustering algorithms depend on the all-in assumption, in which all objects belong to some cluster, and perform poorly on minority clustering tasks, in which ...
- posterJuly 2014
The TALP participation at ERD 2014
ERD '14: Proceedings of the first international workshop on Entity recognition & disambiguationPages 89–94https://doi.org/10.1145/2633211.2634359This document describes the work performed by the TALP Research Center, UPC in its first participation at ERD 2014 short text evaluation track. The objective of this evaluation track is to recognize mentions of entities in a given short text, ...
- articleDecember 2013
A constraint-based hypergraph partitioning approach to coreference resolution
Computational Linguistics (COLI), Volume 39, Issue 4Pages 847–884https://doi.org/10.1162/COLI_a_00151This work is focused on research in machine learning for coreference resolution. Coreference resolution is a natural language processing task that consists of determining the expressions in a discourse that refer to the same entity.
The main ...
-
- research-articleSeptember 2012
Sibyl, a factoid question-answering system for spoken documents
ACM Transactions on Information Systems (TOIS), Volume 30, Issue 3Article No.: 19, Pages 1–40https://doi.org/10.1145/2328967.2328972In this article, we present a factoid question-answering system, Sibyl, specifically tailored for question answering (QA) on spoken-word documents. This work explores, for the first time, which techniques can be robustly adapted from the usual QA on ...
- research-articleJune 2011
RelaxCor participation in CoNLL shared task on coreference resolution
This paper describes the participation of RelaxCor in the CoNLL-2011 shared task: "Modeling Unrestricted Coreference in Ontonotes". RELAXCOR is a constraint-based graph partitioning approach to coreference resolution solved by relaxation labeling. The ...
- research-articleAugust 2010
A global relaxation labeling approach to coreference resolution
COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics: PostersPages 1086–1094This paper presents a constraint-based graph partitioning approach to coreference resolution solved by relaxation labeling. The approach combines the strengths of groupwise classifiers and chain formation methods in one global method. Experiments show ...
- research-articleJuly 2010
RelaxCor: A global relaxation labeling approach to coreference resolution
This paper describes the participation of RelaxCor in the Semeval-2010 task number 1: "Coreference Resolution in Multiple Languages". RelaxCor is a constraint-based graph partitioning approach to coreference resolution solved by relaxation labeling. The ...
- ArticleDecember 2009
Unsupervised Relation Extraction by Massive Clustering
ICDM '09: Proceedings of the 2009 Ninth IEEE International Conference on Data MiningPages 782–787https://doi.org/10.1109/ICDM.2009.81The goal of Information Extraction is to automatically generate structured pieces of information from the relevant information contained in text documents. Machine Learning techniques have been applied to reduce the cost of Information Extraction system ...
- ArticleSeptember 2009
Robust question answering for speech transcripts: UPC experience in QAst 2009
This paper describes the participation of the Technical University of Catalonia in the CLEF 2009 Question Answering on Speech Transcripts track. We have participated in the English and Spanish scenarios of QAST. For both manual and automatic transcripts ...
- ArticleSeptember 2009
Overview of QAST 2009
- Jordi Turmo,
- Pere R. Comas,
- Sophie Rosset,
- Olivier Galibert,
- Nicolas Moreau,
- Djamel Mostefa,
- Paolo Rosso,
- Davide Buscaldi
This paper describes the experience of QAST 2009, the third time a pilot track of CLEF has been held aiming to evaluate the task of Question Answering in Speech Transcripts. Four sites submitted results for at least one of the three scenarios (European ...
- research-articleJune 2009
An analysis of bootstrapping for the recognition of temporal expressions
We present a semi-supervised (bootstrapping) approach to the extraction of time expression mentions in large unlabelled corpora. Because the only supervision is in the form of seed examples, it becomes necessary to resort to heuristics to rank and ...
- ArticleSeptember 2008
Robust question answering for speech transcripts: UPC experience in QAst 2008
This paper describes the participation of the Technical University of Catalonia in the CLEF 2008 Question Answering on Speech Transcripts track. We have participated in the English and Spanish scenarios of QAst. For the processing of manual transcripts ...
- ArticleSeptember 2008
Overview of QAST 2008
This paper describes the experience of QAST 2008, the second time a pilot track of CLEF has been held aiming to evaluate the task of Question Answering in Speech Transcripts. Five sites submitted results for at least one of the five scenarios (lectures ...
- ArticleSeptember 2008
Spoken Document Retrieval Based on Approximated Sequence Alignment
TSD '08: Proceedings of the 11th international conference on Text, Speech and DialoguePages 285–292https://doi.org/10.1007/978-3-540-87391-4_37This paper presents a new approach to spoken document information retrieval for spontaneous speech corpora. The classical approach to this problem is the use of an automatic speech recognizer (ASR) combined with standard information retrieval techniques. ...
- ArticleAugust 2008
A Graph Partitioning Approach to Entity Disambiguation Using Uncertain Information
GoTAL '08: Proceedings of the 6th international conference on Advances in Natural Language ProcessingPages 428–439https://doi.org/10.1007/978-3-540-85287-2_41This paper presents a method for Entity Disambiguation in Information Extraction from different sources in the web. Once entities and relations between them are extracted, it is needed to determine which ones are referring to the same real-world entity. ...
- ArticleJune 2008
Comparing Non-parametric Ensemble Methods for Document Clustering
NLDB '08: Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information SystemsPages 245–256https://doi.org/10.1007/978-3-540-69858-6_25The biases of individual algorithms for non-parametric document clustering can lead to non-optimal solutions. Ensemble clustering methods may overcome this limitation, but have not been applied to document collections. This paper presents a comparison of ...
- chapterMay 2008
Robust Question Answering for Speech Transcripts Using Minimal Syntactic Analysis
Advances in Multilingual and Multimodal Information RetrievalMay 2008, Pages 424–432https://doi.org/10.1007/978-3-540-85760-0_56This paper describes the participation of the Technical University of Catalonia in the CLEF 2007 Question Answering on Speech Transcripts track. For the processing of manual transcripts we have deployed a robust factual Question Answering that uses ...
- chapterMay 2008
Overview of QAST 2007
Advances in Multilingual and Multimodal Information RetrievalMay 2008, Pages 249–256https://doi.org/10.1007/978-3-540-85760-0_29This paper describes QAST, a pilot track of CLEF 2007 aimed at evaluating the task of Question Answering in Speech Transcripts. The paper summarizes the evaluation framework, the systems that participated and the results achieved. These results have ...