Author: Turmo, Jordi : Search

article

TweetNorm: a benchmark for lexical normalization of Spanish tweets

Language Resources and Evaluation (SPLRE), Volume 49, Issue 4Pages 883–905https://doi.org/10.1007/s10579-015-9315-6

The language used in social media is often characterized by the abundance of informal and non-standard writing. The normalization of this non-standard language can be crucial to facilitate the subsequent textual processing and to consequently help boost ...

Article

TextServer: Cloud-Based Multilingual Natural Language Processing

ICDMW '15: Proceedings of the 2015 IEEE International Conference on Data Mining Workshop (ICDMW)Pages 1636–1639

TextServer is an efficient language analysis platform which offers a variety of robust NLP services for a wide range of languages. Services can be easily accessed via a web interface, where document collections can either be uploaded and sent to batch ...

article

Unsupervised ensemble minority clustering

Machine Language (MALE), Volume 98, Issue 1-2Pages 217–268https://doi.org/10.1007/s10994-013-5394-z

Cluster analysis lies at the core of most unsupervised learning tasks. However, the majority of clustering algorithms depend on the all-in assumption, in which all objects belong to some cluster, and perform poorly on minority clustering tasks, in which ...

poster

The TALP participation at ERD 2014

ERD '14: Proceedings of the first international workshop on Entity recognition & disambiguationPages 89–94https://doi.org/10.1145/2633211.2634359

This document describes the work performed by the TALP Research Center, UPC in its first participation at ERD 2014 short text evaluation track. The objective of this evaluation track is to recognize mentions of entities in a given short text, ...

article

A constraint-based hypergraph partitioning approach to coreference resolution

Computational Linguistics (COLI), Volume 39, Issue 4Pages 847–884https://doi.org/10.1162/COLI_a_00151

This work is focused on research in machine learning for coreference resolution. Coreference resolution is a natural language processing task that consists of determining the expressions in a discourse that refer to the same entity.

The main ...

research-article

Sibyl, a factoid question-answering system for spoken documents

ACM Transactions on Information Systems (TOIS), Volume 30, Issue 3Article No.: 19, Pages 1–40https://doi.org/10.1145/2328967.2328972

In this article, we present a factoid question-answering system, Sibyl, specifically tailored for question answering (QA) on spoken-word documents. This work explores, for the first time, which techniques can be robustly adapted from the usual QA on ...

research-article

Free

RelaxCor participation in CoNLL shared task on coreference resolution

CONLL Shared Task '11: Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared TaskPages 35–39

This paper describes the participation of RelaxCor in the CoNLL-2011 shared task: "Modeling Unrestricted Coreference in Ontonotes". RELAXCOR is a constraint-based graph partitioning approach to coreference resolution solved by relaxation labeling. The ...

research-article

Free

A global relaxation labeling approach to coreference resolution

COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics: PostersPages 1086–1094

This paper presents a constraint-based graph partitioning approach to coreference resolution solved by relaxation labeling. The approach combines the strengths of groupwise classifiers and chain formation methods in one global method. Experiments show ...

research-article

Free

RelaxCor: A global relaxation labeling approach to coreference resolution

SemEval '10: Proceedings of the 5th International Workshop on Semantic EvaluationPages 88–91

This paper describes the participation of RelaxCor in the Semeval-2010 task number 1: "Coreference Resolution in Multiple Languages". RelaxCor is a constraint-based graph partitioning approach to coreference resolution solved by relaxation labeling. The ...

Article

Unsupervised Relation Extraction by Massive Clustering

ICDM '09: Proceedings of the 2009 Ninth IEEE International Conference on Data MiningPages 782–787https://doi.org/10.1109/ICDM.2009.81

The goal of Information Extraction is to automatically generate structured pieces of information from the relevant information contained in text documents. Machine Learning techniques have been applied to reduce the cost of Information Extraction system ...

Article

Robust question answering for speech transcripts: UPC experience in QAst 2009

CLEF'09: Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experimentsPages 297–304

This paper describes the participation of the Technical University of Catalonia in the CLEF 2009 Question Answering on Speech Transcripts track. We have participated in the English and Spanish scenarios of QAST. For both manual and automatic transcripts ...

Article

Overview of QAST 2009

CLEF'09: Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experimentsPages 197–211

This paper describes the experience of QAST 2009, the third time a pilot track of CLEF has been held aiming to evaluate the task of Question Answering in Speech Transcripts. Four sites submitted results for at least one of the three scenarios (European ...

research-article

Free

An analysis of bootstrapping for the recognition of temporal expressions

SemiSupLearn '09: Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language ProcessingPages 49–57

We present a semi-supervised (bootstrapping) approach to the extraction of time expression mentions in large unlabelled corpora. Because the only supervision is in the form of seed examples, it becomes necessary to resort to heuristics to rank and ...

Article

Robust question answering for speech transcripts: UPC experience in QAst 2008

CLEF'08: Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information accessPages 492–499

This paper describes the participation of the Technical University of Catalonia in the CLEF 2008 Question Answering on Speech Transcripts track. We have participated in the English and Spanish scenarios of QAst. For the processing of manual transcripts ...

Article

Overview of QAST 2008

CLEF'08: Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information accessPages 314–324

This paper describes the experience of QAST 2008, the second time a pilot track of CLEF has been held aiming to evaluate the task of Question Answering in Speech Transcripts. Five sites submitted results for at least one of the five scenarios (lectures ...

Article

Spoken Document Retrieval Based on Approximated Sequence Alignment

TSD '08: Proceedings of the 11th international conference on Text, Speech and DialoguePages 285–292https://doi.org/10.1007/978-3-540-87391-4_37

This paper presents a new approach to spoken document information retrieval for spontaneous speech corpora. The classical approach to this problem is the use of an automatic speech recognizer (ASR) combined with standard information retrieval techniques. ...

Article

A Graph Partitioning Approach to Entity Disambiguation Using Uncertain Information

GoTAL '08: Proceedings of the 6th international conference on Advances in Natural Language ProcessingPages 428–439https://doi.org/10.1007/978-3-540-85287-2_41

This paper presents a method for Entity Disambiguation in Information Extraction from different sources in the web. Once entities and relations between them are extracted, it is needed to determine which ones are referring to the same real-world entity. ...

Article

Comparing Non-parametric Ensemble Methods for Document Clustering

NLDB '08: Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information SystemsPages 245–256https://doi.org/10.1007/978-3-540-69858-6_25

The biases of individual algorithms for non-parametric document clustering can lead to non-optimal solutions. Ensemble clustering methods may overcome this limitation, but have not been applied to document collections. This paper presents a comparison of ...

chapter

Robust Question Answering for Speech Transcripts Using Minimal Syntactic Analysis

Advances in Multilingual and Multimodal Information RetrievalMay 2008, Pages 424–432https://doi.org/10.1007/978-3-540-85760-0_56

This paper describes the participation of the Technical University of Catalonia in the CLEF 2007 Question Answering on Speech Transcripts track. For the processing of manual transcripts we have deployed a robust factual Question Answering that uses ...

chapter

Overview of QAST 2007

Advances in Multilingual and Multimodal Information RetrievalMay 2008, Pages 249–256https://doi.org/10.1007/978-3-540-85760-0_29

This paper describes QAST, a pilot track of CLEF 2007 aimed at evaluating the task of Question Answering in Speech Transcripts. The paper summarizes the evaluation framework, the systems that participated and the results achieved. These results have ...

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Results

TweetNorm: a benchmark for lexical normalization of Spanish tweets

TextServer: Cloud-Based Multilingual Natural Language Processing

Unsupervised ensemble minority clustering

The TALP participation at ERD 2014

A constraint-based hypergraph partitioning approach to coreference resolution

Upcoming Conferences

Sibyl, a factoid question-answering system for spoken documents

RelaxCor participation in CoNLL shared task on coreference resolution

A global relaxation labeling approach to coreference resolution

RelaxCor: A global relaxation labeling approach to coreference resolution

Unsupervised Relation Extraction by Massive Clustering

Robust question answering for speech transcripts: UPC experience in QAst 2009

Overview of QAST 2009

An analysis of bootstrapping for the recognition of temporal expressions

Robust question answering for speech transcripts: UPC experience in QAst 2008

Overview of QAST 2008

Spoken Document Retrieval Based on Approximated Sequence Alignment

A Graph Partitioning Approach to Entity Disambiguation Using Uncertain Information

Comparing Non-parametric Ensemble Methods for Document Clustering

Robust Question Answering for Speech Transcripts Using Minimal Syntactic Analysis

Overview of QAST 2007

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences