Skip to main content

Naushad Uzzaman

Followers

38

Following

3

Co-authors

3

Public Views

Interests

Uploads

Papers by Naushad Uzzaman

Multimodal summarization of complex sentences

In this paper, we introduce the idea of automatically illustrating complex sentences as multimoda... more In this paper, we introduce the idea of automatically illustrating complex sentences as multimodal summaries that combine pictures, structure and simplified compressed text. By including text and structure in addition to pictures, multimodal summaries provide additional clues of what happened, who did it, to whom and how, to people who may have difficulty reading or who are looking to skim quickly. We present ROC-MMS, a system for automatically creating multimodal summaries (MMS) of complex sentences by generating pictures, textual summaries and structure. We show that pictures alone are insufficient to help people understand most sentences, especially for readers who are unfamiliar with the domain. An evaluation of ROC-MMS in the Wikipedia domain illustrates both the promise and challenge of automatically creating multimodal summaries.

SemEval-2013 Task 1: TempEval-3: Evaluating Time Expressions, Events, and Temporal Relations

by Hector Llorens, Naushad Uzzaman, and Marc Verhagen

TimeML-strict: clarifying temporal annotation

by Hector Llorens and Naushad Uzzaman

ABSTRACT TimeML is an XML-based schema for annotating temporal information over discourse. The st... more ABSTRACT TimeML is an XML-based schema for annotating temporal information over discourse. The standard has been used to annotate a variety of resources and is followed by a number of tools, the creation of which constitute hundreds of thousands of man-hours of research work. However, the current state of resources is such that many are not valid, or do not produce valid output, or contain ambiguous or custom additions and removals. Difficulties arising from these variances were highlighted in the TempEval-3 exercise, which included its own extra stipulations over conventional TimeML as a response. To unify the state of current resources, and to make progress toward easy adoption of its current incarnation ISO-TimeML, this paper introduces TimeML-strict: a valid, unambiguous, and easy-to-process subset of TimeML. We also introduce three resources -- a schema for TimeML-strict; a validator tool for TimeML-strict, so that one may ensure documents are in the correct form; and a repair tool that corrects common invalidating errors and adds disambiguating markup in order to convert documents from the laxer TimeML standard to TimeML-strict.

AnalysisofandObservationsfromaBangla��ewsCor pus

Analysis of N-Gram Based Text Categorization for Bangla in a Newspaper Corpus

by Naushad Uzzaman and Munirul Mansur

In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorizati... more In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of length 2 or 3 are the most useful for categorization. Using gram lengths more than 3 reduces the performance of categorization.

Evaluating Temporal Information Understanding with Temporal Question Answering

by Hector Llorens and Naushad Uzzaman

2012 IEEE Sixth International Conference on Semantic Computing, 2012

ABSTRACT The temporal annotation scheme Time ML was developed to support research in complex temp... more ABSTRACT The temporal annotation scheme Time ML was developed to support research in complex temporal question answering (QA). Given the complexity of temporal QA, most of the efforts have focused, so far, on extracting temporal information, which has been evaluated with corpus-based evaluation. However, the QA task represents a natural way to evaluate temporal information understanding, and creating question sets is less costly for humans than manually annotating temporal information, which is required to perform corpus-based evaluation. Additionally, QA performance better captures the understanding of important temporal information as compared to corpus-based evaluation where all information is equally important for scoring. This paper presents a temporal QA system that performs temporal reasoning. It can be used to answer temporal questions (factoid, list and yes/no), about any document annotated in Time ML. In the paper, we show how this system can be used to evaluate automated temporal information understanding. Our QA-based evaluation results suggest that (i) the available temporal annotations are not complete, and (ii) QA provides a less costly and more reliable way of evaluating temporal understanding systems. To favour replicability, we made the temporal QA system and the question set used in the evaluation available.

Extracting Events and Temporal Expressions from Text

2010 IEEE Fourth International Conference on Semantic Computing, 2010

History (Forward N-Gram) or Future (Backward N-Gram)? Which Model to Consider for N-Gram analysis in Bangla?

This paper presents a directional advantage of n- gram modeling in terms of backward or forward n... more This paper presents a directional advantage of n- gram modeling in terms of backward or forward n- gram modeling in Bangla. The most commonly used n- gram analysis is predominantly a forward n-gram. However in Bangla it appears that a backward n- gram is repeatedly more successful and yields more grammatical results than a forward n-gram. This paper hypothesizes that

Merging Temporal Annotations

by Hector Llorens and Naushad Uzzaman

2012 19th International Symposium on Temporal Representation and Reasoning, 2012

Comparison of different POS Tagging Techniques (n-gram, HMM and Brill’s tagger) for Bangla

Advances and Innovations in Systems, Computing Sciences and Software Engineering, 2007

EVENT AND TEMPORAL EXPRESSION EXTRACTION FROM RAW TEXT: FIRST STEP TOWARDS A TEMPORALLY AWARE SYSTEM

International Journal of Semantic Computing, 2010

Temporal evaluation

TwitterPaul: Extracting and Aggregating Twitter Predictions

TRIOS-TimeBank Corpus: Extended TimeBank corpus with help of deep understanding of text

TRIPS and TRIOS system for TempEval-2: Extracting temporal information from text

A comprehensive Roman (English)-to-Bangla transliteration scheme

... Naushad UzZaman (naushad@bracuniversity.ac.bd), Arnab Zaheen (arnab@bracuniversity. ac.bd) an... more

Rule based automated pronunciation generator

Rule based Automated Pronunciation Generator Ayesha Binte Mosaddeque, Naushad UzZaman, and Mumit ... more

N-gram based statistical grammar checker for Bangla and English

Comparison of Unigram, Bigram, Hmm and Brill's Pos Tagging Approaches for Some South Asian Languages

proceedings of conference on …, 2007

Page 1. Comparison of Unigram, Bigram, HMM and Brill's POS Tagging Approaches for some South... more

Analysis of and Observations From a Bangla News Corpus

Proceedings of 9th …, 2006

A Corpus from linguistic point of view is defined as a collection of transcribed speech or writte... more

Multimodal summarization of complex sentences

In this paper, we introduce the idea of automatically illustrating complex sentences as multimoda... more In this paper, we introduce the idea of automatically illustrating complex sentences as multimodal summaries that combine pictures, structure and simplified compressed text. By including text and structure in addition to pictures, multimodal summaries provide additional clues of what happened, who did it, to whom and how, to people who may have difficulty reading or who are looking to skim quickly. We present ROC-MMS, a system for automatically creating multimodal summaries (MMS) of complex sentences by generating pictures, textual summaries and structure. We show that pictures alone are insufficient to help people understand most sentences, especially for readers who are unfamiliar with the domain. An evaluation of ROC-MMS in the Wikipedia domain illustrates both the promise and challenge of automatically creating multimodal summaries.

SemEval-2013 Task 1: TempEval-3: Evaluating Time Expressions, Events, and Temporal Relations

by Hector Llorens, Naushad Uzzaman, and Marc Verhagen

TimeML-strict: clarifying temporal annotation

by Hector Llorens and Naushad Uzzaman

ABSTRACT TimeML is an XML-based schema for annotating temporal information over discourse. The st... more ABSTRACT TimeML is an XML-based schema for annotating temporal information over discourse. The standard has been used to annotate a variety of resources and is followed by a number of tools, the creation of which constitute hundreds of thousands of man-hours of research work. However, the current state of resources is such that many are not valid, or do not produce valid output, or contain ambiguous or custom additions and removals. Difficulties arising from these variances were highlighted in the TempEval-3 exercise, which included its own extra stipulations over conventional TimeML as a response. To unify the state of current resources, and to make progress toward easy adoption of its current incarnation ISO-TimeML, this paper introduces TimeML-strict: a valid, unambiguous, and easy-to-process subset of TimeML. We also introduce three resources -- a schema for TimeML-strict; a validator tool for TimeML-strict, so that one may ensure documents are in the correct form; and a repair tool that corrects common invalidating errors and adds disambiguating markup in order to convert documents from the laxer TimeML standard to TimeML-strict.

AnalysisofandObservationsfromaBangla��ewsCor pus

Analysis of N-Gram Based Text Categorization for Bangla in a Newspaper Corpus

by Naushad Uzzaman and Munirul Mansur

In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorizati... more In this paper, we study the outcome of using n- gram based algorithm for Bangla text categorization. To analyze the efficiency of this methodology we used one year Prothom-Alo news corpus. Our results show that n-grams of length 2 or 3 are the most useful for categorization. Using gram lengths more than 3 reduces the performance of categorization.

Evaluating Temporal Information Understanding with Temporal Question Answering

by Hector Llorens and Naushad Uzzaman

2012 IEEE Sixth International Conference on Semantic Computing, 2012

ABSTRACT The temporal annotation scheme Time ML was developed to support research in complex temp... more ABSTRACT The temporal annotation scheme Time ML was developed to support research in complex temporal question answering (QA). Given the complexity of temporal QA, most of the efforts have focused, so far, on extracting temporal information, which has been evaluated with corpus-based evaluation. However, the QA task represents a natural way to evaluate temporal information understanding, and creating question sets is less costly for humans than manually annotating temporal information, which is required to perform corpus-based evaluation. Additionally, QA performance better captures the understanding of important temporal information as compared to corpus-based evaluation where all information is equally important for scoring. This paper presents a temporal QA system that performs temporal reasoning. It can be used to answer temporal questions (factoid, list and yes/no), about any document annotated in Time ML. In the paper, we show how this system can be used to evaluate automated temporal information understanding. Our QA-based evaluation results suggest that (i) the available temporal annotations are not complete, and (ii) QA provides a less costly and more reliable way of evaluating temporal understanding systems. To favour replicability, we made the temporal QA system and the question set used in the evaluation available.

Extracting Events and Temporal Expressions from Text

2010 IEEE Fourth International Conference on Semantic Computing, 2010

History (Forward N-Gram) or Future (Backward N-Gram)? Which Model to Consider for N-Gram analysis in Bangla?

This paper presents a directional advantage of n- gram modeling in terms of backward or forward n... more This paper presents a directional advantage of n- gram modeling in terms of backward or forward n- gram modeling in Bangla. The most commonly used n- gram analysis is predominantly a forward n-gram. However in Bangla it appears that a backward n- gram is repeatedly more successful and yields more grammatical results than a forward n-gram. This paper hypothesizes that

Merging Temporal Annotations

by Hector Llorens and Naushad Uzzaman

2012 19th International Symposium on Temporal Representation and Reasoning, 2012

Comparison of different POS Tagging Techniques (n-gram, HMM and Brill’s tagger) for Bangla

Advances and Innovations in Systems, Computing Sciences and Software Engineering, 2007

EVENT AND TEMPORAL EXPRESSION EXTRACTION FROM RAW TEXT: FIRST STEP TOWARDS A TEMPORALLY AWARE SYSTEM

International Journal of Semantic Computing, 2010

Temporal evaluation

TwitterPaul: Extracting and Aggregating Twitter Predictions

TRIOS-TimeBank Corpus: Extended TimeBank corpus with help of deep understanding of text

TRIPS and TRIOS system for TempEval-2: Extracting temporal information from text

A comprehensive Roman (English)-to-Bangla transliteration scheme

... Naushad UzZaman (naushad@bracuniversity.ac.bd), Arnab Zaheen (arnab@bracuniversity. ac.bd) an... more

Rule based automated pronunciation generator

Rule based Automated Pronunciation Generator Ayesha Binte Mosaddeque, Naushad UzZaman, and Mumit ... more

N-gram based statistical grammar checker for Bangla and English

Comparison of Unigram, Bigram, Hmm and Brill's Pos Tagging Approaches for Some South Asian Languages

proceedings of conference on …, 2007

Page 1. Comparison of Unigram, Bigram, HMM and Brill's POS Tagging Approaches for some South... more

Analysis of and Observations From a Bangla News Corpus

Proceedings of 9th …, 2006

A Corpus from linguistic point of view is defined as a collection of transcribed speech or writte... more