Enya Kong Tang

Parallel texts or Bitexts - where the same content is available in several languages, due to document translation, are becoming plentiful and available, both in private data warehouses and on publicly accessible sites on the WWW

Publication Date: 2001

Download (.pdf)

Sourcing for large amount of text and translating them are some of the challenges in building an Example-Based Machine Translation (EBMT) system. These big amounts of translated texts are annotated into the S-SSTC format to cover an... more

Sourcing for large amount of text and translating them are some of the challenges in building an Example-Based Machine Translation (EBMT) system. These big amounts of translated texts are annotated into the S-SSTC format to cover an extensive vocabulary and sentence structures. However, the Bilingual Knowledge Bank (BKB), which is a collection of the S-SSTCs, will normally contain redundancy. Hence, the idea of an optimized BKB is born. An optimized BKB (redundancy reduced; is smaller in size but is as equally extensive in term of its sentence structure coverage compared to an un-optimized BKB. Therefore, an optimized BKB enhances the performance of the EBMT. In this paper, we introduce the idea of an optimized BKB and propose it to be re-used to effectively construct new BKBs in order to adapt an existing EBMT for new language pairs

Publication Date: Nov 1, 2007

Download (.pdf)

Long short term memory (LSTM) networks have been gaining popularity in modeling sequential data such as phoneme recognition, speech translation, language modeling, speech synthesis, chatbot-like dialog systems and others. This paper... more

Long short term memory (LSTM) networks have been gaining popularity in modeling sequential data such as phoneme recognition, speech translation, language modeling, speech synthesis, chatbot-like dialog systems and others. This paper investigates the attention-based encoder-decoder LSTM networks in Malay part-of-speech (POS) tagging when it is compared to weighted finite state transducer (WFST) and hidden Markov model (HMM). The attractiveness of LSTM networks is its strength in modeling long distance dependencies. Malay POS tagging is examined from two different conditions: with and without morphological information. The experiment results show that LSTM networks that are trained without any explicit morphological knowledge perform nearly equally with WFST but better than HMM approach that is trained with morphological information.

Publication Date: 2017

Publication Name: Journal of Telecommunication, Electronic and Computer Engineering

Research Interests:
Engineering, Computer Science, Natural Language Processing, Speech Recognition, and hidden Markov model

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2017

Publication Name: Expert Systems with Applications

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Machine Translation, Semantic similarity, and 3 moreSemantics (Computer Science), Mathematical Sciences, and Example Based Machine Translation

Download (.pdf)

Publication Date: 2001

Publication Name: Webnet

Research Interests:
Computer Science

Publisher: Elsevier BV

Publication Date: 2016

Publication Name: Expert Systems with Applications

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, and Mathematical Sciences

Download (.pdf)

The search that involves structured web resources like XML data, services is still lagging of its own method and relying on contemporary search systems. This paper presents a method that learns semantics from structured information of... more

The search that involves structured web resources like XML data, services is still lagging of its own method and relying on contemporary search systems. This paper presents a method that learns semantics from structured information of these resources. Instead of committing the semantic meaning of resources to strict and formal vocabularies like ontology or data dictionary, we are interested to

Publication Date: 2006

Publication Name: 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06)

Research Interests:
Computer Science, Information Retrieval, Semantic Web, Web Intelligence, Social Semantic Web, and 4 moreResource Selection, World Wide Web, Semantic Analytics, and web search query

Structured retrieval aims at exploiting the structural information of documents when searching for documents. Structured retrieval makes use of both content and structure of documents to improve information retrieval. Therefore, the... more

Structured retrieval aims at exploiting the structural information of documents when searching for documents. Structured retrieval makes use of both content and structure of documents to improve information retrieval. Therefore, the availability of semantic structure in the documents is an important factor for the success of structured retrieval. However, the majority of documents in the Web still lack semantically-rich structure.

Publication Date: 2009

Publication Name: Proceeding of the 2nd ACM workshop on Social web search and mining - SWSM '09

Research Interests:
Computer Science, Information Retrieval, Unsupervised Learning, Semantic Information, Learning Process, and 3 moreDocument Structure, Web Documents, and Explicit semantic analysis ESA

The String-Tree Correspondence Grammar (STCG) [1] is a grammar formalism for defining: • a set of strings (a language), • a set of trees (valid representation/interpretation structures), • the mapping between the two (to be interpreted... more

The String-Tree Correspondence Grammar (STCG) [1] is a grammar formalism for defining: • a set of strings (a language), • a set of trees (valid representation/interpretation structures), • the mapping between the two (to be interpreted for analysis &amp; generation). The formalism is argued to be a totally declarative grammar formalism that can associate, to strings in a language, arbitrary tree structures as desired by the grammar writer to be the linguistic representation structures of the strings. More importantly is the facility to specify the correspondence between the string and the associated tree in a very natural manner. These features are very much desired in grammar writing, in particular for the treatment of certain linguistic phenomena which are &apos;non-standard&apos;, namely featurisation, lexicalisation and crossed dependencies [2,3]. Furthermore, a grammar written in this way naturally inherits the desired property of bi-directionality (in fact non-directionality [4]) such that the same grammar can be interpreted for both analysis and generation. In this paper, we investigate the properties of the STCG for interpretation towards analysis (as is understood within the context of Machine Translation (MT)). Other than using STCG

Publication Date: 1994

Research Interests:
Computer Science

A system was proposed to implement the phoneme segmentation for the Malay language connected words. The system consists of the front-end speech preprocessing part which focuses on the usage of zero crossing rates. The detection algorithm... more

A system was proposed to implement the phoneme segmentation for the Malay language connected words. The system consists of the front-end speech preprocessing part which focuses on the usage of zero crossing rates. The detection algorithm was used to determine the beginning and ending of the phonemes based on the silence intervals and valleys. Object-Oriented Programming (OOP) approach and Graphical

Research Interests:
Object Oriented Programming, Speech Segmentation, Graphic User Interface Design, Word Segmentation, Zero Crossing Rate, and 2 moreFront end and Detection Algorithm

Abstract. Word sense disambiguation (WSD) requires the establish-ment of a list of the different meanings of words. WSD efforts in ma-chine translation require, in addition, the equivalent translation words in target languages. To... more

Abstract. Word sense disambiguation (WSD) requires the establish-ment of a list of the different meanings of words. WSD efforts in ma-chine translation require, in addition, the equivalent translation words in target languages. To facilitate WSD in machine translation systems, we propose ...

Publisher: mti.ugm.ac.id

Publication Date: 2004

Publication Name: Unit Terjemahan Melalui Komputer, Universiti …

Research Interests:
Natural Language Processing

Download (.pdf)

Kamus Dewan is the authoritative dictionary for Bahasa Malaysia, containing a wealth of linguistic and cultural information about Bahasa Malaysia. It is currently available in print, as well as a searchable online dictionary. However, the... more

Kamus Dewan is the authoritative dictionary for Bahasa Malaysia, containing a wealth of linguistic and cultural information about Bahasa Malaysia. It is currently available in print, as well as a searchable online dictionary. However, the online dictionary lacks advanced search capabilities that target specific fields within each headword and lemma entry. For this information to be targeted and extracted efficiently by computers, the macro- and micro-structures of Kamus Dewan entries need to be first annotated or marked up explicitly. We describe how TEI-P5 guidelines have been applied in this endeavour to make the Kamus Dewan more machine-tractable. We also give some examples of how the machine-tractable data from Kamus Dewan can be used for linguistic research and analysis, as well as for producing other language resources.

Publisher: PeerJ

Publication Date: 2016

Research Interests:
Computer Science

Download (.pdf)

Categories are used to organize information and knowledge in directory system, folder etc. As the amount of information increase and the types of information diversify, it is common to have more categories created. As the number of... more

Categories are used to organize information and knowledge in directory system, folder etc. As the amount of information increase and the types of information diversify, it is common to have more categories created. As the number of categories increases, it becomes more difficult to organize, manage and look up information from existing categories. In this paper, categories are annotated with concept features to facilitate the access, retrieval and sharing of information in the categories. We have observed that training texts is crucial in learning the concept of a category and serves as a good measure to help human to construct the category model. Hence, we present a study on training texts selection and evaluate the effectiveness of training texts, as well as its capability to complement human's knowledge in constructing the category model. Experimental evaluation shows that using training texts approach in category model construction gives promising results in both effectiveness and complement measures

Publication Date: 2006

Publication Name: Proceedings of the 2006 Ieee Wic Acm International Conference on Web Intelligence

Research Interests:
Computer Science, Information Retrieval, Text Analysis, and Classification

EXTENDED ABSTRACT The retrieval of structured resources using unstructured queries is challenging as we need to deal with the matching between entities of two different types. Consider an unstructured query, “publications of K.H. Gan in... more

EXTENDED ABSTRACT The retrieval of structured resources using unstructured queries is challenging as we need to deal with the matching between entities of two different types. Consider an unstructured query, “publications of K.H. Gan in WI”, in a structured retrieval system. To match this query to structured resources, the system needs to transform it into a format that is comparable to the structure of the resources. As such, we develop a solution that automatically transform unstructured query to a mediated query which is enhanced with structural information. The mediated query is then matched against structured resources to obtain relevant results.

Publication Date: 2008

Publication Name: Research and Development in Information Retrieval

Research Interests:
XML retrieval

Automatic question answering (QA) is playing an increasingly important role in intelligent answer searching. Many approaches have been employed for retrieving answers to natural language questions with rule-based approach being one of... more

Automatic question answering (QA) is playing an increasingly important role in intelligent answer searching. Many approaches have been employed for retrieving answers to natural language questions with rule-based approach being one of them. Traditionally, rules for automatic QA have been generated manually which may be time consuming and limited in scope. To address this issue, we present a proposed automatic rule extraction approach to generate rules for QA from training data via structural clustering. Key words: Automatic question answering, rule extraction, structural clustering. 1.

Publication Date: 2008

Research Interests:
Natural language, Question Answering, Rule Extraction, and Rule Based

Download (.pdf)

This paper outlines the creation of an open combined semantic lexicon as a resource for the study of lexical semantics in the Malay languages (Malaysian and Indonesian). It is created by combining three earlier wordnets, each built using... more

This paper outlines the creation of an open combined semantic lexicon as a resource for the study of lexical semantics in the Malay languages (Malaysian and Indonesian). It is created by combining three earlier wordnets, each built using different resources and approaches: the Malay Wordnet (Lim & Hussein 2006), the Indonesian Wordnet (Riza, Budiono & Hakim 2010) and the Wordnet Bahasa (Nurril Hirfana, Sapuan & Bond 2011). The final wordnet has been validated and extended as part of sense annotation of the Indonesian portion of the NTU Multilingual Corpus (Tan & Bond 2012). The wordnet has over 48,000 concepts and 58,000 words for Indonesian and 38,000 concepts and 45,000 words for Malaysian.

Publication Date: 2014

Research Interests:
Computer Science, Indonesian Language, Language Resources, Malay Language, and Wordnet

Download (.pdf)

In this paper, we would like to present an approach to construct a huge Bilingual Knowledge Bank (BKB) from an English Malay bilingual dictionary based on the idea of synchronous Structured String-Tree Correspondence (SSTC). The SSTC is a... more

In this paper, we would like to present an approach to construct a huge Bilingual Knowledge Bank (BKB) from an English Malay bilingual dictionary based on the idea of synchronous Structured String-Tree Correspondence (SSTC). The SSTC is a general structure that can associate an arbitrary tree structure to string in a language as desired by the annotator to be the interpretation structure of the string, and more importantly is the facility to specify the correspondence between the string and the associated tree which can be non-projective. With this structure, we are able to match linguistic units at different inter levels of the structure (i.e. define the correspondence between substrings in the sentence, nodes in the tree, subtrees in the tree and sub-correspondences in the SSTC). This flexibility makes synchronous SSTC very well suited for the construction of a Bilingual Knowledge Bank we need for the English-Malay MT application.

Publisher: MTSUMMIT

Publication Date: 2001

Research Interests:
Tree Structure

Download (.pdf)

Kertas ini memperihal tentang pembinaan korpus pertuturan Bahasa Melayu untuk diguna dalam pembinaan sistem pertuturan Bahasa Melayu. Korpus pertuturan Bahasa Melayu ini diwakili dengan perwakilan struktur pokok sintaks-prosodi, yang... more

Kertas ini memperihal tentang pembinaan korpus pertuturan Bahasa Melayu untuk diguna dalam pembinaan sistem pertuturan Bahasa Melayu. Korpus pertuturan Bahasa Melayu ini diwakili dengan perwakilan struktur pokok sintaks-prosodi, yang diubah suai daripada struktur perwakilan Structured-String Correspondence (SSTC). Bagi membina korpus pertuturan Bahasa Melayu dalam perwakilan sintaks-prosodi, ayat teks yang sedia kala dalam perwakilan SSTC diguna sebagai skrip rakaman. Melalui rakaman suara berdasarkan skrip tersebut, fitur prosodi diekstrak keluar dan dianotasi pada struktur pokok SSTC, dan pada masa yang sama, fail bunyi dipaut pada nod struktur pohon SSTC. Pada akhir pemprosesan rakaman dan anotasi, mini korpus pertuturan yang diwakili dengan perwakilan sintaksis-prosodi yang mengandungi 422 ayat, 1720 frasa dan 6978 unit perkataan berjaya dihasil.

Publication Date: 2013

Download (.pdf)

We present the S-SSTC framework for machine translation (MT), introduced in 2002 and developed since as a set of working MT systems (SiSTeC-ebmt). Our approach is example-based, but differs from other EBMT approaches in that it uses... more

We present the S-SSTC framework for machine translation (MT), introduced in 2002 and developed since as a set of working MT systems (SiSTeC-ebmt). Our approach is example-based, but differs from other EBMT approaches in that it uses alignments of string-tree alignments, and in that supervised learning is an integral part of the approach. Our model directly deals with three main difficulties in the traditional treatment of MT that stem from its separation from the "translation task" (the 'world'). First, by allowing the system to learn from real translation examples directly, we avoid the need to indefinitely pursue the elusive goal of writing grammars to exactly describe intermediate syntacticosemantic monolingual representations and their correspondences. Second, we make explicit the dependence of the MT system performance on the input from the environment. That is possible only because the learning process uses feedback from the real translation knowledge when co...

Publisher: PACLIC

Publication Date: 2011

Research Interests:
Computer Science

Download (.pdf)

... Malaysia enyakong@mmu.edu.my Alvin Yeo Wee Universiti Malaysia Sarawak Faculty of Computer Science and Information Technology 94300 Kota Samarahan, Malaysia alvin@fit.unimas.my Wong Chui Yin Multimedia University ...

Publisher: portal.acm.org

Publication Date: 2009

Publication Name: Proceedings of the …

Research Interests:
Machine Translation and Shared Workspace

In this paper we sketch an approach for Natural Language parsing. Our approach is an example-based approach, which relies mainly on examples that already parsed to their representation structure, and on the knowledge that we can get from... more

In this paper we sketch an approach for Natural Language parsing. Our approach is an example-based approach, which relies mainly on examples that already parsed to their representation structure, and on the knowledge that we can get from these examples the required information to parse a new input sentence. In our approach, examples are annotated with the Structured String Tree Correspondence (SSTC) annotation schema where each SSTC describes a sentence, a representation tree as well as the correspondence between substrings in the sentence and subtrees in the representation tree. In the process of parsing, we first try to build subtrees for phrases in the input sentence which have been successfully found in the example-base - a bottom up approach. These subtrees will then be combined together to form a single rooted representation tree based on an example with similar representation structure - a top down approach.

Research Interests:
Natural language, Natural Language Parsing, Bottom Up, and Top Down

Download (.pdf)

Publication Date: 2009

Publisher: American Scientific Publishers

Publication Name: Advanced Science Letters

Research Interests:
Multidisciplinary

Publication Date: Feb 1, 2002

Download (.pdf)

Publisher: Centro de Innovacion y Desarrollo Tecnologico en Computo

Publication Date: 2011

Publication Name: Polibits

Research Interests:
Natural Language Processing

Download (.pdf)

Abstract. This paper presents a research proposal on user-oriented evaluation method to compare the usability of Internet search tools. Cognitive style and problem solving style are identified individual difference factors. Meta-search,... more

Abstract. This paper presents a research proposal on user-oriented evaluation method to compare the usability of Internet search tools. Cognitive style and problem solving style are identified individual difference factors. Meta-search, portal and individual search engines are Internet search tool available. Usability of each search tools based on relevancy and satisfaction is another factor of this study. The ultimate aim

Publication Date: 2000

Publication Name: Lecture Notes in Computer Science

Research Interests:
Information Retrieval, User Interface, Cognitive Style, Problem Solving, Search Engine, and 2 moreEcdl and Advanced Computing for Electrical Technology

Publication Date: 2014

Publication Name: Proceedings of the 5th International Workshop on Web-scale Knowledge Representation Retrieval & Reasoning - Web-KR '14

Publication Date: 2014

Publication Name: Proceedings of the 5th International Workshop on Web-scale Knowledge Representation Retrieval & Reasoning - Web-KR '14

Research Interests:
Sentiment Analysis

Publisher: Elsevier BV

Publication Date: 2011

Publication Name: Procedia - Social and Behavioral Sciences

Research Interests:
Parsing

Download (.pdf)

Publisher: IEEE

Publication Date: 2009

Publication Name: 2009 Oriental COCOSDA International Conference on Speech Database and Assessments

Research Interests:
Natural Language Processing, Speech Recognition, Web Pages, Broadcast news, Data Extraction, and Rule Based

Download (.pdf)

Publication Date: 2011

Research Interests:
Machine Translation, Procedia - Social and Behavioral Science, and Multilingual lexicon

Download (.pdf)

ABSTRACT This research work describes our approaches in using dependency parse tree information to derive useful hidden word statistics to improve the baseline system of Malay large vocabulary automatic speech recognition system. The... more

ABSTRACT This research work describes our approaches in using dependency parse tree information to derive useful hidden word statistics to improve the baseline system of Malay large vocabulary automatic speech recognition system. The traditional approaches to train language model are mainly based on Chomsky hierarchy type 3 that approximates natural language as regular language. This approach ignores the characteristics of natural language. Our work attempted to overcome these limitations by extending the approach to consider Chomsky hierarchy type 1 and type 2. We extracted the dependency tree based lexical information and incorporate the information into the language model. The second pass lattice rescoring was performed to produce better hypotheses for Malay large vocabulary continuous speech recognition system. The absolute WER reduction was 2.2% and 3.8% for MASS and MASS-NEWS Corpus, respectively.

Publication Date: 2013

Publication Name: 2013 International Conference on Asian Language Processing

Research Interests:
Natural Language Processing and Speech Recognition

ABSTRACT There have been many R&amp;D projects conducted under PPSKOMP (School of Computer Sciences, Universiti Sains Malaysia) since its establishment in 1995 until today. In PPSKOMP, there are eight major research groups... more

ABSTRACT There have been many R&amp;D projects conducted under PPSKOMP (School of Computer Sciences, Universiti Sains Malaysia) since its establishment in 1995 until today. In PPSKOMP, there are eight major research groups established, which are: Artificial Intelligence Lab, Computer Aided Translation Unit, Computer Vision Research Group, Health Information Research Group, Information Systems Engineering, Multimedia Research Group, Network Research Group, and Parallel and Distributed Computing. Many knowledge resources and processing components have been developed by researchers and available in each of this research group. However, these resources are resided and accessible only in respective research group and mostly developed using different methodologies, paradigm and platform. In this paper, we present a Service-Oriented Architecture (SOA) framework which capable to resolve this problem and enable the synergisation of research and development strengths in PPSKOMP.

Publication Date: 2010

Publication Name: 2010 International Symposium on Information Technology

Research Interests:
Business, Software Engineering, Computer Vision, Information Theory, Natural Language Processing, and 10 moreMachine Learning, Project Management, Service Oriented Architecture, Machine Translation, Text Mining, Software Architecture, Web Services, Mediation, Computers, and Information System

Publisher: IEEE

Publication Date: 2009

Publication Name: 2009 International Conference on Signal Acquisition and Processing

Research Interests:
Signal Processing, Speech Synthesis, Frequency, Degradation, Concatenated codes, and Natural Languages

Download (.pdf)

ABSTRACT On the web, most structured document collections consist of documents from different sources and marked up with different types of structures. The diversity of structures has led to the emergence of heterogeneous structured... more

ABSTRACT On the web, most structured document collections consist of documents from different sources and marked up with different types of structures. The diversity of structures has led to the emergence of heterogeneous structured documents. The heterogeneity of structured documents is one of the reason for query-document mismatch in structured document retrieval. In structured document retrieval, a user is assumed to have intimate knowledge of the document structures and is able to specify contextual constraints in their queries. However, it is impossible for the user to know all structures in heterogeneous structured document collections. In this paper, we propose to include similar correspondence relations in the representation model for structured document retrieval. The similar correspondences make the relations between similar contents explicit in order to improve structured document retrieval effectiveness. We introduce a generic and flexible structured document model to represent heterogeneous structured documents as well as the similar correspondences in the document collections. We also illustrate how the proposed model can be utilized in structured document retrieval.

Publication Date: 2011

Publication Name: 2011 International Conference on Semantic Technology and Information Retrieval

Research Interests:
Semantics, XML, Semantic Web, and Context Modeling

Publication Date: 2013

Publication Name: Lecture Notes in Computer Science

Download (.pdf)

Publication Date: 2012

Publication Name: 2012 International Conference on Asian Language Processing

Research Interests:
Natural Language Processing, Data Analysis, and Data acquisition

Download (.pdf)

Publication Date: 2014

Publication Name: 2014 International Conference on Asian Language Processing (IALP)

Publisher: Springer Nature

Publication Date: 2013

Publication Name: Language Resources and Evaluation

Research Interests:
Cognitive Science and Data Format

Download (.pdf)

This study focused on how human translators (HTs) performed translation task, which could contribute to a good start in designing and prototyping computer-aided translation (CAT) system. Data gathered from 20 subjects was analyzed with... more

This study focused on how human translators (HTs) performed translation task, which could contribute to a good start in designing and prototyping computer-aided translation (CAT) system. Data gathered from 20 subjects was analyzed with cognitive task analysis (CTA) technique. The user model derived from CTA was integrated into CAT system, where user modeling (UM) technique served in prototyping the adaptive and interactive system. UM would customize the properties of individual HTs with their task within the CAT system. HTs can use the help facilities available on the system to support their routine and non-routine tasks.

Publication Date: 2004

Research Interests:
Cognitive Task Analysis, Data Gathering, and user model

Publisher: paper.ijcsns.org

Publication Date: 2008

Publication Name: IJCSNS

Research Interests:
Natural language, Question Answering, Rule Extraction, and Rule Based

Download (.pdf)

During the improvement of Malay Speech Synthesizer ver2 (MSS ver2), we focused on how the selection of target syllable utterance is to be concatenated. The selection is based on the best match of phonetic context similarity between target... more

During the improvement of Malay Speech Synthesizer ver2 (MSS ver2), we focused on how the selection of target syllable utterance is to be concatenated. The selection is based on the best match of phonetic context similarity between target utterance and recorded ...

Publisher: Oriental COCOSDA

Publication Date: Dec 1, 2006

Publication Name: Citeseer

Research Interests:
Speech Synthesis and Speech Segmentation

Download (.pdf)

In this paper, we will give the update information on the existing speech synthesizer systems that our unit has, the limitation and also our future plan to enhance our system. ... Keywords Speech Synthesis, TTS engine, concatenative... more

In this paper, we will give the update information on the existing speech synthesizer systems that our unit has, the limitation and also our future plan to enhance our system. ... Keywords Speech Synthesis, TTS engine, concatenative synthesis, distortion, spectral discontinuity, ...

Publication Name: utmk.cs.usm.my

Research Interests:
Speech Synthesis and Unit Selection

Publisher: mti.ugm.ac.id

Publication Date: 2004

Research Interests:
Natural Language Processing

Download (.pdf)

This paper outlines the creation of an open combined semantic lexicon as a resource for the study of lexical semantics in the Malay languages (Malaysian and Indonesian). It is created by combining three earlier wordnets, each built using... more

This paper outlines the creation of an open combined semantic lexicon as a resource for the study of lexical semantics in the Malay languages (Malaysian and Indonesian). It is created by combining three earlier wordnets, each built using different resources and approaches: the Malay Wordnet (Lim & Hussein 2006), the Indonesian Wordnet (Riza, Budiono & Hakim 2010) and the Wordnet Bahasa (Nurril Hirfana, Sapuan & Bond 2011). The final wordnet has been validated and extended as part of sense annotation of the Indonesian portion of the NTU Multilingual Corpus (Tan & Bond 2012). The wordnet has over 48,000 concepts and 58,000 words for Indonesian and 38,000 concepts and 45,000 words for Malaysian.

Research Interests:
Indonesian Language, Language Resources, Malay Language, and Wordnet

Download (.pdf)

Publication Date: 2001

Publication Date: Nov 1, 2007

Publication Date: 2017

Publication Name: Journal of Telecommunication, Electronic and Computer Engineering

Research Interests: Engineering, Computer Science, Natural Language Processing, Speech Recognition, and hidden Markov model<div>()</div>

Publisher: Elsevier BV

Publication Date: 2017

Publication Name: Expert Systems with Applications

Publication Date: 2001

Publication Name: Webnet

Research Interests: Computer Science<div>()</div>

Publisher: Elsevier BV

Publication Date: 2016

Publication Name: Expert Systems with Applications

Research Interests: Computer Science, Artificial Intelligence, Natural Language Processing, and Mathematical Sciences<div>()</div>

Publication Date: 2006

Publication Name: 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06)

Publication Date: 2009

Publication Name: Proceeding of the 2nd ACM workshop on Social web search and mining - SWSM '09

Publication Date: 1994

Research Interests: Computer Science<div>()</div>

Publisher: mti.ugm.ac.id

Publication Date: 2004

Publication Name: Unit Terjemahan Melalui Komputer, Universiti …

Research Interests: Natural Language Processing<div>()</div>

Publisher: PeerJ

Publication Date: 2016

Research Interests: Computer Science<div>()</div>

Publication Date: 2006

Publication Name: Proceedings of the 2006 Ieee Wic Acm International Conference on Web Intelligence

Research Interests: Computer Science, Information Retrieval, Text Analysis, and Classification<div>()</div>

Publication Date: 2008

Publication Name: Research and Development in Information Retrieval

Research Interests: XML retrieval<div>()</div>

Publication Date: 2008

Research Interests: Natural language, Question Answering, Rule Extraction, and Rule Based<div>()</div>

Publication Date: 2014

Research Interests: Computer Science, Indonesian Language, Language Resources, Malay Language, and Wordnet<div>()</div>

Publisher: MTSUMMIT

Publication Date: 2001

Research Interests: Tree Structure<div>()</div>

Publication Date: 2013

Publisher: PACLIC

Publication Date: 2011

Research Interests: Computer Science<div>()</div>

Publisher: portal.acm.org

Publication Date: 2009

Publication Name: Proceedings of the …

Research Interests: Machine Translation and Shared Workspace <div>()</div>

Research Interests: Natural language, Natural Language Parsing, Bottom Up, and Top Down<div>()</div>

Publication Date: 2009

Publisher: American Scientific Publishers

Publication Name: Advanced Science Letters

Research Interests: Multidisciplinary<div>()</div>

Publication Date: Feb 1, 2002

Publisher: Centro de Innovacion y Desarrollo Tecnologico en Computo

Publication Date: 2011

Publication Name: Polibits

Research Interests: Natural Language Processing<div>()</div>

Publication Date: 2000

Publication Name: Lecture Notes in Computer Science

Publication Date: 2014

Publication Name: Proceedings of the 5th International Workshop on Web-scale Knowledge Representation Retrieval & Reasoning - Web-KR '14

Publication Date: 2014

Publication Name: Proceedings of the 5th International Workshop on Web-scale Knowledge Representation Retrieval & Reasoning - Web-KR '14

Research Interests: Sentiment Analysis<div>()</div>

Publisher: Elsevier BV

Publication Date: 2011

Publication Name: Procedia - Social and Behavioral Sciences

Research Interests: Parsing<div>()</div>

Publisher: IEEE

Publication Date: 2009

Publication Name: 2009 Oriental COCOSDA International Conference on Speech Database and Assessments

Research Interests: Natural Language Processing, Speech Recognition, Web Pages, Broadcast news, Data Extraction, and Rule Based<div>()</div>

Publication Date: 2011

Research Interests: Machine Translation, Procedia - Social and Behavioral Science, and Multilingual lexicon<div>()</div>

Publication Date: 2013

Publication Name: 2013 International Conference on Asian Language Processing

Research Interests: Natural Language Processing and Speech Recognition<div>()</div>

Research Interests:
Engineering, Computer Science, Natural Language Processing, Speech Recognition, and hidden Markov model

Research Interests:
Computer Science

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, and Mathematical Sciences

Research Interests:
Computer Science

Research Interests:
Natural Language Processing

Research Interests:
Computer Science

Research Interests:
Computer Science, Information Retrieval, Text Analysis, and Classification

Research Interests:
XML retrieval

Research Interests:
Natural language, Question Answering, Rule Extraction, and Rule Based

Research Interests:
Computer Science, Indonesian Language, Language Resources, Malay Language, and Wordnet

Research Interests:
Tree Structure

Research Interests:
Computer Science

Research Interests:
Machine Translation and Shared Workspace

Research Interests:
Natural language, Natural Language Parsing, Bottom Up, and Top Down

Research Interests:
Multidisciplinary

Research Interests:
Natural Language Processing

Research Interests:
Sentiment Analysis

Research Interests:
Parsing

Research Interests:
Natural Language Processing, Speech Recognition, Web Pages, Broadcast news, Data Extraction, and Rule Based

Research Interests:
Machine Translation, Procedia - Social and Behavioral Science, and Multilingual lexicon

Research Interests:
Natural Language Processing and Speech Recognition

Research Interests:
Signal Processing, Speech Synthesis, Frequency, Degradation, Concatenated codes, and Natural Languages

Research Interests:
Semantics, XML, Semantic Web, and Context Modeling

Research Interests:
Natural Language Processing, Data Analysis, and Data acquisition

Research Interests:
Cognitive Science and Data Format

Research Interests:
Cognitive Task Analysis, Data Gathering, and user model

Research Interests:
Natural language, Question Answering, Rule Extraction, and Rule Based

Research Interests:
Speech Synthesis and Speech Segmentation

Research Interests:
Speech Synthesis and Unit Selection

Research Interests:
Natural Language Processing

Research Interests:
Indonesian Language, Language Resources, Malay Language, and Wordnet