Named entity recognition in Vietnamese using classifier voting

PTX Thao, TQ Tri, D Dien, N Collier - ACM Transactions on Asian …, 2007 - dl.acm.org
PTX Thao, TQ Tri, D Dien, N Collier
ACM Transactions on Asian Language Information Processing (TALIP), 2007dl.acm.org
Named entity recognition (NER) is one of the fundamental tasks in natural-language
processing (NLP). Though the combination of different classifiers has been widely applied in
several well-studied languages, this is the first time this method has been applied to
Vietnamese. In this article, we describe how voting techniques can improve the performance
of Vietnamese NER. By combining several state-of-the-art machine-learning algorithms
using voting strategies, our final result outperforms individual algorithms and gained an F …
Named entity recognition (NER) is one of the fundamental tasks in natural-language processing (NLP). Though the combination of different classifiers has been widely applied in several well-studied languages, this is the first time this method has been applied to Vietnamese. In this article, we describe how voting techniques can improve the performance of Vietnamese NER. By combining several state-of-the-art machine-learning algorithms using voting strategies, our final result outperforms individual algorithms and gained an F-measure of 89.12. A detailed discussion about the challenges of NER in Vietnamese is also presented.
ACM Digital Library