[PDF][PDF] Fast methods for kernel-based text analysis

T Kudo, Y Matsumoto - Proceedings of the 41st Annual Meeting of …, 2003 - aclanthology.org
Proceedings of the 41st Annual Meeting of the Association for …, 2003aclanthology.org
Kernel-based learning (eg, Support Vector Machines) has been successfully applied to
many hard problems in Natural Language Processing (NLP). In NLP, although feature
combinations are crucial to improving performance, they are heuristically selected. Kernel
methods change this situation. The merit of the kernel methods is that effective feature
combination is implicitly expanded without loss of generality and increasing the
computational costs. Kernel-based text analysis shows an excellent performance in terms in …
Abstract
Kernel-based learning (eg, Support Vector Machines) has been successfully applied to many hard problems in Natural Language Processing (NLP). In NLP, although feature combinations are crucial to improving performance, they are heuristically selected. Kernel methods change this situation. The merit of the kernel methods is that effective feature combination is implicitly expanded without loss of generality and increasing the computational costs. Kernel-based text analysis shows an excellent performance in terms in accuracy; however, these methods are usually too slow to apply to large-scale text analysis. In this paper, we extend a Basket Mining algorithm to convert a kernel-based classifier into a simple and fast linear classifier. Experimental results on English BaseNP Chunking, Japanese Word Segmentation and Japanese Dependency Parsing show that our new classifiers are about 30 to 300 times faster than the standard kernel-based classifiers.
aclanthology.org
Showing the best result for this search. See all results