Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/7592
|
Title: | Tagging with Disambiguation Rules - A New Evolutionary Approach to the Part-of-Speech Tagging Problem |
Authors: | Silva, Ana Paula Silva, Arlindo Rodrigues, Irene Pimenta |
Editors: | Rosa, Agostinho C. Correia, António Dourado Madani, Kurosh Filipe, Joaquim Kacprzyk, Janusz |
Keywords: | natural language processing tools artificial intelligence local search |
Issue Date: | Oct-2012 |
Publisher: | SciTePress |
Citation: | Ana Paula Silva, Arlindo Silva, Irene Rodrigues: Tagging with Disambiguation Rules - A New Evolutionary Approach to the Part-of-Speech Tagging Problem. 5-14. IJCCI 2012 |
Abstract: | {In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of-speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the right choice is determined by the surrounding word's part-of-speeches. This means that to solve this problem we need a method to disambiguate a word's possible tags set. Traditionally there are two groups of methods used to tackle this task. The first group is based on statistical data concerning the different context's possibilities for a word, while the second group is based on rules that capture the language properties. Normally these rules are designed by human experts. In this work we present a solution that tries to incorporate both these approaches. The proposed system is divided in two components. First, we use an evolutionary algorithm that for each part-of-speech tag of the training corpus, evolves a set of disambiguation rules. We then use a second evolutionary algorithm to solve the tagging problem. In this component the evolution is guided by the rules found earlier. |
URI: | http://hdl.handle.net/10174/7592 |
Type: | bookPart |
Appears in Collections: | INF - Publicações - Capítulos de Livros
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|