Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/2439
|
| Title: | Using graph-kernels to represent semantic information in text classification |
| Authors: | Gonçalves, Teresa Quaresma, Paulo |
| Keywords: | graph-kernels text classification machine learning |
| Issue Date: | Jul-2009 |
| Publisher: | Springer-Verlag |
| Abstract: | Most text classification systems use bag-of-words represen- tation of documents to find the classification target function. Linguistic structures such as morphology, syntax and semantic are completely ne- glected in the learning process.
This paper proposes a new document representation that, while includ- ing its context independent sentence meaning, is able to be used by a structured kernel function, namely the direct product kernel. The proposal is evaluated using a dataset of articles from a Portuguese daily newspaper and classifiers are built using the SVM algorithm. The results show that this structured representation, while only partially de- scribing document’s significance has the same discriminative power over classes as the traditional bag-of-words approach. |
| URI: | http://hdl.handle.net/10174/2439 |
| Type: | article |
| Appears in Collections: | INF - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|