Please use this identifier to cite or link to this item: http://hdl.handle.net/10174/28061

Title: Linguistic and orthographical classic Portuguese variants. Challenges for NLP
Authors: Cameron, Helena
Gonçalves, Maria Filomena
Quaresma, Paulo
Editors: Finatto, Maria José
Vieira, Renata
Pollak, Senja
Luz, Saturnino
Keywords: Classical Portuguese
NLP
Issue Date: Mar-2020
Publisher: CEUR-WP org.
Citation: Cameron, Helena Freire; Gonçalves, Maria Filomena; Quaresma, Paulo (2020): "Linguistic and orthographical classic Portuguese variants. Challenges for NLP". In: Maria José Finatto, Renata Vieira, Senja Pollak and Saturnino Luz (ed.), Proceedings of the Workshop on Digital Humanities and Natural Language Processing, co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2020), vol. 2607. Évora (Portugal): CEUR-WP org, 43-48.
Abstract: In recent times, it was made a great investment in transfer from physical ancient Portuguese texts to digital support. This support transfer allows not only the access to the texts, bringing them to the public in general, but also the possibility of texts to be readable and processed by machines. NLP tools are addressed, mainly, to contemporary Portuguese and the application of NLP to classic texts has several difficulties. The elaboration of big lexical corpora of forms previous to modern Portuguese is an opportunity for multidisciplinary field of studies allowing the enlargement of linguistic studies and also the possibility of obtaining, by NLP, validated corpora, collections and ontologies, that can be input in NLP tools for ancient Portuguese texts. In this work we will present, briefly, the problem of lexical variation of forms in processing classic Portuguese texts, the challenges that emerge from them and future perspectives of work.
URI: http://ceur-ws.org/Vol-2607/short1.pdf
http://hdl.handle.net/10174/28061
ISSN: 1613-0073
Type: article
Appears in Collections:LLT - Artigos em Livros de Actas/Proceedings
CIDEHUS - Artigos em Livros de Actas/Proceedings

Files in This Item:

File Description SizeFormat
Cameron et al. short1.pdf503 kBAdobe PDFView/Open
FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Dspace Dspace
DSpace Software, version 1.6.2 Copyright © 2002-2008 MIT and Hewlett-Packard - Feedback
UEvora B-On Curriculum DeGois