Please use this identifier to cite or link to this item: http://hdl.handle.net/10174/41453

Title: Performance Evaluation of NLP Models for European Portuguese: Multi-GPU/Multi-node Configurations and Optimization Techniques
Authors: Santos, Daniel
Miquelina, Nuno
Schmidt, Daniela
Quaresma, Paulo
Nogueira, Vítor Beires
Keywords: NLP
Model Evaluation
Distributed Training
Issue Date: 17-Feb-2025
Publisher: Springer
Citation: Santos, D., Miquelina, N., Schmidt, D., Quaresma, P., Nogueira, V.B. (2025). Performance Evaluation of NLP Models for European Portuguese: Multi-GPU/Multi-node Configurations and Optimization Techniques. In: Zhu, T., Li, J., Castiglione, A. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2024. Lecture Notes in Computer Science, vol 15256. Springer, Singapore. https://doi.org/10.1007/978-981-96-1551-3_20
Abstract: Natural Language Processing (NLP) research has predominantly focused on the English language, leading to a wealth of resources and advancements tailored to English. However, there is a growing need to extend these capabilities to other languages, such as European Portuguese, to ensure the inclusivity and accessibility of NLP technologies. In this study, we explore the evaluation of NLP models in the European Portuguese language using a multi-GPU/multi-node machine. We utilized various tools such as PyTorch, Accelerate, Transformers, and DeepSpeed with ZeRO Stage 3 to handle the computational demands of large-scale model training. We provide all the key aspects of our methodology to evaluate various models on translated GLUE tasks. Additionally, we introduce AiBERTa, a base model with 110 million parameters, developed and pre-trained on a corpus tailored for European Portuguese. This research highlights the effectiveness of advanced tools and distributed computing in scaling NLP model training, providing a foundation for future enhancements in European Portuguese language processing.
URI: https://link.springer.com/chapter/10.1007/978-981-96-1551-3_20
http://hdl.handle.net/10174/41453
Type: article
Appears in Collections:VISTALab - Artigos em Livros de Actas/Proceedings

Files in This Item:

File Description SizeFormat
ICA3PP_2024.pdf409.43 kBAdobe PDFView/Open
FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Dspace Dspace
DSpace Software, version 1.6.2 Copyright © 2002-2008 MIT and Hewlett-Packard - Feedback
UEvora B-On Curriculum DeGois