|
|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/41453
|
| Title: | Performance Evaluation of NLP Models for European Portuguese: Multi-GPU/Multi-node Configurations and Optimization Techniques |
| Authors: | Santos, Daniel Miquelina, Nuno Schmidt, Daniela Quaresma, Paulo Nogueira, Vítor Beires |
| Keywords: | NLP Model Evaluation Distributed Training |
| Issue Date: | 17-Feb-2025 |
| Publisher: | Springer |
| Citation: | Santos, D., Miquelina, N., Schmidt, D., Quaresma, P., Nogueira, V.B. (2025). Performance Evaluation of NLP Models for European Portuguese: Multi-GPU/Multi-node Configurations and Optimization Techniques. In: Zhu, T., Li, J., Castiglione, A. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2024. Lecture Notes in Computer Science, vol 15256. Springer, Singapore. https://doi.org/10.1007/978-981-96-1551-3_20 |
| Abstract: | Natural Language Processing (NLP) research has predominantly focused on the English language, leading to a wealth of resources and advancements tailored to English. However, there is a growing need to extend these capabilities to other languages, such as European Portuguese, to ensure the inclusivity and accessibility of NLP technologies. In this study, we explore the evaluation of NLP models in the European Portuguese language using a multi-GPU/multi-node machine. We utilized various tools such as PyTorch, Accelerate, Transformers, and DeepSpeed with ZeRO Stage 3 to handle the computational demands of large-scale model training. We provide all the key aspects of our methodology to evaluate various models on translated GLUE tasks. Additionally, we introduce AiBERTa, a base model with 110 million parameters, developed and pre-trained on a corpus tailored for European Portuguese. This research highlights the effectiveness of advanced tools and distributed computing in scaling NLP model training, providing a foundation for future enhancements in European Portuguese language processing. |
| URI: | https://link.springer.com/chapter/10.1007/978-981-96-1551-3_20 http://hdl.handle.net/10174/41453 |
| Type: | article |
| Appears in Collections: | VISTALab - Artigos em Livros de Actas/Proceedings
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|