|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/32684
|
Title: | Characterization of portuguese sown rainfed grasslands using remote sensing and machine learning |
Authors: | Morais, Tiago Jongen, Marjan Tufik, Camila Rodrigues, Nuno Gama, Ivo Fangueiro, David Serrano, João Vieira, Susana Domingos, Tiago Teixeira, Ricardo |
Editors: | Stafford, John V. Lowenberg-DeBoer, James M. |
Keywords: | Sentinel-2 Multiple linear regression LASSO Ridge XGBoost LightGBM Random forests Cross-validation |
Issue Date: | 27-Jul-2022 |
Publisher: | Springer |
Citation: | Morais, T.G.,...Serrano, J., et al. (2022). Characterization of portuguese sown rainfed grasslands using remote sensing and machine learning. Precision Agriculture. https://doi.org/10.1007/s11119-022-09937-9 |
Abstract: | Grasslands are crucial ecosystems that support and provide a diverse number of ecosystem services. Sown biodiverse pastures rich in legumes (SBP) were developed with the main goal of increasing grassland production while minimizing fertilizers inputs. In this paper, the main properties of SBP in Portugal were estimated using remote sensing and machine learning in six different farms and two production years (spring 2018 and 2019). Four pasture characteristics were considered: aboveground standing biomass, fraction of legumes, plant nitrogen (N) content and plant phosphorus (P) content. Remote sensing data were obtained from Sentinel-2. The spectral bands combined with 5 vegetation indices and 9 covariates were used. Multiple linear regression, LASSO, Ridge, random forests, XGBoost and LightGBM regression models were used. Two cross-validation approaches were used: (1) a random approach with random selection of the folds (RN-CV), and (2) a structured approach where each fold is a unique combination of farm and year, which is subsequently used to assess the performance of the model obtained with the 8 other folds (LLYO-CV). Results showed that the random forest method had the best estimation accuracy for all pasture characteristics. Regarding cross-validation approaches, the algorithms with RN-CV have higher estimation accuracy for all pasture characteristics (on average about 10% lower RMSE and an R2 85% higher), as compared to the algorithms with LLYO-CV. However, LLYO-CV should avoid overfitting and improve generalization of the models because in each fold the model is tested in a farm and year that was not used for training. The RMSE for all variables were significantly low, especially in RN-CV. Plant P is the variable where the choice of CV approach has the least influence (RMSE of test set with RN-CV: 0.71 g P kg− 1; LLYO-CV: 0.72 g P kg− 1). Standing biomass is the variable with the highest difference between CV approaches (RN-CV: 722 kg ha− 1; LLYO-CV: 825 kg ha− 1). The RMSE, of legumes and plant N were moderately affected by the CV approach (legume RN-CV: 0.11; LLYO-CV: 0.12 – plant N RN-CV: 3.96 g N kg− 1; LLYO-CV: 3.99 g N kg− 1). The algorithms developed here were applied for entire parcels in the two farms with the most different climate conditions as demonstration of their potential future use for precision farming. |
URI: | http://hdl.handle.net/10174/32684 |
Type: | article |
Appears in Collections: | ERU - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica MED - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|