Machine Learning-Based Prediction of Soybean Plant Height from Agronomic Traits Across Sequential Harvests
Abstrak
The accurate prediction of plant height is crucial for optimizing soybean cultivar selection and improving yield estimations. In this study, we investigate the potential of machine learning (ML) algorithms to predict soybean plant height (PH) based on a diverse set of agronomic parameters analyzed from forty soybean cultivars evaluated across sequential harvests. Using a comprehensive dataset, the models Elastic Net (EN), Extra Trees (ET), Gaussian Process Regressor (GPR), K-Nearest Neighbors, and XGBoost (XGB) were compared in terms of predictive accuracy, uncertainty, and robustness. Our results demonstrate that ET outperformed other models with an average correlation coefficient of 0.674, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mi>R</mi><mn>2</mn></msup></semantics></math></inline-formula> of 0.426 and the lowest RMSE of 6.859 cm and MAE of 5.361 cm, while also showing the lowest uncertainty (5.07%). The proposed ML framework includes an extensive model evaluation pipeline that incorporates the Performance Index (PI), ANOVA, and feature importance analysis, providing a multidimensional perspective on model behavior. The most influential features for PH prediction were the number of stems (NS) and insertion of the first pod (IFP). This research highlights the viability of integrating explainable ML techniques into agricultural decision support systems, enabling data-driven strategies for cultivar evaluation and phenotypic trait forecasting.
Topik & Kata Kunci
Penulis (7)
Bruno Rodrigues de Oliveira
Renato Lustosa Sobrinho
Fernando Rodrigues Trindade Ferreira
Fernando Ferrari Putti
Matteo Bodini
Camila Martins Saporetti
Leonardo Goliatt
Akses Cepat
PDF tidak tersedia langsung
Cek di sumber asli →- Tahun Terbit
- 2025
- Sumber Database
- DOAJ
- DOI
- 10.3390/agriengineering7120408
- Akses
- Open Access ✓