Machine Learning-Driven Models to Predict the optimum Genotype and Planting Date on yield and phytochemical Traits in Roselle (Hibiscus sabdariffa L.)

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Accurate prediction and optimization of morphological traits in Roselle are essential for enhancing crop productivity and adaptability to diverse environments. In the present study, a machine learning framework was developed using Random Forest and Multi-layer Perceptron algorithms to model and predict key morphological traits, branch number, growth period, boll number, and seed number per plant, based on genotype and planting date. The dataset was generated from a field experiment involving ten Roselle genotypes and five planting dates. Both RF and MLP exhibited robust predictive capabilities; however, RF (R² = 0.84) demonstrated superior performance compared to MLP (R² = 0.80), underscoring its efficacy in capturing the nonlinear genotype-by-environment interactions. Permutation-based feature importance analysis further revealed that planting date had a more significant impact on trait variation than genotype. To identify optimal combinations of genotype and planting date for maximizing morphological traits, the RF model was integrated with the Non-dominated Sorting Genetic Algorithm II (NSGA-II). According to the RF–NSGA-II optimization results, the optimal values for branch number (26), growth period (176 days), boll number (116), and seed number per plant (1517) were achieved with the Qaleganj population planted on May 5. Collectively, these findings highlight the potential of integrating machine learning and evolutionary optimization algorithms as powerful computational tools for crop improvement and agronomic decision-making.

Article activity feed