Genomic and phenomic prediction for soybean seed yield, protein, and oil

Liza Van der Laan
Kyle Parmley
Mojdeh Saadati
Hernan Torres Pacin
Srikanth Panthulugiri
Soumik Sarkar
Baskar Ganapathysubramanian
Aaron Lorenz
Asheesh K. Singh

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Developments in genomics and phenomics have provided valuable tools for use in cultivar development. Genomic prediction (GP) has been used in commercial soybean [ Glycine max L. (Merr.)] breeding programs to predict grain yield and seed composition traits. Phenomic prediction (PP) is a rapidly developing field that holds the potential to be used for the selection of genotypes early in the growing season. The objectives of this study were to compare the performance of GP and PP for predicting soybean seed yield, protein, and oil. We additionally conducted genome‐wide association studies (GWAS) to identify significant single‐nucleotide polymorphisms (SNPs) associated with the traits of interest. The GWAS panel of 292 diverse accessions was grown in six environments in replicated trials. Spectral data were collected at two time points during the growing season. A genomic best linear unbiased prediction (GBLUP) model was trained on 269 accessions, while three separate machine learning (ML) models were trained on vegetation indices (VIs) and canopy traits. We observed that PP had a higher correlation coefficient than GP for seed yield, while GP had higher correlation coefficients for seed protein and oil contents. VIs with high feature importance were used as covariates in a new GBLUP model, and a new random forest model was trained with the inclusion of selected SNPs. These models did not outperform the original GP and PP models. These results show the capability of using ML for in‐season predictions for specific traits in soybean breeding and provide insights on PP and GP inclusions in breeding programs.

Version published to 10.1002/tpg2.70002
Feb 19, 2025
Version published to 10.1101/2024.11.01.621550 on bioRxiv
Nov 2, 2024

SNP/InDel-Based GWAS Reveals QTNs and Candidate Genes for Seed Oil and Protein Content in Northern China Soybean Core Accessions

This article has 11 authors:
1. Dequan Liu
2. Jian Chen
3. Liantai Su
4. Mingwei Duan
5. Hao Li
6. Yunlong Hou
7. Zhengguo Cui
8. Liang Chen
9. Fuxin Li
10. Hongmei Qiu
11. Yueqiang Wang
This article has no evaluationsLatest version Dec 19, 2025
Genetic Diversity and Population Structure of Soybean (Glycine max L.) through Integrated Morphological and SSR Analyses

This article has 15 authors:
1. Sabeena Riasat
2. Farah Rasheed
3. Raheela Rehman
4. Rida Kanwal
5. Muhammad Hassaan Khan
6. Haris Moaz
7. Muhammad Atif
8. Usman Irfan
9. Abdul Waheed
10. Ali Haider²
11. Muhammad Moaz Latif
12. Fatima Tul Kubra
13. Hameed Alsamadany
14. Hind A.S. Alzahrani
15. Zaheer Ahmed
This article has no evaluationsLatest version Dec 15, 2025
Realised genetic gain and predictive ability across breeding cycles for cooking time in common bean based on multivariate genomic selection

This article has 10 authors:
1. Renu Saradadevi
2. Winnyfred Amongi
3. Clare Mukankusi
4. Allan Male
5. Jean-Claude Rubyogo
6. Eric Huttner
7. Felipe Castro-Urrea
8. Li Li
9. Kadambot Siddique
10. Wallace Cowling
This article has no evaluationsLatest version Dec 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

SNP/InDel-Based GWAS Reveals QTNs and Candidate Genes for Seed Oil and Protein Content in Northern China Soybean Core Accessions

Genetic Diversity and Population Structure of Soybean (Glycine max L.) through Integrated Morphological and SSR Analyses

Realised genetic gain and predictive ability across breeding cycles for cooking time in common bean based on multivariate genomic selection