Rapid and Non-destructive Evaluation of Soybean Seed Viability Using Transmission Hyperspectral Imaging and Ensemble Learning

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Traditional soybean seed viability assessment methods are destructive, time-intensive, and incapable of rapid, non-destructive single-seed grading. To overcome these limitations, this study proposes a novel approach integrating Transmission Hyperspectral Imaging (THSI) with ensemble learning for rapid, non-destructive evaluation. Naturally aged soybean seeds were analyzed using full-spectrum (400–2500 nm) transmittance data to capture deep physiological information. Multiple datasets were constructed by comparing preprocessing techniques—including Smoothing, First Derivative (FD), Hilbert Transform (HT), Savitzky–Golay, Multiplicative Scatter Correction (MSC), and Standard Normal Variate (SNV)—with dimensionality reduction algorithms such as Principal Component Analysis (PCA), Successive Projections Algorithm (SPA), and Competitive Adaptive Reweighted Sampling (CARS). The proposed Stacking ensemble model integrates predictions from Random Forest (RF), Support Vector Machine (SVM), Naïve Bayes (NB), k-Nearest Neighbors (k-NN), and Logistic Regression (LR), achieving 98.33% accuracy and an F1-score of 98.12% on the CARS-HT dataset—significantly outperforming individual classifiers in both accuracy and robustness. Furthermore, the model identified 17 critical wavelengths that reveal physiological mechanisms ranging from chlorophyll degradation and antioxidant balance in the visible spectrum to water migration and lipid peroxidation in the infrared region. The method's high precision and reliability were validated, providing robust technical support for intelligent and precise soybean seed quality management.

Article activity feed