Artificial Neural Network Reveals the Role of Transport Proteins in Rhodopseudomonas palustris CGA009 During Lignin Breakdown Product Catabolism
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Rhodopseudomonas palustris , a versatile bacterium with diverse biotechnological applications, can effectively breakdown lignin, a complex and abundant polymer in plant biomass. This study investigates the metabolic response of R. palustris when catabolizing various lignin breakdown products (LBPs), including the monolignols p -coumaryl alcohol, coniferyl alcohol, sinapyl alcohol, p -coumarate, sodium ferulate, and kraft lignin. Transcriptomics and proteomics data were generated for those specific LBP breakdown conditions and used as features to train machine learning models, with growth rates as the target. Three models—Artificial Neural Networks (ANN), Random Forest (RF), and Support Vector Machine (SV)—were compared, with ANN achieving the highest predictive accuracy for both transcriptomics (94%) and proteomics (96%) datasets. Permutation feature importance analysis of the ANN models identified the top twenty genes and proteins influencing growth rates. Combining results from both transcriptomics and proteomics, eight key transport proteins were found to significantly influence the growth of R. palustris on LBPs. Re-training the ANN using only these eight transport proteins achieved predictive accuracies of 86% and 76% for proteomics and transcriptomics, respectively. This work highlights the potential of ANN-based models to predict growth-associated genes and proteins, shedding light on the metabolic behavior of R. palustris in lignin degradation under aerobic and anaerobic conditions.
Importance
This study is significant as it addresses the biotechnological potential of Rhodopseudomonas palustris in lignin degradation, a key challenge in converting plant biomass into commercially important products. By training machine learning models with transcriptomics and proteomics data, particularly Artificial Neural Networks (ANN), the work achieves high predictive accuracy for growth rates on various lignin breakdown products (LBPs). Identifying top genes and proteins influencing growth, especially eight key transport proteins, offers insights into the metabolic niche of R. palustris . The ability to predict growth rates using just these few proteins highlights the efficiency of ANN models in distilling complex biological systems into manageable predictive frameworks. This approach not only enhances our understanding of lignin derivative catabolism but also paves the way for optimizing R. palustris for sustainable bioprocessing applications, such as bioplastic production, under varying environmental conditions.