Single-cell RNA-seq reveals trans-sialidase-like superfamily gene expression heterogeneity in Trypanosoma cruzi populations

Curation statements for this article:
  • Curated by eLife

    eLife logo

    eLife Assessment

    This important study utilizes single-cell RNA sequencing to reveal the heterogeneity of trans-sialidase-like superfamily gene expression in Trypanosoma cruzi populations. The approach is highly convincing, as it successfully assigns cells to specific developmental forms and highlights the variability in surface protein expression among trypomastigotes. However, while the findings are solid and contribute to the understanding of immune evasion mechanisms, the study would benefit from a more detailed exploration of the regulatory factors governing trans-sialidase expression. Strengthening this aspect would further enhance its impact on researchers studying T. cruzi pathogenesis and host-parasite interactions.

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

Trypanosoma cruzi , the causative agent of Chagas disease, presents a major public health challenge in Central and South America, affecting approximately 8 million people and placing millions more at risk. The T. cruzi life cycle includes transitions between epimastigote, metacyclic trypomastigote, amastigote, and blood trypomastigote stages, each marked by distinct morphological and molecular adaptations to different hosts and environments. Unlike other trypanosomatids, T. cruzi does not employ antigenic variation but instead relies on a diverse array of cell-surface-associated proteins encoded by large multi-copy gene families (multigene families), essential for infectivity and immune evasion.This study analyzes cell-specific transcriptomes using single-cell RNA sequencing of amastigote and trypomastigote cells to characterize stage-specific surface protein expression during mammalian infection. Through clustering and identification of cell-specific markers, we assigned cells to distinct parasite developmental forms. Analysis of individual cells revealed that surface protein-coding genes, especially members of the trans-sialidase TcS superfamily (TcS), are expressed with greater heterogeneity than single-copy genes. Additionally, no recurrent combinations of TcS genes were observed between individual cells in the population. Our findings thus reveal transcriptomic heterogeneity within trypomastigote populations where each cell displays unique TcS expression profiles. Focusing on the diversity of surface protein expression, this research aims to deepen our understanding of T. cruzi cellular biology and infection strategies.

Article activity feed

  1. eLife Assessment

    This important study utilizes single-cell RNA sequencing to reveal the heterogeneity of trans-sialidase-like superfamily gene expression in Trypanosoma cruzi populations. The approach is highly convincing, as it successfully assigns cells to specific developmental forms and highlights the variability in surface protein expression among trypomastigotes. However, while the findings are solid and contribute to the understanding of immune evasion mechanisms, the study would benefit from a more detailed exploration of the regulatory factors governing trans-sialidase expression. Strengthening this aspect would further enhance its impact on researchers studying T. cruzi pathogenesis and host-parasite interactions.

  2. Reviewer #1 (Public review):

    Summary:

    The authors aimed to assess the variability in the expression of surface protein multigene families between amastigote and trypomastigote Trypanosoma cruzi, as well as between individuals within each population. The analysis presented shows higher expression of multigene family transcripts in trypomastigotes compared to amastigotes and that there is variation in which copies are expressed between individual parasites. Notably, they find no clear subpopulations expressing previously characterised trans-sialidase groups. The mapping accuracy to these multicopy genes requires demonstration to confirm this, and the analysis could be extended further to probe the features of the top expressed genes and the other multigene families also identified as variable.

    Strengths:

    The authors successfully process methanol-fixed parasites with the 10x Genomics platform. This approach is valuable for other studies where using live parasites for these methods is logistically challenging.

    Weaknesses:

    The authors describe a single experiment, which lacks controls or complementation with other approaches and the investigation is limited to the trans-sialidase transcripts.

    It would be more convincing to show either bioinformatically or by carrying out a controlled experiment, that the sequencing generated has been mapped accurately to different members of multigene families to distinguish their expression. If mapping to the multigene families is inaccurate, this will impact the transcript counts and downstream analysis.

  3. Reviewer #2 (Public review):

    Summary:

    This manuscript presents a valuable single-cell RNA-seq study on Trypanosoma cruzi, an important human parasite. It investigates the expression heterogeneity of surface proteins, particularly those from the trans-sialidase-like (TcS) superfamily, within amastigote and trypomastigote populations. The findings suggest a previously underappreciated level of diversity in TcS expression, which could have implications for understanding parasite-host interactions and immune evasion strategies. The use of single-cell approaches to delve into population heterogeneity is strong. However, the study does have some limitations that need to be addressed.

    The focus on single-cell transcriptional heterogeneity in surface proteins, especially the TcS family, in T. cruzi is novel. Given the important role of these proteins in parasite biology and host interaction, the findings have potential significance.

    Strengths:

    The key finding of heterogeneous TcS expression in trypomastigotes is well-supported. The analysis comparing multigene families, single-copy genes, and ribosomal proteins highlights the unusual nature of the variation in surface protein-coding genes.

    Weaknesses:

    While the manuscript identifies TcS heterogeneity, the functional implications of the different expression profiles remain speculative. The authors state it may reflect differences in infectivity, but no direct experimental evidence supports this.

    The manuscript lacks any functional validation of the single-cell findings. For instance, do the trypomastigote subpopulations identified based on TcS expression exhibit differences in infectivity, host cell tropism, or immune evasion? Such experiments would greatly strengthen the study.

    The authors identify a subpopulation of TcS genes that are highly expressed in many cells. However, it is unclear if these correspond to previously characterized TcS members with specific functions.
    The authors hypothesize that observed heterogeneity may relate to chromatin regulation. However, the study does not directly address these mechanisms. There are interesting connections to be made with what they identify as the colocalization of genes within chromatin folding domains, but the authors do not fully explore this. It would be insightful to address these mechanisms in future work.

    The merging of technical replicates needs further justification and explanation as they were not processed through separate experimental conditions. While barcodes were retained, it would be informative to know how well each technical replicate corresponds with the other. If both datasets were sequenced on the same lane, the inclusion of technical replicates adds noise to the analysis.
    While the number of cells sequenced (3192) seems reasonable, it's not clear how much the conclusions are affected by the depth of sequencing. A more detailed description of the sequencing depth and its impact on gene detection would be valuable.

    While most of the methods are clear, the way in which the subsampled gene lists were generated could be more thoroughly described, as some details are not clear for the subsampling of single-copy genes.

    Some of the figures are difficult to interpret. For example, the color scaling in the heatmap of Supplementary Figure 3B is not self-explanatory and it is hard to extract meaningful conclusions from the graph.

  4. Reviewer #3 (Public review):

    The study aimed to address a fundamental question in T. cruzi and Chagas disease biology - how much variation is there in gene expression between individual parasites? This is particularly important with respect to the surface protein-encoding genes, which are mainly from massive repetitive gene families with 100s to 1000s of variant sequences in the genome. There is very little direct evidence for how the expression of these genes is controlled. The authors conducted a single-cell RNAseq experiment of in vitro cultured parasites with a mixture of amastigotes and trypomastigotes. Most of the analysis focused on the heterogeneity of gene expression patterns amongst trypomastigotes. They show that heterogeneity was very high for all gene classes, but surface-protein encoding genes were the most variable. In the case of the trans-sialidase gene family, many sequence variants were only detected in a small minority of parasites. The biology of the parasite (e.g. extensive post-transcriptional regulation) and potential technical caveats (e.g. high dropout rates across the genome) make it difficult to infer what this might mean for actual protein expression on the parasite surface.

    (1) Limit of detection and gene dropouts

    An average of ~1100 genes are detected per parasite which indicates a dropout rate of over 90%. It appears that RNA for the "average" single copy 'core' gene is only detected in around 3% of the parasites sampled (Figure 2c: ~100 / 3192). This may be comparable with some other trypanosome scRNAseq studies, but this still seems to be a major caveat to the interpretation that high cell-to-cell variability in gene expression is explained by biological rather than technical factors. The argument would be more convincing if the dropout rates and expression heterogeneity were minimal for well-known highly expressed genes e.g. tubulin, GAPDH, and ribosomal RNAs. Admittedly, in their Final Remarks, the authors are very cautious in their interpretation, but it would be good to see a more thorough discussion of technical factors that might explain the low detection rates and how these could be tested or overcome in future work.

    (2) Heterogeneity across the board

    The authors focus on the relative heterogeneity in RNA abundance for surface proteins from the multicopy gene families vs core genes. While multicopy gene sequences do show more cell-to-cell variability, the differences (Figure 2D) are roughly average Gini values of 0.99 vs 0.97 (single copy) or 0.95 (ribosomal). Other studies that have applied similar approaches in other systems describe Gini values of < 0.2-0.25 for evenly expressed "housekeeping" genes (PMIDs 29428416, 31784565). Values observed here of >0.9 indicate that the distribution for all gene classes is extremely skewed and so the biological relevance of the comparison is uncertain.

    Nevertheless, this study does provide some tantalising evidence that the expression of surface genes may vary substantially between individual parasites in a single clonal population. The study is also amongst the very first to apply scRNAseq to T. cruzi, so the broader data set will be an important resource for researchers in the field.

  5. Author response:

    We sincerely thank all three reviewers for their time, comments, and valuable suggestions, which will help improve our manuscript. Below, we provide preliminary remarks addressing some of the key issues that have been raised.

    Reviewer 1:

    We agree with the reviewer on the challenge of accurately mapping reads to multigene families. We carefully considered this issue and addressed it by evaluating the performance of multiple aligners using simulated RNA-seq reads. Our results indicate that kallisto performs particularly well in this context, outperforming widely used aligners such as Bowtie2 and STAR. This is likely due to kallisto’s expectation-maximization (EM) algorithm (described in the Materials and Methods section), which employs a probabilistic model to assign reads from similar transcripts. Previous studies have demonstrated the effectiveness of this approach in quantifying highly repetitive sequences, such as transposons (doi.org/10.1093/bioinformatics/btv422). In the revised manuscript, we are considering the inclusion of a supplementary figure to further support the selection of the mapping algorithm.

    Reviewer 2:

    We believe that obtaining experimental evidence on the influence of multiple multigene families would represent a significant advancement in the field. However, we would like to emphasize that this is a short communication centered on a specific and biologically relevant observation within a single multigene family. The manuscript is not intended to comprehensively address all aspects of the experiment but rather to highlight what we consider an important biological phenomenon with potential functional implications.

    The influence of phenotypic heterogeneity and its possible advantages under environmental pressures has been previously proposed for Trypanosoma cruzi, related trypanosomatids, and other biological systems, ranging from bacteria to tumors (Seco-Hidalgo 2015, doi: 10.1098/rsob.150190 and Luzak 2021, doi: 10.1146/annurev-micro-040821-012953, for a comprehensive review on this topic). While the reviewer is correct in noting that our model does not demonstrate a functional role for TcS heterogeneity, the experimental approaches required to address this question in a large multigene family are highly complex and beyond the scope of this study. However, we acknowledge the importance of clarifying that the proposed functional implications remain speculative, so we will revise the manuscript accordingly.

    As the reviewer suggests, in the revised version of the manuscript, we will include additional analyses on the characteristics of frequently expressed TcS genes to identify common features that may explain their expression patterns.

    We appreciate the reviewer’s comments and suggestions regarding the clarity of methodological choices and the explanation of key concepts. Accordingly, we will refine the description of our methodology and ensure that our figures are more intuitive and self-explanatory.

    Reviewer 3:

    We recognize the limitations imposed by gene dropout in our data, as highlighted by the reviewer. In the manuscript, we have aimed to be transparent about this issue and discussed its impact in two separate sections (lines 110–121 and 175–181). To enhance clarity, we will revise these paragraphs to provide a more comprehensive discussion of this limitation. Unfortunately, gene dropout is an inherent limitation of 10x genomics data. Trypanosomatids are not an exception in this regard, and the general metrics of the single-cell RNA-seq data in other reports are equivalent to those obtained in our experiment.

    Despite this important limitation, we believe that our comparative analyses (the contrast between TcS and ribosomal protein expression) provide valuable insights into a biological phenomenon with potential functional relevance for the parasite. Furthermore, we are actively working on generating single-cell RNA-seq data using alternative methodologies that improve gene dropout rates. We anticipate that these future studies will help clarify the extent of the phenomenon described in this work.