Functional characterization of a multi-cancer risk locus on chromosome band 2q33.1 near CASP8
Curation statements for this article:-
Curated by eLife
eLife Assessment
This important study by Kong et al. systematically and rigorously dissects the gene regulatory network underlying melanoma and breast cancer risk at the multi-cancer 2q33 locus. The authors provide compelling evidence that rs3769823 is a key functional variant that acts through allele-preferential binding of the transcription factors E4F1 and IRF2 to regulate CASP8 and FLACC1 in a cell-type-specific manner. The work makes a significant contribution to understanding the mechanisms operating at multi-cancer risk loci.
This article has been Reviewed by the following groups
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
- Evaluated articles (eLife)
Abstract
Genome-wide association studies (GWAS) of melanoma have identified numerous susceptibility loci. However, causal genes and variants underlying risk have yet to be established for most. It is becoming apparent that many functional variants underlying complex traits act via cis-regulation that may be context-specific, dependent on availability of specific transcription factors/complexes in specific cell types and cell-states. To characterize a risk locus on chromosome band 2q33.1 associated with melanoma, breast cancer, and keratinocyte cancers, we integrated fine-mapping, cell-type specific expression quantitative trait locus (eQTL) analysis, a massively parallel reporter assay, individual luciferase assays, and SNP-based proteomics. Integrated analysis implicates the presence of multiple functional variants lying primarily within a promoter for CASP8. A haplotype containing rs3769823 appeared have the largest effect on expression. Strikingly, both tumor/normal context and this risk-associated haplotype play critical roles in mediating allelic cis-regulatory activity. Quantitative mass spectrometry for rs3769823 identified both E4F1, a transcriptional repressor, and IRF2, a transcriptional activator, as binding preferentially to risk-associated rs3969823-A. The binding of these transcription factors was validated via EMSA, supershift, and chromatin immunoprecipitation (ChIP) assays. The relative levels of E4F1 and IRF2 differ by cell-type and play a role in mediating transcriptional activity in a cell-type specific manner. Our results indicate that the top credible causal set variant rs3769823 likely influences expression of CASP8 and FLACC1 in a cell-type specific manner and may be a relevant functional variant for multiple cancers associated with this locus.
Article activity feed
-
eLife Assessment
This important study by Kong et al. systematically and rigorously dissects the gene regulatory network underlying melanoma and breast cancer risk at the multi-cancer 2q33 locus. The authors provide compelling evidence that rs3769823 is a key functional variant that acts through allele-preferential binding of the transcription factors E4F1 and IRF2 to regulate CASP8 and FLACC1 in a cell-type-specific manner. The work makes a significant contribution to understanding the mechanisms operating at multi-cancer risk loci.
-
Reviewer #1 (Public review):
Summary:
In this manuscript, Kong et al. conduct a systematic analysis of the multi-cancer risk locus at 2q33. The authors start with a careful analysis of co-localization between the melanoma risk SNPs and several other cancers and conclude that a subset of credible causal SNPs is shared among different cancers, including breast cancer. Next, they define a starting list of 27 SNPs as potential credible causal SNPs and analysis of TADs (topologically associating domains) to zoom in on CASP8 and FLA CC1 as potential target genes. They then systematically rule out coding and splicing variants in the set and focus on a smaller set of three SNPs constituting a melanocyte enhancer element. Using a combination of mass spectrometry, reporter assays, and electrophoretic mobility shift assays, the authors define a …
Reviewer #1 (Public review):
Summary:
In this manuscript, Kong et al. conduct a systematic analysis of the multi-cancer risk locus at 2q33. The authors start with a careful analysis of co-localization between the melanoma risk SNPs and several other cancers and conclude that a subset of credible causal SNPs is shared among different cancers, including breast cancer. Next, they define a starting list of 27 SNPs as potential credible causal SNPs and analysis of TADs (topologically associating domains) to zoom in on CASP8 and FLA CC1 as potential target genes. They then systematically rule out coding and splicing variants in the set and focus on a smaller set of three SNPs constituting a melanocyte enhancer element. Using a combination of mass spectrometry, reporter assays, and electrophoretic mobility shift assays, the authors define a role for transcription factors IRF2 and E4F1 in the regulatory network driving risk at the locus.
This work represents a high-quality tour de force, using multiple tools, to zoom in on a gene expression regulatory network associated with risk for multiple cancers. It provides a detailed framework for analyses of other multi-cancer risk loci. Limitations of the work, which is rather a current limitation of the field, is the lack of a model to study how the identified network of regulatory elements, transcription factors, and target genes mechanistically drive risk at the organismal level. Advances such as those described in this manuscript contribute significantly to our knowledge of how common risk variants drive risk.
-
Reviewer #2 (Public review):
Summary:
Kong et al. investigate a well-validated risk locus at chromosome band 2q33.1 adjacent to CASP8, a ubiquitously expressed and central initiator caspase in the extrinsic apoptotic pathway. Importantly, this region is a multi-cancer risk locus harboring multiple highly correlated risk alleles that are confounded by linkage. In addition to protein coding and splicing variants, further evaluation of eQTL and TWAS results for the locus suggests a cis-regulatory effect is present for CASP8 and nearby FLACC1. The authors prioritize variants using orthogonal statistical fine-mapping approaches and triage top candidates for functional assays. Luciferase reporter assays demonstrated convincing allele-specific regulatory activity of rs3769823 variant as well as suggestive evidence for rs3769821 and rs59308963. …
Reviewer #2 (Public review):
Summary:
Kong et al. investigate a well-validated risk locus at chromosome band 2q33.1 adjacent to CASP8, a ubiquitously expressed and central initiator caspase in the extrinsic apoptotic pathway. Importantly, this region is a multi-cancer risk locus harboring multiple highly correlated risk alleles that are confounded by linkage. In addition to protein coding and splicing variants, further evaluation of eQTL and TWAS results for the locus suggests a cis-regulatory effect is present for CASP8 and nearby FLACC1. The authors prioritize variants using orthogonal statistical fine-mapping approaches and triage top candidates for functional assays. Luciferase reporter assays demonstrated convincing allele-specific regulatory activity of rs3769823 variant as well as suggestive evidence for rs3769821 and rs59308963. These three variants lie in close proximity within a melanocyte regulatory element marked by overlapping promoter and enhancer chromatin state signals. The authors employ a haplotype reporter assay, which shows that the combination of risk alleles in the forward direction has additive effects compared to the protective haplotype. These effects are also cell type specific among melanocytes, melanoma, and breast cancer cell states. Utilizing electron mobility shift assays, the authors convincingly show augmented nuclear protein binding of the rs3769823-A risk allele, and mass spectrometry of allele-specific rs3769823 binding proteins revealed specific activity of E4F1 and IRF2, whose motif score is strengthened by the risk allele. Correlation of these transcription factors' expression with CASP8 expression suggested repressive effects of E4F1 and activating effects of IRF2, which were confirmed in siRNA assays across multiple cell types. These data provide important evidence towards the molecular mechanisms governing disease susceptibility at the 2q33.1 risk locus and nominate s3769823 as a causal variant through cis-regulatory activity by E4F1 and IRF2.
Strengths:
Major strengths of the work include the authors' employment of orthogonal fine-mapping approaches and functional assays in multiple cell types. These help to fortify a novel molecular mechanism of rs3769823 and also work together to propose a complicated multi-variant and cell-type-specific effect at this locus, which is worth future investigation.
Weaknesses:
The rs3769823 variant is a protein-coding variant for CASP8. While the authors conclude that this is likely neutral to CASP8 function, their evidence is suggestive at best and does not close the door on a protein-coding function for this variant.
Similarly, another variant, rs10804111, is associated with alternative splicing of CASP8. The authors do well to include the potent rs10804111 sQTL effect on CASP8 and further confirm it by a minigene assay. However, its exclusion from the fine-mapping results may be due to a potent bias towards active chromatin marks. Therefore, rs10804111 still requires further investigation.
Some attention is given to FLACC1, whose promoter may be in contact with multiple variants. However, little is known about FLACC1 function, and the authors don't provide meaningful supporting data to illustrate whether FLACC1 is relevant in the context of melanocyte, melanoma, or other cancer types that share this risk locus (breast, prostate). Showing the absolute expression levels in the eQTL analysis would be helpful towards this.Phenotypic assays interrogating the rs3769823-E4F1-IRF2 relevance to melanocyte biology and melanoma pathogenesis are not included.
Finally, the segmented figure organization negatively impacts the readability of the paper.
-
-
-