A strategy for genome-wide seamless tagging of human protein-coding genes
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Comprehensive and systematic proteome-wide experiments require financial and technical resources unavailable to most researchers. Here we describe a scalable CRISPR/Cas9-non-homologous end joining (NHEJ) based method for P ooled R ecombinant I ntegration of S eamless M arkers (PRISM) into protein coding genes. We created two gRNA libraries for 5’- and 3’-tagging of 18,804 human protein-coding genes. Selection for in-frame integration of the donor cassette can be guaranteed by fusing it to an antibiotic resistance enzyme (ARE) and P2A self-cleaving peptide, resulting in tagged proteins and free ARE to select for clones with inserted donor cassettes. We achieved a library integration rate of 19.75% and tagging of ∼80% for genes expressed in Hek293T cells and notably, 89.7% of essential genes, with donor DNA. Our strategy is scalable, specific, and selective, paving the way for genome-scale construction of human cell lines tagged with different types of reporter genes for protein functional characterization.
Significance statement
Here we report the first practical method to achieve near complete 5’- or 3’-end integration of reporter protein-coding sequences to express seamless fusions of human protein and reporter proteins that we call P ooled R ecombinant I ntegration of S eamless M arkers ( PRISM ). PRISM is independent of homology templates, yet it works with high efficiency and precision. Using gRNA-directed CRISPR/Cas9 genome editing and mini-plasmid donor cassettes we were able to tag about 80% of the protein-coding sequences of human genes with a green florescent protein at an integration efficiency of about 20% and most notably, of 90% of essential genes in the human cell line HEK 293T. This is important because the characterization of gene functions in a particular cell type usually begins with essential genes.