A phased, chromosome-scale genome of ‘Honeycrisp’ apple ( Malus domestica )
Abstract
‘Honeycrisp’ is one of the most valuable apple cultivars grown in the United States and a popular breeding parent due to its superior fruit quality traits, high levels of cold hardiness, and disease resistance. However, it suffers from a number of physiological disorders and is susceptible to production and postharvest issues. Although several apple genomes have been sequenced in the last decade, there is still a substantial knowledge gap in understanding the genetic mechanisms underlying cultivar-specific traits. Here we present a fully phased, chromosome-level genome of ‘Honeycrisp’ apples, using PacBio HiFi, Omni-C, and Illumina sequencing platforms. Our genome assembly is by far the most contiguous among all the apple genomes. The sizes of the two assembled haplomes are 674 Mb and 660 Mb, with contig N50s of 32.8 Mb and 31.6 Mb, respectively. In total, 47,563 and 48,655 protein coding genes were annotated from each haplome, capturing 96.8-97.4% complete BUSCOs in the eudicot database, the most complete among all Malus annotations. A gene family analysis using seven Malus genomes shows that a vast majority of ‘Honeycrisp’ genes are assigned into orthogroups shared with other genomes, but it also reveals 121 ‘Honeycrisp’-specific orthogroups. We provide a valuable resource for understanding the genetic basis of horticulturally important traits in apples and other related tree fruit species, including at-harvest and postharvest fruit quality, abiotic stress tolerance, and disease resistance, all of which can enhance breeding efforts in Rosaceae.
Article activity feed
-
This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.69), and has published the reviews under the same license. These are as follows.
Reviewer 1. Dr.Liyi Zhang
‘Honeycrisp’ is known for its exceptionally crisp and juicy texture as a source of interesting genetic diversity in apple breeding programs worldwide. In addition, high quality genomes are required for us to understanding the genetic characteristics of a core cultivar, This study presents a fully phased, chromosome-level high-quality apple genome with a higher contiguity and completeness than previously sequenced apple genomes and also reveals 121 ‘Honeycrisp’-specific orthogroups with a large data set, which provide a toolbox for apple genetic research and breeding.
The paper is well written and the data is convincing. So, …
This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.69), and has published the reviews under the same license. These are as follows.
Reviewer 1. Dr.Liyi Zhang
‘Honeycrisp’ is known for its exceptionally crisp and juicy texture as a source of interesting genetic diversity in apple breeding programs worldwide. In addition, high quality genomes are required for us to understanding the genetic characteristics of a core cultivar, This study presents a fully phased, chromosome-level high-quality apple genome with a higher contiguity and completeness than previously sequenced apple genomes and also reveals 121 ‘Honeycrisp’-specific orthogroups with a large data set, which provide a toolbox for apple genetic research and breeding.
The paper is well written and the data is convincing. So, I recommend to publish this paper ASAP.
**Reviewer 2. Luca Bianco **
Are all data available and do they match the descriptions in the paper?
I could not access to the Bioproject data nor see the results files (i.e. fasta, gff,...) but I am confident they will be available once the paper is accepted.
Is there sufficient data validation and statistical analyses of data quality?
The only exception is what I mentioned regarding the haplotype separation (see general comments below).
General comments: This paper describes the genome sequence of Honeycrisp, an important apple cultivar, produced with the latest sequencing technologies and assembled into phased chromosomes. In my opinion, the manuscript is well written, very interesting and certainly worth publication. There are only a few points that I would like to see addressed:
How can you be sure that the two haplomes are a good representation of each chromosome and not a mix of the two haplotypes? In other words, have you checked that the whole sequence of each chromosome represents one phase only? It would be great if you could provide some data (e.g. SNPs,...) to support this and discuss the results obtained in this regard.
Some additional stats regarding the obtained sequence could be added to table 2 and/or table 5 (e.g. number of Ns in the genome, how many telomers were assembled in each chromosome -- if not all telomers were identified, )
The gene family analysis among the different apple genomes is quite interesting but rather superficial. It would be nice to dig deeper into the function of the orthogroups that are unique to Honeycrisp, describe what pathways they are involved in and so on...
-