Genome sequence of the cultivated cotton Gossypium arboreum.
Nat Genet, 2014/6;46(6):567-72.
Li F[1], Fan G[2], Wang K[1], Sun F[2], Yuan Y[1], Song G[1], Li Q[3], Ma Z[4], Lu C[5], Zou C[5], Chen W[6], Liang X[6], Shang H[5], Liu W[6], Shi C[6], Xiao G[7], Gou C[6], Ye W[5], Xu X[6], Zhang X[5], Wei H[5], Li Z[5], Zhang G[8], Wang J[6], Liu K[5], Kohel RJ[9], Percy RG[9], Yu JZ[9], Zhu YX[7], Wang J[10], Yu S[5]
Affiliations
PMID: 24836287DOI: 10.1038/ng.2987
Impact factor: 41.307
Abstract
The complex allotetraploid nature of the cotton genome (AADD; 2n = 52) makes genetic, genomic and functional analyses extremely challenging. Here we sequenced and assembled the Gossypium arboreum (AA; 2n = 26) genome, a putative contributor of the A subgenome. A total of 193.6 Gb of clean sequence covering the genome by 112.6-fold was obtained by paired-end sequencing. We further anchored and oriented 90.4% of the assembly on 13 pseudochromosomes and found that 68.5% of the genome is occupied by repetitive DNA sequences. We predicted 41,330 protein-coding genes in G. arboreum. Two whole-genome duplications were shared by G. arboreum and Gossypium raimondii before speciation. Insertions of long terminal repeats in the past 5 million years are responsible for the twofold difference in the sizes of these genomes. Comparative transcriptome studies showed the key role of the nucleotide binding site (NBS)-encoding gene family in resistance to Verticillium dahliae and the involvement of ethylene in the development of cotton fiber cells.
MeSH terms
Binding Sites; Chromosome Mapping; DNA, Plant; Disease Resistance; Ethylenes; Evolution, Molecular; Gene Library; Genome, Plant; Gossypium; Models, Genetic; Phylogeny; Plant Diseases; Polyploidy; Retroelements; Sequence Analysis, DNA; Species Specificity; Terminal Repeat Sequences; Transcriptome; Verticillium
More resources
EndNote: Download