A bacterial artificial chromosome (BAC) library for the A-genome of cotton has been constructed from the leaves of G. arboreum L cv. Jianglinzhongmian. It is used as elite A-genome germplasm resources in the present cotton breeding program and has been used to build a genetic reference map of cotton. The BAC library consists of 123,648 clones stored in 322 384-well plates. Statistical analysis of a set of 103 randomly selected BAC clones indicated that each clone has an average insert length of 100.2 kb per plasmid, with a range of 30 to 190 kb. Theoretically, this represents 7.2 haploid genome equivalents based on an A-genome size of 1697 Mb. The BAC library has been arranged in column pools and superpools allowing screening with various PCR-based markers. In the future, the A-genome cotton BAC library will serve as both a giant gene resource and a valuable tool for map-based gene isolation, physical mapping and comparative genome analysis.
1. Introduction
Cotton, one of the most important economical crops in the world, provides most natural textile fiber and some of oil resources for people. Cotton cultivars are diploids and tetraploids derived from natural hybridization of an African/Asian A-genome and an American D-genome species [1]. As one of the ancestors of tetraploids, the Asiatic cotton (G. arboreum L.) has been domesticated and cultivated for almost 2000 years in China since it was first introduced from India. It is still used as germplasm resources in the present cotton breeding program due to their early maturity, strong tolerance to stress, resistance to disease and insects, high fiber strength, and excellent plasticity. So decoding the A-genome will help for the diging of potential gene in Asian cotton, understanding the origin and evolution of allpolyploid, studying the genome structure of important agronomic trait and the relationship of gene structure and function, and elucidating the influence to the trait by the interaction between genes, which is important in crop improvement.
The bacterial artificial chromosome (BAC) cloning system has become an invaluable tool in many research areas of genome research because of its ability to stably maintain large DNA fragments and its ease of manipulation [2–4]. BAC libraries have been developed for a number of major crops including rice [5–7], soybean [8, 9], wheat [10–13], maize [14, 15], sorghum [16], and tomato [17]. For cotton, at least eight BAC/BIBAC libraries have been reported and made available to the public. These libraries were all made from different genotypes of AD-genome allotetraploid Gossypium species. Seven cotton libraries were made from upland cotton species, including Tamcot HQ95, Auburn 623 (http://hbz7.tamu.edu/homelinks/bac_est/bac.htm), TM-1 [18, 19], Maxxa [20], Suyun7235 [21], Zhongmiansuo12 [22] and 0-613-2R [23], and one from G. barbadense L. cv Pima90-53 [24]. Herein, we report the development of a deep-coverage BAC library for the diploid A-genome cotton species, which served as the maternal parent during polyploidization to generate the precursor of the commercially important allotetraploid species. G. arboreum L. cv. Jianglinzhongmian was selected for this aim. It is a diploid species of A-genome, widely used in breeding programs because of its high level of resistance to Fusarium wilt disease and its superior fiber strength. These BAC and BIBAC libraries will provide resources essential for advanced genomics and genetics research of cotton.
2. Materials and Methods2.1. BAC Library Construction
Cotton plants were grown in the dark for about 7 days. Etiolated cotyledons were used as the source of high-molecular-weight (HMW) DNA preparation. Nuclei were isolated, lysed and megabase DNA was purified as described by Yin et al. [25]. Agrose plugs preran to eliminate small fragments in the megabase genomic DNA. Then, megabase nuclear DNA was partially digested with HindIII to identify an appropriate partial digestion condition. Six plugs were prerun and used for large-scale digestion with optimal amount of HindIII. The DNA separation was performed in two stages. First, partially digested DNA was put into the center wells of a 1% agarose gel and separated by PFGE (6 V/cm, 50 s switch time, 18 hours run time, 12.5°C ). The region of the compression zone containing DNA fragments in the size range from 150 to 450 kb was excised from the unstained gel and divided into three equal sections. For the second step, the three excised gel slices were embedded into a second gel and compressed by PFGE (6 V/cm, 3–5 s switch time, 16 hours run time, 12.5°C ). The compressed DNA band was excised and recovered from the agarose gel slices using an Electroeluter model 422 (Biorad, CA, USA). Eluted DNA was ligated to vector pIndigoBAC-5 (HindIII-cloning Ready, Epicentre Technologies, Madison, WI, USA) and incubated under temperature-cycle conditions [26]. 2 μL of ligation mixture was used to transform 18 μL of ElectroMAX DH10B competent cells (Invitrogen, CA, USA) by electroporation at 17 kV/cm, 100 Ω, and 25 μF. Clones were picked by hand into 384-well plates containing LB freezing media. Plates were incubated overnight, replicated, and then frozen at -80°C.
2.2. BAC Clones Characterization
The BAC clones were picked from the library and inoculated into 3 mL 2 × TY medium containing chloramphenicol (12.5 μg/ml) and incubated at 37°C for 24 hours. BAC-DNA was miniprepared by the method of Sambrook [27] with some modifications. To estimate, insert size and determine distribution of clone size, a total of 103 BAC clones were selected at random throughout the library. The BAC-DNA was digested with 5U of NotI enzyme (3 hours at 37°C). The digestion products were separated by PFGE (6 V/cm, 5–15 s switch time, 14 hours run time, 12.5°C). The insert size was estimated by comparison with the midrange PFGE marker II (New England BioLabs, MA, USA).
Southern blots of size-separated BAC inserts were performed by standard protocols after UV nicking the DNA (Gene Linker, UVP, CA, USA). Total genomic cotton DNA digested by HindIII, and HaeIII used as probe and labeled with DIG by standard random priming techniques (High Prime DNA Labeling and Detection Starter Kit I, Roche, Mannheim, Germany).
Six BACs selected from the library at random for stability testing (with a volume of 5 μL) were cultured in 3 mL 2 × TY medium with antibiotic at 37°C for 24 hours. 5 μL of this culture was used to inoculate a subsequent 3 mL 2 × TY. This procedure was continued for five cycles. Every 24 hours period was considered to represent about 20 generations [28]. DNA samples isolated from the 1st and 5th day cultures (0-generation cells and 100-generation cells) were digested with HindIII and ran on 0.8% agarose gel for 16 hours with 2.0 V/cm. The gel was stained with ethidim bromide for 30 minutes, destained in water for 30 minutes, and then photographed.
2.3. BAC Library Pooling
Under the pooling strategy by Yin et al. [25], the BAC library was arranged in two levels of pools (column pools; Super pools) allowing screening with various PCR-based markers (Figure 3). The strategy consists of a two-step approach. Firstly, for every 384-well plate, the clones (5 μL cultures of each clone) in individual row A to P in the same column were combined into a pool containing 16 individual clones by the 12-channel transferpettor. Each column pool composed of 16 sequential 384-well plates in this way. In the second step, every column pool in the same raw was mixed by the 8-channel transferpettor. So the entire BAC library of 123,648 clones was organized into 322 super pools, each consisting of 384 unique clones.
3. Results
As cotton leaves are particularly rich in polyphenols and polysaccharides, we modified the general library construction method to meet the needs of cotton BAC library construction. Modifications included using etiolated cotyledon as the DNA preparation source, addition of PVP40, and increasing the β-mercaptoethanol concentration in the extraction-washing buffer. The prepared DNA, consisting of about 1 Mb nearly free of protein and organelle DNA, was suitable for BAC library construction. HMW-DNA embedded in LMP agarose plugs was partially digested with the enzyme HindIII. The optimal concentration range of HindIII found to produce the maximum number of 150–450 Kb DNA fragments was 8–12 U/plug. The partially digested DNA was gel separated and size selected twice. The DNA fragments from the second size selection were electroeluted from the gel, and DNA concentration was at least 3 ng/μL. Total twelve separate ligation reactions gave rise to the BAC library of 123,648 individual clones stored in 322 384-well plates.
To analyze the distribution of insert size and estimate the average insert size in the BAC library, DNA samples were isolated from 103 clones. BAC-DNA was digested by NotI to release the insert and fractionated by PFGE (Figure 1(a)). Statistical analysis indicated the insert size of clones ranged from 30 to 190 kb, with an average size of 100.2 kb (Figure 1(c)). No clone was found without an insert. Based on an A-genome size of 1697 Mb [29], the coverage of the library is approximately 7.2 haploid genome equivalents. This accounts for an over 99% probability of hitting a specific BAC clone containing any sequence in the genome. NotI is a GC-8 base cutter, while cotton genome is relatively AT-rich, so digestion with NotI should generate one or two insert bands plus a vector band (7.5 kb), which is consistent with our data. A Southern blot of the gel shown in Figure 1 probed with total cotton genomic DNA (Figure 1(b)) indicated that the source of cloned DNA originated from cotton.
Analysis of the cotton BAC clones. (a) Ethidium bromide-stained agarose gel showing 13 BAC clones digested with NotI and separated by PFGE. The size marker is the midrange PFGE marker II and its sizes are indicated in the margin. (b) An autoradiograph of the same gel hybridized with total cotton genomic DNA. (c) Distribution of insert sizes based on the analysis of 103 clones randomly selected from the BAC library.
To test the stability of BAC clones in E. coli, we analyzed the HindIII restriction patterns of six BAC clones in the 0 and 100 generations. No visible changes in fingerprints were seen between 0 and 100 generations (Figure 2), indicating the stability of the BACs.
Stability study of BAC clones. DNA samples were isolated from 6 single BAC clones derived from 1st and 5th cultures (0-generation cells and 100-generation cells) and digested with HindIII. M 1 Kb ladder DNA marker+λDNA/HindIII; a–f 6 single BAC clones; I 0-generation cells II, 100-generation cells.
The scheme representations of BAC pooling and PCR-based screening strategy. “→” indicated the direction of BAC pooling. Step I, one column pool was mixed with each 384-well column and 16 clones in the same column (gray dot) were combined into a column pool. Step II, One super pool was mixed with each column pool in the same raw and the column pools in same raw (black dot) were mixed. The entire BAC library of 123, 648 clones on 322 384-well plates was organized into 322 super pools. “⇢” indicated the direction of three rounds of PCR-based screening. Round I, screening against 322 Super pools. Round II, screening against the individual row of column pools. Round III, screening against the individual column of particular plate(s).
4. Discussion
Cotton has a relatively large genome with about 30%–60% of the genome consisting of highly repetitive sequences. Moreover, many homologous regions exist between the At and Dt genomes of the tetraploids. These characteristics hamper standard analysis of the cotton genome. Therefore, construction of a BAC library provides an important alternative tool for genomic research. Here, we described the development and characterization of a high-quality BAC library from the A-genome diploid species G. arboreum L. cv. Jianglinzhongmian, an elite cultivar. G. arboreum L. is an old cultivated cotton species that was planted extensively in China. To date, this species is still used in the tetraploid cotton-breeding program as an elite germplasm line, due to unique properties that include early maturity, environmental resistance, biotic resistance, high fiber strength, and excellent plasticity. Additionally, G. arboreum L. has historically been chosen as a model system to study fiber development. The A-genome produces spinnable fiber, whereas the D-genome alone is worthless in terms of fiber production [30]. Mei et al. [31] found among seven detected QTLs for six fiber-related traits, five were distributed among A-subgenome chromosomes, suggesting that the A-subgenome of the allotetraploid cotton contributes to the superior fiber production. Therefore, a BAC library established from A-genome species is useful in cotton genome studies.
Efficient library screening is crucial for all applications of the library. Screening can be performed either by hybridization on high-density filters or by PCR. Both methods are feasible, but the main advantage of hybridization is the ability to combine probes for screening the entire BAC library and identifying clones in a single experiment. PCR screening, however, is much more reliable, faster, and efficient with higher specificity owing to effective avoidance of false positive clones identified by repeat sequences in probes by hybridization. Here, we described a three-step PCR screening procedure based on the BAC library pool system. The BAC pool strategy was sensitive enough to identify single positive clones among superpools containing 384 BAC clones. BAC clones cultured overnight served as PCR template directly, rather than using prepared BAC-DNA. This modification considerably simplifies the procedure and shortens the time required for library screening. With this change, the BAC library can be screened using SSRs, RAPD, and other PCR-based molecular tagging and will facilitate the development of whole-genome integrated physical/genetic maps of cotton.
In additon, the G. arboreum L. cv. Jianglinzhongmian BAC library will provide a valuable tool for a diverse range of other studies including comparative genomics, physical mapping, map-based cloning of gene(s) of QTL(s), and marker development based on BAC-end sequencing and genoming sequencing. Recently, we developed a BAC library from TM-1 [19], which is a tetraploid genetic standard line for upland cotton. In the future, the availability of these two cotton BAC libraries will allow us to perform further comparative studies between A-genome and AD-genome species and these comparisons will further reveal the evolution of cotton genome.
Acknowledgments
This study is supported by grants from the Project of the Changjiang Scholars and Innovative Research Team in University (IRT0432) and the 111 Project (B08025), the Ministry of Education of China.
WendelJ. F.CronnR. C.Polyploidy and the evolutionary history of cotton2001781391862-s2.0-3614897576810.1016/S0065-2113(02)78004-8ShizuyaH.BirrenB.KimU.-J.MancinoV.SlepakT.TachiiriY.SimonM.Cloning and stable maintenance of 300-kilobase-pair fragments of human DNA in Escherichia coli using an F-factor-based vector19928918879487972-s2.0-0026703169WooS.-S.JiangJ.GillB. S.PatersonA. H.WingR. A.Construction and characterization of a bacterial artificial chromosome library of Sorghum bicolor19942223492249312-s2.0-0028567284PetersonD.TomkinsJ.FrischD.WingR.PatersonA.Construction of plant bacterial artificial chromosome (BAC) libraries: an illustrated guide200051100NishimuraM.NakamuraS.HayashiN.AsakawaS.ShimizuN.KakuH.HasebeA.KawasakiS.Construction of a BAC library of the rice blast fungus Magnaporthe grisea and finding specific genome regions in which its transposons tend to cluster1998628151515212-s2.0-0032135040JiangG.-H.WangW.-M.XieB.ZhaiW.-X.LuR.-L.ZhuL.-H.Construction of a Bacterial Artificial Chromosome (BAC) contig encompassing the bacterial blight resistance gene Xa4 locus in rice20012832362432-s2.0-0035236490TaoQ.ChangY.-L.WangJ.ChenH.Islam-FaridiM. N.ScheuringC.WangB.StellyD. M.ZhangH.-B.Bacterial artificial chromosome-based physical map of the rice genome constructed by restriction fingerprint analysis20011584171117242-s2.0-0034844163WuC.-C.NimmakayalaP.SantosF. A.SpringmanR.ScheuringC.MeksemK.LightfootD. A.ZhangH.-B.Construction and characterization of a soybean bacterial artificial chromosome library and use of multiple complementary libraries for genome physical mapping20041095104110502-s2.0-534426306210.1007/s00122-004-1712-yWuC.SunS.NimmakayalaP.SantosF. A.MeksemK.SpringmanR.DingK.LightfootD. A.ZhangH.-B.A BAC- and BIBAC-based physical map of the soybean genome20041423193262-s2.0-124234196810.1101/gr.1405004LijavetzkyD.MuzziG.WickerT.KellerB.WingR.DubcovskyJ.Construction and characterization of a bacterial artificial chromosome (BAC) library for the A genome of wheat1999426117611822-s2.0-0033493289CenciA.ChantretN.KongX.GuY.AndersonO. D.FahimaT.DistelfeldA.DubcovskyJ.Construction and characterization of a half million clone BAC library of durum wheat (Triticum turgidum ssp. durum)200310759319392-s2.0-014194205310.1007/s00122-003-1331-zJandaJ.BartošJ.ŠafářJ.KubalákováM.ValárikM.ČíhalíkováJ.ŠimkováH.CabocheM.SourdilleP.BernardM.ChalhoubB.DoleželJ.Construction of a subgenomic BAC library specific for chromosomes 1D, 4D and 6D of hexaploid wheat20041097133713452-s2.0-1994442842710.1007/s00122-004-1768-8AkhunovE. D.AkhunovaA. R.DvořákJ.BAC libraries of Triticum urartu, Aegilops speltoides and Ae. tauschii, the diploid ancestors of polyploid wheat20051118161716222-s2.0-2774445057410.1007/s00122-005-0093-1FuH.DoonerH. K.A gene-enriched BAC library for cloning large allele-specific fragments from maize: isolation of a 240-kb contig of the bronze region20001068668732-s2.0-003391716710.1101/gr.10.6.866YimY.-S.DavisG. L.DuruN. A.MusketT. A.LintonE. W.MessingJ. W.McMullenM. D.SoderlundC. A.PolaccoM. L.GardinerJ. M.CoeE. H.Jr.Characterization of three maize bacterial artificial chromosome libraries toward anchoring of the physical map to the genetic map using high-density bacterial artificial chromosome filter hybridization20021304168616962-s2.0-1224426079710.1104/pp.013474WooS.-S.JiangJ.GillB. S.PatersonA. H.WingR. A.Construction and characterization of a bacterial artificial chromosome library of Sorghum bicolor19942223492249312-s2.0-0028567284HamiltonC. M.FraryA.XuY.TanksleyS. D.ZhangH.-B.Construction of tomato genomic DNA libraries in a binary-BAC (BIBAC) vector19991822232292-s2.0-003295777110.1046/j.1365-313X.1999.00433.xDongJ. M.KohelR. J.ZhangH. B.YuJ.Bacterial artificial chromsome(BAC) libraries constructed from the genetic standard of upland cottons1999, http://algodon.tamu.edu/htdocs-cotton/tm1bac.htmlHuY.GuoW.-Z.ZhangT.-Z.Construction of a bacterial artificial chromosome library of TM-1, a standard line for genetics and genomics in upland cotton20095111071122-s2.0-5814935025410.1111/j.1744-7909.2008.00773.xTomkinsJ. P.PetersonD. G.YangT. J.MainD.WilkinsT. A.PatersonA. H.WingR. A.Development of genomic resources for cotton (Gossypium hirsutum L.): BAC library construction, preliminary STC analysis, and identification of clones associated with fiber development2001832552612-s2.0-003570642510.1023/A:1013798716098WangX. F.JunM. A.MaZ. Y.ZhangG. Y.ZhengY. M.BAC library construction and characterization of Suyuan7235, a cotton germplasm with high fiber strength200618200203ZhengY. M.WangX. F.ZhangG. Y.LiX. H.MaZ. Y.BAC library construction of Zhongmiansuo12 with highyield, highquality and disease resistance2004271720YinJ.-M.GuoW.-Z.ZhangT.-Z.Construction and identification of bacterial artificial chromosome library for 0-613-2R in upland cotton20064822192222-s2.0-3364556712710.1111/j.1744-7909.2006.00219.xWangX. F.MaJ.WangW. S.ZhengY. M.ZhangG. Y.LiuC. J.MaZ. Y.Construction and characterization of the first bacterial artificial chromosome library for the cotton species Gossypium barbadense L20064911139313982-s2.0-3424738020310.1139/G06-113YinJ.GuoW.YangL.LiuL.ZhangT.Physical mapping of the Rf1 fertility-restoring gene to a 100 kb region in cotton20061127131813252-s2.0-3364555825510.1007/s00122-006-0234-1LundA. H.DuchM.PedersenF. S.Increased cloning efficiency by temperature-cycle ligation19962448008012-s2.0-002996297710.1093/nar/24.4.800SambrookJ.FritschE. F.ManiatisT.19892ndNew York, NY, USACold Spring Harbor Laboratory PressKimU.-J.ShizuyaH.De JongP. J.BirrenB.SimonM. I.Stable propagation of cosmid sized human DNA inserts in an F factor based vector1992205108310852-s2.0-0026594245HendrixB.StewartJ. M.Estimation of the nuclear DNA content of Gossypium species20059557897972-s2.0-2324445595310.1093/aob/mci078ApplequistW. L.CronnR.WendelJ. F.Comparative development of fiber in wild and cultivated cotton2001313172-s2.0-003512507910.1046/j.1525-142X.2001.00079.xMeiM.SyedN. H.GaoW.ThaxtonP. M.SmithC. W.StellyD. M.ChenZ. J.Genetic mapping and QTL analysis of fiber-related traits in cotton (Gossypium)200410822802912-s2.0-084228759610.1007/s00122-003-1433-7