Geobacter species are involved in electricity production, bioremediations, and various environmental friendly activities. Whole genome comparative analyses of Geobacter sulfurreducens PCA, Geobacter bemidjiensis Bem, Geobacter sp. FRC-32, Geobacter lovleyi SZ, Geobacter sp. M21, Geobacter metallireducens GS-15, Geobacter uraniireducens Rf4 have been made to find out similarities and dissimilarities among them. For whole genome comparison of Geobacter species, an in-house tool, Geobacter Comparative Genomics Tool (GCGT) has been developed using BLASTALL program, and these whole genome analyses yielded conserved genes and they are used for functional prediction. The conserved genes identified are about 2184 genes, and these genes are classified into 14 groups based on the pathway information. Functions for 74 hypothetical proteins have been predicted based on the conserved genes. The predicted functions include pilus type proteins, flagellar proteins, ABC transporters, and other proteins which are involved in electron transfer. A phylogenetic tree from 16S rRNA of seven Geobacter species showed that G. sulfurreducens PCA is closely related to G. metallireducens GS-15 and G. lovleyi SZ. For evolutionary study, acetate kinase protein is used, which showed closeness to Pelobacter propionicus, Pelobacter carbinolicus, and Deferribacteraceae family bacterial species. These results will be useful to enhance electricity production by using biotechnological approaches.
1. Introduction
Geobacter species have been placed in the Geobacteraceae family which falls under the δ-Proteobacteria class. They are found in abundance where Fe(III) reduction is important, and these reductions play an important role in bioremediation [1]. These species have the ability to transfer electrons directly to electrodes or metals without any electron mediators and as a result they produce electricity or precipitate soluble metals [2, 3]. Geobacter species are involved in a variety of environmental friendly actions like bioremediation, electricity production, and so forth [2, 4]. Geobacter species till now have shown higher current production compared to other organisms, and these species are found to be having nanowires, which allows it to reach distant electron acceptors, and in turn helps in making a microbial fuel cell easier [5]. Geobacter species which are a strictly anaerobic organism also proved to be having the ability to grow in low oxygen concentrations, and this has effects on this organism’s growth in the subsurface environment [6].
As there is an increase in genome sequencing and thus the availability of whole genome data of many microorganisms, comparative genome analysis provides valuable information and these comparative analyses help in annotating the protein function and understanding evolutionary relationships. Comparative genome analyses have been achieved through various ways like pairwise or multiple genome comparisons or through local or global alignment methods. Computational application of these algorithms plays an important role in comparative analysis, and comparison of genomes provides information about conserved and unique genes across these genomes [7]. By identifying homologous genes through pairwise comparison with the closely related species, the hypothetical proteins can be functionally annotated [8, 9]. Phylogenetic analysis between the organisms provides a profile through which the relationship between these organisms can be identified. Since these organisms are in a way evolved by adapting to their environmental conditions, by analyzing these phylogenetic profiles, the evolution of these organisms can be inferred [10]. An evolutionary based comparative study has been carried out by Butler et al. which showed the relationship based on evolution between Geobacteraceae species [11].
The whole genome sequence data of Geobacter spp. are provided in public databases, and comparative genome analysis provides insights into their metabolism. This analysis provides information about the conserved gene across the Geobacter species and the metabolism involved and their evolution, and we also predicted the function of hypothetical proteins in G. sulfurreducens PCA. Here, we compared the genomes of 7 Geobacter species specific to G. sulfurreducens PCA for which the whole genome data are available in the NCBI database [12]. A pairwise comparison tool was built specifically for these 7 species, where the output interface provides information about common and unique genes. The 7 Geobacter genomes compared in this work are G. sulfurreducens PCA, G. bemidjiensis Bem, G. sp. FRC-32, G. lovleyi SZ, G. sp. M21, G. metallireducens GS-15, and G. uraniireducens Rf4. To know the relationship between these bacteria, a phylogenetic tree was built from the 16S rRNA of these genomes. And to analyze the evolutionary relationship, a tree was constructed for acetate kinase protein, a protein involved in acetate metabolism which is also conserved in all these species. Acetate kinase enzyme is involved in phosphorylation of acetate and ATP with the formation of acetyl phosphate and ADP. This process is linked to central carbon metabolism which is very important in energy production of an organism.
2. Materials and Methods2.1. Genome Data
The genome data of the 7 Geobacter species were obtained from the NCBI genome database in 2012 [12]. The genome information like genome length, gene content, GC%, and coding genes for each species was also obtained from the NCBI.
2.2. Geobacter Comparative Genomics Tool
Geobacter Comparative Genomics Tool (GCGT) has been built for pairwise comparative analysis of Geobacter genomes. Currently, this tool consists of all the 7 Geobacter species. This tool is part of an in-house tool, MCGT tool, which has 1205 organisms in total. GCGT is written in Perl script and runs on an Apache web server. GCGT takes total protein sequences, in fasta format, from two different organisms and aligns all sequences based on a local alignment program. The FORMATDB and BLASTALL programs are used to automate the task of sequence comparison for a given E-value. The corresponding BLAST [13] output is parsed by a Perl program that extracts information like accession number, protein name, score, E-value, identity, similarity, and individual alignment patterns for each homologous gene. The output is given in the form of tables showing homologous and nonhomologous sequences between the compared genomes. http://mcgt.bioinfo.au-kbc.org.in/GCGT/.
2.3. Functional Genomics and Gene Reannotation
Annotation of hypothetical proteins of G. sulfurreducens was carried out based on the results obtained. Each Geobacter species was run against G. sulfurreducens at an E-value of 0.0001 and, based on the results obtained, the function of hypothetical protein was predicted. The core genome across the Geobacter species was identified by running a Macro program which identified the common reference sequence IDs of G. sulfurreducens species among the homolog’s obtained from comparing with the other 6 Geobacter species in the GCGT tool. For the identified core genome genes, the pathway information was obtained using UniProt [14]. And also comparisons of protein coding genes, GC content, and various aspects of genome of the Geobacter were carried out.
2.4. Phylogenetic Analysis
A Phylogenetic analysis was done for 16S rRNA of the 7 Geobacter species to identify the relationship between these bacteria. The 16S rRNA protein sequence of 7 Geobacter species was obtained from the genome data downloaded from NCBI, and multiple sequence alignment of these proteins was carried out using ClustalX [15]. Phylogenetic tree was constructed using PHYLIP [16] package. Neighbor-joining method was used to construct phylogenetic trees, and a bootstrap analysis (1000 data resamplings) was used to determine levels of branch points obtained in neighbor-joining analysis. A consensus tree was developed using CONSENSE program, and the tree was viewed using Archaeopteryx [17] visualization software. To study the evolutionary relationship, a tree was constructed for acetate kinase protein, which is involved in acetate metabolism in Geobacter and also conserved across the 7 Geobacter genomes.
3. Results and Discussion3.1. Pairwise Comparison
The pairwise comparison is one of the important methods of comparing microbial genomes. This method was used in identifying the list of genes which are homologous between the compared organisms. The interpairwise comparison between the seven Geobacter species genomes carried out at an E-value of 1×10-5 provided the homologous and specific genes between these genomes (Table 1). A high similarity between Geobacter species of G. sulfurreducens PCA and G. metallireducens GS-15 was found based on pairwise comparison. The similarity was in the range of 80%, and the least similarity was found between G. sulfurreducens PCA and G. lovleyi SZ which was in the range of ~60%. The other Geobacter genomes showed similarity with G. sulfurreducens PCA in the range of 65–70%. These data were used for the functional annotation of hypothetical genes in G. sulfurreducens PCA which is used as a reference organism in the comparative analysis.
Comparative analysis of Geobacter species with Geobacter sulfurreducens PCA. This comparative analysis shows the homologous and specific genes with respect to Geobacter sulfurreducens PCA against other Geobacter species, and this comparison is carried out at an E-value of 1×10-5.
Comparison organism
Reference organism (Geobacter sulfurreducens PCA)
Homologous to comparison organism
Specific to reference organism
Specific to comparison organism
Geobacter metallireducens GS-15
2785
642
670
Geobacter uraniireducens Rf4
2757
670
1176
Geobacter bemidjiensis Bem
2709
718
1058
Geobacter sp. FRC-32
2675
752
927
Geobacter lovleyi SZ
2565
862
970
Geobacter sp. M21
2727
700
1088
The protein sequences which descend from a common ancestry will have the same function [18]. Based on the conserved genes across the Geobacter genomes, the functions of 74 hypothetical genes have been predicted (Table S1A in Supplementary Material found online at http://dx.doi.org/10.1155/2013/850179). The predicted functions include NAD dependent dehydratase (NP_951137.1) enzyme which is involved in the cellular metabolic process, membrane protein which plays an important role in the maintenance of concentrations of ions, type-IV pilus assembly PilZ that helps in transport (NP_952245.1), which transports with the help of a transporter or pore within a cell or between cells, and FAD dependent oxidoreductase (NP_952174.1), an enzyme involved in oxidation-reduction process.
3.2. Core Genome
The genes identified from the genomes are analyzed for their conservation across the seven Geobacter genomes. This concept is used for studying the genomic relationship among the bacteria [19]. Here, we have identified the core genome of seven Geobacter genomes which was about 2184 genes (Table S1B). These core genome proteins are classified according to their functional information (Table 2). This core gene contains essential genes which code for transcription, translation, and various metabolic proteins. The core genes contain high number of membrane proteins, electron transport proteins, and efflux proteins which are involved in the electron transfer mechanism indicating the high rate of electron transfer in these organisms.
Core genome of Geobacter spp. classified based on their functional classes. This classification shows a high number of electron transport proteins, ATPase, and signaling and membrane proteins to be conserved across Geobacter species. These electron transfer proteins and membrane proteins play a significant role in electricity production.
S. no.
Functional classes
No. of proteins
1
Cell membrane proteins
26
2
ATPase proteins
26
3
Efflux proteins
19
4
Lipid metabolism
23
5
Signaling pathway
44
6
ABC transporter substrate binding proteins
32
7
Flagellar proteins
24
8
Virulence, detoxification
4
9
Phage proteins
3
10
Transcriptional regulator proteins
69
11
RNA and ribosomal proteins
48
12
DNA binding proteins
34
13
Electron transport/cytochrome type
55
14
Cell growth proteins
1503
15
Hypothetical proteins
275
3.3. Genome Length and GC Content
The 7 Geobacter genomes show a considerable size variation. These variations in the size of bacterial genomes have been studied by Ussery and Hallin [20]. The genome of G. uraniireducens Rf4 is the largest with a size of 5,136,364nt and with 4358 protein coding genes. This G. uraniireducens Rf4 exhibits a slow growth rate and it also has long flagella [21]. The smallest is G. sulfurreducens PCA which has a genome length of 3,814,139nt and 3428 protein coding genes. The GC content is very important for a small organism and even small change in a base composition affects the coding regions [22]. The GC content of Geobacter organisms is in the range of 25%–75% (Table 3).
Genome data comparison of different Geobacter species. Comparing genome characteristics like protein coding genes, GC content, and coding percentage shows the relationship between these bacteria.
Name
RefSeq
Length
No. of genes
Protein coding
Structural RNAs
GC content
% coding
Geobacter sulfurreducens PCA
NC_002939
3,814,139 nt
3505
3428
57
60%
89%
Geobacter metallireducens GS-15
NC_007517
3,997,420 nt
3620
3520
59
59%
90%
Geobacter uraniireducens Rf4
NC_009483
5,136,364 nt
4490
4358
58
54%
86%
Geobacter sp. FRC-32
NC_011979
4,304,501 nt
3901
3798
62
53%
89%
Geobacter bemidjiensis Bem
NC_011146
4,615,150 nt
4129
4057
71
60%
87%
Geobacter lovleyi SZ
NC_010814
3,917,761 nt
3697
3606
54
54%
91%
Geobacter sp. M21
NC_012918
4,745,806 nt
4227
4080
75
60%
86%
3.4. Phylogenetic Analysis
Phylogenetic tree was constructed using the 16S rRNA of Geobacter genomes to study the relationship among these bacteria. The 16S rRNA is used for taxonomic purposes and identifying new classes of bacteria [23]. Butler et al. have constructed phylogeny tree based on 697 ortholog proteins among six Geobacter and a Pelobacter species [11]. Here, the tree based on 16S rRNA provided the relationship between these 7 organisms (Figure 1). We found that G. sulfurreducens PCA and G. metallireducens GS-15 are very closely related. The species G. uraniireducens Rf4, G. bemidjiensis Bem, and G. lovleyi SZ are separated by a clade.
Phylogenetic tree constructed using 16S rRNA of Geobacter species. This shows the relationship between Geobacter species. G. sulfurreducens PCA is found to be closely related to G. metallireducens.
A tree was also constructed based on the acetate kinase protein NP_953752.1 which is involved in acetate metabolism (Figure 2). Acetate is a primary electron donor for G. sulfurreducens PCA, and the anode electrode acts as an electron acceptor [24]. When G. sulfurreducens is grown with acetate as a substrate, the enzyme acetate kinase phosphorylates acetate to acetyl phosphate which is then acted upon by phosphate acetyltransferase and converts it into acetyl-CoA. This acetyl-CoA enters TCA cycle where it is oxidized through the TCA cycle producing NADH, NADPH, ATP, and reduced ferredoxin. Acetate kinase phosphorylation is an important step in energy production. Thus acetate kinase first enzyme which acts on acetate is taken for evolutionary studies. Apart from its closeness to Geobacter species, acetate kinase protein is found to be closely related to Pelobacter propionicus, Pelobacter carbinolicus, and Deferribacteraceae family bacteria like Calditerrivibrio nitroreducens, Denitrovibrio acetiphilus, Flexistipes sinusarabici, and Desulfovibrio desulfuricans. The closeness to these bacteria indicates its evolvability, and these bacteria have developed different strategies to oxidize organic compounds. Pelobacter is a nonmotile bacterium indicating its loss in its motility towards evolution or in other ways. Geobacter and Pelobacter evolved differently from their ancestors. Apart from Pelobacter, these organisms use organic substance for anaerobic respiration. Desulfovibrio desulfuricans, a known bacterium found to be involved in bioremediation, is found to form a close linkage with Spirochaetaceae family bacteria Treponema primitia ZAS-2. This bacterium is found to possess a large flagellar motor [25], and these findings will be helpful in studying the evolution of Geobacter sulfurreducens PCA.
A neighbor-joining tree for acetate kinase protein conserved across Geobacter species. This tree shows the relationship between various families of bacteria and Geobacteraceae family. The Deferribacteraceae and Spirochaetaceae families are found to be closely related to Geobacteraceae family.
4. Conclusion
As the continuous increase in genomic data of Geobacter bacteria is available, comparative and functional analysis becomes essential. As these Geobacter spp. play a very important role in bioremediation and various environmentally important activities, they need to be analyzed carefully to identify the various aspects of their genome. Comparative analysis is one of the important methods in analyzing the sequence data obtained from databases. A pairwise comparison of the whole genome of six Geobacter species with G. sulfurreducens yielded homologous sequences between them and also genes specific to particular organism. This comparative analysis also helps in systems biology to fill the holes in pathways. The core genome of Geobacter was also identified and classified based on the pathway information. Based on the conservancy of genes, the functions of 74 hypothetical genes were predicted. The relationship between these Geobacter species was studied using 16S rRNA. G. sulfurreducens PCA is found to be very closely related to G. metallireducens GS-15 and distantly related to G. lovleyi SZ among the Geobacter species. And its evolutionary relationship is studied by constructing a tree for acetate kinase protein. And it is found to be closely related to Deferribacteraceae and Spirochaetaceae families which have adopted different strategies to oxidize organic compounds. This evolutionary study indicates that these bacteria ancestors had organic compounds as substrate. These results will be useful in finding similarities and differences in the seven Geobacter spp. taken for the study and it helps in better understanding the genes involved in electricity production and bioremediation process.
Conflict of Interests
The authors declare that they do not have any conflict of interests.
Acknowledgment
The authors would like to thank Dr. Venil N. Sumantran, Adjunct Faculty IITM, for giving valuable suggestions in writing this paper.
LeangC.AdamsL. A.ChinK.-J.NevinK. P.MethéB. A.WebsterJ.SharmaM. L.LovleyD. R.Adaptation to disruption of the electron transfer pathway for Fe(III) reduction in Geobacter sulfurreducens200518717591859262-s2.0-2434444325110.1128/JB.187.17.5918-5926.2005BondD. R.LovleyD. R.Electricity production by Geobacter sulfurreducens attached to electrodes2003693154815552-s2.0-003733760610.1128/AEM.69.3.1548-1555.2003MethéB. A.NelsonK. E.EisenJ. A.PaulsenI. T.NelsonW.HeidelbergJ. F.WuD.WuM.WardN.BeananM. J.DodsonR. J.MadupuR.BrinkacL. M.DaughertyS. C.DeBoyR. T.DurkinA. S.GwinnM.KolonayJ. F.SullivanS. A.HaftD. H.SelengutJ.DavidsenT. M.ZafarN.WhiteO.TranB.RomeroC.ForbergerH. A.WeidmanJ.KhouriH.KhouriH.FeldblyumT. V.UtterbackT. R.Van AkenS. E.LovleyD. R.FraserC. M.Genome of Geobacter sulfurreducens: metal reduction in subsurface environments20033025652196719692-s2.0-034624380310.1126/science.1088727AndersonR. T.VrionisH. A.Ortiz-BernadI.ReschC. T.LongP. E.DayvaultR.KarpK.MarutzkyS.MetzlerD. R.PeacockA.WhiteD. C.LoweM.LovleyD. R.Stimulating the in situ activity of Geobacter species to remove uranium from the groundwater of a uranium-contaminated aquifer20036910588458892-s2.0-1074423169410.1128/AEM.69.10.5884-5891.2003SchoenA. R.Carbon fiber electrode as an electron acceptor for a microbial fuel cell using geobacter2007152426LinW. C.CoppiM. V.LovleyD. R.Geobacter sulfurreducens can grow with oxygen as a terminal electron acceptor2004704252525282-s2.0-384307001910.1128/AEM.70.4.2525-2528.2004FitchW. M.Homologya personal view on some of the problems20001652272312-s2.0-003417825710.1016/S0168-9525(00)02005-9RichterM.KubeM.BazylinskiD. A.LombardotT.GlöcknerF. O.ReinhardtR.SchülerD.Comparative genome analysis of four magnetotactic bacteria reveals a complex set of group-specific genes implicated in magnetosome biomineralization and function200718913489949102-s2.0-3434739211110.1128/JB.00119-07WeiL.LiuY.DubchakI.ShonJ.ParkJ.Comparative genomics approaches to study organism similarities and differences20023521421502-s2.0-003643580210.1016/S1532-0464(02)00506-3SawaG.DicksJ.RobertsI. N.Current approaches to whole genome phylogenetic analysis.20034163742-s2.0-0038206901ButlerJ. E.YoungN. D.LovleyD. R.Evolution from a respiratory ancestor to fill syntrophic and fermentative niches: comparative fenomics of six Geobacteraceae species200910, article 1032-s2.0-6524919128910.1186/1471-2164-10-103SayersE. W.BarrettT.BensonD. A.Database resources of the National Center for Biotechnology Information2011393851AltschulS. F.GishW.MillerW.MyersE. W.LipmanD. J.Basic local alignment search tool199021534034102-s2.0-002518370810.1006/jmbi.1990.9999UniProt ConsortiumOngoing and future developments at the Universal Protein Resource201139D214D219LarkinM. A.BlackshieldsG.BrownN. P.ChennaR.McgettiganP. A.McWilliamH.ValentinF.WallaceI. M.WilmA.LopezR.ThompsonJ. D.GibsonT. J.HigginsD. G.Clustal W and clustal X version 2.020072321294729482-s2.0-3644899150010.1093/bioinformatics/btm404FelsensteinJ.PHYLIP—phylogeny inference package (version 3. 2)19895164166HanM. V.ZmasekC. M.PhyloXML: XML for evolutionary biology and comparative genomics200910, article 3562-s2.0-7174909986210.1186/1471-2105-10-356LeeD.RedfernO.OrengoC.Predicting protein function from sequence and structure200781299510052-s2.0-3644898825410.1038/nrm2281CallisterS. J.McCueL. A.TurseJ. E.MonroeM. E.AuberryK. J.SmithR. D.AdkinsJ. N.LiptonM. S.Comparative bacterial proteomics: analysis of the core genome concept2008322-s2.0-4424908620210.1371/journal.pone.0001542e1542UsseryD. W.HallinP. F.Genome update: length distributions of sequenced prokaryotic genomes200415035135162-s2.0-1642277724ShelobolinaE. S.VrionisH. A.FindlayR. H.LovleyD. R.Geobacter uraniireducens sp. nov., isolated from subsurface sediment undergoing uranium bioremediation2008585107510782-s2.0-4704909487410.1099/ijs.0.65377-0HildebrandF.MeyerA.Eyre-WalkerA.Evidence of selection upon genomic GC-content in bacteria2010692-s2.0-7804942546310.1371/journal.pgen.1001107e1001107ClarridgeJ. E.IIIImpact of 16S rRNA gene sequence analysis for identification of bacteria on clinical microbiology and infectious diseases20041748408622-s2.0-564422731910.1128/CMR.17.4.840-862.2004TrinhN. T.ParkJ. H.KimB.Increased generation of electricity in a microbial fuel cell using Geobacter sulfurreducens20092637487532-s2.0-6594912447810.1007/s11814-009-0125-7MurphyG. E.LeadbetterJ. R.JensenG. J.In situ structure of the complete Treponema primitia flagellar motor20064427106106210642-s2.0-3374829567710.1038/nature05015