Answering the Call: Coping with DNA Damage at the Most Inopportune Time

DNA damage incurred during the process of chromosomal replication has a particularly high possibility of resulting in mutagenesis or lethality for the cell. The SOS response of Escherichia coli appears to be well adapted for this particular situation and involves the coordinated up-regulation of genes whose products center upon the tasks of maintaining the integrity of the replication fork when it encounters DNA damage, delaying the replication process (a DNA damage checkpoint), repairing the DNA lesions or allowing replication to occur over these DNA lesions, and then restoring processive replication before the SOS response itself is turned off. Recent advances in the fields of genomics and biochemistry has given a much more comprehensive picture of the timing and coordination of events which allow cells to deal with potentially lethal or mutagenic DNA lesions at the time of chromosomal replication.


MECHANISM OF INDUCTION
Irradiation of Escherichia coli with near ultraviolet light (254 nm UV) produces DNA lesions that block DNA replication and pose a dangerous threat to the integrity of the genetic information. DNA damage that blocks replication can result in genomic rearrangements when it resumes from the wrong place, mutagenesis when the incorrect base is incorporated opposite to the lesion, or even cell death when the block to replication cannot be overcome. A large body of work has demonstrated that E coli responds to this challenge by upregulating the expression of several genes which function to repair the DNA lesions, maintain the integrity of the DNA replication fork, and prevent premature cell division. This cellular response to DNA damage produced by UV and other agents has been collectively termed the SOS response, after the international distress signal (see [1]; reviewed in [2,3]).
The sensor for SOS induction in E coli is a two component repressor/activator system of LexA and RecA that "senses" when the replication fork is impeded from progressing normally. The observation that nonreplicating E coli do not induce a strong SOS response following UV irradiation supports the notion that replication is necessary for signaling the response [4]. Many of the genes induced following DNA damage are regulated by the LexA repressor protein which binds to a 20 base pair consensus sequence in the operator region of the genes, suppressing their expression [5,6]. Derepression of these genes occurs when RecA binds to single stranded regions of DNA created at replication forks disrupted by DNA damage. RecA bound to single strand DNA becomes conformationally active, serving as a coprotease that promotes the autocatalytic cleavage of the LexA repressor. As the cellular concentration of LexA diminishes, the genes normally suppressed by LexA are more frequently transcribed (see [2,4] and the references therein).
Kenyon and Walker carried out the first systematic search to identify genes that are up-regulated in a recA/lexAdependent fashion [7]. Through random insertion of a lac reporter gene into the E coli chromosome, they were able to identify promoters which were up-regulated following DNA damage. Subsequent analysis of the up-regulated genes revealed a 20 basepair consensus LexA binding motif, or "SOS box" shared by these genes in their promoter/operator regions [8]. Since these initial studies, several groups, using a variety of strategies, have identified many more LexAdependent damage inducible genes, as well as some genes that are induced in a LexA-independent manner. [9,10,11,12,13,14,15,16,17,18,19,20,21]. In one recent approach, DNA microarrays containing amplified DNA fragments from 95.5% of all open reading frames identified in the E coli genome were used to examine the changes in gene expression during the first hour following UV exposure in both wild-type cells and lexA1 mutants, which are unable to induce genes under LexA control [21]. These DNA microarrays contain PCR amplified DNA fragments of known and predicted genetic sequences printed on the surface of a glass slide. Through the comparative hybridization of two cellular RNA preparations, the relative difference between transcript levels of any gene in these preparations can be determined ( Figure 1). These techniques, in total, identified 43 locations  recD  yebA  manZ  manX  manY  gatA  gatB  gatC  gatZ  gatY  aceK  gatD  arp  aceA  ycgE  yjeB  ycgC  aroF  modF  nrdE   nrdA  nrdB  yfaE  sbp  dedA  yaiD  corA  suhB  fadL  gyrA  srmB  rpoD  yegQ  yhdG  dniR  dnaA  rpsU  b3814  b1436  yjiW  glvB  lexA  dinD  b1741  yigN  dinI  yebG  recA  oraA  dinB  b1728  yafP  smpA  umuD  umuC  dsbB  uvrB  ybhK  ruvA  recN  sulA  yafO  yebF  b2619  holE  ruvB  yfjB Figure 1. Clustered display of gene expression profiles following UV exposure in wild-type and lexA1 (SOS deficient) Escherichia coli [21]. Increased transcript levels are shown in red, decreased transcript levels are shown in green. The timepoints for irradiated samples (left to right) were 5, 10, 20, 40, and 60 minutes following UV irradiation. The timepoints for unirradiated samples (left to right) were 20 and 60 minutes following UV irradiation. All measurements are relative to time 0. Genes were selected for this analysis if their expression level deviated from time 0 by at least a factor of 2 in at least 3 time points. The colored image was produced using cluster analysis and is publicly available with this data at http://genome-www.stanford.edu/UVirradiation. The color scale ranges from saturated green for log ratios 2.0 and below to saturated red for log ratios 2.0 and above.
on the E coli chromosome that were up-regulated in a lexAdependent manner after UV irradiation in actively replicating cells. In addition, several transcripts were either downregulated or degraded following UV irradiation.
The biological function of many of these UV-responsive genes has been studied extensively and, in some cases, their role in the recovery process is well characterized. In this review, we outline some of the recent advances in our 2:2 (2002) understanding of the events that occur after the E coli genome is damaged, focusing particularly on the induction of cellular responses after UV exposure.

MAINTAINING THE REPLICATION FORK
One of the earliest recognized, and most heavily induced, DNA damage inducible gene products is RecA [22]. The act of binding to the single strand DNA regions at blocked replication forks serves not only to "sense" that replication is blocked, but also to maintain the structural integrity of the replication fork itself when progression is impeded. It is well established that recA mutants are extremely hypersensitive to DNA damage. This hypersensitivity correlates with a striking phenomenon called rec-less degradation in which the genomic DNA is rapidly degraded in the presence of DNA damage [22,23]. The degradation is much more severe when replication is active in the cells at the time of damage and it has been shown to initiate at the replication forks and then degrade progressively back from these points. These observations led Horii and Susuki to propose that RecA plays a role in protecting DNA at replication forks when they are blocked by DNA damage [23].
recA was originally identified and characterized as a gene essential for recombination in E coli [24]. In vitro, purified RecA binds to single strand DNA progressively and then pairs that single stranded DNA with homologous duplex DNA. The product of this reaction creates a RecA protein filament bound to a three strand DNA structure (for reviews see [25,26,27]). During recombinational processes, this activity is thought to be critical for bringing together homologous strands from different DNA molecules. During DNA replication, this same biochemical activity of RecA may play a nonrecombinational role in maintaining the DNA replication fork [28,29,30]. Semiconservative replication copies both strands of the DNA template concurrently in a 5 → 3 direction. While the leading strand can be synthesized continuously, synthesis on the lagging strand template occurs discontinuously, periodically reinitiating as the replication machinery moves processively along the template. The coordination of this process implies that at any given time, the region immediately behind the replication machinery will contain a single stranded region. In the event that replication becomes blocked or contains single strand gaps, the region left behind the replication fork should be an ideal substrate for RecA to bind and protect.
Several aspects of RecA function at replication forks arrested by DNA damage remain to be characterized. For instance, the structure of the blocked replication fork is not well understood. Does it end with the leading strand facing a lesion? Can the lagging strand continue on without the leading strand? It is also not known whether RecA promotes recombination at blocked replication forks, whether it is primarily nonrecombinational at these sites, or whether it is some combination of these two scenarios that depends upon the lesion presented. Finally, during recombinational processes, the act of RecA pairing single stranded DNA with homologous duplex DNA can also be associated with strand exchange and branch migration. These processes have also been proposed to function at damage blocked replication forks, perhaps to allow nucleotide excision repair or translesion DNA polymerases to gain access to the blocking DNA lesion [28,29,30,31]. In vitro experiments suggest that several branch migration proteins are capable of acting on these substrates [32,33,34,35]. However, whether branch migration occurs and which proteins, if any, act on replication structures in vivo has yet to be examined.

INDUCING REPAIR TO CLEAR THE DAMAGE
The production of DNA damage in the genome must be countered with a repair response that clears the DNA prior to the resumption of DNA replication and cell division. In E coli, UV-induced lesions are subject to nucleotide excision repair (NER), catalyzed by the UvrABCD proteins (reviewed in [2]). Three of these genes, uvrA, uvrB, and uvrD, are induced after DNA damage as part of the SOS response and have thus been implicated in promoting damage inducible excision repair. Recent studies have shed new light on the SOS-regulated NER response and have shown that inducible repair of UV lesions is dependent on the up-regulation of UvrA and UvrB but not UvrD.
Crowley and Hanawalt showed that cells treated with the transcription inhibitor rifampicin were deficient in their ability to remove the major UV-induced lesion, the cyclobutane pyrimidine dimer (CPD), from their genome following exposure to 40 J/m 2 UV [36]. A similar reduction in repair was observed in lexA3(Ind-) mutants that are unable to induce the SOS response after UV. However, lexA51(Def) mutants that constitutively express the SOS response were not deficient in NER even when treated with rifampicin after UV. Direct measurements of UvrA and UvrB levels in all three cell types correlated with the cell's ability to efficiently remove CPDs from the genome. In wild-type cells, 2-fold induction of both proteins occurred within the first 10 minutes after UV and maximal induction (as compared with the lexA(Def) strain) was reached within the first 40 minutes after UV, a time at which more than 80% of all CPDs are removed. Wild-type cells treated with rifampicin and the lexA3(Ind-) cells maintained low constitutive levels of UvrA and UvrB an hour after UV exposure, consistent with their slower rate of removal of CPDs. The pyrimidine(6-4)pyrimidone photoproduct (6-4 photoproduct), a less prevalent but more distorting UV-induced lesion, was repaired equally well in the presence or absence of SOS induction, indicating that inducible NER is necessary for efficient genomic repair of CPDs, but not 6-4 photoproducts in wild-type cells [36].
To determine the role of UvrD, also known as Helicase II, in promoting SOS-dependent inducible NER, Crowley and Hanawalt studied repair and survival in a uvrD deletion strain that carried a plasmid encoding a wild-type copy of uvrD under control of the T7 promoter. This plasmid construct yielded approximately wild-type levels of UvrD in an undamaged transformant and promoted almost complete complementation of the UV sensitivity and repair deficiencies observed in the nontransformed deletion mutant.
Since the levels of uvrD transcript could not be increased from the plasmid promoter after UV, the authors concluded that constitutive levels of UvrD are sufficient for promoting SOS-dependent NER in vivo [37]. It is interesting, therefore, to speculate on a role for UvrD in promoting cellular recovery from UV that is distinct from its role in NER. The observation that uvrD mutations are incompatible with polA (DNA pol I) or rep (Rep helicase) mutations suggests that Helicase II may function in replication in vivo [38,39]. Recent genetic and biochemical investigations have provided further support for such a role [40,41,42,43]. Its slow rate of up-regulation and modest 2.5-fold induction after UV [21] supports a role for UvrD that occurs later in the damage recovery process, perhaps after the majority of repair has been completed but before replication resumes.
An exciting new finding regarding the inducibility of E coli NER has come from the work of Moolenaar et al who have discovered a second endonuclease, dubbed Cho (UvrC homolog), that catalyzes a UvrB-dependent 3'-incision at a variety of lesions in the presence or absence of UvrC (43a). Cho, the gene product of the ydjQ gene, was discovered on the basis of its homology with the N-terminal region of UvrC, which is required for its 3'-incision activity. The ydjQ gene is a member of the SOS regulon and its gene product is up-regulated with kinetics similar to that of UvrA and UvrB [21]. Given that UvrC is not induced by DNA damage, this discovery suggests that E coli may possess an SOS-dependent subpathway of NER involving the novel Cho endonuclease.

GETTING BY THE LESIONS
The SOS response also upregulates the expression of three DNA polymerases, Pol II (polB), Pol IV (dinB), and Pol V (umuC) in addition to the constitutively expressed Pol I and Pol III. While Pol I, involved in repair and Okazaki fragment removal, and Pol III, the replicative DNA polymerase, can polymerize and incorporate the standard nucleotides opposite to their complementary bases, these inducible polymerases have the ability to incorporate and pair nucleotides opposite to specific forms of damaged DNA bases (ie, they promote translesion DNA synthesis). In vitro studies on single strand templates have shown that Pol III, the replicational polymerase of E coli, is arrested at DNA lesions such as those produced by UV irradiation [44,45]. Several studies identified three DNA damage-inducible polymerases in E coli (as well as multiple homologs in eukaryotic cells, including XP-V) capable of incorporating nucleotides opposite DNA lesions with relatively high efficiency when Pol III had extended the template up to the blocking DNA lesion [9,16,44,46,47,48,49,50]. Presumably, this allows lesions that would otherwise be blocks to DNA replication, a lethal event, to be bypassed and for replication to recover, a mutagenic event at worst. In fact, the first E coli translesion DNA polymerase, Pol V, encoded by umuC, was originally identified by its ability to produce mutations following UV irradiation, long before it was known to be a DNA polymerase [9]. In the absence of umuC (or umuD, see section: DNA damage replication checkpoint), the mutation rate drops dramatically following UV irradiation. Perhaps as a trade off for not mutating, these mutants are slightly more sensitive to UV irradiation than wild-type cells [9].
Following up on sequence similarities between the bypass DNA polymerase, REV1, identified by Nelson and colleagues [51] in yeast, both umuC and dinB were shown to be polymerases as well [47,48]. Pol II, the product of the polB gene, was first identified as a polymerase and later shown to be inducible following damage, but its function has remained elusive since polB mutants are not sensitive to UV, which is sometimes interpreted to indicate a role in recovery of replication from DNA damage [13,52,53,54]. Furthermore, no obvious replication defects are observed in polB mutants [55]. More recently, work from Rangarajan and colleagues [56] has shown that both the recovery of replication and the mutation spectra following UV irradiation is altered in dinB mutants, strongly suggesting that Pol II functions during the recovery process in vivo.
When and how these polymerases act at a DNA lesion in vivo remain unclear. Recent work from Napolitano and colleagues suggests that sometimes a polymerase might be specific for a specific DNA lesion but other times multiple polymerases may compete for the same DNA lesion, depending upon the nature of the DNA lesion and its sequence context [57]. Using a mutation assay based upon a plasmid construct that contain either an N-2-acetylaminofluorene (AAF) adduct, a benzo-a-pyrene adduct (BaP), or a 6-4 photoproduct, they demonstrated that the mutation frequency and spectra were altered in the absence of a specific bypass polymerase for each lesion. For instance, by measuring AAF induced −2 frameshifts, they found that mutagenesis was significantly reduced in polB mutants, but not in dinB nor umuC mutants. Conversely, when they measured BaP induced −1 frameshifts, they found that mutagenesis was significantly reduced in dinB and umuC mutants but not in polB mutants. This implies that, in vivo, different polymerases must be acting on different DNA lesions [57]. If and how the cell knows which polymerase to use at different DNA lesions remains an interesting but unknown question.

DNA DAMAGE REPLICATION CHECKPOINT
One of the bypass polymerases, Pol V (umuC) has an accomplice, umuD, which comprise an operon that is upregulated rapidly following SOS induction. Since the initial discovery of these genes in screens for mutants that were not mutable by UV [9], extensive genetic and biochemical work has shown that they are required for errorprone translesion synthesis in E coli [48]. Following SOS induction, UmuD is proteolytically cleaved to form UmuD' by a mechanism similar to the LexA cleavage reaction [58,59,60,61]. The proteins form a UmuD' 2 C complex that is believed to be the active form promoting lesion bypass during replication recovery after UV damage. The uncleaved form of UmuD can also form an UmuD 2 C complex that has been postulated to aid in the cessation of DNA 2:2 (2002) replication following the detection of UV-induced lesions in the genome [61,62]. The UV dose-dependent cleavage of UmuD, the checkpoint protein, to UmuD' , the bypass protein, occurs with kinetics that parallel the rate at which nucleotide excision repair proteins remove the bulk of UV-induced lesions following irradiation [36,63]. This processing time is proposed to act as a checkpoint that contributes to cell survival by allowing ample time to repair the genome prior to recovery of replication, avoiding the production of more deleterious forms of DNA damage which would result from further attempts at replicating a damaged DNA template (reviewed in Smith, 1998 and. The checkpoint role of UmuD comes from the initial observation that constitutive over-expression of the uncleaved form of UmuD, but not the cleaved form UmuD' , results in a cold sensitive phenotype and a modest inhibition of DNA synthesis [64]. Opperman and colleagues have since shown that expression of noncleavable Umu(S60A)D and UmuC from a plasmid partially complemented the UV sensitivity of a ∆umuDC strain and delayed DNA synthesis and cell growth following UV. As predicted, a ∆umuDC strain resumed DNA synthesis and cell growth within the first 10 minutes after UV, whereas deletion mutants complemented with umuC+umuD+ on a plasmid delayed resumption of replication for more than 45 minutes following 25 J/m 2 UV. Complementation of a ∆umuDC strain with a umuD(S60A)C+ construct delayed DNA synthesis and cell growth over 60 minutes. Neither umuD'C+ nor umuD+C125 complemented the rapid resumption of DNA synthesis, despite the ability of both strains to perform translesion synthesis. It is important to note that resumption of replication was not observed in the umuD(S60A)C+ construct, suggesting that UmuD'-dependent translesion synthesis is required for proper resumption of replication after UV [63,65,66].
To perform their roles in replication inhibition and translesion synthesis in vivo, the UmuD 2 C and UmuD' 2 C complexes have been shown to interact with DNA polymerase III directly, but each in distinctive ways. Using affinity chromatography, Sutton et al demonstrated that UmuD 2 has a greater affinity for the beta (processivity) subunit whereas UmuD' 2 interacts most strongly with the alpha (catalytic) subunit of Pol III. Both the UmuD and UmuD' homodimers bound to the epsilon (proofreading) subunit of Pol III [67]. Consistent with these interactions are data showing that only over-expression of the epsilon and beta subunits, and not any of the other eight Pol III subunits, had an effect on the cold sensitive phenotype of a umuD+C+ over-expressing strain at the restrictive temperature [68]. Together, these data suggest that UmuD 2 C associates with the DNA replication complex directly and that these interactions serve to slow or block DNA synthesis, perhaps through sequestration of these subunits from the Pol III holoenzyme. It is interesting to note the similarity between this potential prokaryotic DNA damage checkpoint and the p21-dependent DNA damage inducible cell cycle checkpoint in eukaryotes. In eukaryotes, p21 targets PCNA, the eukaryotic analog of the beta subunit, to promote replication arrest after DNA damage [69].

DNA DAMAGE CELL DIVISION CHECKPOINT
E coli cells exposed to UV or other SOS-inducing treatments continue to elongate but fail to septate and thus grow as filaments. This inhibition of septation is an SOSdependent process that involves the lexA+-regulated gene, sulA (sfiA). This gene was originally isolated as a suppressor of lon protease mutants, which are particularly prone to damage-induced filamentation [70]. The lon gene product is an ATP-dependent protease that degrades SulA in vivo. This degradation is rapid and, combined with LexA repression, serves to maintain SulA at low concentrations in the cell. Upon damage to the genome, sulA transcription is upregulated dramatically, achieving 10-fold induction in the first five minutes after 25 J/m 2 UV [21]. Therefore, SulA accumulates to inhibitory levels only when the SOS response is induced. The sulA gene product inhibits septation by interacting with FtsZ, a key cell division protein that forms the septation ring early in cell division [71]. Evidence exists that a second gene, sfiC, also works to inhibit cell septation after DNA damage through interaction with FtsZ. This protein is DNA damage inducible but its regulation is not dependent on lexA+recA+ [72,73,74].
Recently, two papers have suggested that a third mode of SOS-dependent septation inhibition exists in E coli. This mode of inhibition, is independent of sulA and sfiC, yet requires umuDC [65,75]. In their studies of cold sensitivity induced by UmuDC overproduction, Opperman and colleagues noted the production of cell filaments at 30 • C (the restrictive temperature) when umuDC were expressed constitutively from a single chromosomal copy of the operon. Over-expression of umuDC led to longer filaments and the production of filaments at 40 • C. This role of UmuDC is apparently distinct from its role in cold sensitivity as physiological levels of the proteins induced cell division inhibition but not cold sensitivity. Unlike the cold sensitive phenotype, umuDC-dependent septation inhibition requires SOS induction, demonstrating the need for at least one other LexAregulated gene in this process [65]. It is tempting to speculate that this gene might be ftsK, an essential cell division protein transcribed from dinH, an SOS-regulated promoter [76,77]. It is unclear, however, how up-regulation of a gene required for septation would operate to inhibit this process after DNA damage. Perhaps high levels of FtsK prevent proper formation of the septation ring, perhaps by disrupting the stoichiometry of septation factors. Interestingly, over-expression of ftsQAZ from a plasmid suppressed umuDC-dependent filamentation [65]. This finding, combined with the recent demonstration that FtsK is required for recruitment of FtsQ and other factors to the septation ring [78], suggests that proper stoichiometry of septation proteins may indeed play a role in promoting the regulation of cell division in vivo. It is also possible that the UmuDC complex plays a direct inhibitory role in this process. Since it is the target of SulA and SfiC, FtsZ would be a logical target for inhibition by the UmuDC complex, but this interaction has yet to be demonstrated. There may also be a role for minCDE in regulating cell division and septation after DNA damage. These genes, which induce and regulate septum formation, are repressed following UV in a lexA-independent manner [21]. Clearly, further investigation of this division inhibition is warranted, especially in conjunction with the role of UmuDC in promoting the arrest of DNA replication after DNA damage.

RELIEVING STRESS: TURNING THE SOS RESPONSE OFF
Thus, the SOS response appears to upregulate genes that hold replication and cell division in place until the offending lesions are repaired (or polymerases that allow replication to occur over the lesions are expressed). Once this has occurred, the stress response must be turned off. Two mechanisms that appear to play a role in returning the cell to normalcy are the LexA repressor and the product of the dinI gene.
LexA represses its own gene, and following SOS induction, LexA repressor is heavily induced [79]. In the current model, as long as replication remains blocked and single stranded DNA is present at the replication fork to activate and bind RecA, the LexA produced will be cleaved and SOS genes will continue to be expressed (reviewed in [2]). Once replication has resumed and the single stranded regions have been filled in, RecA no longer remains in the activated state and the LexA protein accumulates, binds, and represses genes that contain LexA binding sites at their promoters. Eventually, enough LexA is produced to repress all genes under its control and the cell resumes replicating and dividing normally.
Once replication has resumed and repair has been completed, there still remains an excessive amount of RecA in the cell due to its large up-regulation following SOS induction. One additional gene induced during the SOS response, dinI, has recently been shown to help eliminate excess RecA and help terminate the SOS response through an unusual mechanism. DinI was discovered as an SOS inducible gene through the original screen by Kenyon and Walker [7]. The first hint that DinI may play a role in suppressing SOS response came from work by Ohmori and coworkers who demonstrated that it could suppress the cold-sensitive phenotype of an unusual dinD68 mutation. The dinD68 allele causes SOS induction without any DNA damaging agent at temperatures less than 20 • C through a mechanism that is not yet understood [80]. However, subsequent work from Yasuda's group [81,82] additionally showed that DinI, when over-expressed, conferred a UV sensitive phenotype to cells, inhibited induction of SulA, and impaired LexA and UmuD cleavage following DNA damage. These are all activities that are consistent with repression, rather than activation of SOS functions. Recently, Camerini-Otero and coworkers have shown that the down-regulation occurs directly through DinI binding to the RecA protein itself [83,84]. This group demonstrated that the C-terminal alpha helix region of DinI has an unusually strong negatively charged surface that appears to mimic the structure and charge of single-stranded DNA, the substrate for RecA activation. However, unlike RecA bound to single stranded DNA, RecA bound to DinI is not an acti-vated form and cannot induce the SOS response. Thus, these data strongly suggest that DinI functions to down-regulate the SOS response by titrating away the inducing "sensor" for SOS induction itself, the RecA protein.

SUMMARY
The picture emerging from these studies of E coli is one in which multiple events are choreographed spatially and temporally such that the entire process can restore and complete the duplication of the genome following a moderate dose of DNA damage without sacrificing viability or the integrity of the genetic information.
There are, however, some critical pieces to this picture which remain to be filled in. It will be interesting and important to now identify the specific structure of the replication fork when it arrests. We believe that single stranded DNA is produced (and activates RecA) to initiate the cascade, however, where it is formed and how the arrested replication fork is structured remain to be characterized. Furthermore, the timing and contribution that lesion repair, lesion bypass, or even recombination has on the recovery of replication is not at all clear and remains a critical question since each strategy has very different consequences for the integrity and fidelity of the genetic information as it is duplicated. Lastly and most importantly, when we look at the genes that are induced following DNA damage, fully half of these genes have not been well characterized as to their functions. While new information is constantly coming, it is apparent from recent studies reviewed here that our view to date is extremely DNA-centric and is limited by a small number of assays. It will be interesting to see which discoveries remain to be elucidated from the characterization of these unknown genes.