Prebiotic Chemistry and the Origin of the RNA World
The demonstration that ribosomal peptide synthesis is a ribozyme- catalyzed reaction makes it almost certain that there was once an RNA World. The central problem for origin-of-life studies, therefore, is to understand how a protein-free RNA World became established on the primitive Earth. We first review the literature on the prebiotic synthesis of the nucleotides, the nonenzymatic synthesis and copying of polynucleotides, and the selection of ribozyme catalysts of a kind that might have facilitated polynucleotide replication. This leads to a brief outline of the Molecular Biologists’ Dream, an optimistic scenario for the origin of the RNA World. In the second part of the review we point out the many unresolved problems presented by the Molecular Biologists’ Dream. This in turn leads to a discussion of genetic systems simpler than RNA that might have “invented” RNA. Finally, we review studies of prebiotic membrane formation.
Keywords nucleotide synthesis, polynucleotide formation, RNA replication, prebiotic membranes
INTRODUCTION
The ideas behind the hypothesis of an RNA World originated in the late 1960s in response to a profound puzzle. The basic principles of molecular biology were well understood, and it was clear that the replication of nucleic acids was dependent on protein enzymes and the synthesis of protein enzymes was dependent on nucleic acids. Even if one allowed for every possible simplification of the system, for example, by postulating a nucleic acid with only two bases and proteins assembled from a very limited suite of amino acids, what remained was too complicated to have arisen de novo from an assembly of abiotic organic molecules. The only way out was to regard the dilemma as a “chicken and egg problem” and to ask which came first, proteins or nucleic acids? At the time, it was well recognized that natural selection through replication and mutation was the only mechanism for evolving complex biochemical systems from simpler ones. Trying to solve the “chicken and egg” problem, therefore, was equivalent to asking whether proteins or nucleic acids were more plausible as the components of a self-contained replicating system. The answer seemed obvious: nucleic acids. WatsonCrick base-pairing provided a very plausible mechanism by which a polynucleotide could direct the synthesis of its complement from mononucleotides or short oligonucleotides, while no equivalent mechanism was known for the replication of a polypeptide. These arguments were developed in some detail in three overlapping papers (Woese, 1967; Crick, 1968; Orgel, 1968), and a program to explore nonenzymatic copying of nucleic acid sequences was initiated (Sulston et al., 1968a, 1968b).
The authors of the three early papers clearly recognized that an autonomous RNA “organism” would be possible only if RNA could take on several of the functions presently performed by proteins, for example, the functions of RNA polymerases and nucleases. They speculated that coenzymes incorporating nucleotides in their structure were fossils from a time when RNA functioned without the help of proteins (Woese, 1967; Orgel, 1968; Orgel & Sulston, 1971), an idea that was subsequently developed in some detail (White, 1976). In one instance it was speculated that the original ribosome was composed entirely of RNA (Crick, 1968). However, in none of the papers was it suggested that RNA catalysis was still important in contemporary biology. It was taken for granted that protein enzymes could always outperform RNA catalysts and had, therefore, completely replaced them in contemporary organisms.
The unanticipated discovery of catalytic RNA molecules, ribozymes, that perform enzyme-like reactions (Kruger et al., 1982; Guerrier-Takada et al., 1983) marked the beginning of the present interest in a proteinless biological world. In the few years following Cech and Altman’s discoveries, ribozymes were shown to be able to catalyze a significant number of diverse chemical reactions. This led to an increased interest in the hypothesis that an RNA World, a term introduced by Gilbert (Gilbert, 1986), preceded the DNA/RNA/Protein world (Gesteland et al., 1999). The determination of the structure of the ribosome, showing that it is a ribozyme (Steitz & Moore, 2003), seems to clinch the case for an RNA World, although it leaves open almost all questions about the origin and “biochemistry” of the RNA World. The RNA World hypothesis does not deny that peptides may have been involved in the origin of life. It does, however, exclude the possibility that any peptides that were involved were formed by ribosomal protein synthesis or a closely related mechanism.
The knowledge that an RNA World preceded our familiar biochemical world has profound implications for those interested in the origin of life. It may be claimed, without too much exaggeration, that the problem of the origin of life is the problem of the origin of the RNA World, and that everything that followed is in the domain of natural selection. If this is accepted, studies of the chemistry of the origin of life are, in principle, greatly simplified because they need only be concerned with the origin of RNA and do not need to deal with the origins of most other features of biochemistry. Of course, the origin of protein synthesis and of DNA are also of the greatest interest, but their appearance can be regarded as the consequences of selection acting on populations of autonomous RNA organisms. The focus of this review, therefore, will be the origin of the RNA World and its evolution prior to the development of protein synthesis. We will have little to say about the prebiotic synthesis of amino acids, peptides, cofactors, etc.
While acceptance of an RNA World greatly simplifies the problem of the origin ol’lile, it also has a negative aspect (Orgel, 2003). If the origin of the RNA World preceded the origin of protein synthesis, little can be learned about the chemistry of the origin of life from the study of protein enzyme mechanisms. The justification of prebiotic syntheses by appealing to their similarity to enzymatic mechanisms has been routine in the literature of prebiotic chemistry. Acceptance of the RNA World hypothesis invalidates this type of argument. If the RNA World originated de novo on the primitive Earth, it erects an almost opaque barrier between biochemistry and prebiotic chemistry.
It is possible that the RNA World was the first organized biochemical world on the primitive Earth. If we suppose that this is the case, the problem of the origin of life can conveniently be divided into a number of subproblems:
1. The nonenzymatic synthesis of nucleotides.
2. The nonenzymatic polymerization of nucleotides to give random- sequence RNA.
3. The nonenzymatic copying or replication or both, of RNA.
4. The emergence through natural selection of a set of functional RNA catalysts that together could sustain exponential growth in the prebiotic environment.
The first three topics are part of the traditional field of prebiotic chemistry, while the fourth is the subject matter of the newer field of RNA evolution. We begin this review by covering the first three topics in some detail. Since the fourth topic falls outside the scope of traditional prebiotic chemistry, only a very brief overview will be given. From our discussion of prebiotic chemistry we will conclude that the abiotic synthesis of RNA is so difficult that it is unclear that the RNA World could have evolved de novo on the primitive Earth, a conclusion that was first emphasized by Cairns-Smith (Cairns-Smith & Davies, 1977; Cairns- Smith, 1982). Consequently, we will have to consider different routes to the RNA World. We will explore the possibility that a simpler replicating molecule could have formed on the primitive Earth and that organisms with a genetic system based on that simpler polymer could have “invented” RNA.
PREBIOTIC SYNTHESIS OF NUCLEOTIDES
Prebiotic Synthesis
Prebiotic chemistry is concerned with molecules that are interesting to students of the origin of life which, they believe, could have been formed on the primitive Earth. Since we know very little about the availability of starting materials on the primitive Earth or about the physical conditions at the site where life began, it is often difficult to decide whether or not a synthesis is plausibly prebiotic. Not surprisingly, claims of the type, “My synthesis is more prebiotic than yours” are common. Nonetheless, there is fairly general agreement about the following restrictions on organic synthesis imposed by the requirement for prebioticity:
It must be plausible, at least to the proposers of a prebiotic synthesis, that the starting materials for a synthesis could have been present in adequate amounts at the site of synthesis.
Reactions must occur in water or in the absence of a solvent.
The yield of the product must be “significant,” at least in the view of the proposers of the synthesis.
Clearly “prebiotic” is a very elastic term, and it would not be wise to try to define it too closely.
Just as many people have been speaking prose all their lives without realizing it, many organic chemists of the 19th and the first half of the 20th century were prebiotic chemists without realizing it. If it were discovered for the first time today, Wohler’s synthesis of urea from ammonium cyanate (Wohler, 1828) would certainly merit publication in \Science or Nature as an important contribution to prebiotic chemistry. Butlerow’s synthesis of sugars from formaldehyde (Butlerow, 1861) is still one of the cornerstones of the subject. However, these and other early experiments on the synthesis of biochemicals from simple starting materials were never motivated by an interest in the origin of life. Stanley Miller’s classic experiment demonstrating the synthesis of amino acids in an electric discharge (Miller, 1953) marks the beginning of prebiotic chemistry as an enterprise directed to understanding the chemistry of the origin of life.
Miller and Urey believed that the atmosphere of the primitive Earth was strongly reducing, containing large amounts of methane and ammonia. Miller showed that formaldehyde and hydrogen cyanide (HCN) were key intermediates in the synthesis of glycine from such a mixture (Miller, 1957). Although not directly relevant to the origin of the RNA World, these observations led Juan Oro and his coworkers to study the products formed when ammonium cyanide is refluxed in aqueous solution. His remarkable discovery that adenine is a product of cyanide polymerization (Oro & Kimball, 1960), together with the earlier results reported by Butlerow and Miller, determined the direction of research on prebiotic chemistry for many years. The relevance of all of this early work to the origin of life has been questioned because it now seems very unlikely that the Earth’s atmosphere was ever as strongly reducing as Miller and Urey assumed. However, it still seems possible that that the Earth’s atmosphere was once sufficiently reducing to have supported Miller/Urey chemistry to some extent (Kasting & Brown, 1998).
The Butlerow (Formose) Synthesis of Sugars from Formaldehyde
The polymerization of formaldehyde in the presence of simple mineral catalysts to form a mixture of sugarsthe formose reaction- originally discovered by Butlerow in the 19th century (Butlerow, 1861), has been investigated in considerable detail (Mizuno & Weiss, 1974). The reaction is of great interest as a unique, cyclic autocatalytic process that takes place in aqueous solution and converts a very simple substrate, formaldehyde, to a mixture of complex molecules, many of which are important biochemicals.
It is fortunate that Butlerow did not completely purify the formaldehyde he used before initiating the reaction, because the formation of sugars from formaldehyde is dependent on the presence of trace amounts of one of a number of common impurities (Socha et al., 1980; Kieboom & VanBekkum, 1984). Glycolaldehyde, the first product of the polymerization reaction, is an efficient initiator and is often used in this role. In the absence of an initiator, formaldehyde in alkaline solution undergoes the Cannizaro reaction, yielding methanol and formic acid. The Butlerow synthesis of sugars is usually carried out in alkaline solution in the presence of a catalyst. Most studies have employed heterogeneous catalysts, particularly suspensions of calcium hydroxide, but some homogeneous catalysts are known, for example, Pb^sup ++^ and T1^sup +^ ions. A few investigations of the reaction under near-neutral conditions in the presence of minerals have been reported (Gabel & Ponnamperuma, 1967; Reid & Orgel, 1967).
The most intriguing feature of the formose reaction is the long induction period that precedes the formation of detectable products. Under many conditions the polymerization, once started, is completed in a time shorter than the induction period, but the induction period can be reduced progressively by adding increasing amounts of an initiator. The first product of the polymerization is glycolaldehyde, which is later converted to glyceraldehyde and a variety of tetrose, pentose, and hexose sugars. Under the conditions usually used to bring about the reaction, the sugars decompose to hydroxy-acids and related compounds on a timescale similar to that of their appearance.
Cycles of the type shown in Figure 1 best explain most of the experimental findings (Breslow, 1959). Two types of reaction are involved, forward and reverse aldol reactions and tautomerizations that interconvert aldehydes and ketones. The scheme in the figure is no doubt a gross oversimplification. Many related cycles involving reverse aldol reactions of different representatives of the tetrose, pentose, and hexose sugars must contribute to the total reaction. Furthermore, the addition of formaldehyde to glyceraldehyde and similar molecules leads to the formation of branched chain sugars, and the Cannizaro reduction of sugars to polyols is also an important side reaction (Mizuno & Weiss, 1974). Despite these complications, the major conclusion to be drawn from the scheme in Figure 1 is correct; as the consequence of traversing a cycle of the type shown, a single input glycolaldehyde molecule leads, in principle, to the production of two output glycolaldehyde molecules. This ensures an exponentially growing rate of product formation until the concentration of formaldehyde begins to decline.
FIG. 1. The simplest hypothetical autocatalytic formose reaction cycle. In each turn of the cycle, a glycolaldehyde molecule facilitates the synthesis of a second glycolaldehyde molecule from two formaldehyde molecules. The stereochemistry at the asymmetric carbon atoms (marked with asterisks in the diagram) is not specified.
The Butlerow reaction, if it could be directed to the synthesis of ribose, would provide an ideal route to the sugar component of the nucleotides. However, until recently it had not been possible to channel the Butlerow reaction to the synthesis of any particular sugar, and ribose usually has been a notoriously minor product (Decker et al., 1982). More recently, Zubay has studied in detail the progress of the Pb^sup ++^ catalyzed formose reaction (Zubay, 1998; Zubay & Mui, 2001). He has shown that more than 30% of the input formaldehyde can be converted to a mixture of the aldopentoses and provides evidence suggesting that ribose is the first pentose sugar formed, and that the other pentoses are formed from it by Pb^sup ++^-catalyzed isomerization. These studies suggest that a satisfactory prebiotic synthesis of ribose may be possible. In a very recent report it has been claimed that the four pentose sugars are stabilized by the presence of calcium borate minerals (Ricardo et al., 2004).
The production of ribose in the formose reaction depends, at least in part, on the aldol reaction of glycolaldehyde with glyceraldehyde. Eschenmoser and his coworkers showed that the pattern of products could be greatly simplified if glycolaldehyde and glyceraldehyde were replaced by their monophosphates (Mueller et al., 1990). Under alkaline conditions glycolaldehyde phosphate alone yields a relatively simple mixture of tetrose-2-4diphosphates and hexose-2-4-6-triphosphates. Most interestingly, ribose-2-4- diphosphate was the major sugar product from the reaction of glycolaldehyde phosphate with glyceraldehyde-2-phosphate. Ribose-2- 4diphosphate was also a major product of an equivalent reaction involving formaldehyde and two molecules of glycolaldehyde- phosphate. In these reactions the phosphate groups prevent the rearrangements that are characteristic of triose, tetrose, and pentose sugars under alkaline conditions and that lead directly or indirectly to much of the complexity of the formose product mixture. Eschenmoser’s synthesis would provide a first step in a plausible route to the nucleotides if ribose-2-4-diphosphate could be converted to a 5-phosphate or a 1-5-diphosphate.
The reactions described above occur in solution only at high pHs and with high concentrations of the reactants. However, certain layer hydroxides such as magnesium aluminium hydroxide are powerful catalysts for the reaction. Negatively charged organic phosphates are absorbed so strongly between the positively charged metal- hydroxide layers that they can be concentrated from very dilute solution. Furthermore, once in the environment between the metal- hydroxide layers, they react rapidly to form sugar phosphates even if the pH of the external solution is close to 7 (Pitsch et al., 1995). Since layer hydroxides are abundant minerals, this version of Eschenmoser’s synthesis may be considered as a promising prebiotic reaction. However, it is less specific than the solution reaction for the production of ribose-2-4-diphosphate.
We conclude that some progress has been made in the search for an efficient and specific prebiotic synthesis of ribose and its phosphates. However, in every scenario, there are still a number of obstacles to the completion of a synthesis that yields significant amounts of sufficiently pure ribose in a form that could readily be incorporated into nucleotides.
Purine Synthesis
In a series of seminal papers published in the 1950s, Juan Oro and his coworkers showed that adenine is produced in appreciable yield by refluxing a solution of ammonium cyanide, and that 4-amino- 5-cyanoimidazole (II) is an intermediate in the synthesis (Oro & Kimball, 1960, 1961, 1962; Oro, 1961a). This and closely related reactions have been investigated repeatedly under different reaction conditions, and the products have been analyzed using improved analytical techniques. In addition to adenine, small amounts of guanine have been detected among the products of HCN polymerization (Miyakawa et al., 2002a, 2002b). In a particularly striking experiment, adenine has been obtained in 20% yield by heating HCN with liquid ammonia in a sealed tube (Wakamatsu et al., 1966). Here we can only review the literature on HCN polymerization that is most relevant to prebiotic chemistry.
The first reasonably stable product of the polymerization of HCN in aqueous solution is the HCN tetramer, diaminomaleodinitrile (I). Subsequent steps in the polymerization are complex and are not well understood. The tetramer, once formed, initiates a further \polymerization reaction that leads to the precipitation of a dark intractable solid from which adenine, guanine, and numerous other mostly uncharacterized compounds can be released by hydrolysis with acids or bases. In some experiments a small quantity of adenine is also present in the solution phase (Miyakawa et al., 2002a, 2002b). Very little is known about the structure of the insoluble polymer or about the way in which adenine is incorporated into it. While some adenine may be released directly from the solid on hydrolysis by acid, much of it is released initially as adenine-8carboxamide and related compounds (Voet & Schwartz, 1983).
Several reactions that might contribute to the synthesis of adenine from HCN via the HCN tetramer (Figure 2) have been studied (Ferris & Orgel, 1965, 1966a; Sanchez et al., 1967, 1968). It has been shown that formamidine can be formed by the addition of ammonia to HCN, and that formamidine reacts with the HCN tetramer to give 4- amino-5-cyano-imidazole (AICN) (Figure 2b), which, in turn, reacts with a second molecule of formamidine to yield adenine. In other experiments it has been shown that HCN adds to AICN in aqueous solution to give adenine directly (Figure 2c). Heating HCN tetramer or AICN with ammonium formate-the hydrolysis product of HCN-in the solid state is another way of obtaining adenine (Zubay & Mui, 2001; Hill & Orgel, 2002). Adenine has also been obtained directly by heating formamide, a synthesis that may involve HCN as an intermediate (Saladino et al., 2001). However, none of the reactions has been proven to contribute directly to adenine synthesis under the conditions employed by Oro and his coworkers.
FIG. 2. Steps in possible prebiotic syntheses of adenine from HCN. (a) The formation of the HCN tetramer. (b) The conversion of HCN tetramer to AICN. (c) The formation of purines from AICN or from its hydrolysis product 4-amino-imidazole-5-carboxamide (III).
AICN and its hydrolysis product 4-amino-imidazole-5carboxamide (III) are readily converted to hypoxanthine and a variety of 2, 6- disubstituted purines in aqueous solution by reaction with simple one-carbon molecules (Figure 2c) (Sanchez et al., 1968). Thus AICN and the related carboxamide, if they could be obtained under prebiotic conditions, would offer very plausible routes to the purines that are important in biochemistry. We must therefore review attempts to obtain HCN tetramer and to convert it to AICN under plausibly prebiotic conditions.
Detailed kinetic studies show that hydrolysis of HCN to formamide and ultimately ammonium formate competes very effectively with tetramer synthesis if the HCN concentration falls below 10^sup -1^ to 10^sup -2^ M (Sanchez et al., 1967). It would have been impossible to reach such a high concentration of HCN in the bulk oceans, while evaporation of lakes or tide pools could not adequately concentrate a molecule as volatile as HCN. The most plausible prebiotic method for concentrating HCN is by “eutectic freezing” (Sanchez et al., 1966a). If a dilute aqueous solution of HCN is cooled below 0C, pure ice crystallizes out and the solution becomes more concentrated until a eutectic is obtained at -23.4C that contains 74.5 (moles)% of HCN. This very concentrated solution slowly deposits a typical dark HCN polymer. Schwartz and his coworkers showed that adenine could be obtained in 0.004% yield by hydrolysis of this polymer or in 0.02% yield if glycolonitrile was added to the reaction mixture before freezing (Schwartz et al., 1982). Miller and his coworkers have obtained very similar results (Miyakawa et al, 2002a, 2002b). At present, the most plausible routes to adenine from HCN involve an initial synthesis of HCN tetramer in solutions concentrated by freezing.
The route from HCN tetramer to AICN by reaction with formamidine (Figure 2b) is somewhat problematical, since the formation of formamidine would require a high concentration of ammonia, and it is questionable that ammonia was ever present on the primitive Earth in significant amounts. The photochemical isomerization of the HCN tetramer (Figure 2b) provides an alternative route to AICN (Ferris & Orgel, 1966b). The reaction occurs readily in sunlight and gives almost quantitative yields of AICN. The combination of HCN tetramer formation in eutectic solution (Figure 2a), photoisomerization of tetramer to AICN (Figure 2b), and reaction of AICN with HCN to give adenine (Figure 2c) constitutes a potentially prebiotic synthesis of adenine that is independent of ammonia.
Completely different schemes for the accumulation of adenine and other purines on the primitive Earth have been discussed. Miyakama and his coworkers suggested that purines were formed in the atmosphere but by mechanisms that are independent of HCN (Miyakawa et al., 2000). Substantial amounts of adenine have been found in carbonaceous chondrites, so it has been suggested that purines were formed elsewhere in the solar system, perhaps by Oro-type chemistry, and brought to the Earth in meteorites (Oro, 1961b; Chyba & Sagan, 1992).
Pyrimidine Synthesis
Most of the published work on prebiotic pyrimidine synthesis is concerned with a series of closely related reactions between cyanoacetylene (IV) or its hydrolysis product, cyanoacetaldehyde (V), and cyanate ions, cyanogen or urea (Figure 3) (Ferris et al., 1968, 1974; Robertson & Miller, 1995a, 1995b; Nelson et al., 2001). The product, cytosine, is obtained in good yield in several of these reactions. Since cyanoacetylene is a major product formed when an electric discharge is passed through a mixture of nitrogen and methane (Sanchez et al., 1966b) and hydrolyses readily to cyanoacetaldehyde (Ferris et al., 1968), these two molecules have been claimed to be potentially prebiotic (Orgel, 2002). Uracil is formed from cytosine by hydrolysis, and this has been proposed as a prebiotic synthesis.
The highest yields of cytosine, up to 50%, are obtained when cyanoacetaldehyde is incubated with a saturated solution of urea (Robertson & Miller, 1995a, 1995b). However, the so-called drying lagoon model, which postulates that lagoons of saturated urea existed on the primitive Earth does not seem plausible (Shapiro, 1999, 2002). More modest yields of cytosine (about 5%) are obtained when cyanoacetylene reacts with 1.0 M cyanate or when either cyanoacetylene or cyanoacetaldehyde reacts with 1.0 M urea (Ferris et al., 1968; Robertson & Miller, 1995a, 1995b). The reactions of cyanoacetylene or cyanoacetaldehyde with urea or of cyanoacetylene with cyanate in eutectic solution seem the most plausible prebiotic routes to cytosine. Syntheses of this kind are particularly attractive because they could proceed in parallel with the synthesis of adenine from HCN (Orgel, 2004).
Nucleoside Synthesis
The synthesis of nucleosides from ribose and the nucleoside bases is the weakest link in the chain of prebiotic reactions leading to oligonucleotides. If D-ribose is heated directly with hypoxanthine in the presence either of magnesium chloride or of the mixture of inorganic salts present in seawater, up to 8% of authentic [beta]-D- inosine is formed, along with a somewhat smaller amount of the [alpha]-isomer (Fuller et al., 1972). The product mixture formed from adenine under the same conditions is more complex, since the major reaction occurs at the amino-group of the base, and nucleoside formation is a relatively minor side reaction. However, hydrolysis of the reaction products under relatively mild conditions leaves behind a small yield of a mixture of adenosine isomers including up to 3% of [beta]-Dadenosine.
FIG. 3. Steps in proposed prebiotic syntheses of cytosine. (a) The hydrolysis of cyanoacetylene to cyanoacetaldehyde. (b) The reaction between cyanoacetylene and two molecules of cyanic acid. (c) The condensation of cyanoacetaldehyde with urea.
FIG. 4. (a) The synthesis of cytosine arabinoside from arabinose, cyanamide, and cyanoacetylene. (b) Modification of the mode of hydrolysis of cyclocytidine caused by the presence of a neighboring phosphate group.
No direct synthesis of pyrimidine nucleosides from ribose and uracil or cytosine has been reported. An indirect synthesis in which [alpha]-cytidine is first obtained from ribose, cyanamide, and cyanoacetylene in aqueous solution has been described (Sanchez & Orgel, 1970). The corresponding reaction of arabinose yields [beta]- cytosine arabinoside (VI) (Figure 4a). If ribose is replaced by ribose-5-phosphate in this reaction sequence [alpha]-cytidine5′- phosphate is obtained in up to 40% yield. It is possible to photo- anomerize [alpha]-cytidine to [beta]-cytidine but only in 5% yield (Sanchez & Orgel, 1970). A similar reaction occurs with a [beta]- cytidine-5′-phosphate.
Nagivary prepared the 3′-phosphate of cyclo-cytidine (VII) by standard laboratory synthesis and showed that it hydrolyzes in water, via a cyclic phosphate, to give a good yield of cytidine 2′(3′)-phosphate (Figure 4b) along with a smaller amount of cytosine arabinoside-3′-phosphate. He speculated that these two reactions together might provide a prebiotic route to cytidine-3′-phosphate (Tapiero & Nagyvary, 1971). More recently, Sutherland and his coworkers have succeeded in combining the two reaction sequences, thus obtaining cytidine-3′-phosphate directly, in aqueous solution, from arabinose-3-phosphate, cyanamide and cyanoacetylene (Ingar et al., 2003). The source of arabinose 3-phosphate on the primitive Earth is unclear, but this work nonetheless suggests that the prebiotic synthesis of pyrimidine nucleotides may be possible.
One mode of biosynthesis of nucleosides or nucleotides involves the displacement of phosphate from [alpha]-ribose1-phosphate or of pyrophosphate from [alpha]-ribose-5phosphate-1-pyrophosphate by a nucleoside base. This seems to provide a promising approach to the corresponding prebiotic syntheses. A few preliminary and unpublished experiments attempted inour laboratory using ribose 1-phosphate were unsuccessful, but preliminary experiments reported by Zubay and his coworkers suggest that a synthesis from ribose-5-phosphate-1- pyrophosphate may be possible (Zubay & Mui, 2001).
Phosphorylation of Nucleosides
Inorganic phosphates or polyphosphates are the most plausible source of phosphate for prebiotic synthesis, although reduced forms of phosphorus have occasionally been considered in this context (Schwartz, 1997; Peyser & Ferris, 2001). Only orthophosphates are abundant in rocks and minerals, principally insoluble calcium phosphates, but there is evidence that condensed phosphates are products of volcanism (Yamagata et al, 1991). A number of approaches to prebiotic synthesis using inorganic orthophosphates or polyphosphates have been explored.
Many of the earliest attempts to phosphorylate nucleosides utilized organic condensing agents such as cyanamide, cyanamide dimer, or cyanate. These reactions are prebiotic equivalents of the much-used phosphorylation protocols that employ carbodiimides as activating agents in organic solvents. Unfortunately, such reactions are usually inefficient in aqueous solution because of the competition of water for the activated phosphate intermediate (Lohrmann & Orgel, 1968). Appreciable yields of cyclic phosphates can occasionally be obtained, for example, from monophosphates of cis glycols, because the cis hydroxyl group can compete efficiently with water for the activated phosphate moiety. Nucleoside-2′- or 3′- phosphates sometimes give nucleoside-2′- or 3′-cyclic phosphates in good yield in this way. More recently it has been shown that AMP can be converted to ADP and ATP by cyanate in the presence of insoluble calcium phosphates (Yamagata, 1999).
Nucleosides, like other alcohols, can be phosphorylated by heating in the solid state with acidic phosphates such as NaH^sub 2^PO^sub 4^. These reactions require fairly high temperatures and are not very efficient (Beck et al., 1967). However, urea, and to a lesser extent amides such as acetamide, catalyze the reaction, particularly if ammonium phosphate is used as the inorganic component. Presumably, ammonium phosphate is particularly effective in this reaction because it loses ammonia on heating and thus generates a very acidic environment. In the absence of an organic component, heating ammonium phosphate with urea yields a mixture of high molecular-weight linear polyphosphates (Lundstrom & Whittaker, 1937; Osterberg & Orgel, 1972).
Nucleosides can be converted to a complex mixture of products containing one or more phosphate groups in excellent yield by heating at moderate temperatures with ammonium phosphate and urea (Lohrmann & Orgel, 1971). When uridine, for example, is heated with excess urea and ammonium phosphate at 100C, about 70% of the input is converted to a complex mixture of phosphorylated products (Figure 5). Attempts to direct this reaction to the synthesis of a particular phosphate or polyphosphate, for example, a nucleoside-5′- phosphate or 5′-triphosphate, have met with some success (Handschuh et al., 1973; Osterberg et al., 1973; Reimann & Zubay, 1999). These solid-state reactions have been studied in considerable detail, but the reaction mechanism is not known. It is unlikely to involve carbamoyl phosphate or a phosphoramidate intermediate, so acid-base catalysis seems most probable (Osterberg & Orgel, 1972).
Phosphorus is present on the Earth today almost entirely as insoluble calcium phosphates, and this is likely to have been true on the primitive Earth. The urea-catalyzed phosphorylation reaction discussed above proceeds much more slowly when ammonium phosphate is replaced by calcium phosphate. Nonetheless, yields of nucleotides as high as 20% were obtained when hydroxylapatite was heated with urea, ammonium chloride, and a nucleoside at 100C (Lohrmann & Orgel, 1971).
FIG. 5. Phosphorylation of uridine with a mixture of inorganic phosphate, ammonium chloride, ammonium bicarbonate, and urea at 100C. U, __; Up!, __ _ _ _ __; pU, __ _ _ __; Up, __ __ __ __; pUp!, __ _ __; pUp _ _ _ _ _ _; total incorporation of inorganic phosphate, __. Redrawn from Figure 6 in Lohrmann and Orgel (1971).
We have seen that the heating of ammonium phosphates with a catalyst such as urea leads to the formation of a mixture of linear polyphosphates. These compounds can also be formed by heating other acidic phosphates such as NaH^sub 2^PO^sub 4^ strongly and, perhaps for this reason, are produced in volcanoes (Yamagata et al., 1991). While linear polyphosphates have never been shown to be good phosphorylating agents in aqueous solution, they are converted readily to cyclic triphosphates, the trimetaphosphates, under alkaline conditions in the presence of divalent metal ions. The cyclic trimetaphosphates are important as potentially prebiotic phosphorylating agents.
Cis glycols react with trimetaphosphate in strongly alkaline solution to yield cyclic phosphates that subsequently hydrolyze readily to a mixture of monophosphates. Nucleosides yield 2′,3′- cyclic phosphates that hydroyze to a mixture of 2′- and 3′- phosphates, often in excellent yield (Schwartz, 1969; Saffhill, 1970; Tsuhako et al., 1984). The same reaction occurs less efficiently under less alkaline conditions in the presence of Mg^sup ++^ (Yamagata et al., 1995) or in a cycle of wetting and drying reactions (Cheng et al., 2002). Monohydric alcohols react only under highly alkaline conditions to yield triphosphates, for example, a mixture of thymidine-3′-triphosphate, thymidine-5′-triphosphate, and a small amount of thymidine 3′-5′-bis triphosphate from thymidine (Etaix & Orgel, 1978).
FIG. 6. The phosphorylation of [alpha]-hydroxyaldehydes by the amidotriphosphate anion.
Amines react very readily with trimetaphosphate to give N- triphosphates. An ingenious modification of this reaction provides a simple procedure for phosphorylating 2-hydroxyaldehydes (Krishnamurthy et al., 2000). The reaction of ammonia with the trimetaphosphate anion yields an amidotriphosphate that can form a Schiff’s base with the carbonyl group of the hydroxyaldehyde. Next the 2-hydroxyl group attacks the [alpha]-phosphate, expelling pyrophosphate and forming a cyclic phosphoramidate. Finally the cyclic phosphoramidate undergoes hydrolysis to yield the 2- phosphate of the aldehyde (Figure 6). This reaction sequence is of considerable interest for prebiotic chemistry because glycolaldehyde phosphate and glyceraldehydes-2-phosphate are important, potentially prebiotic molecules.
While most effort has been devoted to the phosphorylation of preformed nucleosides, the direct phosphorylation of ribose is also interesting because the reaction of a-ribofuranose-1-phosphate with nucleoside bases is a plausible route to the nucleosides. Ribose can be phosphorylated directly with inorganic phosphate in aqueous solution using cyanogen or cyanamide as a condensing agent (Halmann et al., 1969). The product, however, is relatively pure [beta]- ribofuranose-1 phosphate, not the desired [alpha]-isomer.
Conclusion
The inevitable conclusion of this survey of nucleotide synthesis is that there is at present no convincing, prebiotic total synthesis of any of the nucleotides. Many individual steps that might have contributed to the formation of nucleotides on the primitive Earth have been demonstrated, but few of the reactions give high yields of products, and those that do tend to produce complex mixtures of products. It should also be realized that any prebiotic synthesis of a nucleotide would yield a racemic product, not the biologically important D-nucleotide. Recent publications, particularly those of Zubay and his coworkers (cited above), suggest that the search for a convincing prebiotic synthesis of the nucleotides is not hopeless. However, the difficulties remain so severe that alternatives to the de novo appearance of RNA on the primitive Earth deserve serious consideration. The succeeding sections of this review, in addition to discussing possible routes to RNA from a hypothetical source of prebiotic nucleotides, will also consider other ways in which the RNA World could have appeared.
INPUT IN METEORITES OR COMETS
A substantial proportion of the meteorites that presently fall on the Earth belong to a class known as carbonaceous chondrites. These interesting stones may contain more than 3% of carbon, much of it organic. Numerous detailed chemical analyses of carbonaceous chondrites have led to the identification of a very large number of hydrocarbons, carboxylic acids, amino acids, hydroxy acids, sulfonic acids, phosphonic acids, poly-hydroxy compounds (Cooper et al., 2001), etc. Very little is known in detail about the mechanisms that produced the various classes of molecules. It is thought that primary processes in the interstellar or stellar medium produced organic precursors that were incorporated into the parent bodies of the meteorites. Then further processing on the parent bodies modified and extended the inventory of organics. The details of this complex subject are beyond the scope of the present review. Comprehensive accounts of all aspects of this complicated chemistry are available (Anders, 1989; Chyba & Sagan, 1992; Cronin & Chang, 1993; Botta & Bada, 2002). The presence of adenine and the other nucleoside bases in the carbonaceous chondrites at a level of about one part per million, however they may have formed, is obviously relevant to discussions of the origin of the RNA World.
There can be no doubt that material analogous to that making up the carbonaceous chondrites contribute substantially to the formation of the Earth, and that the influx of meteoritic material was much greater during the early history of the Earth than it is today. The extent to which the organic material present in meteorites could have survived passage through the atmosphere and impact on the surface of the Earth is unclear. It i\s also difficult to estimate the importance of micrometeorites and comets (Goo, 1961), which are thought by some to have brought in the bulk of the organic material that was present on the early Earth. Despite the uncertainties, many scientists believe that meteorites, comets, and interplanetary dust particles were the major source of organic material for the origin of life. If so, life must have originated in a mixture of organic molecules different from those obtained by Miller/Urey chemistry but at least as complicated.
SYNTHESIS IN THE DEEP SEA VENTS
Wachtershauser initiated a novel approach to the problem of the origin of life (Wachtershauser, 1988). He proposed that the original source of organic material for the origin of life was provided by the reduction of carbon dioxide using hydrogen sulfide (H^sub 2^S) over ferrous sulfide (FeS) as the reducing agent. He further proposed that the products of this reaction never entered free aqueous solution but set up a complex “metabolism” while confined to the surface on which they were synthesized. In more detail, he noted that the conversion of FeS to FeS^sub 2^, pyrite, by H^sub 2^S could provide the reducing power needed to produce organic material from CO2. The self-organization of the reductive citric acid cycle on an iron sulfide surface without the help of enzymes or other informational molecules was also a central part of his scheme. Chemistry of this kind could most plausibly occur in the deep-sea vents, where superheated water containing dissolved H^sub 2^S and transition-metal Sulfides is mixed suddenly with a large excess of cold seawater, causing sulfides, including large amounts of FeS, to precipitate. Scenarios for the origin of life that involve this or related hydrothermal chemistry are now popular. As far as I am aware, these scenarios have not been shown to be directly relevant to the origin of the RNA World, but they may be relevant to the origin of membrane-forming organic material.
Wachtershauser’s imaginative suggestion that the formation of FeS^sub 2^ from H^sub 2^S and FeS could drive reduction has been amply confirmed in his collaborations with Stetter and his coworkers. They have, for example, demonstrated the reduction of acetylene and mercaptans by H^sub 2^S over FeS (Blochl et al., 1992). Most impressively, it has recently been reported that freshly precipitated ferrous sulfide in the presence of H^sub 2^S is able to reduce molecular nitrogen to ammonia (Dorr et al., 2003). Thus, the reaction of FeS with H^sub 2^S to form pyrite provides the driving force for many reductions, including the very difficult reduction of N^sub 2^ to ammonia in just the way predicted by Wachtershauser.
The reduction of CO2 has not been achieved, but an interesting alternative synthesis, the reaction of carbon monoxide with methyl mercaptan over a mixed iron/nickel sulfide to give an activated derivative of acetic acid, probably the methyl thioester, has been described (Huber & Wachtershauser, 1997). In another paper, Huber and Wachtershauser report that peptides are formed from amino acids by activation with carbon monoxide in the presence of Fe/Ni S and H^sub 2^S or methyl mercaptan (Huber & Wachtershauser, 1998). They suggest that carbonyl sulfide may be an intermediate in this reaction, its hydrolysis to carbon dioxide and hydrogen sulfide in the presence of transition-metal sulfides providing the free energy needed to drive peptide synthesis. The N-carboxy anhydrides of the amino acids are probably intermediates in the reaction sequence.
Cody and his coworkers have reported another potentially important synthesis. They were able to obtain pyruvic acid from formic acid, a source of carbon monoxide, in the presence of nonylmercaptan and FeS (Cody et al., 2000). The reaction was carried out at 250C and a high pressure, without the addition of water in excess of that generated by the decomposition of the formic acid. The synthesis of a significant yield of pyruvic acid, a relatively unstable molecule at such a high temperature, is unexpected. It is not clear whether nonylmercaptan has an essential role in the reaction or if it could be omitted or replaced by a lower molecular weight thiol such as methyl mercaptan, nor is it known whether the synthesis occurs entirely on the surface of FeS or if a volatile metal carbonyl is involved.
In addition to these recent studies, earlier work on the Fischer- Tropsch reaction, which leads to the production of long straight- chain hydrocarbons and their oxygenated derivatives from carbon monoxide and hydrogen in the presence of a suitable catalyst, may well be relevant to the chemistry of the deep-sea vents. The conditions and catalysts typically employed in the chemical industry are unlikely to have been present on the primitive Earth, but there is some evidence that similar results can be obtained with plausible prebiotic catalysts (Zolotov et al., 2001). The production of straight-chain fatty acids might provide a source of membrane- forming organic material (see below).
In the light of all of the experimental evidence, the deep-sea vents must now be added to the list of plausible sources of some or all of the organic material needed for the origin of life. However, it seems unlikely that nucleotides could have formed directly in the vents, so Wachtershauser’s theory is best suited to a scenario in which some other genetic polymer preceded RNA (see below).
In the above discussion I have treated reactions that occur on metal sulfides as prebiotic syntheses that support a novel scenario for the formation of the organic substrates that contributed to the origin of life. Wachtershauser insists that they are more than that and that they support his theory of a complex surface metabolism in which the reductive citric acid cycle or something like it self- organized on the surface of a metal sulfide. I am unaware of any experimental evidence for such self-organization, and I have argued elsewhere that on theoretical grounds it is exceedingly unlikely that such self-organization could occur (Orgel, 2000). If life originated in the deep-sea vents, I think it more likely that the vents provided the components of a relatively simple informational polymer. Unfortunately, the experimental study of hydrothermal reactions under anaerobic conditions requires the use of specialized equipment that is available in only a limited number of laboratories, so it may be a substantial time before the scope of hydrothermal synthesis on the surfaces of transition metal sulfides is adequately explored.
POLYMERIZATION OF ACTIVATED NUCLEOTIDES
The polymerization of nucleotides in aqueous solution is an uphill reaction and does not occur spontaneously to a significant extent. Evaporation of acidic solutions of nucleotides and subsequent heating leads to the formation of complex mixtures of very short oligonucleotides, in which 2′-5′-, or 3′-5′- phosphodiester linkages occur more or less at random (Moravek, 1967). Consequently, attempts to polymerize nucleotides from aqueous solution must necessarily make use of external activating agents. Attempts along these lines using cyanamide and similar activating agents or water-soluble carbodiimides have been disappointing, at best leading to poor yields of dinucleotides and very short oligonucleotides.
Polymerization of preactivated nucleotides has met with greater success. Unfortunately, nucleoside-5′-polyphosphates react so slowly in aqueous solution at moderate temperatures and pHs that their polymerization cannot easily be studied in the laboratory. Instead, nucleotides activated as phosphoramidates, usually phosphorimidazolides, have been used as substrates in most experiments (Figure 7a). They can be obtained in fairly good yield from nucleoside-5′-polyphosphates and amines or imidazoles and have, therefore, been claimed to be prebiotic (Lohrmann, 1977). However, it is unclear that phosphorimidazolides could have occurred in large amounts on the primitive Earth, so these experiments form only a rough guide to the classes of reaction that might have been relevant to chemical evolution. The source of the free energy needed to drive the uphill polymerization of nucleotides is unclear. One plausible suggestion involves the initial formation of nucleoside polyphosphates from nucleosides and inorganic phosphate, as described above. Then the nucleoside polyphosphates could have polymerized slowly or they could have been converted to phosphoramidates that polymerized more rapidly.
FIG. 7. (a) The structures of some activated nucleotides used in the prebiotic synthesis of oligonucleotides. (b) Oversimplified diagram showing activated mononucleotides aligned on a d(CCCGCCCGCCCGCC) template, (c) Electrophoresis of the products formed from activated monomers aligned on the d(CCCGCCCGCCCGCC) template shown in (b) above. The composition of the oligomers corresponding to major products is indicated. Degradation with specific RNAases showed that each major product is complementary to a subsequence of the template.
Nucleoside 5′-phosphorimidazolides oligomerize in aqueous solution, but in the absence of a catalyst yield only a complex mixture of short linear and cyclic products. A number of metal ions are effective catalysts for this class of polymerization (Sawai & Orgel, 1975; Sawai, 1976), in particular Pb^sup ++^ (Sawai, 1976; Sleeper & Orgel, 1979). Long homo- and hetero-oligomers are obtained using this metal ion when the reaction is carried out in eutectic solution (Kanavarioti et al., 2001; Monnard et al., 2003). The uranyl ion is another very efficient catalyst in aqueous solution, leading to the synthesis of oligomers up to at least lomers. These oligonucleotide products are predominantly 2′-5′-linked (Sawai et al., 1989, 1992).
James Ferris and his coworkers have investigated in considerable detail a remarkable series of reactions in which an abundant clay mineral,montmorillonite, catalyzes the synthesis of long oligonucleotides from relatively dilute solutions of nucleoside phosphoramidates including the 5′ phosphorimidazolides (Ferris et al., 2003). In some of their more recent experiments using a phosphoramidate based on 1-methyladenine (X; Figure 7a), they were able to identify oligomers up to 40 residues long (Huang & Ferris, 2003) with about 80% of 3′-5′- phosphodiester linkages (Prabahar & Ferris, 1997). This is one of the most impressive examples of the catalysis of a prebiotic reaction by a mineral that has been reported.
Montmorillonite is a clay with a layered aluminosilicate structure that readily expands to permit large molecules to enter the interlayers, and it is probable that new phosphodiester bonds are formed when the substrates are absorbed in the interlayers (Ertem & Ferris, 1998). An extensive body of experimental work suggests that there are specific catalytic sites somewhere on or in the mineral that can be blocked by adsorbed but unreactive analogs of the substrates, for example dimethyladenine (Wang & Ferris, 2001).
The regiospecificity of the polymerization is very dependent on the nature of the nucleoside base and the activating group. In some cases it can favor the formation of 3′-5′- over 2′-5′- phosphodiester linkages by as much as a factor of four (Prabahar & Ferris, 1997), but in other cases the regiospecificity is modest or favors the formation of 2′-5′-phosphodiester bonds (Ertem & Ferris, 1997). The enantiospecificty of the montmorillonite-catalyzed reaction is limited (Joshi et al., 2000). Clearly, this impressive example of mineral-based catalysis is efficient but not particularly specific with respect to the distribution of isomeric products. Nonetheless, products of the montmorillonite-catalyzed oligomerization of ImpC (VIII) can act as templates for the oligomerization of ImpG (VIII) (Ertem & Ferris, 1997).
In summary, two procedures for the efficient oligomerization of nucleoside-5′-phosphorimidazolides and related phosphoramidates have been reported, montmorillonite catalysis and metal ion catalysis by Pb^sup ++^ or uranyl ions. These are only indirect models of prebiotic synthesis, since the phosphoramidate substrates of the polymerizations are not likely to be prebiotic molecules.
TEMPLATE-DIRECTED SYNTHESIS
The replication of RNA without the help of protein enzymes is an essential feature of the RNA World hypothesis. It seems probable, but by no means certain, that the synthesis of complementary RNA on a preformed RNA template, without the help of any informational catalyst, played a part in the origin of the RNA World (Orgel, 2003). We will review experiments on nonenzymatic template-directed synthesis here. Later we will discuss briefly ribozymecatalyzed template-directed synthesis, which is likely to have evolved in the RNA World before the “invention” of protein synthesis.
The basic principle of template-directed synthesis is a simple one (Figure 7b). If a polynucleotide is incubated with an appropriate mixture of complementary mononucleotides or short oligonucleotides at a sufficiently low temperature, in many cases a double- or triple-helical complex is formed (Howard et al., 1966). These complexes are structurally similar to double- or triple- stranded nucleic acids, but one chain is interrupted. Thus the template brings complementary activated monomers or short oligomers together and facilitates their ligation. Whenever a complementary complex is formed, the template influences the ligation reaction, but the devil is in the details. The ligation reaction may be more or less efficient and may lead to a greater or lesser excess of 3′- 5′-phosphodiester linkages over 2′-5′-linkages. The objective of most research programs in this field has been to find conditions that lead to the efficient synthesis of predominantly 3′-5′-linked oligonucleotides for as broad a range of template sequences as possible.
The first experiments on template-directed synthesis of oligonucleotides were reported by Naylor and Gilham who showed that a dA^sub 12^ template catalyzed the synthesis of T^sub 12^ from two T^sub 6^ molecules when a water-soluble carbodiimide was used as the condensing agent (Naylor & Gilham, 1966). Zoe Shabarova and her coworkers have reported a large number of related experiments using different deoxy-templates, deoxy-substrates, and activating agents (Shabarova, 1988). We will not discuss this interesting and extensive literature because it is unlikely to be relevant to the origin of the RNA World. Experiments on RNA synthesis necessarily used activated ribonucleotide monomers or short oligomers as substrates, but often used DNA templates when RNA heteropolymers were not easily available. Water-soluble carbodiimides were initially used as activating agents (Sulston et al., 1968a, 1968b, 1969). In these experiments complex mixtures of isomeric short oligonucleotide products were obtained.
More recent work in the area of template-directed synthesis from mononucleotides has employed preactivated substrates, phosphorimidazolides, or closely related phosphoramidates (Figure 7a). The extensive and detailed literature on this topic has been reviewed (Joyce, 1987; Orgel, 1992; Kozlov & Orgel, 2000), so only the main conclusions will be presented here. The cited reviews should be consulted for experimental procedures and quantitative results. Nucleoside-5′-triphosphates, which might seem an obvious choice of substrates in experiments with nucleotide monomers, cannot easily be used in laboratory experiments because they react too slowly at temperatures below the melting points of the helical complexes that they form. Studies of the spontaneous template- directed ligation of oligonucleotides terminated by a 5′- triphosphate group, however, have led to interesting conclusions, as we will see.
The first efficient and regiospecific polymerization reactions to be reported were the syntheses of long oligoguanylic acids (oligoGs) on poly(C) templates using guanosine-5′-phosphorimidazolide (VIII; ImpG) as the activated monomer. The reaction showed a remarkably specific dependence on divalent metal ions. In the presence of Mg^sup ++^ and Pb^sup ++^, the products were almost exclusively 2′- 5′-linked (Lohrmann & Orgel, 1980), while in the presence of Mg^sup ++^ and Zn^sup ++^ ions virtually pure 3′-5′-linked oligomers were formed (Bridson & Orgel, 1980). If neither Pb^sup ++^ nor Zn^sup ++^ was present, no long oligomers were obtained. This reaction was restricted to the synthesis of oligo(G)s on a poly(C) template and could not be extended to incorporate bases other than G on homo- or hetero-oligonucleotide templates.
It was found, surprisingly, that when 2-methyl imidazole replaced imidazole in the activated nucleotide (IX; to give 2-MeImpG) the synthesis of long oligo(G)s on a poly(C) template no longer required the presence of any metal ion other than Mg^sup ++^. In this context, 2-Meimidazole seems unique since neither imidazole nor its 2-ethyl derivative can substitute for it. Furthermore, the product oligomers formed from 2-MeImpG are almost exclusively 3′-5′-linked (Inoue & Orgel, 1981). One further advantage of using the 2-methyl imidazolides is that it allows the copying of heteropolymers containing all four bases, but only if the template contains at least 60% of C residues (Joyce, 1987). This latter restriction rules out the possibility of repeated rounds of replication, since the product of a successful template-directed oligomerization contains at most 40% of C residues and cannot, therefore, act efficiently as a template.
There is now substantial evidence that the template-directed oligomerization of 2-Me-imidazole derivatives proceeds best in double helices that adopt the A-form nucleic acid structure (Kurz et al., 1997, 1998; Kozlov et al., 1999b, 2000b). RNA sequences are superior to DNA sequences as templates, but the general features of the reactions are similar for RNA and DNA templates (Zielinski et al., 2000). Nucleic acid analogs that tend to preorganize in the A- form DNA structure are usually excellent templates (Kozlov et al., 1999b, 2000b).
The sequence dependence of the reactions has been studied extensively using hairpin oligodeoxynucleotides as substrates (Wu & Orgel, 1992a, 1992b, 1992c; Hill et al., 1993). Incorporation of G opposite C in the template is most efficient, while incorporation of U opposite A is least efficient. Incorporation of A opposite U or of C opposite G is of intermediate efficiency. A pair of adjacent A residues in the template is an almost complete barrier to further synthesis. The fidelity of these reactions is usually very good, but with one notable exception: wobble pairing of G opposite U leads to extensive misincorporation of G, particularly on some RNA templates. The results of a long series of detailed studies show, therefore, that a wide variety of DNA or RNA sequences can be copied, but replication is not possible in this system (Joyce, 1987). The results of copying an especially favorable sequence, deoxy(CCGCCCGCCCGCCC), are shown in Figure 7c (Acevedo & Orgel, 1987).
Any prebiotic synthesis that yields ribonucleotides would produce racemates. Unfortunately, the L-enantiomers of activated nucleotides are efficient inhibitors of template-directed synthesis using the naturally occuring D-enantiomers (Joyce et al., 1984). This difficulty, often described as enantiomeric cross-inhibition, is not easily overcome without making substantial changes to the nature of the backbone of the template (Kozlov et al., 1999a). This is a major obstacle to any scheme for polynucleotide replication from plausibly prebiotic, monomeric substrates.
The ligation of short oligonucleotides activated as phosphorimidazolides has been studied less extensively. When very short oligomers activated as phosphorimidazolides are used as substrates, the efficiency and regiosp\ecificity of the reaction depends very strongly and unpredictably on the sequences of the substrates (Ninio & Orgel, 1978). However, with somewhat longer 3′,- 5′-linked sequences ligation is efficient and yields predominantly 3′-5′-linked products (Rohatgi et al., 1996b). The higher melting points of helices formed by oligomers, coupled with the possibility of using phosphorus-labeled substrates of high specific activity, has made possible a detailed study of the ligation of substrates activated as 5′-triphosphates (Rohatgi et al., 1996a, 1996b). Encouragingly, the ligation of 5′-triphosphates, a close analog of enzymatic ligation, yields almost exclusively 3′-5′-linked products. In the context of prebiotic chemistry, 3′-5′-linked oligonucleotides are superior to mononucleotides as substrates with respect to regiospecificity, and they permit ligation over wider temperature ranges. However, it is not obvious that homochiral, exclusively 3′- 5′-linked oligomers are plausible prebiotic molecules (see previous section), and the fidelity of template-directed ligation of oligomers is lower than the fidelity of oligomerization of monomers.
RIBOZYMES
If one believes that the RNA World was the first organized biological world, one must postulate that a library of RNA strands with different sequences formed spontaneously on the primitive Earth and that this family of sequences included catalysts able to support self-replication of RNA. The idea of an RNA that performed some of the functions of an RNA polymerase is, therefore, an essential feature of the de novo RNA World hypothesis. It is much harder to decide which other functions would have needed to evolve in the RNA World once replication got started. Presumably catalysts for some metabolic reactions were required for the RNA World to become self- sustaining, but it is hard to guess which metabolic reactions. Instead of discussing ribozymes in general, we will review very briefly what is known about the ribozyme-catalyzed reactions that seems most relevant to the origin of the RNA World. The whole range of ribozyme-catalyzed reactions and the experimental protocols used in directed RNA-evolution experiments fall outside the scope of this article. Excellent accounts of these topics are available in two general reviews (Wilson & Szostak, 1999; Joyce, 2004).
Ribozymes that occur in living organisms, although important, are rather limited in the range of reactions that they catalyze. The initial discoveries of Cech and Altman identified ribozymes that cut and ligate naturally occurring RNA (Kruger et al., 1982; Guerrier- Takada et al., 1983). With one important exception, the ribozymes that have been discovered subsequently in nature perform the same kind of functions. There is no evidence that transcription or RNA replication involve ribozyme catalysis. The most important ribozyme in nature is the ribosome. It is now clear that the peptide-bond- forming step of protein synthesis is catalyzed by the RNA of the ribosome without direct involvement of the proteins (Steitz & Moore, 2003). This is the smoking gun that has led to the more general acceptance of the RNA World hypothesis. Unfortunately, our rapidly improving understanding of protein synthesis is unlikely to throw much light on the origin of the RNA World, since the invention of protein synthesis marked the beginning of its decline.
In vitro methods have been used to isolate RNA molecules that bind with high specificity to selected small organic molecules or to peptides and proteins (Puglisi & Williamson, 1999). In principle, it should be possible to extend this type of procedure to obtain RNAs that bind to almost any water-soluble polymer or to any structured solid surface, including the surfaces of specific minerals or rocks. Selection of RNAs that perform catalytic functions required the invention of more complicated experimental protocols but is now a routine laboratory procedure (Wilson & Szostak, 1999; Joyce, 2004). It is clear that a great variety of reactions can be catalyzed by RNA. On the one hand, an RNA that catalyzes the racemization of a sterically hindered biphenyl is a typical and interesting example of a ribozyme that brings about a reaction that has no obvious relation to biochemistry (Prudent et al., 1994). On the other, the RNA- catalyzed synthesis of 4-thiouridine-5′-phosphate from 4-thiouracil and 5-phosphoribosyl-1-pyrophosphate is typical of the type of riboenzymatic reaction that, according to the RNA-first scenario, might have evolved when the availability of prebiotic nucleotides declined (Unrau & Bartel, 1998). However, at the very beginning of the evolution of the RNA World only one function was essential, namely catalysis of RNA replication. Remarkable progress has been made in this area.
Bartel and his coworkers have been able to evolve a catalytic RNA that has many of the essential properties of an RNA polymerase (Johnston et al., 2001). This rather complicated molecule (Figure 8), when presented with an arbitrary, single-stranded RNA template, an RNA primer, and a mixture of the four nucleoside triphosphates, will synthesize the complement of the template. At present, templates that contain more than 14 residues cannot be copied effectively, but it seems likely that further in vitro evolution will lead to the isolation of a ribozyme capable of copying much longer RNAs. The isolation of such a ribozyme would constitute a major advance in attempts to understand the origin of life. However, the formidable problem of separating the double-stranded product of the copying reaction so as to permit a second round of copying would remain to be solved.
FIG. 8. A ribozyme that functions as an RNA-dependent RNA polymerase. Redrawn with modification from Robertson et al. (2001).
THE MOLECULAR BIOLOGISTS’ DREAM
The RNA-first scenar
