BSC 1010C
General Biology I
Dr. Graeme Lindbeck
glindbeck@valenciacollege.edu


From Gene to Protein

Outline

  1. The study of metabolic defects provided evidence that genes specify proteins
    1. How Genes Control Metabolism
    2. One Gene-One Polypeptide
  2. Transcription and translation are the two main steps from gene to protein
  3. In the genetic code, a particular triplet of nucleotides specifies a certain amino acid
    1. Cracking the Genetic Code
    2. The Evolutionary Significance of a Common Genetic Language
  4. Transcription is the DNA-directed synthesis of RNA
    1. RNA Polymerase Binding and Initiation of Transcription
    2. Elongation of the RNA Strand
    3. Termination of Transcription
  5. Translation is the RNA-directed synthesis of a polypeptide
    1. The Structure and Function of Transfer RNA
    2. Aminoacyl-tRNA Synthetases
    3. Ribosomes
    4. Building a Polypeptide
    5. Polyribosomes
    6. From Polypeptide to Functional Protein
  6. Some polypeptides have signal sequences that target them to specific destinations in the cell
  7. Comparing protein synthesis in prokaryotes and eukaryotes
  8. Eukaryotic cells modify RNA after transcription
    1. Alteration of MRNA Ends
    2. Split Genes and RNA Splicing
    3. Ribozymes
    4. The Functional and Evolutionary Importance of Introns
  9. A point mutation can affect the function of a protein
    1. Types of Point Mutations
    2. Mutagens

Inherited instructions in DNA direct protein synthesis. Thus, proteins are the links between genotype and phenotype, since proteins are directly involved in the expression of specific phenotypic traits.

Back to top

I. The study of metabolic defects provided evidence that genes specify proteins

Archibald Garrod was the first to propose the relationship between genes and proteins (1909).

  1. How Genes Control Metabolism
  2. Garrod's hypothesis was confirmed several decades later by research which deten-nined that specific genes direct production of specific enzymes.

    George Beadle and Edward Tatum were later able to demonstrate the relationship between genes and enzymes by studying mutants of a bread mold, Neurospora crassa.

    Auxotroph = (Auxo = to augment; troph = nourishment) Nutritional mutants that can only be grown on minimal medium augmented with nutrients not required by the wild type.

    Minimal medium = Support medium that is mixed only with molecules required for the growth of wild-type organisms.

    Complete growth medium = Minimal medium supplemented with all 20 amino acids and some other nutrients.

    Beadle and Tatum then identified specific metabolic defects (from mutations) by transferring fragments of auxotrophic mutants growing on complete growth medium to vials containing minimal medium each supplemented with only one additional nutrient.

    Experiment:

    Beadle and Tatum experimented further to more specifically describe the defect in the multistep pathway that synthesizes the amino acid arginine.

    Arginine synthesis requires three steps each catalyzed by a specific enzyme:


    Gene A
    Gene B
    Gene C

    ¯
    ¯
    ¯

    Enzyme A
    Enzyme B
    Enzyme C
    Precursor ® Ornithine ® Citrulline ® Arginine

    They distinguished between three classes of arginine auxotrophs by adding either arginine, citruiline, or ornithine to the medium and seeing if growth occurred.

    Results:

    Some mutants required arginine, some either arginine or citrulline, and others could grow when any of the three were added.


    Minimal
    Medium
    (MM)
    mm
    plus
    Ornithine
    mm
    plus
    Citrulline
    mm
    plus
    Arginine
    Wild Type
    +
    +
    +
    +
    Class I Mutants
    -
    +
    +
    +
    Class II Mutants
    -
    -
    +
    +
    Class III Mutants
    -
    -
    -
    +
    + = growth, - = no growth

    Conclusions:

    Beadle and Tatum deduced from their data that the three classes of mutants each lacked a different enzyme and were thus blocked at different steps in the arginine synthesis pathway.

  3. One Gene-One Polypeptide
  4. Beadle and Tatum's one gene-one enzyme hypothesis has been slightly modified:

    As a result of this new information, Beadle and Tatum's hypothesis has been restated as one gene-one polypeptide.

Back to top

II. Transcription and translation are the two main steps from gene to protein

Ribonucleic acid (RNA) links DNA's genetic instructions for making proteins to the process of protein synthesis. It copies or transcribes the message from DNA and then translates that message into a protein.

The linear sequence of nucleotides in DNA ultimately determines the linear sequence of amino acids in a protein.

Transcription = The synthesis of RNA using DNA as a template.

Translation = Synthesis of a polypeptide, which occurs under the direction of messenger RNA (mRNA).

Prokaryotes and eukaryotes differ in how protein synthesis is organized within their cells.

Back to top

III. In the genetic code, a particular triplet of nucleotides specifies a certain amino acid

There is not a one-to-one correspondence between the nitrogenous bases and the amino acids they specify, since there are only 4 nucleotides and 20 amino acids.

Researchers have verified that the flow of information from a gene to a protein is based on a triplet code.

Codon = A three-nucleotide sequence in mRNA that specifies which amino acid will be added to a growing polypeptide or that signals termination; the basic unit of the genetic code.

Genes are not directly translated into amino acids, but are first transcribed as codons into mRNA.

An mRNA is complementary to the DNA template from which it is transcribed.

During translation, the linear sequence of codons along mRNA is translated into the linear sequence of amino acids in a polypeptide.

  1. Cracking the Genetic Code
  2. The first codon was deciphered in 1961 by Marshall Nirenberg of the National Institutes of Health.

    More elaborate techniques allowed investigators to determine all 64 codons by the mid1960's.

    There is redundancy in the genetic code, but no ambiguity.

    The correct ordering and grouping of nucleotides is important in the molecular language of cells. This ordering is called the reading frame.

    Reading frame = The correct grouping of adjacent nucleotide triplets into codons that are in the correct sequence on mRNA.

  3. The Evolutionary Significance of a Common Genetic Language
  4. The genetic code is shared nearly universally among living organisms.

    There are some exceptions to this universality:

    The fact that the genetic code is shared nearly universally by all organisms indicates that this code was established very early in life's history.

Back to top

IV. Transcription is the DNA-directed synthesis of RNA

Transcription of messenger RNA (mRNA) from template DNA is catalyzed by RNA polymerases, which:

There are several types of RNA polymerase.

Specific DNA nucleotide sequences mark where transcription of a gene begins (initiation) and ends (termination). Initiation and termination sequences plus the nucleotides in between are called a transcription unit.

Transcription unit = Nucleotide sequence on the template strand of DNA that is transcribed into a single RNA molecule by RNA polymerase; it includes the initiation and termination sequences, as well as the nucleotides in between.

Transcription occurs in three key steps: a) polymerase binding and initiation; b) elongation; and c) termination.

  1. RNA Polymerase Binding and Initiation of Transcription
  2. RNA polymerases bind to DNA at regions called promoters.

    Promoter = Region of DNA that includes the site where RNA polymerase binds and where transcription begins (initiation site). In eukaryotes, the promoter is about 100 nucleotides long and consists of:

    1. The initiation site, where transcription begins.
    2. A few nucleotide sequences recognized by specific DNA-binding proteins (transcription factors) that help initiate transcription.

    In eukaryotes, RNA polymerases cannot recognize the promoter without the help of transcription factors.

    Transcription factors = DNA-binding proteins that bind to specific DNA nucleotide sequences at the promoter and that help RNA polymerase recognize and bind to the promoter region, so that transcription can begin.

    TATA box = A short nucleotide sequence at the promoter that is rich in thymine (T) and adenine (A) and that is located about 25 nucleotides upstream from the initiation site.

  3. Elongation of the RNA Strand
  4. Once transcription begins, RNA polymerase II moves along DNA and performs two primary functions:

    1. It untwists and opens a short segment of DNA exposing about ten nucleotide bases; one of the exposed DNA strands is the template for base-pairing with RNA nucleotides.
    2. It links incoming RNA nucleotides to the 3' end of the elongating strand; thus, RNA grows one nucleotide at a time in the 5' ® 3' direction.

    During transcription, mRNA grows about 30 - 60 nucleotides per second. As the mRNA strand elongates:

    Following in series, several molecules of RNA polymerase II can simultaneously transcribe the same gene.

  5. Termination of Transcription
  6. Transcription proceeds until RNA polymerase reaches a termination site on the DNA.

    Terminator sequence = DNA sequence that signals RNA polymerase to stop transcription and to release the RNA molecule and DNA template.

    Prokaryotic mRNA is ready for translation as soon as it leaves the DNA template. Eukaryotic mRNA, however, must be processed before it leaves the nucleus and becomes functional.

Back to top

V. Translation is the RNA-directed synthesis of a polypeptide

During translation, proteins are synthesized according to a genetic message of sequential codons along mRNA.

Molecules of tRNA are specific for only one particular amino acid. Each type of tRNA associates a distinct mRNA codon with one of the 20 amino acids used to make proteins.

Anticodon = A nucleotide triplet in tRNA that base pairs with a complementary nucleotide triplet (codon) in mRNA.

tRNAs decode the genetic message, codon by codon. For example,

  1. The Structure and Function of Transfer RNA
  2. All types of eukaryotic RNA, including tRNA, are transcribed from template DNA located within the nucleus.

    The ability of tRNA to carry specific amino acids and to recognize the correct codons depends upon its structure; its form fits function.

    The three-dimensional structure is roughly L-shaped.

    There are only about 45 distinct types of tRNA. However, this is enough to translate the 64 codons, since some tRNAs recognize two or three mRNA codons specifying the same amino acid.

    Wobble = The ability of one tRNA to recognize two or three different mRNA codons; occurs when the third base (5' end) of the tRNA anticodon has some play or wobble, so that it can hydrogen bond with more than one kind of base in the third position (3' end) of the codon.

  3. Aminoacyl-tRNA Synthetases
  4. The correct linkage between tRNA and its designated amino acid must occur before the anticodon pairs with its complementary mRNA codon. This process of correctly pairing a tRNA with its appropriate amino acid is catalyzed by an aminoacyl-tRNA synthetase.

    Aminoacyl-tRNA synthetase = A type of enzyme that catalyzes the attachment of an amino acid to its tRNA.

    1. Activation of the amino acid with AMP. The synthetase's active site binds the amino acid and ATP; the ATP loses two phosphate groups and attaches to the amino acid as AMP (adenosine monophosphate).

    2. Attachment of the amino acid to tRNA. The appropriate tRNA covalently bonds to the amino acid, displacing AMP from the enzyme's active site.

    The aminoacyl-tRNA complex releases from the enzyme and transfers its amino acid to a growing polypeptide on the ribosome.

  5. Ribosomes
  6. Ribosomes coordinate the pairing of tRNA anticodons to mRNA codons.

    The large and small subunits of eukaryotic ribosomes are:

    Compared to eukaryotic ribosomes, prokaryotic ribosomes are smaller and have a different molecular composition.

    In addition to an mRNA binding site, each ribosome has three tRNA binding (P and A).

    As the ribosome holds the tRNA and mRNA molecules together, enzymes transfer the new amino acid from its tRNA to the carboxyl end of the growing polypeptide.

  7. Building a Polypeptide
  8. The building of a polypeptide, or translation, occurs in three stages: 1) initiation, 2) elongation, and 3) termination.

    1. Initiation

    Initiation must bring together the mRNA, the first amino acid attached to its tRNA, and the two ribosomal subunits.

    An initiation complex is assembled as mRNA and a special initiator tRNA bind to a small ribosomal subunit.

    In the second step, a large ribosomal subunit binds to the small one to form a functional ribosome.

    2. Elongation

    Several proteins called elongation factors take part in this three-step cycle which adds amino acids one by one to the initial amino acid.

    1. Codon recognition. The mRNA codon in the A site of the ribosome forms hydrogen bonds with the anticodon of an entering tRNA carrying the next amino acid in the chain.
      • An elongation factor directs tRNA into the A site.
      • Hydrolysis of GTP provides energy for this step.
    2. Peptide bond formation. An enzyme, peptidyl transferase, catalyzes the formation of a peptide bond between the polypeptide in the P site and the new amino acid in the A site.
      • Peptidyl transferase is part of the large ribosomal subunit and consists of ribosomal proteins and rRNA.
      • The polypeptide separates from its tRNA and is transferred to the new amino acid carried by the tRNA in the A site.
    3. Translocation. The tRNA in the A site, which is now attached to the growing polypeptide, is translocated to the P site. Simultaneously, the tRNA that was in the P site is translocated to the E site and from there exits the ribosome.
      • During this process, the codon and anticodon remain bonded, so the mRNA and the tRNA move as a unit, bringing the next codon to be translated into the A site.
      • The mRNA is moved through the ribosome only in the 5' to 3' direction. GTP hydrolysis provides energy for each translocation step.
    4. Termination

      Each iteration of the elongation cycle takes about 60 milliseconds and is repeated until synthesis is complete and a termination codon reaches the ribosome's A site.

      Termination codon (stop codon) = Base triplet (codon) on mRNA that signals the end of translation.

      • Stop codons are UAA, UAG and UGA.
      • Stop codons do not code for amino acids.

      When a stop codon reaches the ribosome's A site, a protein release factor binds to the codon and initiates the following sequence of events:

      • Peptidyl transferase hydrolyzes the bond between the completed polypeptide and the tRNA in the P site.
      • This frees the polypeptide and tRNA, so they can both release from the ribosome.
      • The two ribosomal subunits dissociate from mRNA and separate back into a small and a large subunit.

  9. Polyribosomes
  10. Single ribosomes can make average-sized polypeptides in less than a minute; usually, however, clusters of ribosomes simultaneously translate an mRNA.

    Polyribosome = A cluster of ribosomes simultaneously translating an mRNA molecule.

  11. From Polypeptide to Functional Protein
  12. The biological activity of proteins depends upon a precise folding of the polypeptide chain into a native three-dimensional conformation.

    Some proteins must undergo post-translational modification before they become fully functional in the cell.

Back to top

VI. Some polypeptides have signal sequences that target them to specific destinations in the cell

Eukaryotic ribosomes function either free in the cytosol or bound to endomembranes.

There is only one type of ribosome, and synthesis of all proteins begins in the cytosol. What determines whether a ribosome will be free in the cytosol or attached to rough ER?

The ribosome continues protein synthesis and the leading end of the new polypeptide (N-terminus) threads into the cisternal space.

Different signal sequences may also dispatch proteins to specific sites other than the ER. For example, newly formed proteins may be targeted for mitochondria or chloroplasts.

Back to top

VII. Comparing protein synthesis in prokaryotes and eukaryotes

Transcription and translation are similar in prokaryotes and eukaryotes, but protein synthesis is organized differently within their cells.

Back to top

VIII. Eukaryotic cells modify RNA after transcription

Before eukaryotic mRNA is exported from the nucleus, it is processed in two ways: a) both ends are covalently altered and b) intervening sequences are removed and the remainder spliced together.

  1. Alteration of mRNA
  2. During mRNA processing, both the 5' and 3' ends are covalently modified.

    5' cap = Modified guanine nucleotide (guanosine triphosphate) that is added to the 5' end of mRNA shortly after transcription begins; has two important functions:

    Leader sequence = Noncoding (untranslated) sequence of mRNA from the 5' end to the start codon.

    The 3' end, which is transcribed last, is modified by enzymatic addition of a poly-A tail, before the mRNA exits the nucleus.

    Poly-A tail = Sequence of about 200 adenine nucleotides added to the 3' end of mRNA before it exits the nucleus.

    Trailer sequence = Noncoding (untranslated) sequence of mRNA from the stop codon to the poly-A tail.

  3. Split Genes and RNA Splicing
  4. The original RNA transcript accurately reflects the complementary base sequence of the gene in template DNA; however, it is much longer than the mRNA that functions in the cytoplasm.

    heteogeous nuclear RNA (hnRNA) = Pool of RNA in the nucleus that contains molecules of widely varied sizes; includes primary mRNA transcripts or precursor mRNA (pre-mRNA).

    Genes that code for proteins in eukaryotes are not continuous sequences.

    Introns = Noncoding sequences in DNA that intervene between coding sequences (exons); are initially transcribed, but not translated, because they are excised from the transcript before mature RNA leaves the nucleus.

    Exons = Coding sequences of a gene that are transcribed and expressed.

    Introns and exons are both transcribed to form hnRNA, but the introns are subsequently removed and the remaining exons linked together during the process of RNA splicing.

    RNA splicing = RNA processing that removes introns and joins exons from eukaryotic hnRNA; produces mature mRNA that will move into the cytoplasm from the nucleus.

    Though there is much left to be discovered, some details of RNA splicing are now known.

    Small nuclear ribonucleoproteins (snRNPs) = Complexes of proteins and small nuclear RNAs that are found only in the nucleus; some participate in RNA splicing. (SnRNPs is pronounced "snurps".)

    These small nuclear particles are composed of-.

    1. Small nuclear RNA (snRNA). This small RNA molecule has less than 300 nucleotides - much shorter than mRNA.
    2. Protein. Each snRNP has seven or more proteins.
      • There are various types of snRNPs with different functions; those involved in RNA splicing are part of a larger, more complex assembly called a spliceosome.

    Spliceosome = A large molecular complex that catalyzes RNA splicing reactions; composed of small nuclear ribonucleoproteins (snRNPs) and other proteins.

  5. Ribozymes
  6. Other kinds of RNA transcripts, such as tRNA and rRNA, are spliced differently; however, as with mRNA splicing, RNA is often involved in catalyzing the reactions.

    Ribozymes = RNA molecules that can catalyze reactions by breaking and forming covalent bonds; are called ribozymes to emphasize their enzymelike catalytic activity.

    Three-dimensional conformations vary among the types of RNA. These differences in shape give RNA its ability to perform a variety of functions, such as:

    1. Information carrier. Messenger RNA (mRNA) carries genetic information from DNA to ribosomes; this genetic message specifies a protein's primary structure.
    2. Adaptor molecule. Transfer RNA (tRNA) acts as an adaptor in protein synthesis by translating information from one form (mRNA nucleotide sequence) into another (protein amino acid sequence).
    3. Catalyst and structural molecule. During translation, ribosomal RNA (rRNA) plays structural and probably enzymatic roles in ribosomes. Small nuclear RNA (snRNA) in snRNP particles also plays structural and enzymatic roles within spliceosomes that catalyze RNA splicing reactions.
    4. Viral genomes. Some viruses use RNA as their genetic material.

  7. The Functional and Evolutionary Importance of Introns
  8. What are the iologicalfunctions of introns and gene splicing?

    Introns may play a regulatory role in the cell.

    Introns may allow a single gene to direct the synthesis of different proteins.

    Introns play an important role in the evolution of protein diversity; they increase the probability that recombination of exons will occur between alleles.

    Protein domains = Continuous polypeptide sequences that are structural and functional units in proteins with a modular architecture.

Back to top

IX. A point mutation can affect the function of a protein

Knowing how genes are translated into proteins, scientists can give a molecular description of heritable changes that occur in organisms.

Mutation = A permanent change in DNA that can involve large chromosomal regions or a single nucleotide pair.

Point mutation = A mutation limited to about one or two nucleotides in a single gene.

  1. Types of Point Mutations
  2. There are two categories of point mutations: 1) base-pair substitutions and 2) base-pair insertions or deletions.

    1. Substitutions

      Base-pair substitution = The replacement of one base pair with another; occurs when a nucleotide and its partner from the complementary DNA strand are replaced with another pair of nucleotides according to base-pairing rules.

      Depending on how base-pair substitutions are translated, they can result in little or no change in the protein encoded by the mutated gene.

      • Redundancy in the genetic code is why some substitution mutations have no effect. A base pair change may simply transform one codon into another that codes for the same amino acid.
      • Even if the substitution alters an amino acid, the new amino acid may have similar properties to the one it replaces, or it may be in a part of the protein where the exact amino acid sequence is not essential to its activity.

      Some base-pair substitutions result in readily detectable changes in proteins.

      • Alteration of a single amino acid in a crucial area of a protein will significantly alter protein activity.
      • On rare occasions, such a mutation will produce a protein that is improved or has capabilities that enhance success of the mutant organism and its descendants.
      • More often, such mutations produce a less active or inactive protein that impairs cell function.

      Base-pair substitutions are usually missense mutations or nonsense mutations.

      Missense mutation = Base-pair substitution that alters an amino acid codon (sense codon) to a new codon that codes for a different amino acid.

      • Altered codons make sense (are translated), but not necessarily that originally intended.
      • Base-pair substitutions are usually missense mutations.

      Nonsense mutation = Base-pair substitution that changes an amino acid codon (sense codon) to a chain termination codon, or vice versa.

      • Nonsense mutations can result in premature termination of translation and the production of a shorter than normal polypeptide.
      • Nearly all nonsense mutations lead to nonfunctional proteins.
    2. Insertions or Deletions

      Base-pair insertions or deletions usually have a greater negative effect on proteins than substitutions.

      Base-pair insertion = The insertion of one or more nucleotide pairs into a gene.

      Base-pair deletion = The deletion of one or more nucleotide pairs from a gene.

      Because mRNA is read as a series of triplets during translation, insertion or deletion of nucleotides may alter the reading frame (triplet grouping) of the genetic message. This type of frameshift mutation will occur whenever the number of nucle otides inserted or deleted is not 3 or a multiple of 3.

      Frameshift mutation = A base-pair insertion or deletion that causes a shift in the reading frame, so that codons beyond the mutation will be the wrong grouping of triplets and will specify the wrong amino acids.

      • A frameshift mutation causes the nucleotides following the insertion or deletion to be improperly grouped into codons.
      • This results in extensive missense, which will sooner or later end in nonsense (premature termination).
      • Frameshift will produce a nonfunctional protein unless the insertion or deletion is very near the end of the gene.

  3. Mutagens
  4. Mutagenesis = The creation of mutations.

    Mutagen = Physical or chemical agents that interact with DNA to cause mutations.

Course Pages maintained by
Dr. Graeme Lindbeck .