Cell Biology
Information Flow in Cells
 

How do cells store, intraconvert, and interpretpret information?  Organisms grow and develop while their bodies are regulated and maintained so that they can successfully reproduce in order to form a new generation.  How do all these processes function?  Nucleic acids and proteins play the roles of information molecules in cells.  Here are a few more questions worth pondering before we delve into the information flow of cells.


Introduction

What is the basic nature of information representated by these molecules? (We will see the importance of linear sequences and three-dimensional structures)
How is information converted from one form to another?
How is the information selectively converted? (In other words, how is the linear nucleotide gene sequence expressed into animo acids?)
What kind of vocabulary can we develop to enable us to describe and discuss information molecules?
How can we exploit experimentally what goes on during information flow?


Central Dogma

The occurrence of membrane bound organelles for eukaryotes (more specifically, the nucleus) creates a spatial and temporal difference.  In other words, transcription occurs in the nucleus while translation occurs in cytoplasm, and transcription/translation may proceed at different times since they are UNCOUPLED.  This uncoupling allows for eukaryotes to increase the quality of the process by enforcing more controls (i.e. cutting out intons) to reduce the amount of errors.


Transcription

RNA polymerase II and general transcription factors (GTFs) contribute to transcription.  The primary transcript is the initial UNPROCESSED single strand molecule.  Transcription begins at the 5' end where a methyl guanosine cap is created.  This cap site will 1)stop degradation of the 5' RNA strand, 2)make nuclear export more efficient, and 3)make initiation of translation of mRNA more efficient as well.

Now on the 3' end, processing has 2 steps that will "snip off" this edge of the strand at specific points by endonucleases.  1)Cleavage removal of a stretch of nucleotides at the 3' end is performed by endonucleases and other helpers.
2)20 - 250 adenines are added by polyA polymerases to create the "poly A tail."

The poly A tail will increase RNA stability and translatability.

Now the mRNA, consisting of only exons, must leave the nucleus through NUCLEAR PORES. 


Gene Expression

Remember the uncoupling of transcription and translation mentioned earlier?  What else can these extra steps do to increase efficiency of gene expression?  Well, the many steps involved allow the cell to control  various mechanisms and products at different points along the way.  Proofreading, switches, delays, and other maneuvers enhance the specificity of the whole process.  The numerous steps are responsible for a large amount of proteins so that MORE CAN BE DONE to improve gene expression.

Gene expression can be controlled at various stages.

1)Control at the level of transcription
        There are 2 important regions of a gene, upstream promoter sequences and coding sequences (where information for protein structure and function lie).  In the promotor sequences, RNA polymerase II binds to the TATA box (initiator of transcription) by the aid of TATA-binding proteins (TBP).  DNA-bound GTFs interact with subunits bound to TBP so that DNA will loop around.  Protein-protein interactions increase transcription performance so enhancers, which can be tens of thousands of base pairs away (and work upstream, downstream, inverted), will promote the assembly of the basal transcription machinery at the promoter and interact with activators and coactivators.  The CAAT and GC boxes bind transcription factors which aid in polymerase efficiency.  Transcription factors can be stimulated (by glucocorticoids) or inhibited (by negative regulatory proteins that bind to promoter elements) and affect the regulation of the basal level of transcription (which is initiated by promoters).  Methylation can inactivate gene expression, and removal of acetyl groups can repress transcription.  Another way to alter proteins is to cut pieces of promoters.  For instance, blue cotton could be created.

2)Control at the level of RNA processing
        Alternative splicing allows for a gene to encode for two or more related proteins.  This is the reason parts of an intron in one cell can be part of an exon in another cell.  An example of control at the level of RNA processing is the shifting of the poly A site to control the length of the mRNA strand.

3)Control at the level of RNA editing
        The nucleotide base, cytosine (C), can be converted into uracil (U).  ACG codons can create start AUG codons.  A strand can be shortened when editing CAG, CAA, and CGA codons to produce stop codons.

4)Control at the level of RNA transport
        If mRNA can't get out of the nucleus, it can't be translated into a protein.  As we talked about earlier, the methyl guanosine cap and poly A tail help in RNA nuclear export and stability, respectively.  Adenoviruses will use this control to prevent host transport, but still allow for viral genes to continue onto translation.  HIV and influenza inhibit host mRNA as well.

5)Control at the level of translation
        This occurs in the cytosol.  The untranslated region on the 3' side contains information for the localization of mRNA.  For example, the bicoid gene is localized at the anterior end of the oocyte because it helps in head and thorax development.  The oskar gene is at the posterior end for germ cell formation.  Also, an unfertilized eggs will have inhibitory proteins to mask mRNAs for developmental use.  Ferritin is a protein that grabs iron atoms.  At low iron concentrations, the repressor of ferritin's mRNA is active at the 5' end.  When there is more iron around, repressor is modified and it loses affinity for the sequence (found in the untranslated region) for it usually binds.  The control on ferritin is lifted now so it can detoxify the cells.

6)Control of RNA stability
        Half life is the time for half of the population to degrade.  If proteins live longer, there will be a larger population even though production of each strand is accomplished at the same rate.  More mRNAs will be available to act as templates.  In prokaryotes, the 5' end begins degredation before the 3' end is even comopleted.  mRNAs that are used for short periods (i.e., cell division) will be have short lives and those used for the production of dominant proteins (i.e., hemoglobin and ovalbumin) live longer.  Nucleotide sequences in the 3' untranslated region have CCUCC repeats that allow for proteins to bind and stabilize the poly A tail while poly A nucleases eat away at it.  AU-rich sequences will destabilize the tail.  Note that the loss of the 3' tail will allow for degradation to begin at the 5' end which will eventually move to the 3' end.


Techniques for Studying Genes

Differences in gene expression can be studied by looking at the proteins created.

    Proteins can act as an antigen for an antibody.  The use of indicator dyes and possibly primary and secondary assays would allow for detection of specific proteins.

An easier way would be to focus on the mRNA that make the proteins because of various "tricks" that can be implemented.

    Two dimensional gels can use molecular weight and isoelectricity to identify the exact gene in question.  A metabolic label can be placed on the specimen by radioactivity.  The gene can be cut out of the gel so that one may work backwards to work on finding the amino acid sequence.
    Single stranded probes are used to in hybridization.  The known identity of the probe allows for isolation of preferred mRNA strands.
    To find a specific gene (or all of them), a cDNA library will come in handy.  Reverse transcriptase will convert mRNA into complimentary DNA (cDNA).  The double stranded version of this can be incorporated into a cloning vected (i.e. plasmid) and inserted into bacteria.  Dilution plating will allow for each separate colony to consist of different mRNA.  This library can be probed.  The cDNA library will only contain the coding sequences and not the upstream regulatory sequences.  Another problem is that since promoters direct the timing and extent of gene expression, their absence allows for bias.  If the cDNA in the library is not representational of the actual amount, expression could range from "cowabunga" to just a little bit.
    To obtain the regulatory sequences, one must start with genomic DNA.  Enzyme restriction digests will cut DNA into pieces.  Single stranded versions can be inserted into plasmids.
    One genome project, total expression monitoring, involves the representation of known nucleotide sequences on an array created by dual hybridization of cDNA.  Recently Dr. William F. Loomis from the Center for Molecular Genetics, University of California-San Diego, spoke at Texas Tech University .  He talked about silicon chips that contained entire cDNA libraries.  Techniques in the field of genetics and cell biology continue to facilitate research as scientists delve into the new millennium.


 Back to main page