Transcription: Difference between revisions

From The School of Biomedical Sciences Wiki
Jump to navigation Jump to search
No edit summary
Line 1: Line 1:
Transcription is the process by which double-stranded DNA is made into single-stranded mRNA. This process is highly regulated and controlled to ensure the right amount of a specific [[Gene|gene]] is coded for at a specific time and place. It is said to be the first initial step in the process of gene expression in living organisms. Which is then followed by the process of translation.
Transcription is the process by which double-stranded DNA is made into single-stranded mRNA. This process is highly regulated and controlled to ensure the right amount of a specific [[Gene|gene]] is coded for at a specific time and place. It is said to be the first initial step in the process of gene expression in living organisms. Which is then followed by the process of [https://teaching.ncl.ac.uk/bms/wiki/index.php/Translation translation].  


== mRNA  ==
== mRNA  ==
Line 5: Line 5:
[[Proteins|Proteins]] are synthesised in the [[Cytosol|cytosol]], however, [[DNA|DNA]] does not leave the [[Nucleus|nucleus]], therefore a copy of the [[Gene|gene]] coding for the desired [[Protein|protein]] is sent as a messenger to the [[Cytosol|cytosol]] from the [[Nucleus|nucleus]]. This is called messenger RNA '[[MRNA|mRNA]]', which is a single stranded molecule that is a complementary copy of the [[DNA|DNA]] strand it was synthesised from. [[RNA|RNA]] is made from ribose nucleotides<ref>HGS Biology A-Level notes, Dr Millar, 2006</ref> that are free in the nucleus ([[RATP|rATP]], [[RGTP|rGTP]], [[RCTP|rCTP]] and [[RUTP|rUTP]]).<br>Pre mRNA is first transcribed which contains non coding Introns these introns are spliced out to leave mRNA that contains only the coding exons.  
[[Proteins|Proteins]] are synthesised in the [[Cytosol|cytosol]], however, [[DNA|DNA]] does not leave the [[Nucleus|nucleus]], therefore a copy of the [[Gene|gene]] coding for the desired [[Protein|protein]] is sent as a messenger to the [[Cytosol|cytosol]] from the [[Nucleus|nucleus]]. This is called messenger RNA '[[MRNA|mRNA]]', which is a single stranded molecule that is a complementary copy of the [[DNA|DNA]] strand it was synthesised from. [[RNA|RNA]] is made from ribose nucleotides<ref>HGS Biology A-Level notes, Dr Millar, 2006</ref> that are free in the nucleus ([[RATP|rATP]], [[RGTP|rGTP]], [[RCTP|rCTP]] and [[RUTP|rUTP]]).<br>Pre mRNA is first transcribed which contains non coding Introns these introns are spliced out to leave mRNA that contains only the coding exons.  


mRNA contains specific sequences which code for different amino acids. In transcription of [[Proteins|proteins]] there are certain sequences which code for start and stop signals for [[Protein|protien]] synthesis. Generally, [[Methionine|Methionine]] is a start codon. Methionine only has one series of bases coding for it (AUG) compared to [[Serine|Serine]] and [[Arginine|Arginine]] which have 6 sets of coding mRNA each. There are three [[Stop codon|stop codons]] which code for the end of a protein molecule (UAA, UAG, UGG).&nbsp;<br>
mRNA contains specific sequences which code for different amino acids. In transcription of [[Proteins|proteins]] there are certain sequences which code for start and stop signals for [[Protein|protien]] synthesis. Generally, [[Methionine|Methionine]] is a start codon. Methionine only has one series of bases coding for it (AUG) compared to [[Serine|Serine]] and [[Arginine|Arginine]] which have 6 sets of coding mRNA each. There are three [[Stop codon|stop codons]] which code for the end of a protein molecule (UAA, UAG, UGG).&nbsp;<br>  


Code for all 20 amino acids and stop codons<ref>Berg JM, Tymoczko JL, Stryer L, Biochemistry, (2007) 6th Editiion, Pages 125, 126, W.H. Freeman and Company, New York</ref>:  
Code for all 20 amino acids and stop codons<ref>Berg JM, Tymoczko JL, Stryer L, Biochemistry, (2007) 6th Editiion, Pages 125, 126, W.H. Freeman and Company, New York</ref>:  
Line 30: Line 30:
Leu  
Leu  


Leu
Leu  


|  
|  
Line 39: Line 39:
Ser  
Ser  


Ser
Ser  


|  
|  
Line 48: Line 48:
Stop  
Stop  


Stop
Stop  


|  
|  
Line 57: Line 57:
Stop  
Stop  


Trp
Trp  


|  
|  
Line 66: Line 66:
A  
A  


G
G  


|-
|-
Line 77: Line 77:
Leu  
Leu  


Leu
Leu  


|  
|  
Line 86: Line 86:
Pro  
Pro  


Pro
Pro  


|  
|  
Line 95: Line 95:
Gln  
Gln  


Gln
Gln  


|  
|  
Line 104: Line 104:
Arg  
Arg  


Arg
Arg  


|  
|  
Line 113: Line 113:
A  
A  


G
G  


|-
|-
Line 124: Line 124:
Ile&nbsp;  
Ile&nbsp;  


Met
Met  


|  
|  
Line 133: Line 133:
Thr  
Thr  


Thr
Thr  


|  
|  
Line 142: Line 142:
Lys  
Lys  


Lys
Lys  


|  
|  
Ser  
Ser  


Ser<br>
Ser<br>  


Arg  
Arg  


Arg
Arg  


|  
|  
Line 160: Line 160:
A  
A  


G
G  


|-
|-
Line 171: Line 171:
Val  
Val  


Val
Val  


|  
|  
Line 180: Line 180:
Ala  
Ala  


Ala
Ala  


|  
|  
Line 189: Line 189:
Glu  
Glu  


Glu
Glu  


|  
|  
Line 198: Line 198:
Gly&nbsp;  
Gly&nbsp;  


Gly
Gly  


|  
|  
Line 207: Line 207:
A  
A  


G
G  


|}
|}


<br>
<br>  


== Double stranded DNA  ==
== Double stranded DNA  ==
Line 229: Line 229:
In ''[[E.Coli|E. coli]]'' the [[Holoenzyme|holoenzyme]] recognises specific [[Consensus sequence]]&nbsp;at -35 and -10 within the promotor region. At the&nbsp;-35 sequence the DNA remains double stranded, and a closed complex is formed, however at the -10 sequence (or Pribnow box) about 14 bases are melted, and the closed complex becomes a&nbsp;[[Transcription Bubbles|Transcription_Bubble]] with exposed bases.  
In ''[[E.Coli|E. coli]]'' the [[Holoenzyme|holoenzyme]] recognises specific [[Consensus sequence]]&nbsp;at -35 and -10 within the promotor region. At the&nbsp;-35 sequence the DNA remains double stranded, and a closed complex is formed, however at the -10 sequence (or Pribnow box) about 14 bases are melted, and the closed complex becomes a&nbsp;[[Transcription Bubbles|Transcription_Bubble]] with exposed bases.  


Unlike bacteria, Eukaryotic cells has mor ethan one transcription factor, these transcripsion factors are: TF2D, TF2A, TF2B, TF2F, TF2E, and TF2H, binding in that order onto the TATA box. They have diffenrt fucntions during the intiation process.
Unlike bacteria, Eukaryotic cells has mor ethan one transcription factor, these transcripsion factors are: TF2D, TF2A, TF2B, TF2F, TF2E, and TF2H, binding in that order onto the TATA box. They have diffenrt fucntions during the intiation process.  


== Initiation  ==
== Initiation  ==
Line 249: Line 249:
In prokaryotes, such as ''E.coli,'' however, there are two types of terminator. The first is called ‘Factor independent termination’. The sequence of bases at the end of the gene have a region rich and G+C bases with a sequence in-between, followed by 4 to 10&nbsp;A+T bases. Once this area of the [[Gene|gene]] has been transcribed, the section of rich G+C on the [[MRNA|mRNA]] molecule bind together by complimentary base pairing. This forms a hairpin structure at the end of the [[MRNA|mRNA]] molecule. This hairpin structure has properties that cause the [[RNA polymerase|RNA polymerase]] to pause in transcribing the [[Gene|gene]]. Once paused, the [[RNA polymerase|RNA polymerase]] unbinds from the [[DNA|DNA]] molecule and releases the complete [[MRNA|mRNA]] molecule, thus terminating transcription. There is also a second method of termination. This is called Rho dependant termination. This involves a helicase enzyme called a [[Rho|Rho]] factor, which unwinds the [[MRNA|mRNA]] from the [[DNA|DNA]] molecule faster than it does naturally. The Rho factor unwinds the [[MRNA|mRNA]] until it reaches the [[RNA polymerase|RNA polymerase]]. This causes the [[RNA polymerase|RNA polymerase]] to pause and stop transcribing proteins, causing the [[RNA polymerase|RNA polymerase]] to unbind from the [[DNA|DNA]] and the complete [[MRNA|mRNA]] molecule to be released.<ref>Hartl et al. (2012) Genetics: Analysis of genes and genomes, 8th Edition - Chapter 10: Pages 354 -355</ref>  
In prokaryotes, such as ''E.coli,'' however, there are two types of terminator. The first is called ‘Factor independent termination’. The sequence of bases at the end of the gene have a region rich and G+C bases with a sequence in-between, followed by 4 to 10&nbsp;A+T bases. Once this area of the [[Gene|gene]] has been transcribed, the section of rich G+C on the [[MRNA|mRNA]] molecule bind together by complimentary base pairing. This forms a hairpin structure at the end of the [[MRNA|mRNA]] molecule. This hairpin structure has properties that cause the [[RNA polymerase|RNA polymerase]] to pause in transcribing the [[Gene|gene]]. Once paused, the [[RNA polymerase|RNA polymerase]] unbinds from the [[DNA|DNA]] molecule and releases the complete [[MRNA|mRNA]] molecule, thus terminating transcription. There is also a second method of termination. This is called Rho dependant termination. This involves a helicase enzyme called a [[Rho|Rho]] factor, which unwinds the [[MRNA|mRNA]] from the [[DNA|DNA]] molecule faster than it does naturally. The Rho factor unwinds the [[MRNA|mRNA]] until it reaches the [[RNA polymerase|RNA polymerase]]. This causes the [[RNA polymerase|RNA polymerase]] to pause and stop transcribing proteins, causing the [[RNA polymerase|RNA polymerase]] to unbind from the [[DNA|DNA]] and the complete [[MRNA|mRNA]] molecule to be released.<ref>Hartl et al. (2012) Genetics: Analysis of genes and genomes, 8th Edition - Chapter 10: Pages 354 -355</ref>  


The [[MRNA|mRNA]] molecule then exits the nucleus into the [[Cytosol|cytosol]], where it will be translated into proteins that&nbsp;the cell requires&nbsp;for the&nbsp;second step&nbsp;of gene expression known as ‘translation’.<br>
The [[MRNA|mRNA]] molecule then exits the nucleus into the [[Cytosol|cytosol]], where it will be translated into proteins that&nbsp;the cell requires&nbsp;for the&nbsp;second step&nbsp;of gene expression known as ‘translation’.<br>  


<br>
<br>  


[[Image:17.7 2 the stages of transcription- initiation, elongation, and termination.jpg]]<ref>http://abenagh.pbworks.com/w/page/35459046/4%20Group%20Free%20Response</ref>  
[[Image:17.7 2 the stages of transcription- initiation, elongation, and termination.jpg]]<ref>http://abenagh.pbworks.com/w/page/35459046/4%20Group%20Free%20Response</ref>  
Line 257: Line 257:
== References  ==
== References  ==


<references />
<references />  


<references />
<references />

Revision as of 14:34, 14 November 2015

Transcription is the process by which double-stranded DNA is made into single-stranded mRNA. This process is highly regulated and controlled to ensure the right amount of a specific gene is coded for at a specific time and place. It is said to be the first initial step in the process of gene expression in living organisms. Which is then followed by the process of translation.

mRNA

Proteins are synthesised in the cytosol, however, DNA does not leave the nucleus, therefore a copy of the gene coding for the desired protein is sent as a messenger to the cytosol from the nucleus. This is called messenger RNA 'mRNA', which is a single stranded molecule that is a complementary copy of the DNA strand it was synthesised from. RNA is made from ribose nucleotides[1] that are free in the nucleus (rATP, rGTP, rCTP and rUTP).
Pre mRNA is first transcribed which contains non coding Introns these introns are spliced out to leave mRNA that contains only the coding exons.

mRNA contains specific sequences which code for different amino acids. In transcription of proteins there are certain sequences which code for start and stop signals for protien synthesis. Generally, Methionine is a start codon. Methionine only has one series of bases coding for it (AUG) compared to Serine and Arginine which have 6 sets of coding mRNA each. There are three stop codons which code for the end of a protein molecule (UAA, UAG, UGG). 

Code for all 20 amino acids and stop codons[2]:

Second Position
First Position (5' End) U C A G Third Position (3' End)
U

Phe

Phe

Leu

Leu

Ser

Ser

Ser

Ser

Tyr

Tyr

Stop

Stop

Cys

Cys

Stop

Trp

U

C

A

G

C

Leu

Leu

Leu

Leu

Pro

Pro

Pro

Pro

His

His

Gln

Gln

Arg

Arg

Arg

Arg

U

C

A

G

A

Ile

Ile

Ile 

Met

Thr

Thr

Thr

Thr

Asn

Asn

Lys

Lys

Ser

Ser

Arg

Arg

U

C

A

G

G

Val

Val

Val

Val

Ala

Ala

Ala

Ala

Asp

Asp

Glu

Glu

Gly

Gly

Gly 

Gly

U

C

A

G


Double stranded DNA

DNA is double stranded as opposed to mRNA which is single stranded, therefore only one strand of DNA is copied. The copied strand is called the 'template strand' (also known as the 'sense strand'), the other strand is called the 'non-template strand' or the 'coding strand' (also known as the 'anti-sense strand') and will have the same base sequence as the RNA strand produced. mRNA is synthesised by the enzyme 'RNA polymerase', however, in order for the RNA to synthesise mRNA it must bind to a single strand of DNA. The DNA must be unwound and unzipped, which is done via an enzyme called 'DNA helicase',[3] which unwinds and unzipps the double stranded DNA at the loci of the gene to be transcribed, causing an area of single stranded DNA to be accessible to the RNA polymerase. This unwound section is known as the transcription bubble.

Unlike DNA polymerase which requires an RNA primer to iniate replication, RNA polymerase does not.

Promoter regions

RNA polymerase must recognise and bind to a region upstream of the gene being transcribed called the 'promoter region'. This region is a sequnce of bases that determines the strength of the binding of RNA polymerase, to the DNA strand and therefore determining the efficiency of trancription of the gene it is accossiated with. If the promoter is a strong promoter, then RNA polymerase binds strongly to the DNA strand. If the promoter is a weak promoter, then the RNA polymerase can become hindered and can even unbind from the DNA strand. The promoter region strength is determined by how promoter sequence compares to other promoters on separate genes. When different promoters are compared, a sequence of bases can be determined that are most common in all the promoter sequences of that type, this is called a 'consensus sequence'. The closer the promoter sequence is to the consensus sequence, the stronger the promoter and the stronger the binding of the RNA polymerase.

Sigma factors and Eukaryotic Transciption factors

RNA polymerase cannot bind to the promoter region unless a sigma factor is present. Sigma factors ensure that the RNA polymerase binds to the correct promoter region, this is another method in which transcription is regulated. The sigma factor binds to the RNA polymerase via specific binding sites on its structure and forms a ‘holoenzyme’, i.e. sigma factor + RNA polymerase = holoenzyme.

In E. coli the holoenzyme recognises specific Consensus sequence at -35 and -10 within the promotor region. At the -35 sequence the DNA remains double stranded, and a closed complex is formed, however at the -10 sequence (or Pribnow box) about 14 bases are melted, and the closed complex becomes a Transcription_Bubble with exposed bases.

Unlike bacteria, Eukaryotic cells has mor ethan one transcription factor, these transcripsion factors are: TF2D, TF2A, TF2B, TF2F, TF2E, and TF2H, binding in that order onto the TATA box. They have diffenrt fucntions during the intiation process.

Initiation

Once the sigma factor has bound to the RNA polymerase, the RNA can bind to the promoter region upstream of the gene on the single stranded DNA. The RNA is then free to transcribe the gene. Free ribose nucleotides bind to the DNA sequence via complementary base pairing. Instead of the base Thymine found in DNA, the base uracil is used in RNA. The RNA polymerase joins the nucleotides together via strong covalent phosphodiester bonds, this forms the single strand of mRNA. This process is called initiation.

In Eukaryotic cells the initatition starts with TFIIH binding to the RNA pol II and phosphorylating the N terminus of the polymerease. When the double stand is broken the RNA starts to bind to DNA at teh start site.

Elongation

When 10 nucleotides of mRNA have been synthesised, the sigma factor is released from the RNA polymerase. The RNA polymerase continues to transcribe the gene. This is called elongation, where the RNA polymerase moves along the DNA strand and creates a single strand of mRNA that is complimentary to the DNA sequence. Only 8 nucleotides of mRNA remain attached to the DNA sequence at a time[4]. The mRNA peels of the DNA sequence but still remains attached to the rest of the mRNA molecule. Once the mRNA has been synthesised from specific nucleotides, an enzyme recombines the two DNA strands and rewinds it into its helix structure. This occurs while the RNA polymerase is still transcribing during elongation.

Termination

Once the gene has been synthesised, the RNA polymerase must stop transcribing or it would continue to transcribe uncontrollably. The sequence present at the end of a gene sequence that stops transcription is called the ‘terminator sequence’.

In eukaryotes, the termination occurs when the RNA polymerase complex reaches a chain termination sequence, usually consisting of bases TTATTT (or AAUAAA when transcibed onto the mRNA). Termination of the transciption occurs abouut 10-35 bases downstream. The RNA is cut from the DNA by endonuclease

In prokaryotes, such as E.coli, however, there are two types of terminator. The first is called ‘Factor independent termination’. The sequence of bases at the end of the gene have a region rich and G+C bases with a sequence in-between, followed by 4 to 10 A+T bases. Once this area of the gene has been transcribed, the section of rich G+C on the mRNA molecule bind together by complimentary base pairing. This forms a hairpin structure at the end of the mRNA molecule. This hairpin structure has properties that cause the RNA polymerase to pause in transcribing the gene. Once paused, the RNA polymerase unbinds from the DNA molecule and releases the complete mRNA molecule, thus terminating transcription. There is also a second method of termination. This is called Rho dependant termination. This involves a helicase enzyme called a Rho factor, which unwinds the mRNA from the DNA molecule faster than it does naturally. The Rho factor unwinds the mRNA until it reaches the RNA polymerase. This causes the RNA polymerase to pause and stop transcribing proteins, causing the RNA polymerase to unbind from the DNA and the complete mRNA molecule to be released.[5]

The mRNA molecule then exits the nucleus into the cytosol, where it will be translated into proteins that the cell requires for the second step of gene expression known as ‘translation’.


[6]

References

  1. HGS Biology A-Level notes, Dr Millar, 2006
  2. Berg JM, Tymoczko JL, Stryer L, Biochemistry, (2007) 6th Editiion, Pages 125, 126, W.H. Freeman and Company, New York
  3. HGS Biology A-level notes, Dr Millar, 2006
  4. HGS Biology A-Level notes, Dr Millar, 2006
  5. Hartl et al. (2012) Genetics: Analysis of genes and genomes, 8th Edition - Chapter 10: Pages 354 -355
  6. http://abenagh.pbworks.com/w/page/35459046/4%20Group%20Free%20Response