DNA Transcription: Where do genes begin and end?

  • Thread starter mtanti
  • Start date
  • Tags
    Dna Genes
In summary, the RNA polymerase knows where to start and stop transcription on a strand of DNA by determining the start codon and stop codon markers. It is possible to have overlapping genes, and the simplest form of transcription assumes that all start codons are starts of an ORF. Different methods are more computationally intensive, but can also detect genes that are already known.
  • #1
mtanti
172
0
Dear all,

I'm studying DNA-Protein relationships for a computer simulation I have to make about it and I'm finding it difficult to understand how the RNA polymerase knows where to start and stop transcription on a strand of DNA. Could someone explain this for me?

Let's assume that '[' indicates the start of a gene (start codon? tata-box? promoter?) and ']' indicates the end of a gene (stop codon? transcription terminator?). Are all '[' and ']' markers well-formed brackets? Can you have over lapping genes as in [..[..]..]? Can you have missing brackets as in [.. or ..]? If not, what happens if this is the case?

Thanks a lot!
 
Biology news on Phys.org
  • #2
Simply stated translation starts at a start codon (ATG) and stops at a stop codon (TGA, TAG, or TAA).

The promotor of a gene and regulatory elements determine whether the transcription machinery can bind near the ATG, so not all start codons are starts of an ORF.

You can definitely have overlapping genes.

I guess you are also interested in 5'UTRs and 3'UTRs?

This is all textbook material, so it might be better to pick up one of those and read up on what is known about the transcription machinery.
 
Last edited:
  • #3
Thanks
 
  • #4
Simply stated transcription starts at a start codon (ATG) and stops at a stop codon (TGA, TAG, or TAA).

That would be translation, not transcription. Translation is somewhat straightforward, and the above definition should be good enough (but do remember ATG is not the only start codon).

If you're asking about transcription, that's a little less clear. Even less clear is what defines a gene. Everything you learn in a standard textbook is very general and if you're looking into predicting genes from a genome sequence, they won't even look for ribosomal binding sites, because it's quite useless to do so quite frankly.

Also if we're just talking very standard general biology stuff, than you CANNOT have overlapping genes as you indicated [..[..]..]. However, [...[]...] is quite common (where there's very little overlap between the end of one gene and start of another - quite common in operons)
 
  • #5
My simulation is about the synthesis of mRNA from DNA. I need to work with random DNA sequences so I need to know if there are any "illegal" sequences. What would happen if you have multiple start codons before the next stop codon (overlapping)? You can assume the simplest form of transcription.

How can I know where genes begin and end without knowing where RNA polymerase bind to DNA and start producing mRNA? Are you saying it's still unknown where genes begin and end?
 
  • #6
What kind of organism(s) are you working with: eukaryote, prokaryote, other?
Within a genome (or large DNA fragment) are you looking for: gene, mRNA, CDS, other?

A start codon encountered after a start codon, is simply the amino acid. So if you encounter ATG somewhere downstream, it simply encodes for a methionine.

Example of how gene-finding in bacterial genome works:
The naive approach to gene-finding (I'm warning you right now, that when I usually say gene, I really mean ORFs - different people use different definitions for "gene") within a genome is to simply find all regions that contain more than x codons (let's say x=30) without a stop codon. After all those are marked for all 6 reading frames, you remove any that are overlapping, choosing the longest regions, and you say those are your genes. Whilst this method is very simple, it does decently. More computationally expensive methods include hidden markov models, tBLASTX, etc. but also realize they can only detect genes that are already known. Prediction of novel genes (without biological experiments) are definitely possible, but that requires an even lengthier explanation.
 

1. What is DNA transcription?

DNA transcription is the process by which the genetic information encoded in DNA is copied into a complementary strand of RNA. This RNA can then be used to synthesize proteins, which are responsible for carrying out a variety of functions in the cell.

2. How does DNA transcription begin?

DNA transcription begins with the binding of an enzyme called RNA polymerase to a specific region of DNA called the promoter. This signals the start of transcription and allows the RNA polymerase to start copying the DNA sequence into RNA.

3. Where do genes begin and end in the DNA sequence?

Genes are specific sequences of DNA that contain the instructions for making a particular protein. They begin with a start codon and end with a stop codon, which are specific sequences of nucleotides that signal the start and end of a gene.

4. What is the role of transcription factors in DNA transcription?

Transcription factors are proteins that bind to specific DNA sequences and help regulate the transcription process. They can either enhance or inhibit transcription, depending on the specific factors present and the needs of the cell.

5. How does DNA transcription differ from DNA replication?

DNA transcription only copies a specific region of DNA into RNA, while DNA replication copies the entire DNA sequence. Additionally, transcription produces single-stranded RNA, while replication produces double-stranded DNA. Transcription also uses RNA polymerase, whereas replication uses DNA polymerase.

Similar threads

Replies
2
Views
3K
Replies
6
Views
2K
Replies
4
Views
2K
  • Biology and Medical
Replies
8
Views
1K
  • Biology and Medical
Replies
8
Views
2K
  • Biology and Medical
Replies
2
Views
2K
Replies
1
Views
1K
Replies
1
Views
1K
Replies
4
Views
3K
  • Biology and Medical
Replies
14
Views
3K
Back
Top