How have scientists derived the gene sequences known today?

  • Thread starter Thread starter TytoAlba95
  • Start date Start date
  • Tags Tags
    Gene Sequences
Click For Summary
SUMMARY

Scientists have derived gene sequences through a combination of techniques including the Maxam-Gilbert method, Sanger sequencing, and Next Generation Sequencing (NGS). Initially, mRNA was purified and reverse transcribed to cDNA, which was then cloned for sequencing. Gene sequences are refined by comparing them with genomic DNA and utilizing computational predictions based on gene features such as promoters and splice sites. This multi-faceted approach has led to the comprehensive gene sequences available in databases like NCBI.

PREREQUISITES
  • Understanding of Sanger sequencing techniques
  • Familiarity with Next Generation Sequencing (NGS)
  • Knowledge of mRNA purification and cDNA synthesis
  • Basic concepts of gene prediction algorithms
NEXT STEPS
  • Research the Maxam-Gilbert sequencing technique
  • Explore advanced methods in Next Generation Sequencing (NGS)
  • Learn about gene prediction algorithms and their applications
  • Investigate the role of genomic DNA in gene sequence refinement
USEFUL FOR

Geneticists, bioinformaticians, and researchers involved in genomics and gene sequencing will benefit from this discussion.

TytoAlba95
Messages
132
Reaction score
19
I know that human genome sequencing was done by 2001. Inintially there were Maxam-Gilberth technique, then Sanger's technique and then finally NGS techniques have made sequening faster and efficient. My question is, though we are able to sequence the whole genome, how have scientists arrived at the gene sequences available in different databases link NCBI?

Did they compare the mRNA sequences with the known genomic sequence, to arrive at the gene sequences known today?
 
Biology news on Phys.org
There are a number of different methods that have changed over time and depend on the data available for the organism. Initially, scientists would purify mRNA from an organism, reverse transcribe to cDNA then clone the cDNA into plasmids or BACs for Sanger sequencing (older methods) or generate libraries for next generation sequencing (modern methods). The gene sequence can later be refined by methods that better identify the ends of the transcripts, and the gene sequence can be compared to genomic DNA sequences from the same organism.

For many organisms, only the genomic DNA sequence is available, so gene sequence are predicted computationally by using information about what gene features look like (e. g. promoters, splice sites, terminators), making use of the fact that protein coding genes have long, statistically detectable stretched lacking stop codons, and comparing the sequence to other organisms with better known genes.

For more information see:
https://www.nature.com/articles/nature24286https://en.m.wikipedia.org/wiki/Gene_prediction
 
  • Like
Likes   Reactions: TytoAlba95
Thank you
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 2 ·
Replies
2
Views
5K
  • · Replies 2 ·
Replies
2
Views
5K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 11 ·
Replies
11
Views
6K
  • · Replies 1 ·
Replies
1
Views
3K
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
4K
Replies
2
Views
3K
  • · Replies 11 ·
Replies
11
Views
6K