Gene Identification: How Are Genes Identified?

In summary, DNA contains genes, which code for proteins. DNA is sparsely distributed within strands of DNA, and genes are identified by their start and stop codons, as well as by the proteins they produce. Once you sequence and assemble a certain area, you can run programs that predict potential gene locations. Once you think your prediction is good, you can look at the sequence before and after the gene to see if there any marker. Non-coding sequence before and after the gene that are required for the gene to be transcribed into RNA. There are also important when you are trying to id a gene. The last step is to do experimental work to confirm that the gene produces the predicted protein and RNA.
  • #1
nesp
60
0
The way I understand it, genes are rather sparse within a strand of DNA. The human genome, for example, has a few thousand genes in a strand of 3.5 million A,C,G,T bases. I also understand that genes code for proteins through RNA but, again, most of the DNA is noncoding. My question is, how do we identify genes from non-genes? I've searched through google and found some useful sites, such as this one http://www.genomenewsnetwork.org/articles/06_00/sequence_primer.shtml

but they are overly complex for my question. I'm impressed that the entire genome can be decoded, but is there a simple explanation for how a geneticist identifies that this particulare section of DNA is a gene and that one is not?
 
Biology news on Phys.org
  • #2
Here a few basic concept, every gene will have 3 base pair (i.e. a codon) that signal the start of protein. There several different start signal and those depend on the organism. There's also stop signal at the end of the gene that signal the last codon

http://en.wikipedia.org/wiki/Start_codon
http://en.wikipedia.org/wiki/Stop_codon#Start.2Fstop_codonsOnce you sequence and assemble a certain area, you can run programs that will predict potential gene and their location. These potential genes are called open reading frame (ORF).

ORFinder is an example
http://www.ncbi.nlm.nih.gov/gorf/gorf.html

Once you id a potential ORF, you can compare, using a computer program, the predicted protein sequence to a database of other predicted and/or sequenced proteins.

BLAST is such a program
http://www.ncbi.nlm.nih.gov/BLAST/

Once you think your prediction is good, you can then look at the sequence before and after the gene to see if there any marker. Non-coding sequence before and after the gene that are required for the gene to be transcribed into RNA. There are also important when you are trying to id a gene.

The last step you be to do experiemental work to confirm that the gene produce the predicted protein and the predicted RNA.
 
  • #3
Thanks for that explanation and those links, they are very helpful. My field is mathematics, not biology, and if I didn't know what you were referring to, I would think you were describing a computer program, especially the start and stop signals. In essence, DNA is a biological computer. It's amazing the kind of order and logical structures that can be produced through randomness and chaos.
 

What is gene identification?

Gene identification is the process of identifying and characterizing specific genes within an organism's genome. This involves identifying the sequence of DNA that makes up the gene and determining its function.

Why is gene identification important?

Gene identification is important because it allows us to understand the genetic basis of various traits and diseases. By identifying and studying genes, we can gain insights into how they function and how they may contribute to certain health conditions.

How are genes identified?

Genes are identified using a variety of methods, including genetic mapping, DNA sequencing, and gene expression analysis. These techniques allow researchers to locate specific genes within the genome and determine their function.

What are the challenges of gene identification?

One of the main challenges of gene identification is the vast amount of genetic information that must be analyzed. Additionally, some genes may be difficult to identify due to their location within the genome or their complex function.

How is gene identification used in research and medicine?

Gene identification plays a crucial role in both research and medicine. In research, it allows scientists to study the function of genes and their role in different biological processes. In medicine, gene identification can aid in the diagnosis and treatment of genetic disorders and diseases.

Similar threads

  • Biology and Medical
Replies
11
Views
2K
Replies
12
Views
1K
  • Biology and Medical
Replies
8
Views
1K
  • Biology and Medical
Replies
8
Views
1K
Replies
2
Views
3K
Replies
4
Views
2K
Replies
2
Views
1K
  • Biology and Medical
Replies
11
Views
5K
Replies
2
Views
3K
  • Biology and Medical
Replies
3
Views
3K
Back
Top