>genescaffold:dipOrd1:GeneScaffold_6031:21480:5787 8:-1

  • Thread starter hivesaeed4
  • Start date
In summary, the conversation discusses the structure of FASTA headers for gene sequences in NCBI. The headers typically include information about the chromosome, genome, and transcription direction. In the case of a mouse genome, additional information about the gene's location and scaffold is required, as well as the identity of the genome (dipOrd1).
  • #1
hivesaeed4
217
0
Normally when we have a FASTA header on top of a sequence of a gene in ncbi you expect something like

>chromosome:NCBI36:7:100155759:100159857:1

This clearly indicate that on the 7th chromosome and between bases 100155759 and 100159857 the concerned gene is present. It further tells us that its the human genome (NCBI36) and that the gene is transcribed in the direction of the forward strand.

But let's come back to the title which actually is also a FaSTA header present on top of a sequence of a gene in ncbi. Now its the mouse genome (found that out separately, and I know the gene lies between 21480 and 57878 and that its transcribed along the reverse strand). But where does the gene lie (as in which mouse chromosome)? And what's this gene scaffold ? And what's dipOrd1?

Lots of thanks in advance.
 
Biology news on Phys.org
  • #2

FAQ: >genescaffold:dipOrd1:GeneScaffold_6031:21480:5787 8:-1

1. What is a genescaffold?

A genescaffold is a linear sequence of DNA bases that contains multiple genes and regulatory elements. It serves as a framework for organizing and mapping genetic information.

2. What does dipOrd1 refer to in the genescaffold name?

dipOrd1 is a code that refers to the taxonomic order of the organism from which the genescaffold was derived. In this case, it likely refers to the diploid order of the organism.

3. What is the significance of GeneScaffold_6031?

GeneScaffold_6031 is a unique identifier given to this specific genescaffold. This allows scientists to easily reference and locate this particular sequence of DNA.

4. What do the numbers 21480:5787 8:-1 represent in the genescaffold name?

The first set of numbers (21480:5787) indicates the start and end positions of the genescaffold on the chromosome. The number 8 represents the orientation of the genescaffold (in this case, it is reversed or inverted) and -1 indicates that the genescaffold is located on the negative strand of DNA.

5. How is a genescaffold different from a gene?

A gene is a specific segment of DNA that contains instructions for producing a protein. A genescaffold, on the other hand, is a larger sequence of DNA that contains multiple genes and regulatory elements. It provides a framework for organizing and understanding the genetic information contained within an organism.

Similar threads

Replies
1
Views
2K
Replies
17
Views
6K
Replies
2
Views
3K
Replies
2
Views
3K
Replies
2
Views
2K
Replies
2
Views
3K
Replies
4
Views
7K
Back
Top