Aligning 2 Sequences using Needleman-Wunsch

AI Thread Summary
The discussion centers around the application of the Needleman-Wunsch algorithm for aligning two sequences, CCATTGGAGTA and CATAGA. The user expected to obtain four alignment results due to the permutability of the first 'C' in the second sequence but only received two. Suggestions were made to adjust software parameters, particularly the gap penalty, to potentially generate additional alignments. It was noted that some alignment software struggles with first base pair permutations. Users recommended ClustalW and other bioinformatics tools for sequence alignment, emphasizing the importance of understanding the underlying principles of bioinformatics, including mathematical models and scoring methods. The conversation also highlighted the challenges of bioinformatics education and the need for a solid grasp of theoretical concepts to make sense of practical applications.
Wunsch
Messages
9
Reaction score
0
I tryed to align two sequences
CCATTGGAGTA
and CATAGA
using Needleman-Wunsch algorithm but it only gives me 2 alignment resutls
CCATTGGAGTA
C-A-T--AG-A
and
CCATTGGAGTA
C-AT---AG-A

while I thought it should be 4 results because the first C in the second sequence is also permutable, right ?

So can you tell me how to take out the other two alignments ?

Thank you very much inadvance
 
Biology news on Phys.org
You should be able to change some parameter in the software option. You can change the gap penalty. Some alignment software also have problem when it comes to the first base pair permutation.
 
Thank iansmith, I solve that problem with a pen and a piece of paper only..
But i am still really wanting to know how a software parameter change would be able to handle that problem, what does such a software do to get 4 alignments ?
This is what I took from my biology class, talking with me about software thingies I think is like pouring water on a duck's head because I actually don't understand a thing at all.

Thank you so very much in advance and a "Happy New Year" from me to you too...:wink:
 
I never get any help from anyone except iansmith !
 
During class, we were pointed to ClustalW (http://www.ebi.ac.uk/clustalw/) to do sequence alignments. Click the FAQ, and walla! It gives you help on exactly what all those insane options do...

And don't dispair, bioinformatics is a tough subject - and gets worse. Once I waltzed into my uni's computer labs to check my email and saw my friends (who were doing biomedical science) crying during their bioinformatics prac
 
Last edited:
For 2 sequence aligment you can use Blast 2 sequence and you easily change so parameter.
http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.cgi

For Clustal, I prefer to use the version available at the Biology Workbeanch of UCSD
http://workbench.sdsc.edu/

You need to register and it is free. Also you get a bunch of other bioinformatic tools.

I also took a grad level class. You need to understand the principle and theory behind bioinformatics. It make more sense once you understand the basics. My class was mostly about sequence alignment, BLAST and making threes.
 
Last edited by a moderator:
Thank you jikx
iansmith, I know that was my mistake, I am really sorry. Can you tell me how to do now ? I don't know what i should do. I am really stuck!
By the way, would you also tell me what lies behind bioinformatics ?
 
Just paste this in the big white box (from the site i mentioned above):


>FOSB_Wunsch1 Protein fosA
CCATTGGAGTA

>FOSB_Wunsch2 Protein fosB
CATAGA


and you get an output like this:
http://www.ebi.ac.uk/cgi-bin/clustalw/result?tool=clustalw&jobid=clustalw-20050106-11352860&poll=yes

which is an output before you muck around with all the settings like window length, gap extensions etc (the FAQ can help you there) .. however, these programs are more for sequences which are hundreds of of basepairs long!
 
Last edited by a moderator:
Wunsch, do you have any theoritical part for class.

If you don't:
First, you need to understand the mathematical models that are behind bioinformatics. You should look the Wright-Fisher model, Hardy-Weinberg Equilibrium, and Neutral model.
Second, you need to understand probability and how it applies to bioinformatics.
Third, try to understand what is behind the scoring methods such as the blossom scoring matrix.
 
Back
Top