Discussion Overview
The discussion revolves around the challenge of performing multiple sequence alignment (MSA) with a large dataset of 4000 sequences. Participants explore software options and algorithms suitable for handling such a volume, including ClustalW and alternatives.
Discussion Character
- Exploratory, Technical explanation, Debate/contested
Main Points Raised
- One participant inquires about the maximum number of sequences that can be processed by ClustalW, noting issues with both offline and online versions of the software.
- Another participant suggests using BLAST, although they acknowledge it may not be suitable for MSA, and mention the possibility of writing a custom program for alignment.
- A different participant clarifies that BLAST is primarily for aligning sequences to a database rather than performing MSA.
- One participant recommends checking if the university has the GCG software package, which includes multiple programs that might facilitate MSA.
Areas of Agreement / Disagreement
Participants express uncertainty about the best approach to take for MSA with 4000 sequences, with no consensus on a specific software solution or method. Multiple competing views on software options remain unresolved.
Contextual Notes
Limitations include the lack of information on the specific capabilities of ClustalW regarding sequence limits and the potential need for custom programming solutions. The discussion does not resolve the effectiveness of suggested alternatives.
Who May Find This Useful
Researchers or practitioners in bioinformatics, particularly those dealing with large-scale sequence alignment challenges.