Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

clustalw(1) [debian man page]

CLUSTALW(1)							  Clustal Manual						       CLUSTALW(1)

NAME
clustalw - Multiple alignment of nucleic acid and protein sequences SYNOPSIS
clustalw [-infile] file.ext [OPTIONS] clustalw [-help | -fullhelp] DESCRIPTION
Clustal W is a general purpose multiple alignment program for DNA or proteins. The program performs simultaneous alignment of many nucleotide or amino acid sequences. It is typically run interactively, providing a menu and an online help. If you prefer to use it in command-line (batch) mode, you will have to give several options, the minimum being -infile. OPTIONS
DATA (sequences) -infile=file.ext Input sequences. -profile1=file.ext and -profile2=file.ext Profiles (old alignment) VERBS (do things) -options List the command line parameters. -help or -check Outline the command line params. -fullhelp Output full help content. -align Do full multiple alignment. -tree Calculate NJ tree. -pim Output percent identity matrix (while calculating the tree). -bootstrap=n Bootstrap a NJ tree (n= number of bootstraps; def. = 1000). -convert Output the input sequences in a different file format. PARAMETERS (set things) General settings: -interactive Read command line, then enter normal interactive menus. -quicktree Use FAST algorithm for the alignment guide tree. -type= PROTEIN or DNA sequences. -negative Protein alignment with negative values in matrix. -outfile= Sequence alignment file name. -output= GCG, GDE, PHYLIP, PIR or NEXUS. -outputorder= INPUT or ALIGNED -case LOWER or UPPER (for GDE output only). -seqnos= OFF or ON (for Clustal output only). -seqnos_range= OFF or ON (NEW: for all output formats). -range=m,n Sequence range to write starting m to m+n. -maxseqlen=n Maximum allowed input sequence length. -quiet Reduce console output to minimum. -stats=file Log some alignments statistics to file. Fast Pairwise Alignments: -ktuple=n Word size. -topdiags=n Number of best diags. -window=n Window around best diags. -pairgap=n Gap penalty. -score PERCENT or ABSOLUTE. Slow Pairwise Alignments: -pwmatrix= :Protein weight matrix=BLOSUM, PAM, GONNET, ID or filename -pwdnamatrix= DNA weight matrix=BLOSUMIUB, BLOSUMCLUSTALW or BLOSUMfilename. -pwgapopen=f Gap opening penalty. -pwgapext=f Gap extension penalty. Multiple Alignments: -newtree= File for new guide tree. -usetree= File for old guide tree. -matrix= Protein weight matrix=BLOSUM, PAM, GONNET, ID or filename. -dnamatrix= DNA weight matrix=IUB, CLUSTALW or filename. -gapopen=f Gap opening penalty. -gapext=f Gap extension penalty. -engaps No end gap separation pen. -gapdist=n Gap separation pen. range. -nogap Residue-specific gaps off. -nohgap Hydrophilic gaps off. -hgapresidues= List hydrophilic res. -maxdiv=n Percent identity for delay. -type= PROTEIN or DNA -transweight=f Transitions weighting. -iteration= NONE or TREE or ALIGNMENT. -numiter=n Maximum number of iterations to perform. Profile Alignments: -profile Merge two alignments by profile alignment. -newtree1= File for new guide tree for profile1. -newtree2= File for new guide tree for profile2. -usetree1= File for old guide tree for profile1. -usetree2= File for old guide tree for profile2. Sequence to Profile Alignments: -sequences Sequentially add profile2 sequences to profile1 alignment. -newtree= File for new guide tree. -usetree= File for old guide tree. Structure Alignments: -nosecstr1 Do not use secondary structure-gap penalty mask for profile 1. -nosecstr2 Do not use secondary structure-gap penalty mask for profile 2. -secstrout=STRUCTURE or MASK or BOTH or NONE Output in alignment file. -helixgap=n Gap penalty for helix core residues. -strandgap=n Gap penalty for strand core residues. loopgap=n Gap penalty for loop regions. -terminalgap=n Gap penalty for structure termini. -helixendin=n Number of residues inside helix to be treated as terminal. -helixendout=n Number of residues outside helix to be treated as terminal. -strandendin=n Number of residues inside strand to be treated as terminal. -strandendout=n Number of residues outside strand to be treated as terminal. Trees: -outputtree=nj OR phylip OR dist OR nexus -seed=n Seed number for bootstraps. -kimura Use Kimura's correction. -tossgaps Ignore positions with gaps. -bootlabels=node Position of bootstrap values in tree display. -clustering= NJ or UPGMA. BUGS
The Clustal bug tracking system can be found at http://bioinf.ucd.ie/bugzilla/buglist.cgi?quicksearch=clustal. SEE ALSO
clustalx(1). REFERENCES
o Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG. (2007). Clustal W and Clustal X version 2.0.[1] Bioinformatics, 23, 2947-2948. o Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD. (2003). Multiple sequence alignment with the Clustal series of programs.[2] Nucleic Acids Res., 31, 3497-3500. o Jeanmougin F, Thompson JD, Gouy M, Higgins DG, Gibson TJ. (1998). Multiple sequence alignment with Clustal X[3]. Trends Biochem Sci., 23, 403-405. o Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. (1997). The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.[4] Nucleic Acids Res., 25, 4876-4882. o Higgins DG, Thompson JD, Gibson TJ. (1996). Using CLUSTAL for multiple sequence alignments.[5] Methods Enzymol., 266, 383-402. o Thompson JD, Higgins DG, Gibson TJ. (1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.[6] Nucleic Acids Res., 22, 4673-4680. o Higgins DG. (1994). CLUSTAL V: multiple alignment of DNA and protein sequences.[7] Methods Mol Biol., 25, 307-318 o Higgins DG, Bleasby AJ, Fuchs R. (1992). CLUSTAL V: improved software for multiple sequence alignment.[8] Comput. Appl. Biosci., 8, 189-191. o Higgins,D.G. and Sharp,P.M. (1989). Fast and sensitive multiple sequence alignments on a microcomputer.[9] Comput. Appl. Biosci., 5, 151-153. o Higgins,D.G. and Sharp,P.M. (1988). CLUSTAL: a package for performing multiple sequence alignment on a microcomputer.[10] Gene, 73, 237-244. AUTHORS
Des Higgins Copyright holder for Clustal. Julie Thompson Copyright holder for Clustal. Toby Gibson Copyright holder for Clustal. Charles Plessy <plessy@debian.org> Prepared this manpage in DocBook XML for the Debian distribution. COPYRIGHT
Copyright (C) 1988-2010 Des Higgins, Julie Thompson & Toby Giboson (Clustal) Copyright (C) 2008-2010 Charles Plessy (This manpage) This program is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this program. If not, see http://www.gnu.org/licenses/, or on Debian systems, /usr/share/common-licenses/LGPL-3. This manual page and its XML source can be used, modified, and redistributed as if it were in public domain. NOTES
1. Clustal W and Clustal X version 2.0. http://www.ncbi.nlm.nih.gov/pubmed/17846036 2. Multiple sequence alignment with the Clustal series of programs. http://www.ncbi.nlm.nih.gov/pubmed/12824352 3. Multiple sequence alignment with Clustal X http://www.ncbi.nlm.nih.gov/pubmed/9810230 4. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. http://www.ncbi.nlm.nih.gov/pubmed/9396791 5. Using CLUSTAL for multiple sequence alignments. http://www.ncbi.nlm.nih.gov/pubmed/8743695 6. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. http://www.ncbi.nlm.nih.gov/pubmed/7984417 7. CLUSTAL V: multiple alignment of DNA and protein sequences. http://www.ncbi.nlm.nih.gov/pubmed/8004173 8. CLUSTAL V: improved software for multiple sequence alignment. http://www.ncbi.nlm.nih.gov/pubmed/1591615 9. Fast and sensitive multiple sequence alignments on a microcomputer. http://www.ncbi.nlm.nih.gov/pubmed/2720464 10. CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. http://www.ncbi.nlm.nih.gov/pubmed/3243435 Clustal 2.1 12/28/2010 CLUSTALW(1)
Man Page