Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

hhfilter(1) [debian man page]

HHFILTER(1)							   User Commands						       HHFILTER(1)

NAME
hhfilter - filter an alignment by maximum sequence identity of match states and minimum coverage SYNOPSIS
hhfilter -i infile -o outfile [options] DESCRIPTION
HHfilter version 2.0.15 (June 2012) Filter an alignment by maximum sequence identity of match states and minimum coverage (C) Johannes Soeding, Michael Remmert, Andreas Biegert, Andreas Hauser Remmert M, Biegert A, Hauser A, and Soding J. HHblits: Lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods 9:173-175 (2011). -i <file> read input file in A3M/A2M or FASTA format -o <file> write to output file in A3M format -a <file> append to output file in A3M format OPTIONS
-v <int> verbose mode: 0:no screen output 1:only warings 2: verbose -id [0,100] maximum pairwise sequence identity (%) (def=90) -diff [0,inf[ filter MSA by selecting most diverse set of sequences, keeping at least this many seqs in each MSA block of length 50 (def=0) -cov [0,100] minimum coverage with query (%) (def=0) -qid [0,100] minimum sequence identity with query (%) (def=0) -qsc [0,100] minimum score per column with query (def=-20.0) -neff [1,inf] target diversity of alignment (default=off) Input alignment format: -M a2m use A2M/A3M (default): upper case = Match; lower case = Insert; '-' = Delete; '.' = gaps aligned to inserts (may be omitted) -M first use FASTA: columns with residue in 1st sequence are match states -M [0,100] use FASTA: columns with fewer than X% gaps are match states Example: hhfilter -id 50 -i d1mvfd_.a2m -o d1mvfd_.fil.a2m hhfilter 2.0.15 June 2012 HHFILTER(1)

Check Out this Related Man Page

HHMAKE(1)							   User Commands							 HHMAKE(1)

NAME
hhmake - build an HMM from an input alignment or convert between HMMER format and HHsearch format SYNOPSIS
hhmake -i file [options] DESCRIPTION
HHmake version 2.0.15 (June 2012) Build an HMM from an input alignment in A2M, A3M, or FASTA format, or convert between HMMER format (.hmm) and HHsearch format (.hhm). Remmert M, Biegert A, Hauser A, and Soding J. HHblits: Lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods 9:173-175 (2011). (C) Johannes Soeding, Michael Remmert, Andreas Biegert, Andreas Hauser -i <file> query alignment (A2M, A3M, or FASTA), or query HMM Output options: -o <file> HMM file to be written to (default=<infile.hhm>) -a <file> HMM file to be appended to -v <int> verbose mode: 0:no screen output 1:only warings 2: verbose -seq <int> max. number of query/template sequences displayed (def=10) Beware of overflows! All these sequences are stored in memory. -cons make consensus sequence master sequence of query MSA -name <name> use this name for HMM (default: use name of first sequence) Filter query multiple sequence alignment -id [0,100] maximum pairwise sequence identity (%) (def=90) -diff [0,inf[ filter MSA by selecting most diverse set of sequences, keeping at least this many seqs in each MSA block of length 50 (def=100) -cov [0,100] minimum coverage with query (%) (def=0) -qid [0,100] minimum sequence identity with query (%) (def=0) -qsc [0,100] minimum score per column with query (def=-20.0) -neff [1,inf] target diversity of alignment (default=off) Input alignment format: -M a2m use A2M/A3M (default): upper case = Match; lower case = Insert; '-' = Delete; '.' = gaps aligned to inserts (may be omitted) -M first use FASTA: columns with residue in 1st sequence are match states -M [0,100] use FASTA: columns with fewer than X% gaps are match states Pseudocount (pc) options: -pcm 0-2 position dependence of pc admixture 'tau' (pc mode, default=0) 0: no pseudo counts: tau = 0 1: constant tau = a 2: diversity-dependent: tau = a/(1 + ((Neff[i]-1)/b)^c) (Neff[i]: number of effective seqs in local MSA around column i) 3: constant diversity pseudocounts -pca [0,1] overall pseudocount admixture (def=1.0) -pcb [1,inf[ Neff threshold value for -pcm 2 (def=1.5) -pcc [0,3] extinction exponent c for -pcm 2 (def=1.0) -pre_pca [0,1] PREFILTER pseudocount admixture (def=0.8) -pre_pcb [1,inf[ PREFILTER threshold for Neff (def=1.8) Context-specific pseudo-counts: -nocontxt use substitution-matrix instead of context-specific pseudocounts -contxt <file> context file for computing context-specific pseudocounts (default=debian/tmp/usr/share/hhsuite/data/context_data.lib) -cslib <file> column state file for fast database prefiltering (default=debian/tmp/usr/share/hhsuite/data/cs219.lib) Example: hhmake -i test.a3m hhmake 2.0.15 June 2012 HHMAKE(1)
Man Page