News and Announcements » |
Description:
This script takes a sequence file and performs all processing steps through building the OTU table.
These are the values that you would typically pass as –template_fp to align_seqs.py and lane_mask_fp to filter_alignment.py, respectively.
Usage: pick_otus_through_otu_table.py [options]
Input Arguments:
Note
[REQUIRED]
[OPTIONAL]
Output:
This script will produce a set of cluster centroids (as a FASTA file) and a cluster mapping file (from denoise.py if sff.txt and mapping file were provided), an OTU mapping file (pick_otus.py), a representative set of sequences (FASTA file from pick_rep_set.py), a sequence alignment file (FASTA file from align_seqs.py), taxonomy assignment file (from assign_taxonomy.py), a filtered sequence alignment (from filter_alignment.py), a phylogenetic tree (Newick file from make_phylogeny.py) and an OTU table (from make_otu_table.py).
Simple example:
The following command will start an analysis on inseq1.fasta (-i), which is a post-split_libraries fasta file. The sequence identifiers in this file should be of the form <sample_id>_<unique_seq_id>. The following steps, corresponding to the preliminary data preparation, are applied.
All output files will be written to the directory specified by -o, and subdirectories as appropriate.
pick_otus_through_otu_table.py -i inseqs1.fasta -o wf1/ -p custom_parameters.txt
Simple example with denoising:
This command will do the same steps as the previous example and additionally denoise the data set prior to OTU picking. Only flowgrams in the input sff.txt file that have a matching identifier in inseqs1.fasta are considered here, the rest is discarded.
All output files will be written to the directory specified by -o, and subdirectories as appropriate.
pick_otus_through_otu_table.py -s inseqs1.sff.txt -m metadata_mapping.txt -i inseqs1.fasta -o wf2/ -p custom_parameters.txt