News and Announcements » |
Description:
Usage: filter_fasta.py [options]
Input Arguments:
Note
[REQUIRED]
[OPTIONAL]
Output:
OTU map-based filtering:
Keep all sequences that show up in an OTU map.
filter_fasta.py -f inseqs.fasta -o otu_map_filtered_seqs.fasta -m otu_map.txt
Chimeric sequence filtering:
Discard all sequences that show up in chimera checking output. NOTE: It is very important to pass -n here as this tells the script to negate the request, or discard all sequences that are listed via -s. This is necessary to remove the identified chimeras from inseqs.fasta.
filter_fasta.py -f inseqs.fasta -o non_chimeric_seqs.fasta -s chimeric_seqs.txt -n
Sequence list filtering:
Keep all sequences from as fasta file that are listed in a text file.
filter_fasta.py -f inseqs.fasta -o list_filtered_seqs.fasta -s seqs_to_keep.txt
biom-based filtering:
Keep all sequences that are listed as observations in a biom file.
filter_fasta.py -f inseqs.fastq -o biom_filtered_seqs.fastq -b otu_table.biom
fastq filtering:
Keep all sequences from a fastq file that are listed in a text file (note: file name must end with .fastq to support fastq filtering).
filter_fasta.py -f inseqs.fastq -o list_filtered_seqs.fastq -s seqs_to_keep.txt
sample id list filtering:
Keep all sequences from a fasta file where the sample id portion of the sequence identifier is listed in a text file (sequence identifiers in fasta file must be in post-split libraries format: sampleID_seqID).
filter_fasta.py -f sl_inseqs.fasta -o sample_id_list_filtered_seqs.fasta --sample_id_fp map.txt