SNP IDENTIFICATION PIPELINE:

 

Script name: unix-snp-trace

 

PURPOSE: Script reads trace data from a folder, process the data

by ‘PHRED’, generates fasta sequences, PHD files, QUAL files.

PHRAP process the fasta sequences and produces ‘.ACE’ file which is read by Polybayes program.

 

Last part of the script (CONSED) calls 'phredPhrap' script and 'consed'

from 'edit_dir' which is sub folder of 'phrap_out' folder.

 

'phredPhrap' script and CONSED programs will work if you have installed

and configured 'PHRED/PHRAP/CONSED' under a separate folder as a single

package.

 

Please comment the lines that invoke 'CONSED' at the bottom of this script if you do not wish to run CONSED.

 

STEPS INVOLVED

 --------------

This script assumes that you have installed and configured PHRED, PHRAP, and PolyBayes separately

 

Script can be modified to accept a  multiple fasta file as input (skipping PHRED analysis)

 

TRACE FILE PROCESSING using PHRED and PHRAP that are installed under

individual folders.

 

1. Reads trace data, process and writes output files under

'$HOME/phred_out' directory.

 

2. Subdirectories under 'phred_out' contains sequences, chromatogram files and PHD files.

 

3. Sequence files are processed by PHRAP.

 

4. PHRAP produces contigs, alignment and .ace files under 'phrap_out'

folder.

 

5. Running POLYBAYES: .ACE file is read by PolyBayes and results of Polybayes are written into 'poly_out'

 

6. phredPhrap and consed viewer are executed from phred_out/edit_dir using chromatogram files as input;for processing trace data and viewing with CONSED-VIEWER using PHRED/PHRAP/CONSED set of programs

 

     

 

# -------------- OUTPUT FOLDERS ------------

OUTPUTS 1. phred_out, results of PHRED analysis

OUTPUTS 2. phrap_out, results of PHRAP analysis

OUTPUTS 2. poly_out,  results of POLYBayes analysis