Skip to content

SOAPdenovo-Trans #
Find similar titles

SOAPdenovo-Trans is a De novo transcriptome assembly program basing on the SOAPdenovo framework, adapt to alternative splicing and different expression level among transcripts.

Installation #

  • Download pre-compiled binary
  • Download source code, "sh"

Usage #

Configuration 파일을 만들고, 프로그램 구동

$ SOAPdenovo-Trans all -s config_file -o outputGraph

NOTE: SOAPdenovo-Trans has two versions: SOAPdenovo-Trans-31mer and SOAPdenovo-Trans-127mer.

Configuration 파일 항목

  1. avg_ins: the average insert size of this library
  2. reverse_seq: if the sequences need to be complementarily reversed (0 or 1)
  3. asm_flags: which part the reads are used. (1: only contig assembly, 2: only scaffold, 3: both contig and scaffold)
  4. rd_len_cutof: cut the reads to this length
  5. map_len: the min alignment length between a read and a contig for reliable read location

Configuration 파일 예제

#maximal read length
#maximal read length in this lib
#average insert size
#if sequence needs to be reversed 
#in which part(s) the reads are used
#minimum aligned length to contigs for a reliable read location (at     least 32 for short insert size)
#fastq file for read 1 
#fastq file for read 2 always follows fastq file for read 1
#fasta file for read 1 
#fastq file for read 2 always follows fastq file for read 1
#fastq file for single reads
#fasta file for single reads
#a single fasta file for paired reads


-s  <string>        configFile: the config file of reads
-o  <string>        outputGraph: prefix of output graph file name
-g  <string>        inputGraph: prefix of input graph file names
-R  (optional)      output assembly RPKM statistics, [NO]
-f  (optional)      output gap related reads for SRkgf to fill gap, [NO]
-S  (optional)      scaffold structure exists, [NO]
-F  (optional)      fill gaps in scaffolds, [NO]
-K  <int>           kmer (min 13, max 31/127): kmer size, [23]
-p  <int>           n_cpu: number of cpu for use, [8]
-d  <int>           kmerFreqCutoff: kmers with frequency no larger than KmerFreqCutoff will be deleted, [0]
-e  <int>           EdgeCovCutoff: edges with coverage no larger than EdgeCovCutoff will be deleted, [2]
-M  <int>           mergeLevel (min 0, max 3): the strength of merging similar sequences during contiging, [1]
-L  <int>           minContigLen: shortest contig for scaffolding, [100]
-t  <int>           locusMaxOutput: output the number of transcripts no more than locusMaxOutput in one locus, [5]
-G  <int>           gapLenDiff: allowed length difference between estimated and filled gap, [50]

Incoming Links #

Related Articles #

Suggested Pages #