NAME
abyss-pe - assemble reads into contigsSYNOPSIS
abyss-pe [ OPTION]... [PARAMETER=VALUE]... [MAKE_TARGET]...DESCRIPTION
Assemble the reads of the input files into contigs. The reads may be in FASTA, FASTQ, qseq, export, SRA, SAM or BAM format and may be compressed with gz, bz2 or xz and may be tarred.Parameters of abyss-pe
- name, JOB_NAME
- The name of this assembly. The resulting scaffolds will be stored in ${name}-scaffolds.fa.
- in
- input files. Use this variable when assembling data from a single library.
- lib
- a quoted list of whitespace-separated paired-end library names. Use this varible when assembling data from multiple paired-end libraries. For each library name in lib, the user must define a variable on the command line with the same name, which indicates the read files for that library. See EXAMPLES below for a concrete example of usage.
- pe
- list of paired-end libraries that will be used only for merging unitigs into contigs and will not contribute toward the consensus sequence.
- mp
- list of mate-pair libraries that will be used for scaffolding. Mate-pair libraries do not contribute toward the consensus sequence.
- long
- list of long sequence libraries that will be used for rescaffolding. long sequence libraries do not contribute toward the consensus sequence.
- se
- files containing single-end reads
- a
- maximum number of branches of a bubble [2]
- b
- maximum length of a bubble (bp) [10000]
- c
- minimum mean k-mer coverage of a unitig [sqrt(median)]
- d
- allowable error of a distance estimate (bp) [6]
- e
- minimum erosion k-mer coverage [sqrt(median)]
- E
- minimum erosion k-mer coverage per strand [1]
- j
- number of threads [2]
- k
- size of k-mer (bp)
- l
- minimum alignment length of a read (bp) [k]
- m
- minimum overlap of two unitigs (bp) [30]
- n
- minimum number of pairs required for building contigs [10]
- N
- minimum number of pairs required for building scaffolds [n]
- p
- minimum sequence identity of a bubble [0.9]
- q
- minimum base quality when trimming [3]
- Q
- minimum base quality [0]
- s
- minimum unitig size required for building contigs (bp)
[200]
- S
- minimum contig size required for building scaffolds (bp) [s]
- SS
- SS=--SS to assemble in strand-specific mode
- t
- minimum tip size (bp) [2k]
- v
- v=-v to enable verbose logging
- np, NSLOTS
- the number of processes of an MPI assembly
- mpirun
- the path to mpirun
- aligner
- the program to use to align the reads to the contigs [map]
- cs
- convert colour-space contigs to nucleotide contigs following assembly
Options of make
- -n, --dry-run
- Print the commands that would be executed, but do not execute them.
Make targets for abyss-pe
- default
- Equivalent to `scaffolds scaffolds-dot stats'.
- unitigs
- Assemble unitigs.
- unitigs-dot
- Output the unitig overlap graph.
- pe-sam
- Map paired-end reads to the unitigs and output a SAM file. SAM file will only contain reads mapping to different contigs, and the read ID, sequence and quality strings will be replaced with '*' characters.
- pe-bam
- Map paired-end reads to the unitigs and output a BAM file. BAM file will only contain reads mapping to different contigs, and the read ID, sequence and quality strings will be replaced with '*' characters.
- pe-index
- Generate an index of the unitigs used by abyss-map.
- contigs
- Assemble contigs.
- contigs-dot
- Output the contig overlap graph.
- mp-sam
- Map mate-pair reads to the contigs and output a SAM file. SAM file will only contain reads mapping to different contigs, and the read ID, sequence and quality strings will be replaced with '*' characters.
- mp-bam
- Map mate-pair reads to the contigs and output a BAM file. BAM file will only contain reads mapping to different contigs, and the read ID, sequence and quality strings will be replaced with '*' characters.
- mp-index
- Generate an index of the contigs used by abyss-map.
- scaffolds
- Assemble scaffolds.
- scaffolds-dot
- Output the scaffold overlap graph.
- scaftigs
- Break scaffolds and generate AGP file.
- long-scaffs
- Rescaffold using RNA-Seq assembled contigs.
- long-scaffs-dot
- Output the RNA scaffold overlap graph.
- stats
- Display assembly contiguity statistics.
- clean
- Remove intermediate files.
- version
- Display the version of abyss-pe.
- versions
- Display the versions of all programs used by abyss-pe.
- help
- Display a helpful message.
ENVIRONMENT VARIABLES
Any parameter that may be specified on the command line may also be specified in an environment variable.- PATH
- must contain the directory where the ABySS executables are installed. Use `abyss-pe versions` to check that PATH is configured correctly.
- TMPDIR
- specifies a directory to use for temporary files
Scheduler integration
ABySS integrates well with cluster job schedulers, such as:* SGE (Sun Grid Engine)
* Portable Batch System (PBS)
* Load Sharing Facility (LSF)
* IBM LoadLeveler
EXAMPLES
One paired-end library
abyss-pe k=64 name=ecoli in='reads1.fa reads2.fa'
Multiple paired-end libraries
abyss-pe k=64 name=ecoli lib='lib1 lib2' \
Paired-end and mate-pair libraries
abyss-pe k=64 name=ecoli lib='pe1 pe2' mp='mp1 mp2' \
Including RNA-Seq assemblies
abyss-pe k=64 name=ecoli lib=pe1 mp=mp1 long=long1 \
MPI
abyss-pe np=8 k=64 name=ecoli in='reads1.fa reads2.fa'
SGE
qsub -N ecoli -t 64 -pe openmpi 8 \
SEE ALSO
make(1), ABYSS(1)AUTHOR
Written by Shaun Jackman.REPORTING BUGS
Report bugs to <[email protected]>.COPYRIGHT
Copyright 2014 Canada's Michael Smith Genome Sciences Centre2014-May | abyss-pe (ABySS) 1.5.2 |