BioWDL: RNA-seq

A BioWDL pipeline for processing RNA-seq data, starting with FASTQ files to produce expression measures and VCFs.

Please be aware that the page you are currently viewing is not for the latest available version!

Inputs for pipeline

The following is an overview of all available inputs in pipeline.

Required inputs

pipeline.dockerImagesFile
File
A YAML file describing the docker image used for the tasks. The dockerImages.yml provided with the pipeline is recommended.
pipeline.hisat2Index
Array[File]+?
The hisat2 index files. Defining this will cause the hisat2 aligner to run. Note that is starIndex is also defined the star results will be used for downstream analyses. May be omitted in starIndex is defined.
pipeline.outputDir
String — Default: "."
The output directory.
pipeline.referenceFasta
File
The reference fasta file
pipeline.referenceFastaDict
File
Sequence dictionary (.dict) file of the reference
pipeline.referenceFastaFai
File
Fasta index (.fai) file of the reference
pipeline.sampleConfigFile
File
The samplesheet, including sample ids, library ids, readgroup ids and fastq file locations.
pipeline.starIndex
Array[File]+?
The star index files. Defining this will cause the star aligner to run and be used for downstream analyses. May be ommited if hisat2Index is defined.
pipeline.strandedness
String
The strandedness of the RNA sequencing library preparation. One of "None" (unstranded), "FR" (forward-reverse: first read equal transcript) or "RF" (reverse-forward: second read equals transcript).

Other common inputs

pipeline.cpatHex
File?
A hexamer frequency table for CPAT. Required if lncRNAdetection is `true`.
pipeline.cpatLogitModel
File?
A logit model for CPAT. Required if lncRNAdetection is `true`.
pipeline.dbsnpVCF
File?
dbsnp VCF file used for checking known sites
pipeline.dbsnpVCFIndex
File?
Index (.tbi) file for the dbsnp VCF
pipeline.detectNovelTranscripts
Boolean — Default: false
Whether or not a transcripts assembly should be used. If set to true Stringtie will be used to create a new GTF file based on the BAM files. This generated GTF file will be used for expression quantification. If `referenceGtfFile` is also provided this reference GTF will be used to guide the assembly.
pipeline.expression.stringtieAssembly.geneAbundanceFile
String?
Where the abundance file should be written.
pipeline.lncRNAdatabases
Array[File] — Default: []
A set of GTF files the assembled GTF file should be compared with. Only used if lncRNAdetection is set to `true`.
pipeline.lncRNAdetection
Boolean — Default: false
Whether or not lncRNA detection should be run. This will enable detectNovelTranscript (this cannot be disabled by setting detectNovelTranscript to false). This will require cpatLogitModel and cpatHex to be defined.
pipeline.preprocessing.regions
File?
A bed file describing the regions to operate on.
pipeline.referenceGtfFile
File?
A reference GTF file. Used for expression quantification or to guide the transcriptome assembly if detectNovelTranscripts is set to `true` (this GTF won't be be used directly for the expression quantification in that case.
pipeline.refflatFile
File?
A refflat files describing the genes. If this is defined RNAseq metrics will be collected.
pipeline.sampleJobs.bamMetrics.ampliconIntervals
File?
An interval list describinig the coordinates of the amplicons sequenced. This should only be used for targeted sequencing or WES. Required if `ampliconIntervals` is defined.
pipeline.sampleJobs.bamMetrics.targetIntervals
Array[File]+?
An interval list describing the coordinates of the targets sequenced. This should only be used for targeted sequencing or WES. If defined targeted PCR metrics will be collected. Requires `ampliconIntervals` to also be defined.
pipeline.sampleJobs.indexStarBam.outputBamPath
String?
The location where the BAM file should be written to. The index will appear alongside this link to the BAM file.
pipeline.sampleJobs.qc.adapterForward
String? — Default: "AGATCGGAAGAG"
The adapter to be removed from the reads first or single end reads.
pipeline.sampleJobs.qc.adapterReverse
String? — Default: "AGATCGGAAGAG"
The adapter to be removed from the reads second end reads.
pipeline.sampleJobs.qc.contaminations
Array[String]+?
Contaminants/adapters to be removed from the reads.
pipeline.sampleJobs.qc.readgroupName
String — Default: sub(basename(read1),"(\.fq)?(\.fastq)?(\.gz)?","")
The name of the readgroup.
pipeline.umiDeduplication
Boolean — Default: false
Whether or not UMI based deduplication should be performed.
pipeline.variantCalling
Boolean — Default: false
Whether or not variantcalling should be performed.
pipeline.variantcalling.callAutosomal.haplotypeCaller.excludeIntervalList
Array[File]+?
Bed files or interval lists describing the regions to NOT operate on.
pipeline.variantcalling.callAutosomal.haplotypeCaller.pedigree
File?
Pedigree file for determining the population "founders"
pipeline.variantcalling.callAutosomal.haplotypeCaller.ploidy
Int?
The ploidy with which the variants should be called.
pipeline.variantcalling.callX.excludeIntervalList
Array[File]+?
Bed files or interval lists describing the regions to NOT operate on.
pipeline.variantcalling.callX.pedigree
File?
Pedigree file for determining the population "founders"
pipeline.variantcalling.callY.excludeIntervalList
Array[File]+?
Bed files or interval lists describing the regions to NOT operate on.
pipeline.variantcalling.callY.pedigree
File?
Pedigree file for determining the population "founders"
pipeline.variantcalling.genotypeGvcfs.pedigree
File?
Pedigree file for determining the population "founders"
pipeline.variantcalling.regions
File?
A bed file describing the regions to operate on.
pipeline.variantcalling.singleSampleGvcf
Boolean — Default: false
Whether to output single-sample gvcfs
pipeline.variantcalling.vcfBasename
String — Default: "multisample"
The basename of the VCF and GVCF files that are outputted by the workflow
pipeline.variantcalling.XNonParRegions
File?
Bed file with the non-PAR regions of X
pipeline.variantcalling.YNonParRegions
File?
Bed file with the non-PAR regions of Y

Advanced inputs

Show/Hide
pipeline.ConvertDockerTagsFile.dockerImage
String — Default: "quay.io/biocontainers/biowdl-input-converter:0.2.1--py_0"
The docker image used for this task. Changing this may result in errors which the developers may choose not to address.
pipeline.ConvertSampleConfig.checkFileMd5sums
Boolean — Default: false
Whether or not the MD5 sums of the files mentioned in the samplesheet should be checked.
pipeline.ConvertSampleConfig.dockerImage
String — Default: "quay.io/biocontainers/biowdl-input-converter:0.2.1--py_0"
The docker image used for this task. Changing this may result in errors which the developers may choose not to address.
pipeline.ConvertSampleConfig.old
Boolean — Default: false
Whether or not the old samplesheet format should be used.
pipeline.ConvertSampleConfig.skipFileCheck
Boolean — Default: true
Whether or not the existance of the files mentioned in the samplesheet should be checked.
pipeline.CPAT.startCodons
Array[String]?
Equivalent to CPAT's `--start` option.
pipeline.CPAT.stopCodons
Array[String]?
Equivalent to CPAT's `--stop` option.
pipeline.expression.additionalAttributes
Array[String]+?
Additional attributes which should be taken from the GTF used for quantification and added to the merged expression value tables.
pipeline.expression.htSeqCount.additionalAttributes
Array[String] — Default: []
Equivalent to the --additional-attr option of htseq-count.
pipeline.expression.htSeqCount.featureType
String?
Equivalent to the --type option of htseq-count.
pipeline.expression.htSeqCount.format
String — Default: "bam"
Equivalent to the -f option of htseq-count.
pipeline.expression.htSeqCount.idattr
String?
Equivalent to the --idattr option of htseq-count.
pipeline.expression.htSeqCount.memory
String — Default: "40G"
The amount of memory the job requires in GB.
pipeline.expression.htSeqCount.order
String — Default: "pos"
Equivalent to the -r option of htseq-count.
pipeline.expression.mergedHTSeqFragmentsPerGenes.featureAttribute
String?
Equivalent to the -F option of collect-columns.
pipeline.expression.mergedHTSeqFragmentsPerGenes.featureColumn
Int?
Equivalent to the -f option of collect-columns.
pipeline.expression.mergedHTSeqFragmentsPerGenes.header
Boolean — Default: false
Equivalent to the -H flag of collect-columns.
pipeline.expression.mergedHTSeqFragmentsPerGenes.separator
Int?
Equivalent to the -s option of collect-columns.
pipeline.expression.mergedHTSeqFragmentsPerGenes.valueColumn
Int?
Equivalent to the -c option of collect-columns.
pipeline.expression.mergedStringtieFPKMs.featureAttribute
String?
Equivalent to the -F option of collect-columns.
pipeline.expression.mergedStringtieFPKMs.featureColumn
Int?
Equivalent to the -f option of collect-columns.
pipeline.expression.mergedStringtieFPKMs.separator
Int?
Equivalent to the -s option of collect-columns.
pipeline.expression.mergedStringtieTPMs.featureAttribute
String?
Equivalent to the -F option of collect-columns.
pipeline.expression.mergedStringtieTPMs.featureColumn
Int?
Equivalent to the -f option of collect-columns.
pipeline.expression.mergedStringtieTPMs.separator
Int?
Equivalent to the -s option of collect-columns.
pipeline.expression.mergeStringtieGtf.keepMergedTranscriptsWithRetainedIntrons
Boolean — Default: false
Equivalent to the -i flag of 'stringtie --merge'.
pipeline.expression.mergeStringtieGtf.label
String?
Equivalent to the -l option of 'stringtie --merge'.
pipeline.expression.mergeStringtieGtf.memory
String — Default: "10G"
The amount of memory needed for this task in GB.
pipeline.expression.mergeStringtieGtf.minimumCoverage
Float?
Equivalent to the -c option of 'stringtie --merge'.
pipeline.expression.mergeStringtieGtf.minimumFPKM
Float?
Equivalent to the -F option of 'stringtie --merge'.
pipeline.expression.mergeStringtieGtf.minimumIsoformFraction
Float?
Equivalent to the -f option of 'stringtie --merge'.
pipeline.expression.mergeStringtieGtf.minimumLength
Int?
Equivalent to the -m option of 'stringtie --merge'.
pipeline.expression.mergeStringtieGtf.minimumTPM
Float?
Equivalent to the -T option of 'stringtie --merge'.
pipeline.expression.stringtie.memory
String — Default: "10G"
The amount of memory needed for this task in GB.
pipeline.expression.stringtie.threads
Int — Default: 1
The number of threads to use.
pipeline.expression.stringtieAssembly.memory
String — Default: "10G"
The amount of memory needed for this task in GB.
pipeline.expression.stringtieAssembly.threads
Int — Default: 1
The number of threads to use.
pipeline.GffCompare.A
Boolean — Default: false
Equivalent to gffcompare's `-A` flag.
pipeline.GffCompare.C
Boolean — Default: false
Equivalent to gffcompare's `-C` flag.
pipeline.GffCompare.debugMode
Boolean — Default: false
Equivalent to gffcompare's `-D` flag.
pipeline.GffCompare.discardSingleExonReferenceTranscripts
Boolean — Default: false
Equivalent to gffcompare's `-N` flag.
pipeline.GffCompare.discardSingleExonTransfragsAndReferenceTranscripts
Boolean — Default: false
Equivalent to gffcompare's `-M` flag.
pipeline.GffCompare.genomeSequences
File?
Equivalent to gffcompare's `-s` option.
pipeline.GffCompare.inputGtfList
File?
Equivalent to gffcompare's `-i` option.
pipeline.GffCompare.K
Boolean — Default: false
Equivalent to gffcompare's `-K` flag.
pipeline.GffCompare.maxDistanceFreeEndsTerminalExons
Int?
Equivalent to gffcompare's `-e` option.
pipeline.GffCompare.maxDistanceGroupingTranscriptStartSites
Int?
Equivalent to gffcompare's `-d` option.
pipeline.GffCompare.namePrefix
String?
Equivalent to gffcompare's `-p` option.
pipeline.GffCompare.noTmap
Boolean — Default: false
Equivalent to gffcompare's `-T` flag.
pipeline.GffCompare.outPrefix
String — Default: "gffcmp"
The prefix for the output.
pipeline.GffCompare.precisionCorrection
Boolean — Default: false
Equivalent to gffcompare's `-Q` flag.
pipeline.GffCompare.snCorrection
Boolean — Default: false
Equivalent to gffcompare's `-R` flag.
pipeline.GffCompare.verbose
Boolean — Default: false
Equivalent to gffcompare's `-V` flag.
pipeline.GffCompare.X
Boolean — Default: false
Equivalent to gffcompare's `-X` flag.
pipeline.gffread.CDSFastaPath
String?
The location the CDS fasta should be written to.
pipeline.gffread.filteredGffPath
String?
The location the filtered GFF should be written to.
pipeline.gffread.outputGtfFormat
Boolean — Default: false
Equivalent to gffread's `-T` flag.
pipeline.gffread.proteinFastaPath
String?
The location the protein fasta should be written to.
pipeline.jointgenotyping
Boolean — Default: false
Whether joint genotyping should be performed when Variant Calling. Default: false. Warning: joint genotyping is not part of GATK best practices
pipeline.multiqcTask.clConfig
String?
Equivalent to MultiQC's `--cl-config` option.
pipeline.multiqcTask.comment
String?
Equivalent to MultiQC's `--comment` option.
pipeline.multiqcTask.config
File?
Equivalent to MultiQC's `--config` option.
pipeline.multiqcTask.dataDir
Boolean — Default: false
Equivalent to MultiQC's `--data-dir` flag.
pipeline.multiqcTask.dataFormat
String?
Equivalent to MultiQC's `--data-format` option.
pipeline.multiqcTask.dirs
Boolean — Default: false
Equivalent to MultiQC's `--dirs` flag.
pipeline.multiqcTask.dirsDepth
Int?
Equivalent to MultiQC's `--dirs-depth` option.
pipeline.multiqcTask.exclude
Array[String]+?
Equivalent to MultiQC's `--exclude` option.
pipeline.multiqcTask.export
Boolean — Default: false
Equivalent to MultiQC's `--export` flag.
pipeline.multiqcTask.fileList
File?
Equivalent to MultiQC's `--file-list` option.
pipeline.multiqcTask.fileName
String?
Equivalent to MultiQC's `--filename` option.
pipeline.multiqcTask.flat
Boolean — Default: false
Equivalent to MultiQC's `--flat` flag.
pipeline.multiqcTask.force
Boolean — Default: false
Equivalent to MultiQC's `--force` flag.
pipeline.multiqcTask.fullNames
Boolean — Default: false
Equivalent to MultiQC's `--fullnames` flag.
pipeline.multiqcTask.ignore
String?
Equivalent to MultiQC's `--ignore` option.
pipeline.multiqcTask.ignoreSamples
String?
Equivalent to MultiQC's `--ignore-samples` option.
Boolean — Default: false
Equivalent to MultiQC's `--ignore-symlinks` flag.
pipeline.multiqcTask.interactive
Boolean — Default: true
Equivalent to MultiQC's `--interactive` flag.
pipeline.multiqcTask.lint
Boolean — Default: false
Equivalent to MultiQC's `--lint` flag.
pipeline.multiqcTask.megaQCUpload
Boolean — Default: false
Opposite to MultiQC's `--no-megaqc-upload` flag.
pipeline.multiqcTask.memory
String — Default: "4G"
The amount of memory this job will use.
pipeline.multiqcTask.module
Array[String]+?
Equivalent to MultiQC's `--module` option.
pipeline.multiqcTask.noDataDir
Boolean — Default: false
Equivalent to MultiQC's `--no-data-dir` flag.
pipeline.multiqcTask.pdf
Boolean — Default: false
Equivalent to MultiQC's `--pdf` flag.
pipeline.multiqcTask.sampleNames
File?
Equivalent to MultiQC's `--sample-names` option.
pipeline.multiqcTask.tag
String?
Equivalent to MultiQC's `--tag` option.
pipeline.multiqcTask.template
String?
Equivalent to MultiQC's `--template` option.
pipeline.multiqcTask.title
String?
Equivalent to MultiQC's `--title` option.
pipeline.multiqcTask.zipDataDir
Boolean — Default: false
Equivalent to MultiQC's `--zip-data-dir` flag.
pipeline.preprocessing.applyBqsr.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.preprocessing.applyBqsr.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.preprocessing.baseRecalibrator.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.preprocessing.baseRecalibrator.knownIndelsSitesVCFIndexes
Array[File] — Default: []
The indexed for the known variant VCFs.
pipeline.preprocessing.baseRecalibrator.knownIndelsSitesVCFs
Array[File] — Default: []
VCF files with known indels.
pipeline.preprocessing.baseRecalibrator.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.preprocessing.gatherBamFiles.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.preprocessing.gatherBamFiles.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.preprocessing.gatherBqsr.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.preprocessing.gatherBqsr.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.preprocessing.scatterList.bamFile
File?
Equivalent to biopet scatterregions' `--bamfile` option.
pipeline.preprocessing.scatterList.bamIndex
File?
The index for the bamfile given through bamFile.
pipeline.preprocessing.scatterList.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.preprocessing.scatterList.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.preprocessing.scatterSizeMillions
Int — Default: 1000
Same as scatterSize, but is multiplied by 1000000 to get scatterSize. This allows for setting larger values more easily
pipeline.preprocessing.splitNCigarReads.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.preprocessing.splitNCigarReads.memory
String — Default: "16G"
The amount of memory this job will use.
pipeline.runMultiQC
Boolean — Default: outputDir != "."
Whether or not MultiQC should be run.
pipeline.sampleJobs.bamMetrics.ampliconIntervalsLists.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.sampleJobs.bamMetrics.ampliconIntervalsLists.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.sampleJobs.bamMetrics.picardMetrics.collectAlignmentSummaryMetrics
Boolean — Default: true
Equivalent to the `PROGRAM=CollectAlignmentSummaryMetrics` argument.
pipeline.sampleJobs.bamMetrics.picardMetrics.collectBaseDistributionByCycle
Boolean — Default: true
Equivalent to the `PROGRAM=CollectBaseDistributionByCycle` argument.
pipeline.sampleJobs.bamMetrics.picardMetrics.collectGcBiasMetrics
Boolean — Default: true
Equivalent to the `PROGRAM=CollectGcBiasMetrics` argument.
pipeline.sampleJobs.bamMetrics.picardMetrics.collectInsertSizeMetrics
Boolean — Default: true
Equivalent to the `PROGRAM=CollectInsertSizeMetrics` argument.
pipeline.sampleJobs.bamMetrics.picardMetrics.collectQualityYieldMetrics
Boolean — Default: true
Equivalent to the `PROGRAM=CollectQualityYieldMetrics` argument.
pipeline.sampleJobs.bamMetrics.picardMetrics.collectSequencingArtifactMetrics
Boolean — Default: true
Equivalent to the `PROGRAM=CollectSequencingArtifactMetrics` argument.
pipeline.sampleJobs.bamMetrics.picardMetrics.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.sampleJobs.bamMetrics.picardMetrics.meanQualityByCycle
Boolean — Default: true
Equivalent to the `PROGRAM=MeanQualityByCycle` argument.
pipeline.sampleJobs.bamMetrics.picardMetrics.memory
String — Default: "32G"
The amount of memory this job will use.
pipeline.sampleJobs.bamMetrics.picardMetrics.qualityScoreDistribution
Boolean — Default: true
Equivalent to the `PROGRAM=QualityScoreDistribution` argument.
pipeline.sampleJobs.bamMetrics.rnaSeqMetrics.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.sampleJobs.bamMetrics.rnaSeqMetrics.memory
String — Default: "32G"
The amount of memory this job will use.
pipeline.sampleJobs.bamMetrics.targetIntervalsLists.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.sampleJobs.bamMetrics.targetIntervalsLists.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.sampleJobs.bamMetrics.targetMetrics.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.sampleJobs.bamMetrics.targetMetrics.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.sampleJobs.collectUmiStats
Boolean — Default: false
Whether or not UMI deduplication stats should be collected. This will potentially cause a massive increase in memory usage of the deduplication step.
pipeline.sampleJobs.hisat2.downstreamTranscriptomeAssembly
Boolean — Default: true
Equivalent to hisat2's `--dta` flag.
pipeline.sampleJobs.hisat2.memory
String — Default: "48G"
The amount of memory this job will use.
pipeline.sampleJobs.hisat2.threads
Int — Default: 1
The number of threads to use.
pipeline.sampleJobs.indexStarBam.dockerImage
String — Default: "quay.io/biocontainers/samtools:1.8--h46bd0b3_5"
The docker image used for this task. Changing this may result in errors which the developers may choose not to address.
pipeline.sampleJobs.markDuplicates.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.sampleJobs.markDuplicates.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.sampleJobs.markDuplicates.read_name_regex
String?
Equivalent to the `READ_NAME_REGEX` option of MarkDuplicates.
pipeline.sampleJobs.platform
String — Default: "illumina"
The platform used for sequencing.
pipeline.sampleJobs.qc.Cutadapt.bwa
Boolean?
Equivalent to cutadapt's --bwa flag.
pipeline.sampleJobs.qc.Cutadapt.colorspace
Boolean?
Equivalent to cutadapt's --colorspace flag.
pipeline.sampleJobs.qc.Cutadapt.cores
Int — Default: 1
The number of cores to use.
pipeline.sampleJobs.qc.Cutadapt.cut
Int?
Equivalent to cutadapt's --cut option.
pipeline.sampleJobs.qc.Cutadapt.discardTrimmed
Boolean?
Equivalent to cutadapt's --quality-cutoff option.
pipeline.sampleJobs.qc.Cutadapt.discardUntrimmed
Boolean?
Equivalent to cutadapt's --discard-untrimmed option.
pipeline.sampleJobs.qc.Cutadapt.doubleEncode
Boolean?
Equivalent to cutadapt's --double-encode flag.
pipeline.sampleJobs.qc.Cutadapt.errorRate
Float?
Equivalent to cutadapt's --error-rate option.
pipeline.sampleJobs.qc.Cutadapt.front
Array[String] — Default: []
A list of 5' ligated adapter sequences to be cut from the given first or single end fastq file.
pipeline.sampleJobs.qc.Cutadapt.frontRead2
Array[String] — Default: []
A list of 5' ligated adapter sequences to be cut from the given second end fastq file.
pipeline.sampleJobs.qc.Cutadapt.infoFilePath
String?
Equivalent to cutadapt's --info-file option.
pipeline.sampleJobs.qc.Cutadapt.interleaved
Boolean?
Equivalent to cutadapt's --interleaved flag.
pipeline.sampleJobs.qc.Cutadapt.length
Int?
Equivalent to cutadapt's --length option.
pipeline.sampleJobs.qc.Cutadapt.lengthTag
String?
Equivalent to cutadapt's --length-tag option.
pipeline.sampleJobs.qc.Cutadapt.maq
Boolean?
Equivalent to cutadapt's --maq flag.
pipeline.sampleJobs.qc.Cutadapt.maskAdapter
Boolean?
Equivalent to cutadapt's --mask-adapter flag.
pipeline.sampleJobs.qc.Cutadapt.matchReadWildcards
Boolean?
Equivalent to cutadapt's --match-read-wildcards flag.
pipeline.sampleJobs.qc.Cutadapt.maximumLength
Int?
Equivalent to cutadapt's --maximum-length option.
pipeline.sampleJobs.qc.Cutadapt.maxN
Int?
Equivalent to cutadapt's --max-n option.
pipeline.sampleJobs.qc.Cutadapt.memory
String — Default: "4G"
The amount of memory this job will use.
pipeline.sampleJobs.qc.Cutadapt.minimumLength
Int? — Default: 2
Equivalent to cutadapt's --minimum-length option.
pipeline.sampleJobs.qc.Cutadapt.nextseqTrim
String?
Equivalent to cutadapt's --nextseq-trim option.
pipeline.sampleJobs.qc.Cutadapt.noIndels
Boolean?
Equivalent to cutadapt's --no-indels flag.
pipeline.sampleJobs.qc.Cutadapt.noMatchAdapterWildcards
Boolean?
Equivalent to cutadapt's --no-match-adapter-wildcards flag.
pipeline.sampleJobs.qc.Cutadapt.noTrim
Boolean?
Equivalent to cutadapt's --no-trim flag.
pipeline.sampleJobs.qc.Cutadapt.noZeroCap
Boolean?
Equivalent to cutadapt's --no-zero-cap flag.
pipeline.sampleJobs.qc.Cutadapt.overlap
Int?
Equivalent to cutadapt's --overlap option.
pipeline.sampleJobs.qc.Cutadapt.pairFilter
String?
Equivalent to cutadapt's --pair-filter option.
pipeline.sampleJobs.qc.Cutadapt.prefix
String?
Equivalent to cutadapt's --prefix option.
pipeline.sampleJobs.qc.Cutadapt.qualityBase
Int?
Equivalent to cutadapt's --quality-base option.
pipeline.sampleJobs.qc.Cutadapt.qualityCutoff
String?
Equivalent to cutadapt's --quality-cutoff option.
pipeline.sampleJobs.qc.Cutadapt.restFilePath
String?
Equivalent to cutadapt's --rest-file option.
pipeline.sampleJobs.qc.Cutadapt.stripF3
Boolean?
Equivalent to cutadapt's --strip-f3 flag.
pipeline.sampleJobs.qc.Cutadapt.stripSuffix
String?
Equivalent to cutadapt's --strip-suffix option.
pipeline.sampleJobs.qc.Cutadapt.suffix
String?
Equivalent to cutadapt's --suffix option.
pipeline.sampleJobs.qc.Cutadapt.times
Int?
Equivalent to cutadapt's --times option.
pipeline.sampleJobs.qc.Cutadapt.tooLongOutputPath
String?
Equivalent to cutadapt's --too-long-output option.
pipeline.sampleJobs.qc.Cutadapt.tooLongPairedOutputPath
String?
Equivalent to cutadapt's --too-long-paired-output option.
pipeline.sampleJobs.qc.Cutadapt.tooShortOutputPath
String?
Equivalent to cutadapt's --too-short-output option.
pipeline.sampleJobs.qc.Cutadapt.tooShortPairedOutputPath
String?
Equivalent to cutadapt's --too-short-paired-output option.
pipeline.sampleJobs.qc.Cutadapt.trimN
Boolean?
Equivalent to cutadapt's --trim-n flag.
pipeline.sampleJobs.qc.Cutadapt.untrimmedOutputPath
String?
Equivalent to cutadapt's --untrimmed-output option.
pipeline.sampleJobs.qc.Cutadapt.untrimmedPairedOutputPath
String?
Equivalent to cutadapt's --untrimmed-paired-output option.
pipeline.sampleJobs.qc.Cutadapt.wildcardFilePath
String?
Equivalent to cutadapt's --wildcard-file option.
pipeline.sampleJobs.qc.Cutadapt.Z
Boolean — Default: true
Equivalent to cutadapt's -Z flag.
pipeline.sampleJobs.qc.Cutadapt.zeroCap
Boolean?
Equivalent to cutadapt's --zero-cap flag.
pipeline.sampleJobs.qc.FastqcRead1.adapters
File?
Equivalent to fastqc's --adapters option.
pipeline.sampleJobs.qc.FastqcRead1.casava
Boolean — Default: false
Equivalent to fastqc's --casava flag.
pipeline.sampleJobs.qc.FastqcRead1.contaminants
File?
Equivalent to fastqc's --contaminants option.
pipeline.sampleJobs.qc.FastqcRead1.dir
String?
Equivalent to fastqc's --dir option.
pipeline.sampleJobs.qc.FastqcRead1.extract
Boolean — Default: false
Equivalent to fastqc's --extract flag.
pipeline.sampleJobs.qc.FastqcRead1.format
String?
Equivalent to fastqc's --format option.
pipeline.sampleJobs.qc.FastqcRead1.kmers
Int?
Equivalent to fastqc's --kmers option.
pipeline.sampleJobs.qc.FastqcRead1.limits
File?
Equivalent to fastqc's --limits option.
pipeline.sampleJobs.qc.FastqcRead1.memory
String — Default: "4G"
The amount of memory this job will use.
pipeline.sampleJobs.qc.FastqcRead1.minLength
Int?
Equivalent to fastqc's --min_length option.
pipeline.sampleJobs.qc.FastqcRead1.nano
Boolean — Default: false
Equivalent to fastqc's --nano flag.
pipeline.sampleJobs.qc.FastqcRead1.noFilter
Boolean — Default: false
Equivalent to fastqc's --nofilter flag.
pipeline.sampleJobs.qc.FastqcRead1.nogroup
Boolean — Default: false
Equivalent to fastqc's --nogroup flag.
pipeline.sampleJobs.qc.FastqcRead1.threads
Int — Default: 1
The number of cores to use.
pipeline.sampleJobs.qc.FastqcRead1After.adapters
File?
Equivalent to fastqc's --adapters option.
pipeline.sampleJobs.qc.FastqcRead1After.casava
Boolean — Default: false
Equivalent to fastqc's --casava flag.
pipeline.sampleJobs.qc.FastqcRead1After.contaminants
File?
Equivalent to fastqc's --contaminants option.
pipeline.sampleJobs.qc.FastqcRead1After.dir
String?
Equivalent to fastqc's --dir option.
pipeline.sampleJobs.qc.FastqcRead1After.extract
Boolean — Default: false
Equivalent to fastqc's --extract flag.
pipeline.sampleJobs.qc.FastqcRead1After.format
String?
Equivalent to fastqc's --format option.
pipeline.sampleJobs.qc.FastqcRead1After.kmers
Int?
Equivalent to fastqc's --kmers option.
pipeline.sampleJobs.qc.FastqcRead1After.limits
File?
Equivalent to fastqc's --limits option.
pipeline.sampleJobs.qc.FastqcRead1After.memory
String — Default: "4G"
The amount of memory this job will use.
pipeline.sampleJobs.qc.FastqcRead1After.minLength
Int?
Equivalent to fastqc's --min_length option.
pipeline.sampleJobs.qc.FastqcRead1After.nano
Boolean — Default: false
Equivalent to fastqc's --nano flag.
pipeline.sampleJobs.qc.FastqcRead1After.noFilter
Boolean — Default: false
Equivalent to fastqc's --nofilter flag.
pipeline.sampleJobs.qc.FastqcRead1After.nogroup
Boolean — Default: false
Equivalent to fastqc's --nogroup flag.
pipeline.sampleJobs.qc.FastqcRead1After.threads
Int — Default: 1
The number of cores to use.
pipeline.sampleJobs.qc.FastqcRead2.adapters
File?
Equivalent to fastqc's --adapters option.
pipeline.sampleJobs.qc.FastqcRead2.casava
Boolean — Default: false
Equivalent to fastqc's --casava flag.
pipeline.sampleJobs.qc.FastqcRead2.contaminants
File?
Equivalent to fastqc's --contaminants option.
pipeline.sampleJobs.qc.FastqcRead2.dir
String?
Equivalent to fastqc's --dir option.
pipeline.sampleJobs.qc.FastqcRead2.extract
Boolean — Default: false
Equivalent to fastqc's --extract flag.
pipeline.sampleJobs.qc.FastqcRead2.format
String?
Equivalent to fastqc's --format option.
pipeline.sampleJobs.qc.FastqcRead2.kmers
Int?
Equivalent to fastqc's --kmers option.
pipeline.sampleJobs.qc.FastqcRead2.limits
File?
Equivalent to fastqc's --limits option.
pipeline.sampleJobs.qc.FastqcRead2.memory
String — Default: "4G"
The amount of memory this job will use.
pipeline.sampleJobs.qc.FastqcRead2.minLength
Int?
Equivalent to fastqc's --min_length option.
pipeline.sampleJobs.qc.FastqcRead2.nano
Boolean — Default: false
Equivalent to fastqc's --nano flag.
pipeline.sampleJobs.qc.FastqcRead2.noFilter
Boolean — Default: false
Equivalent to fastqc's --nofilter flag.
pipeline.sampleJobs.qc.FastqcRead2.nogroup
Boolean — Default: false
Equivalent to fastqc's --nogroup flag.
pipeline.sampleJobs.qc.FastqcRead2.threads
Int — Default: 1
The number of cores to use.
pipeline.sampleJobs.qc.FastqcRead2After.adapters
File?
Equivalent to fastqc's --adapters option.
pipeline.sampleJobs.qc.FastqcRead2After.casava
Boolean — Default: false
Equivalent to fastqc's --casava flag.
pipeline.sampleJobs.qc.FastqcRead2After.contaminants
File?
Equivalent to fastqc's --contaminants option.
pipeline.sampleJobs.qc.FastqcRead2After.dir
String?
Equivalent to fastqc's --dir option.
pipeline.sampleJobs.qc.FastqcRead2After.extract
Boolean — Default: false
Equivalent to fastqc's --extract flag.
pipeline.sampleJobs.qc.FastqcRead2After.format
String?
Equivalent to fastqc's --format option.
pipeline.sampleJobs.qc.FastqcRead2After.kmers
Int?
Equivalent to fastqc's --kmers option.
pipeline.sampleJobs.qc.FastqcRead2After.limits
File?
Equivalent to fastqc's --limits option.
pipeline.sampleJobs.qc.FastqcRead2After.memory
String — Default: "4G"
The amount of memory this job will use.
pipeline.sampleJobs.qc.FastqcRead2After.minLength
Int?
Equivalent to fastqc's --min_length option.
pipeline.sampleJobs.qc.FastqcRead2After.nano
Boolean — Default: false
Equivalent to fastqc's --nano flag.
pipeline.sampleJobs.qc.FastqcRead2After.noFilter
Boolean — Default: false
Equivalent to fastqc's --nofilter flag.
pipeline.sampleJobs.qc.FastqcRead2After.nogroup
Boolean — Default: false
Equivalent to fastqc's --nogroup flag.
pipeline.sampleJobs.qc.FastqcRead2After.threads
Int — Default: 1
The number of cores to use.
pipeline.sampleJobs.qc.runAdapterClipping
Boolean — Default: defined(adapterForward) || defined(adapterReverse) || length(select_first([contaminations, []])) > 0
Whether or not adapters should be removed from the reads.
pipeline.sampleJobs.star.limitBAMsortRAM
Int?
Equivalent to star's `--limitBAMsortRAM` option.
pipeline.sampleJobs.star.memory
String — Default: "48G"
The amount of memory this job will use.
pipeline.sampleJobs.star.outSAMunmapped
String? — Default: "Within KeepPairs"
Equivalent to star's `--outSAMunmapped` option.
pipeline.sampleJobs.star.outStd
String?
Equivalent to star's `--outStd` option.
pipeline.sampleJobs.star.runThreadN
Int — Default: 4
The number of threads to use.
pipeline.sampleJobs.star.twopassMode
String? — Default: "Basic"
Equivalent to star's `--twopassMode` option.
pipeline.sampleJobs.umiDedup.memory
String — Default: "5G"
The amount of memory required for the task.
pipeline.scatterSize
Int — Default: scatterSizeMillions * 1000000
The size of the scattered regions in bases for the GATK subworkflows. Scattering is used to speed up certain processes. The genome will be seperated into multiple chunks (scatters) which will be processed in their own job, allowing for parallel processing. Higher values will result in a lower number of jobs. The optimal value here will depend on the available resources.
pipeline.scatterSizeMillions
Int — Default: 1000
Same as scatterSize, but is multiplied by 1000000 to get scatterSize. This allows for setting larger values more easily.
pipeline.variantcalling.calculateRegions.mergeBeds.outputBed
String — Default: "merged.bed"
The path to write the output to
pipeline.variantcalling.callAutosomal.haplotypeCaller.contamination
Float?
Equivalent to HaplotypeCaller's `-contamination` option.
pipeline.variantcalling.callAutosomal.haplotypeCaller.emitRefConfidence
String — Default: if gvcf then "GVCF" else "NONE"
Whether to include reference calls. Three modes: 'NONE', 'BP_RESOLUTION' and 'GVCF'
pipeline.variantcalling.callAutosomal.haplotypeCaller.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.callAutosomal.haplotypeCaller.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.variantcalling.callAutosomal.haplotypeCaller.outputMode
String?
Specifies which type of calls we should output. Same as HaplotypeCaller's `--output-mode` option.
pipeline.variantcalling.callX.contamination
Float?
Equivalent to HaplotypeCaller's `-contamination` option.
pipeline.variantcalling.callX.emitRefConfidence
String — Default: if gvcf then "GVCF" else "NONE"
Whether to include reference calls. Three modes: 'NONE', 'BP_RESOLUTION' and 'GVCF'
pipeline.variantcalling.callX.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.callX.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.variantcalling.callX.outputMode
String?
Specifies which type of calls we should output. Same as HaplotypeCaller's `--output-mode` option.
pipeline.variantcalling.callY.contamination
Float?
Equivalent to HaplotypeCaller's `-contamination` option.
pipeline.variantcalling.callY.emitRefConfidence
String — Default: if gvcf then "GVCF" else "NONE"
Whether to include reference calls. Three modes: 'NONE', 'BP_RESOLUTION' and 'GVCF'
pipeline.variantcalling.callY.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.callY.memory
String — Default: "12G"
The amount of memory this job will use.
pipeline.variantcalling.callY.outputMode
String?
Specifies which type of calls we should output. Same as HaplotypeCaller's `--output-mode` option.
pipeline.variantcalling.gatherGvcfs.intervals
Array[File] — Default: []
Bed files or interval lists describing the regions to operate on.
pipeline.variantcalling.gatherGvcfs.javaXmx
String — Default: "12G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.gatherGvcfs.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.variantcalling.gatherVcfs.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.gatherVcfs.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.variantcalling.genotypeGvcfs.annotationGroups
Array[String] — Default: ["StandardAnnotation"]
Which annotation groups will be used for the annotation
pipeline.variantcalling.genotypeGvcfs.javaXmx
String — Default: "6G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.genotypeGvcfs.memory
String — Default: "18G"
The amount of memory this job will use.
pipeline.variantcalling.mergeSingleSampleGvcf.intervals
Array[File] — Default: []
Bed files or interval lists describing the regions to operate on.
pipeline.variantcalling.mergeSingleSampleGvcf.javaXmx
String — Default: "12G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.mergeSingleSampleGvcf.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.variantcalling.mergeSingleSampleVcf.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.mergeSingleSampleVcf.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.variantcalling.scatterAllRegions.bamFile
File?
Equivalent to biopet scatterregions' `--bamfile` option.
pipeline.variantcalling.scatterAllRegions.bamIndex
File?
The index for the bamfile given through bamFile.
pipeline.variantcalling.scatterAllRegions.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.scatterAllRegions.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.variantcalling.scatterAllRegions.notSplitContigs
Boolean — Default: false
Equivalent to biopet scatterregions' `--notSplitContigs` flag.
pipeline.variantcalling.scatterAutosomalRegions.bamFile
File?
Equivalent to biopet scatterregions' `--bamfile` option.
pipeline.variantcalling.scatterAutosomalRegions.bamIndex
File?
The index for the bamfile given through bamFile.
pipeline.variantcalling.scatterAutosomalRegions.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
pipeline.variantcalling.scatterAutosomalRegions.memory
String — Default: "24G"
The amount of memory this job will use.
pipeline.variantcalling.scatterAutosomalRegions.notSplitContigs
Boolean — Default: false
Equivalent to biopet scatterregions' `--notSplitContigs` flag.
pipeline.variantcalling.scatterSizeMillions
Int — Default: 1000
Same as scatterSize, but is multiplied by 1000000 to get scatterSize. This allows for setting larger values more easily

Do not set these inputs!

The following inputs should not be set, even though womtool may show them as being available inputs.