BioWDL: gatk-variantcalling

A pipeline that calls variants on a group of BAM files.

Please be aware that the page you are currently viewing is not for the latest available version!

Inputs for GatkVariantCalling

The following is an overview of all available inputs in GatkVariantCalling.

Required inputs

GatkVariantCalling.bamFilesAndGenders
Array[struct(file : File, gender : String?, index : File)]
List of structs containing,BAM file, BAM index and gender. The BAM should be recalibrated beforehand if required. The gender string is optional. Actionable values are 'female','f','F','male','m' and 'M'.
GatkVariantCalling.referenceFasta
File
The reference fasta file
GatkVariantCalling.referenceFastaDict
File
Sequence dictionary (.dict) file of the reference
GatkVariantCalling.referenceFastaFai
File
Fasta index (.fai) file of the reference

Other common inputs

GatkVariantCalling.callAutosomal.haplotypeCaller.excludeIntervalList
Array[File]+?
Bed files or interval lists describing the regions to NOT operate on.
GatkVariantCalling.callAutosomal.haplotypeCaller.pedigree
File?
Pedigree file for determining the population "founders"
GatkVariantCalling.callAutosomal.haplotypeCaller.ploidy
Int?
The ploidy with which the variants should be called.
GatkVariantCalling.callX.excludeIntervalList
Array[File]+?
Bed files or interval lists describing the regions to NOT operate on.
GatkVariantCalling.callX.pedigree
File?
Pedigree file for determining the population "founders"
GatkVariantCalling.callY.excludeIntervalList
Array[File]+?
Bed files or interval lists describing the regions to NOT operate on.
GatkVariantCalling.callY.pedigree
File?
Pedigree file for determining the population "founders"
GatkVariantCalling.dbsnpVCF
File?
dbsnp VCF file used for checking known sites
GatkVariantCalling.dbsnpVCFIndex
File?
Index (.tbi) file for the dbsnp VCF
GatkVariantCalling.genotypeGvcfs.pedigree
File?
Pedigree file for determining the population "founders"
GatkVariantCalling.jointgenotyping
Boolean — Default: true
Whether to perform jointgenotyping (using HaplotypeCaller to call GVCFs and merge them with GenotypeGVCFs) or not
GatkVariantCalling.outputDir
String — Default: "."
The directory where the output files should be located
GatkVariantCalling.regions
File?
A bed file describing the regions to operate on.
GatkVariantCalling.singleSampleGvcf
Boolean — Default: false
Whether to output single-sample gvcfs
GatkVariantCalling.vcfBasename
String — Default: "multisample"
The basename of the VCF and GVCF files that are outputted by the workflow
GatkVariantCalling.XNonParRegions
File?
Bed file with the non-PAR regions of X
GatkVariantCalling.YNonParRegions
File?
Bed file with the non-PAR regions of Y

Advanced inputs

Show/Hide
GatkVariantCalling.calculateRegions.mergeBeds.outputBed
String — Default: "merged.bed"
The path to write the output to
GatkVariantCalling.callAutosomal.haplotypeCaller.contamination
Float?
Equivalent to HaplotypeCaller's `-contamination` option.
GatkVariantCalling.callAutosomal.haplotypeCaller.emitRefConfidence
String — Default: if gvcf then "GVCF" else "NONE"
Whether to include reference calls. Three modes: 'NONE', 'BP_RESOLUTION' and 'GVCF'
GatkVariantCalling.callAutosomal.haplotypeCaller.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.callAutosomal.haplotypeCaller.memory
String — Default: "12G"
The amount of memory this job will use.
GatkVariantCalling.callAutosomal.haplotypeCaller.outputMode
String?
Specifies which type of calls we should output. Same as HaplotypeCaller's `--output-mode` option.
GatkVariantCalling.callX.contamination
Float?
Equivalent to HaplotypeCaller's `-contamination` option.
GatkVariantCalling.callX.emitRefConfidence
String — Default: if gvcf then "GVCF" else "NONE"
Whether to include reference calls. Three modes: 'NONE', 'BP_RESOLUTION' and 'GVCF'
GatkVariantCalling.callX.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.callX.memory
String — Default: "12G"
The amount of memory this job will use.
GatkVariantCalling.callX.outputMode
String?
Specifies which type of calls we should output. Same as HaplotypeCaller's `--output-mode` option.
GatkVariantCalling.callY.contamination
Float?
Equivalent to HaplotypeCaller's `-contamination` option.
GatkVariantCalling.callY.emitRefConfidence
String — Default: if gvcf then "GVCF" else "NONE"
Whether to include reference calls. Three modes: 'NONE', 'BP_RESOLUTION' and 'GVCF'
GatkVariantCalling.callY.javaXmx
String — Default: "4G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.callY.memory
String — Default: "12G"
The amount of memory this job will use.
GatkVariantCalling.callY.outputMode
String?
Specifies which type of calls we should output. Same as HaplotypeCaller's `--output-mode` option.
GatkVariantCalling.dockerImages
Map[String,String] — Default: {"bedtools": "quay.io/biocontainers/bedtools:2.23.0--hdbcaa40_3", "picard": "quay.io/biocontainers/picard:2.20.5--0", "gatk4": "quay.io/biocontainers/gatk4:4.1.0.0--0", "biopet-scatterregions": "quay.io/biocontainers/biopet-scatterregions:0.2--0"}
specify which docker images should be used for running this pipeline
GatkVariantCalling.gatherGvcfs.intervals
Array[File] — Default: []
Bed files or interval lists describing the regions to operate on.
GatkVariantCalling.gatherGvcfs.javaXmx
String — Default: "12G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.gatherGvcfs.memory
String — Default: "24G"
The amount of memory this job will use.
GatkVariantCalling.gatherVcfs.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.gatherVcfs.memory
String — Default: "24G"
The amount of memory this job will use.
GatkVariantCalling.genotypeGvcfs.annotationGroups
Array[String] — Default: ["StandardAnnotation"]
Which annotation groups will be used for the annotation
GatkVariantCalling.genotypeGvcfs.javaXmx
String — Default: "6G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.genotypeGvcfs.memory
String — Default: "18G"
The amount of memory this job will use.
GatkVariantCalling.mergeSingleSampleGvcf.intervals
Array[File] — Default: []
Bed files or interval lists describing the regions to operate on.
GatkVariantCalling.mergeSingleSampleGvcf.javaXmx
String — Default: "12G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.mergeSingleSampleGvcf.memory
String — Default: "24G"
The amount of memory this job will use.
GatkVariantCalling.mergeSingleSampleVcf.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.mergeSingleSampleVcf.memory
String — Default: "24G"
The amount of memory this job will use.
GatkVariantCalling.scatterAllRegions.bamFile
File?
Equivalent to biopet scatterregions' `--bamfile` option.
GatkVariantCalling.scatterAllRegions.bamIndex
File?
The index for the bamfile given through bamFile.
GatkVariantCalling.scatterAllRegions.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.scatterAllRegions.memory
String — Default: "24G"
The amount of memory this job will use.
GatkVariantCalling.scatterAllRegions.notSplitContigs
Boolean — Default: false
Equivalent to biopet scatterregions' `--notSplitContigs` flag.
GatkVariantCalling.scatterAutosomalRegions.bamFile
File?
Equivalent to biopet scatterregions' `--bamfile` option.
GatkVariantCalling.scatterAutosomalRegions.bamIndex
File?
The index for the bamfile given through bamFile.
GatkVariantCalling.scatterAutosomalRegions.javaXmx
String — Default: "8G"
The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.
GatkVariantCalling.scatterAutosomalRegions.memory
String — Default: "24G"
The amount of memory this job will use.
GatkVariantCalling.scatterAutosomalRegions.notSplitContigs
Boolean — Default: false
Equivalent to biopet scatterregions' `--notSplitContigs` flag.
GatkVariantCalling.scatterSize
Int — Default: scatterSizeMillions * 1000000
The size of the scattered regions in bases. Scattering is used to speed up certain processes. The genome will be seperated into multiple chunks (scatters) which will be processed in their own job, allowing for parallel processing. Higher values will result in a lower number of jobs. The optimal value here will depend on the available resources.
GatkVariantCalling.scatterSizeMillions
Int — Default: 1000
Same as scatterSize, but is multiplied by 1000000 to get scatterSize. This allows for setting larger values more easily