A BioWDL pipeline for virus assembly. Category:Multi-Sample Experimental
Currently viewing version: develop
View the Project on GitHub biowdl/virus-assembly
There are no releases available for this repository. The content is, therefore, likely still under development and not production ready. Use at own risk!
This pipeline can be used to assemble a viral genome.
In order to run the complete multisample pipeline, you can
run pipeline.wdl
using
Cromwell:
java -jar cromwell-<version>.jar run -i inputs.json pipeline.wdl
The inputs JSON can be generated using WOMtools as described in the WOMtools documentation.
The primary inputs are described below, additional inputs (such as precommands and JAR paths) are available. Please use the above mentioned WOMtools command to see all available inputs.
field | type | default | |
---|---|---|---|
outputDir | String |
The output directory. | |
sampleConfigFiles | Array[File] |
The sample configuration files. |
All inputs have to be preceded by
pipeline.
. Type is indicated according to the WDL data types:File
should be indicators of file location (a string in JSON). Types ending in?
indicate the input is optional, types ending in+
indicate they require at least one element.
The sample configuration should be a YML file which adheres to the following structure:
samples:
<sample>:
libraries:
<library>:
readgroups:
<readgroup>:
R1: <Path to first-end FastQ file.>
R1_md5: <MD5 checksum of first-end FastQ file.>
R2: <Path to second-end FastQ file.>
R2_md5: <MD5 checksum of second-end FastQ file.>
Replace the text between < >
with appropriate values. MD5 values may be
omitted and R2 values may be omitted in the case of single-end data.
Multiple readgroups can be added per library and multiple libraries may be
given per sample.
This pipeline will produce an assembled viral genome.
For any question related to this pipeline, please use the github issue tracker or contact the SASC team directly at: sasc@lumc.nl.