New NCBI-AMRFinderPlus workflow and integration in TheiaProk_Illumina_PE wf (#65)

kapsakcj · sage-wright · kevinlibuit · web-flow · commit 43f3c883b51c · 2022-05-12T11:10:11.000-07:00
Fixes #41, Fixes #46 * added new task file for NCBI-amrfinderplus. passes miniwdl check but is NOT incorporated or tested in any workflows yet * adds single task workflow NCBI-AMRFinderPlus. update dockstore.yml with new wf. added lines in task to output 4 TSV outputs from amrfinderplus * removed comment from amrfinderplus task since it was breaking the cmd. fixed import path in wf_amrfinderplus.wdl * changed math to make integers instead of floats so bash doesn't get upset * update to ncbi-amrfinderplus docker image 3.10.24 and deleted some comment lines * rename amfrfinder tas to amrfinder_nuc for nculeotide. also added maxRetries to runtime block * added amrfinderplus outputs to export_taxon_tables inputs and output TSV header and outputs * added amrfinderplus to wf_theiaprok_illumina_pe.wdl. Have not tested locally or in Terra yet * added code to take in user-defined inputs or gambit_predicted_taxon to use with amrfinder --organism option * amrfinder --organism is always used and wil feed in bash variable * updated single task wf_amrfinderplus.wdl with new task name "amrfinderplus_nuc" * task_amrfinderplus.wdl: added double quotes so bash conditional evaluates properly * updated/fixed PHBG_Version to v0.4.0-dev. Also quoted echo statement so VSCode stops complaining * added if/else for amrfinder --organism flag. if organism input is supplied by user or gambit_predicted taxon, use --organism flag. otherwise do not use --organism flag * last grep command exits gracefully even if it finds nothing * added new outputs for amrfinderplus task for strings with all stress, virulence, and amr genes. floated to table in addition to TSVs * added new amrfinderplus gene names string output to single-task workflow * removed abricate from wf_theiaprok_illumina_pe and export taxon tables task. still need to test functionality. also getting warnings in VSCode that may need addressing * added new amrfinderplus output strings to export_taxon_tables task as well as theiaprok_illumina_pe workflow * attempting to make new amrfinderplus string outputs optional for export_taxon_tables task * fixed outputs for export_taxon_table * Make amrfinderplus docker image modifiable by user * amrfinderplus_nuc task: updated string matching for all organisms. Much easier to match genus and species with 2 words. Now redirecting STDOUT/ERR to a txt file for grepping amrfinder db version. Added new output string "amrfinder_db_version". export_taxon_table task: added new amrfinderplus_db_version to inputs & resulting TSV for export taxon table. wf_amrfinderplus.wdl: added new output amrfinderplus_db_version. wf_theiaprok_illumina_pe.wdl: updated with new amrfinderplus_db_version output * update gambit task and output * capture trimmomatic out * added a missing \t in export_taxon_tables task * round est_coverage to 2 decimal places * update default docker * fix syntax * Update task_versioning.wdl * removing redundant date capture * remove redundant date capture * avoid workflow/task name conflict Co-authored-by: Sage Wright <sage.wright@theiagen.com> Co-authored-by: kevinlibuit <kevin@libuit.com> Co-authored-by: kevinlibuit <kevinlibuit@users.noreply.github.com>
diff --git a/.dockstore.yml b/.dockstore.yml
@@ -40,6 +40,11 @@ workflows:
    primaryDescriptorPath: /workflows/wf_mashtree_fasta.wdl
    testParameterFiles:
     - empty.json
+ - name: NCBI-AMRFinderPlus
+   subclass: WDL
+   primaryDescriptorPath: /workflows/wf_amrfinderplus.wdl
+   testParameterFiles:
+    - empty.json
  - name: Kraken2_PE
    subclass: WDL
    primaryDescriptorPath: /workflows/wf_kraken2_pe.wdl
diff --git a/tasks/gene_typing/task_amrfinderplus.wdl b/tasks/gene_typing/task_amrfinderplus.wdl
@@ -0,0 +1,144 @@
+version 1.0
+
+task amrfinderplus_nuc {
+  input {
+    File assembly
+    String samplename
+    # Parameters 
+    # --indent_min Minimum DNA %identity [0-1]; default is 0.9 (90%) or curated threshold if it exists
+    # --mincov Minimum DNA %coverage [0-1]; default is 0.5 (50%)
+    String? organism # make optional?
+    Int? minid
+    Int? mincov
+    Int cpu = 4
+    String docker = "quay.io/staphb/ncbi-amrfinderplus:3.10.24"
+  }
+  command <<<
+    # logging info
+    date | tee DATE
+    amrfinder --version | tee AMRFINDER_VERSION
+    
+    ### set $amrfinder_organism BASH variable based on gambit_predicted_taxon or user-defined input string
+    ### final variable has strict syntax/spelling based on list from amrfinder --list_organisms
+    # there may be other Acinetobacter species to add later, like those in the A. baumannii-calcoaceticus species complex
+    if [[ "~{organism}" == *"Acinetobacter"*"baumannii"* ]]; then
+      amrfinder_organism="Acinetobacter_baumannii"
+    elif [[ "~{organism}" == *"Campylobacter"*"coli"* ]] || [[ "~{organism}" == *"Campylobacter"*"jejuni"* ]]; then
+      amrfinder_organism="Campylobacter"
+    elif [[ "~{organism}" == *"Clostridioides"*"difficile"* ]]; then
+      amrfinder_organism="Clostridioides_difficile"
+    elif [[ "~{organism}" == *"Enterococcus"*"faecalis"* ]]; then 
+      amrfinder_organism="Enterococcus_faecalis"
+    elif [[ "~{organism}" == *"Enterococcus"*"faecium"* ]] || [[ "~{organism}" == *"Enterococcus"*"hirae"* ]]; then 
+      amrfinder_organism="Enterococcus_faecium"
+    # should capture all Shigella and Escherichia species
+    elif [[ "~{organism}" == *"Escherichia"* ]] || [[ "~{organism}" == *"Shigella"* ]]; then 
+      amrfinder_organism="Escherichia"
+    # add other Klebsiella species later? Cannot use K. oxytoca as per amrfinderplus wiki
+    elif [[ "~{organism}" == *"Klebsiella"*"aerogenes"* ]] || [[ "~{organism}" == *"Klebsiella"*"pnemoniae"* ]]; then 
+      amrfinder_organism="Klebsiella"
+    # because some people spell the species 'gonorrhea' differently
+    elif [[ "~{organism}" == *"Neisseria"*"gonorrhea"* ]] || [[ "~{organism}" == *"Neisseria"*"gonorrhoeae"* ]] || [[ "~{organism}" == *"Neisseria"*"meningitidis"* ]]; then 
+      amrfinder_organism="Neisseria"
+    elif [[ "~{organism}" == *"Pseudomonas"*"aeruginosa"* ]]; then 
+      amrfinder_organism="Pseudomonas_aeruginosa"
+    # pretty broad, could work on Salmonella bongori and other species
+    elif [[ "~{organism}" == *"Salmonella"* ]]; then 
+      amrfinder_organism="Salmonella"
+    elif [[ "~{organism}" == *"Staphylococcus"*"aureus"* ]]; then 
+      amrfinder_organism="Staphylococcus_aureus"
+    elif [[ "~{organism}" == *"Staphylococcus"*"pseudintermedius"* ]]; then 
+      amrfinder_organism="Staphylococcus_pseudintermedius"
+    elif [[ "~{organism}" == *"Streptococcus"*"agalactiae"* ]]; then 
+      amrfinder_organism="Streptococcus_agalactiae"
+    elif [[ "~{organism}" == *"Streptococcus"*"pneumoniae"* ]] || [[ "~{organism}" == *"Streptococcus"*"mitis"* ]]; then 
+      amrfinder_organism="Streptococcus_pneumoniae"
+    elif [[ "~{organism}" == *"Streptococcus"*"pyogenes"* ]]; then 
+      amrfinder_organism="Streptococcus_pyogenes"
+    elif [[ "~{organism}" == *"Vibrio"*"cholerae"* ]]; then 
+      amrfinder_organism="Vibrio_cholerae"
+    else 
+      echo "Either Gambit predicted taxon is not supported by NCBI-AMRFinderPlus or the user did not supply an organism as input."
+      echo "Skipping the use of amrfinder --organism optional parameter."
+    fi
+
+    # checking bash variable
+    echo "amrfinder_organism is set to:" ${amrfinder_organism}
+    
+    # if amrfinder_organism variable is set, use --organism flag, otherwise do not use --organism flag
+    if [[ -v amrfinder_organism ]] ; then
+      # always use --plus flag, others may be left out if param is optional and not supplied 
+      # send STDOUT/ERR to log file for capturing database version
+      amrfinder --plus \
+        --organism ${amrfinder_organism} \
+        ~{'--name ' + samplename} \
+        ~{'--nucleotide ' + assembly} \
+        ~{'-o ' + samplename + '_amrfinder_all.tsv'} \
+        ~{'--threads ' + cpu} \
+        ~{'--coverage_min ' + mincov} \
+        ~{'--ident_min ' + minid} 2>&1 | tee amrfinder.STDOUT-and-STDERR.log
+    else 
+      # always use --plus flag, others may be left out if param is optional and not supplied 
+      # send STDOUT/ERR to log file for capturing database version
+      amrfinder --plus \
+        ~{'--name ' + samplename} \
+        ~{'--nucleotide ' + assembly} \
+        ~{'-o ' + samplename + '_amrfinder_all.tsv'} \
+        ~{'--threads ' + cpu} \
+        ~{'--coverage_min ' + mincov} \
+        ~{'--ident_min ' + minid} 2>&1 | tee amrfinder.STDOUT-and-STDERR.log
+    fi
+
+    # capture the database version from the stdout and stderr file that was just created
+    grep "Database version:" amrfinder.STDOUT-and-STDERR.log | sed 's|Database version: ||' >AMRFINDER_DB_VERSION
+
+    # Element Type possibilities: AMR, STRESS, and VIRULENCE 
+    # create headers for 3 output files; tee to 3 files and redirect STDOUT to dev null so it doesn't print to log file
+    head -n 1 ~{samplename}_amrfinder_all.tsv | tee ~{samplename}_amrfinder_stress.tsv ~{samplename}_amrfinder_virulence.tsv ~{samplename}_amrfinder_amr.tsv >/dev/null
+    # looks for all rows with STRESS, AMR, or VIRULENCE and append to TSVs
+    grep 'STRESS' ~{samplename}_amrfinder_all.tsv >> ~{samplename}_amrfinder_stress.tsv
+    grep 'VIRULENCE' ~{samplename}_amrfinder_all.tsv >> ~{samplename}_amrfinder_virulence.tsv
+    # || true is so that the final grep exits with code 0, preventing failures
+    grep 'AMR' ~{samplename}_amrfinder_all.tsv >> ~{samplename}_amrfinder_amr.tsv || true
+
+    # create string outputs for all genes identified in AMR, STRESS, VIRULENCE
+    amr_genes=$(awk -F '\t' '{ print $7 }' ~{samplename}_amrfinder_amr.tsv | tail -n+2 | tr '\n' ', ' | sed 's/.$//')
+    stress_genes=$(awk -F '\t' '{ print $7 }' ~{samplename}_amrfinder_stress.tsv | tail -n+2 | tr '\n' ', ' | sed 's/.$//')
+    virulence_genes=$(awk -F '\t' '{ print $7 }' ~{samplename}_amrfinder_virulence.tsv | tail -n+2 | tr '\n' ', ' | sed 's/.$//')
+
+    # if variable for list of genes is EMPTY, write string saying it is empty to float to Terra table
+    if [ -z "${amr_genes}" ]; then
+       amr_genes="No AMR genes detected by NCBI-AMRFinderPlus"
+    fi 
+    if [ -z "${stress_genes}" ]; then
+       stress_genes="No STRESS genes detected by NCBI-AMRFinderPlus"
+    fi 
+    if [ -z "${virulence_genes}" ]; then
+       virulence_genes="No VIRULENCE genes detected by NCBI-AMRFinderPlus"
+    fi 
+
+    # create final output strings
+    echo "${amr_genes}" > AMR_GENES
+    echo "${stress_genes}" > STRESS_GENES
+    echo "${virulence_genes}" > VIRULENCE_GENES
+  >>>
+  output {
+    File amrfinderplus_all_report = "~{samplename}_amrfinder_all.tsv"
+    File amrfinderplus_amr_report = "~{samplename}_amrfinder_amr.tsv"
+    File amrfinderplus_stress_report = "~{samplename}_amrfinder_stress.tsv"
+    File amrfinderplus_virulence_report = "~{samplename}_amrfinder_virulence.tsv"
+    String amrfinderplus_amr_genes = read_string("AMR_GENES")
+    String amrfinderplus_stress_genes = read_string("STRESS_GENES")
+    String amrfinderplus_virulence_genes = read_string("VIRULENCE_GENES")
+    String amrfinderplus_version = read_string("AMRFINDER_VERSION")
+    String amrfinderplus_db_version = read_string("AMRFINDER_DB_VERSION")
+  }
+  runtime {
+    memory: "8 GB"
+    cpu: cpu
+    docker: docker
+    disks: "local-disk 100 SSD"
+    preemptible: 0
+    maxRetries: 3
+  }
+}
diff --git a/tasks/quality_control/task_cg_pipeline.wdl b/tasks/quality_control/task_cg_pipeline.wdl
@@ -32,6 +32,7 @@ task cg_pipeline {
           with open("R2_MEAN_Q", 'wt') as r2_mean_q:
             r2_mean_q.write(line["avgQuality"])
           coverage += float(line["coverage"])
+          coverage="{:.2f}".format(coverage)
           with open("EST_COVERAGE", 'wt') as est_coverage:
             est_coverage.write(str(coverage))
     CODE
diff --git a/tasks/quality_control/task_screen.wdl b/tasks/quality_control/task_screen.wdl
@@ -49,8 +49,9 @@ task check_reads {
         # wc -c counts characters
 
         # set proportion variables for easy comparison
-        percent_read1=$(python3 -c "print(round(($read1_bp / $read2_bp)*100,2))")
-        percent_read2=$(python3 -c "print(round(($read2_bp / $read1_bp)*100,2))")
+        # removing the , 2) to make these integers instead of floats
+        percent_read1=$(python3 -c "print(round(($read1_bp / $read2_bp)*100))")
+        percent_read2=$(python3 -c "print(round(($read2_bp / $read1_bp)*100))")
 
         if [ "$percent_read1" -lt "~{min_proportion}" ] ; then
           flag="FAIL; more than 50 percent of the total sequence is found in R2 (BP: $read2_bp; PERCENT: $percent_read2) compared to R1 (BP: $read1_bp; PERCENT: $percent_read1)"
diff --git a/tasks/quality_control/task_trimmomatic.wdl b/tasks/quality_control/task_trimmomatic.wdl
@@ -21,7 +21,7 @@ task trimmomatic_pe {
     ~{read1} ~{read2} \
     -baseout ~{samplename}.fastq.gz \
     SLIDINGWINDOW:~{trimmomatic_window_size}:~{trimmomatic_quality_trim_score} \
-    MINLEN:~{trimmomatic_minlen} > ~{samplename}.trim.stats.txt
+    MINLEN:~{trimmomatic_minlen} &> ~{samplename}.trim.stats.txt
   >>>
   output {
     File read1_trimmed = "~{samplename}_1P.fastq.gz"
diff --git a/tasks/species_typing/task_ts_mlst.wdl b/tasks/species_typing/task_ts_mlst.wdl
@@ -7,7 +7,7 @@ task ts_mlst {
   input {
     File assembly
     String samplename
-    String docker = "staphb/mlst:2.19.0"
+    String docker = "staphb/mlst:2.22.0"
     Int? cpu = 4
     # Parameters
     # --nopath          Strip filename paths from FILE column (default OFF)
diff --git a/tasks/task_versioning.wdl b/tasks/task_versioning.wdl
@@ -8,10 +8,10 @@ task version_capture {
     volatile: true
   }
   command {
-    PHBG_Version="PHBG v.0.4-dev"
+    PHBG_Version="PHBG v0.5.0"
     ~{default='' 'export TZ=' + timezone}
     date +"%Y-%m-%d" > TODAY
-    echo $PHBG_Version > PHBG_VERSION
+    echo "$PHBG_Version" > PHBG_VERSION
   }
   output {
     String date = read_string("TODAY")
diff --git a/tasks/utilities/task_broad_terra_tools.wdl b/tasks/utilities/task_broad_terra_tools.wdl
@@ -48,9 +48,15 @@ task export_taxon_tables {
     String gambit_version
     String gambit_db_version
     String gambit_docker
-    File abricate_amr_results
-    String abricate_amr_database
-    String abricate_amr_version
+    File amrfinderplus_all_report
+    File amrfinderplus_amr_report
+    File amrfinderplus_stress_report
+    File amrfinderplus_virulence_report
+    String amrfinderplus_amr_genes
+    String amrfinderplus_stress_genes
+    String amrfinderplus_virulence_genes
+    String amrfinderplus_version
+    String amrfinderplus_db_version
     String ts_mlst_results
     String ts_mlst_predicted_st
     String ts_mlst_pubmlst_scheme
@@ -118,9 +124,9 @@ task export_taxon_tables {
     if [ ! -z ${sample_table} ]; then
        # create single-entity sample data table
        ## header
-      echo -e "entity:${sample_table}_id\treads\tread1\tread2\trun_id\tcollection_date\toriginating_lab\tcity\tcounty\tzip\ttheiaprok_illumina_pe_version\ttheiaprok_illumina_pe_analysis_date\tseq_platform\tnum_reads_raw1\tnum_reads_raw2\tnum_reads_raw_pairs\tfastq_scan_version\tnum_reads_clean1\tnum_reads_clean2\tnum_reads_clean_pairs\ttrimmomatic_version\tbbduk_docker\tr1_mean_q\tr2_mean_q\tassembly_fasta\tcontigs_gfa\tshovill_pe_version\tquast_report\tquast_version\tgenome_length\tnumber_contigs\tn50_value\tcg_pipeline_report\tcg_pipeline_docker\test_coverage\tgambit_report\tgambit_predicted_taxon\tgambit_predicted_taxon_rank\tgambit_version\tgambit_db_version\tgambit_docker\tabricate_amr_results\tabricate_amr_database\tabricate_amr_version\tts_mlst_results\tts_mlst_predicted_st\tts_mlst_pubmlst_scheme\tts_mlst_version\tserotypefinder_report\tserotypefinder_docker\tserotypefinder_serotype\tectyper_results\tectyper_version\tectyper_predicted_serotype\tlissero_results\tlissero_version\tsistr_results\tsistr_allele_json\tsister_allele_fasta\tsistr_cgmlst\tsistr_version\tsistr_predicted_serotype\tseqsero2_report\tseqsero2_version\tseqsero2_predicted_antigenic_profile\tseqsero2_predicted_serotype\tseqsero2_predicted_contamination\tkleborate_output_file\tkleborate_version\tkleborate_key_resistance_genes\tkleborate_genomic_resistance_mutations\tkleborate_mlst_sequence_type\ttbprofiler_output_file\ttbprofiler_output_bam\ttbprofiler_output_bai\ttbprofiler_version\ttbprofiler_main_lineage\ttbprofiler_sub_lineage\ttbprofiler_dr_type\ttbprofiler_resistance_genes" > ~{samplename}_terra_table.tsv
+      echo -e "entity:${sample_table}_id\treads\tread1\tread2\trun_id\tcollection_date\toriginating_lab\tcity\tcounty\tzip\ttheiaprok_illumina_pe_version\ttheiaprok_illumina_pe_analysis_date\tseq_platform\tnum_reads_raw1\tnum_reads_raw2\tnum_reads_raw_pairs\tfastq_scan_version\tnum_reads_clean1\tnum_reads_clean2\tnum_reads_clean_pairs\ttrimmomatic_version\tbbduk_docker\tr1_mean_q\tr2_mean_q\tassembly_fasta\tcontigs_gfa\tshovill_pe_version\tquast_report\tquast_version\tgenome_length\tnumber_contigs\tn50_value\tcg_pipeline_report\tcg_pipeline_docker\test_coverage\tgambit_report\tgambit_predicted_taxon\tgambit_predicted_taxon_rank\tgambit_version\tgambit_db_version\tgambit_docker\tts_mlst_results\tts_mlst_predicted_st\tts_mlst_pubmlst_scheme\tts_mlst_version\tserotypefinder_report\tserotypefinder_docker\tserotypefinder_serotype\tectyper_results\tectyper_version\tectyper_predicted_serotype\tlissero_results\tlissero_version\tsistr_results\tsistr_allele_json\tsister_allele_fasta\tsistr_cgmlst\tsistr_version\tsistr_predicted_serotype\tseqsero2_report\tseqsero2_version\tseqsero2_predicted_antigenic_profile\tseqsero2_predicted_serotype\tseqsero2_predicted_contamination\tkleborate_output_file\tkleborate_version\tkleborate_key_resistance_genes\tkleborate_genomic_resistance_mutations\tkleborate_mlst_sequence_type\ttbprofiler_output_file\ttbprofiler_output_bam\ttbprofiler_output_bai\ttbprofiler_version\ttbprofiler_main_lineage\ttbprofiler_sub_lineage\ttbprofiler_dr_type\ttbprofiler_resistance_genes\tamrfinderplus_all_report\tamrfinderplus_amr_report\tamrfinderplus_stress_report\tamrfinderplus_virulence_report\tamrfinderplus_version\tamrfinderplus_db_version\tamrfinderplus_amr_genes\tamrfinderplus_stress_genes\tamrfinderplus_virulence_genes" > ~{samplename}_terra_table.tsv
       ## TheiaProk Outs
-      echo -e "~{samplename}\t~{reads}\t~{read1}\t~{read2}\t~{run_id}\t~{collection_date}\t~{originating_lab}\t~{city}\t~{county}\t~{zip}\t~{theiaprok_illumina_pe_version}\t~{theiaprok_illumina_pe_analysis_date}\t~{seq_platform}\t~{num_reads_raw1}\t~{num_reads_raw2}\t~{num_reads_raw_pairs}\t~{fastq_scan_version}\t~{num_reads_clean1}\t~{num_reads_clean2}\t~{num_reads_clean_pairs}\t~{trimmomatic_version}\t~{bbduk_docker}\t~{r1_mean_q}\t~{r2_mean_q}\t~{assembly_fasta}\t~{contigs_gfa}\t~{shovill_pe_version}\t~{quast_report}\t~{quast_version}\t~{genome_length}\t~{number_contigs}\t~{n50_value}\t~{cg_pipeline_report}\t~{cg_pipeline_docker}\t~{est_coverage}\t~{gambit_report}\t~{gambit_predicted_taxon}\t~{gambit_predicted_taxon_rank}\t~{gambit_version}\t~{gambit_db_version}\t~{gambit_docker}\t~{abricate_amr_results}\t~{abricate_amr_database}\t~{abricate_amr_version}\t~{ts_mlst_results}\t~{ts_mlst_predicted_st}\t~{ts_mlst_pubmlst_scheme}\t~{ts_mlst_version}\t~{serotypefinder_report}\t~{serotypefinder_docker}\t~{serotypefinder_serotype}\t~{ectyper_results}\t~{ectyper_version}\t~{ectyper_predicted_serotype}\t~{lissero_results}\t~{lissero_version}\t~{sistr_results}\t~{sistr_allele_json}\t~{sister_allele_fasta}\t~{sistr_cgmlst}\t~{sistr_version}\t~{sistr_predicted_serotype}\t~{seqsero2_report}\t~{seqsero2_version}\t~{seqsero2_predicted_antigenic_profile}\t~{seqsero2_predicted_serotype}\t~{seqsero2_predicted_contamination}\t~{kleborate_output_file}\t~{kleborate_version}\t~{kleborate_key_resistance_genes}\t~{kleborate_genomic_resistance_mutations}\t~{kleborate_mlst_sequence_type}\t~{tbprofiler_output_file}\t~{tbprofiler_output_bam}\t~{tbprofiler_output_bai}\t~{tbprofiler_version}\t~{tbprofiler_main_lineage}\t~{tbprofiler_sub_lineage}\t~{tbprofiler_dr_type}\t~{tbprofiler_resistance_genes}"  >> ~{samplename}_terra_table.tsv
+      echo -e "~{samplename}\t~{reads}\t~{read1}\t~{read2}\t~{run_id}\t~{collection_date}\t~{originating_lab}\t~{city}\t~{county}\t~{zip}\t~{theiaprok_illumina_pe_version}\t~{theiaprok_illumina_pe_analysis_date}\t~{seq_platform}\t~{num_reads_raw1}\t~{num_reads_raw2}\t~{num_reads_raw_pairs}\t~{fastq_scan_version}\t~{num_reads_clean1}\t~{num_reads_clean2}\t~{num_reads_clean_pairs}\t~{trimmomatic_version}\t~{bbduk_docker}\t~{r1_mean_q}\t~{r2_mean_q}\t~{assembly_fasta}\t~{contigs_gfa}\t~{shovill_pe_version}\t~{quast_report}\t~{quast_version}\t~{genome_length}\t~{number_contigs}\t~{n50_value}\t~{cg_pipeline_report}\t~{cg_pipeline_docker}\t~{est_coverage}\t~{gambit_report}\t~{gambit_predicted_taxon}\t~{gambit_predicted_taxon_rank}\t~{gambit_version}\t~{gambit_db_version}\t~{gambit_docker}\t~{ts_mlst_results}\t~{ts_mlst_predicted_st}\t~{ts_mlst_pubmlst_scheme}\t~{ts_mlst_version}\t~{serotypefinder_report}\t~{serotypefinder_docker}\t~{serotypefinder_serotype}\t~{ectyper_results}\t~{ectyper_version}\t~{ectyper_predicted_serotype}\t~{lissero_results}\t~{lissero_version}\t~{sistr_results}\t~{sistr_allele_json}\t~{sister_allele_fasta}\t~{sistr_cgmlst}\t~{sistr_version}\t~{sistr_predicted_serotype}\t~{seqsero2_report}\t~{seqsero2_version}\t~{seqsero2_predicted_antigenic_profile}\t~{seqsero2_predicted_serotype}\t~{seqsero2_predicted_contamination}\t~{kleborate_output_file}\t~{kleborate_version}\t~{kleborate_key_resistance_genes}\t~{kleborate_genomic_resistance_mutations}\t~{kleborate_mlst_sequence_type}\t~{tbprofiler_output_file}\t~{tbprofiler_output_bam}\t~{tbprofiler_output_bai}\t~{tbprofiler_version}\t~{tbprofiler_main_lineage}\t~{tbprofiler_sub_lineage}\t~{tbprofiler_dr_type}\t~{tbprofiler_resistance_genes}\t~{amrfinderplus_all_report}\t~{amrfinderplus_amr_report}\t~{amrfinderplus_stress_report}\t~{amrfinderplus_virulence_report}\t~{amrfinderplus_version}\t~{amrfinderplus_db_version}\t~{amrfinderplus_amr_genes}\t~{amrfinderplus_stress_genes}\t~{amrfinderplus_virulence_genes}"  >> ~{samplename}_terra_table.tsv
       # modify file paths to GCP URIs
       sed -i 's/\/cromwell_root\//gs:\/\//g' ~{samplename}_terra_table.tsv
       # export table 
diff --git a/workflows/wf_amrfinderplus.wdl b/workflows/wf_amrfinderplus.wdl
@@ -0,0 +1,32 @@
+version 1.0
+
+import "../tasks/gene_typing/task_amrfinderplus.wdl" as amrfindertask
+import "../tasks/task_versioning.wdl" as versioning
+
+workflow amrfinderplus_wf {
+  input {
+      File assembly
+      String samplename
+    }
+  call amrfindertask.amrfinderplus_nuc {
+    input:
+      assembly = assembly,
+      samplename = samplename
+    }
+  call versioning.version_capture{
+    input:
+  }
+  output {
+    String amrfinderplus_version = amrfinderplus_nuc.amrfinderplus_version
+    String amrfinderplus_db_version = amrfinderplus_nuc.amrfinderplus_db_version
+    String amrfinderplus_wf_version = version_capture.phbg_version
+    String amrfinderplus_wf_analysis_date = version_capture.date
+    File amrfinderplus_all_report = amrfinderplus_nuc.amrfinderplus_all_report
+    File amrfinderplus_amr_report = amrfinderplus_nuc.amrfinderplus_amr_report
+    File amrfinderplus_stress_report = amrfinderplus_nuc.amrfinderplus_stress_report
+    File amrfinderplus_virulence_report = amrfinderplus_nuc.amrfinderplus_virulence_report
+    String amrfinderplus_amr_genes = amrfinderplus_nuc.amrfinderplus_amr_genes
+    String amrfinderplus_stress_genes = amrfinderplus_nuc.amrfinderplus_stress_genes
+    String amrfinderplus_virulence_genes = amrfinderplus_nuc.amrfinderplus_virulence_genes
+    }
+ }
diff --git a/workflows/wf_gambit_query.wdl b/workflows/wf_gambit_query.wdl
diff --git a/workflows/wf_kraken2_pe.wdl b/workflows/wf_kraken2_pe.wdl
diff --git a/workflows/wf_kraken2_se.wdl b/workflows/wf_kraken2_se.wdl
diff --git a/workflows/wf_theiaprok_illumina_pe.wdl b/workflows/wf_theiaprok_illumina_pe.wdl

Original file line number	Diff line number	Diff line change
`@@ -8,10 +8,10 @@ task version_capture {`
`8`	`8`	`volatile: true`
`9`	`9`	`}`
`10`	`10`	`command {`
`11`		`- PHBG_Version="PHBG v.0.4-dev"`
	`11`	`+ PHBG_Version="PHBG v0.5.0"`
`12`	`12`	`~{default='' 'export TZ=' + timezone}`
`13`	`13`	`date +"%Y-%m-%d" > TODAY`
`14`		`- echo $PHBG_Version > PHBG_VERSION`
	`14`	`+ echo "$PHBG_Version" > PHBG_VERSION`
`15`	`15`	`}`
`16`	`16`	`output {`
`17`	`17`	`String date = read_string("TODAY")`