Codeplot
WGS
Introduction

Whole Genome Sequencing (WGS) is to sequence the species with known Genome sequence. Through sequence alignment and mutation detection, a large number of variation information can be found, including single nucleotide polymorphism (SNP), insertion deletion (INDEL).

Script
Input
Task nameAttribute nameTypeDescription
* WGS split_chrBoolean GATK variant calling by chromosome
* WGS snp_filter_optionString option of GATK snp filter
* WGS sample_nameString sample name
* WGS run_fqfilterBoolean run fastq filter or not
* WGS ref_nameString reference name
* WGS refFile reference sequence file
* WGS indel_filter_optionString option of GATK indel filter
* WGS fqfilter_threadInt threads number used in fastq filter
* WGS fqfilter_qualSysInt input fastq quality system
* WGS fqfilter_qualRateFloat low quality rate in fastq filter
* WGS fqfilter_outQualSysInt output fastq quality system
* WGS fqfilter_nRateFloat N rate threshold in fastq filter
* WGS fqfilter_lowQualInt low quality threshold in fastq filter
* WGS fqfilter_adapter2String adapter sequence of read2
* WGS fqfilter_adapter1String adapter sequence of read1
* WGS fq2Array[File] paired-end sequenceing read2
* WGS fq1Array[File] paired-end sequenceing read1
* WGS align_threadInt number threads used in bwa mem
* WGS align_puString PU in bwa RG
* WGS align_plString PL in bwa RG
* WGS align_libString LIB in bwa RG
* WGS align_idString ID in bwa RG
* WGS align_cnString CN in bwa RG
Output
Task nameAttribute nameTypeDescription
* WGS fqStatFile output fastq statistical results
* WGS markdupBamFile output bam which duplicate reads was tags or remove
* WGS MarkdupBamBaiFile index of markdup bam
* WGS gvcfFile gvcf file of variant calling
* WGS gvcfTbiFile gvcf index file
* WGS vcfFile vcf file of variant calling
* WGS vcfTbiFile vcf index file
* WGS filterSnpFile filter snp
* WGS filterIndelFile filter indel
* WGS snpStatFile snp statistical results
* WGS indelStatFile indel statistical results
* WGS snpTypeFile snp type statistical
* WGS indelLengthFile indel length statistical
* WGS bamStatFile output bam statistical
* WGS bamCumuFile align cumulative depth distribution
* WGS bamHisFile align depth distribution
* WGS bamInsertFile insertion length distribution