02/13/2018 12:05:05 PM - kneaddata.knead_data - INFO: Running kneaddata v0.7.0 02/13/2018 12:05:05 PM - kneaddata.knead_data - INFO: Output files will be written to: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main 02/13/2018 12:05:05 PM - kneaddata.knead_data - DEBUG: Running with the following arguments: verbose = False bmtagger_path = None minscore = 50 bowtie2_path = /n/sw/centos6/bowtie2-2.2.1/bowtie2 maxperiod = 500 no_discordant = False serial = True fastqc_start = False bmtagger = False cat_final_output = True log_level = DEBUG log = /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.log max_memory = 500m remove_intermediate_output = True fastqc_path = None output_dir = /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main trf_path = None remove_temp_output = True reference_db = /n/huttenhower_lab/data/kneaddata_databases/Homo_sapiens input = /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R2.fastq pi = 10 reorder = False pm = 80 trimmomatic_path = /n/huttenhower_lab/tools/kneaddata/bin_v0.7.0_devel/trimmomatic-0.33.jar store_temp_output = False mismatch = 7 threads = 6 delta = 7 bowtie2_options = --very-sensitive --phred33 bypass_trim = False processes = 1 trimmomatic_quality_scores = -phred33 fastqc_end = False trf = False trimmomatic_options = None output_prefix = MSMB4LZR match = 2 02/13/2018 12:05:32 PM - kneaddata.utilities - INFO: READ COUNT: raw pair1 : Initial number of reads ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R1.fastq ): 22629224 02/13/2018 12:06:00 PM - kneaddata.utilities - INFO: READ COUNT: raw pair2 : Initial number of reads ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R2.fastq ): 22629224 02/13/2018 12:06:00 PM - kneaddata.utilities - DEBUG: Checking input file to Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R1.fastq 02/13/2018 12:06:00 PM - kneaddata.utilities - DEBUG: Checking input file to Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R2.fastq 02/13/2018 12:06:00 PM - kneaddata.utilities - INFO: Running Trimmomatic ... 02/13/2018 12:06:00 PM - kneaddata.utilities - INFO: Execute command: java -Xmx500m -d64 -jar /n/huttenhower_lab/tools/kneaddata/bin_v0.7.0_devel/trimmomatic-0.33.jar PE -threads 6 -phred33 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.2.fastq ILLUMINACLIP:/n/huttenhower_lab/tools/kneaddata/lib_v0.7.0_devel/kneaddata/adapters/NexteraPE-PE.fa:2:30:10:8:TRUE SLIDINGWINDOW:4:20 MINLEN:50 02/13/2018 12:10:02 PM - kneaddata.utilities - DEBUG: TrimmomaticPE: Started with arguments: -threads 6 -phred33 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/fastq/MSMB4LZR_R2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.2.fastq ILLUMINACLIP:/n/huttenhower_lab/tools/kneaddata/lib_v0.7.0_devel/kneaddata/adapters/NexteraPE-PE.fa:2:30:10:8:TRUE SLIDINGWINDOW:4:20 MINLEN:50 Using PrefixPair: 'AGATGTGTATAAGAGACAG' and 'AGATGTGTATAAGAGACAG' Using Long Clipping Sequence: 'GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG' Using Long Clipping Sequence: 'TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG' Using Long Clipping Sequence: 'CTGTCTCTTATACACATCTGACGCTGCCGACGA' Using Long Clipping Sequence: 'CTGTCTCTTATACACATCTCCGAGCCCACGAGAC' ILLUMINACLIP: Using 1 prefix pairs, 4 forward/reverse sequences, 0 forward only sequences, 0 reverse only sequences Input Read Pairs: 22629224 Both Surviving: 14059585 (62.13%) Forward Only Surviving: 1389426 (6.14%) Reverse Only Surviving: 2369029 (10.47%) Dropped: 4811184 (21.26%) TrimmomaticPE: Completed successfully 02/13/2018 12:10:02 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.1.fastq 02/13/2018 12:10:02 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.1.fastq 02/13/2018 12:10:02 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.2.fastq 02/13/2018 12:10:02 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.2.fastq 02/13/2018 12:10:18 PM - kneaddata.utilities - INFO: READ COUNT: trimmed pair1 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.1.fastq ): 14059585 02/13/2018 12:10:30 PM - kneaddata.utilities - INFO: READ COUNT: trimmed pair2 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.2.fastq ): 14059585 02/13/2018 12:10:36 PM - kneaddata.utilities - INFO: READ COUNT: trimmed orphan1 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.1.fastq ): 1389426 02/13/2018 12:10:41 PM - kneaddata.utilities - INFO: READ COUNT: trimmed orphan2 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.2.fastq ): 2369029 02/13/2018 12:10:41 PM - kneaddata.run - INFO: Decontaminating ... 02/13/2018 12:10:46 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.1.fastq 02/13/2018 12:10:46 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.2.fastq 02/13/2018 12:10:46 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.1.fastq 02/13/2018 12:10:46 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.2.fastq 02/13/2018 12:10:46 PM - kneaddata.utilities - INFO: Running bowtie2 ... 02/13/2018 12:10:46 PM - kneaddata.utilities - INFO: Execute command: kneaddata_bowtie2_discordant_pairs --bowtie2 /n/sw/centos6/bowtie2-2.2.1/bowtie2 --threads 6 -x /n/huttenhower_lab/data/kneaddata_databases/Homo_sapiens --bowtie2-options "--very-sensitive --phred33" -1 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.1.fastq -2 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.2.fastq --un-pair /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_clean_%.fastq --al-pair /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_contam_%.fastq -U /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.1.fastq,/n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.trimmed.single.2.fastq --un-single /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_%_clean.fastq --al-single /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_%_contam.fastq -S /dev/null 02/13/2018 12:30:32 PM - kneaddata.utilities - DEBUG: 31877625 reads; of these: 31877625 (100.00%) were unpaired; of these: 28170824 (88.37%) aligned 0 times 2842622 (8.92%) aligned exactly 1 time 864179 (2.71%) aligned >1 times 11.63% overall alignment rate pair1_aligned : 1618663 pair2_aligned : 1618663 orphan1_unaligned : 1227184 orphan2_unaligned : 2098706 orphan2_aligned : 288778 pair2_unaligned : 12422467 pair1_unaligned : 12422467 orphan1_aligned : 180697 02/13/2018 12:30:33 PM - kneaddata.utilities - DEBUG: Checking output file from bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_clean_1.fastq 02/13/2018 12:30:33 PM - kneaddata.utilities - DEBUG: Checking output file from bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_clean_2.fastq 02/13/2018 12:30:46 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_contam_1.fastq ) : 1618663 02/13/2018 12:30:48 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_contam_2.fastq ) : 1618663 02/13/2018 12:30:48 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_1_contam.fastq ) : 180697 02/13/2018 12:30:48 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_2_contam.fastq ) : 288778 02/13/2018 12:31:48 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens pair1 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_clean_1.fastq ): 12422467 02/13/2018 12:32:03 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens pair2 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_clean_2.fastq ): 12422467 02/13/2018 12:32:09 PM - kneaddata.utilities - INFO: READ COUNT: final pair1 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_paired_1.fastq ): 12422467 02/13/2018 12:32:14 PM - kneaddata.utilities - INFO: READ COUNT: final pair2 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_paired_2.fastq ): 12422467 02/13/2018 12:32:14 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_clean_1.fastq 02/13/2018 12:32:14 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_paired_clean_2.fastq 02/13/2018 12:32:15 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens orphan1 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_1_clean.fastq ): 1227184 02/13/2018 12:32:15 PM - kneaddata.utilities - INFO: READ COUNT: final orphan1 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_unmatched_1.fastq ): 1227184 02/13/2018 12:32:15 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_1_clean.fastq 02/13/2018 12:32:16 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens orphan2 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_2_clean.fastq ): 2098706 02/13/2018 12:32:17 PM - kneaddata.utilities - INFO: READ COUNT: final orphan2 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_unmatched_2.fastq ): 2098706 02/13/2018 12:32:17 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR_Homo_sapiens_bowtie2_unmatched_2_clean.fastq 02/13/2018 12:32:49 PM - kneaddata.knead_data - INFO: Final output file created: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2017-11-17/WGS/kneaddata/main/MSMB4LZR.fastq