02/14/2018 10:37:18 PM - kneaddata.knead_data - INFO: Running kneaddata v0.7.0
02/14/2018 10:37:18 PM - kneaddata.knead_data - INFO: Output files will be written to: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main
02/14/2018 10:37:18 PM - kneaddata.knead_data - DEBUG: Running with the following arguments: 
verbose = False
bmtagger_path = None
minscore = 50
bowtie2_path = /n/sw/centos6/bowtie2-2.2.1/bowtie2
maxperiod = 500
no_discordant = False
serial = True
fastqc_start = False
bmtagger = False
cat_final_output = True
log_level = DEBUG
log = /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.log
max_memory = 500m
remove_intermediate_output = True
fastqc_path = None
output_dir = /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main
trf_path = None
remove_temp_output = True
reference_db = /n/huttenhower_lab/data/kneaddata_databases/Homo_sapiens
input = /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R2.fastq
pi = 10
reorder = False
pm = 80
trimmomatic_path = /n/huttenhower_lab/tools/kneaddata/bin_v0.7.0_devel/trimmomatic-0.33.jar
store_temp_output = False
mismatch = 7
threads = 6
delta = 7
bowtie2_options = --very-sensitive --phred33
bypass_trim = False
processes = 1
trimmomatic_quality_scores = -phred33
fastqc_end = False
trf = False
trimmomatic_options = None
output_prefix = CSM79HIV
match = 2

02/14/2018 10:38:07 PM - kneaddata.utilities - INFO: READ COUNT: raw pair1 : Initial number of reads ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R1.fastq ): 14606116
02/14/2018 10:39:00 PM - kneaddata.utilities - INFO: READ COUNT: raw pair2 : Initial number of reads ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R2.fastq ): 14606116
02/14/2018 10:39:00 PM - kneaddata.utilities - DEBUG: Checking input file to Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R1.fastq
02/14/2018 10:39:00 PM - kneaddata.utilities - DEBUG: Checking input file to Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R2.fastq
02/14/2018 10:39:00 PM - kneaddata.utilities - INFO: Running Trimmomatic ... 
02/14/2018 10:39:00 PM - kneaddata.utilities - INFO: Execute command: java -Xmx500m -d64 -jar /n/huttenhower_lab/tools/kneaddata/bin_v0.7.0_devel/trimmomatic-0.33.jar PE -threads 6 -phred33 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.2.fastq ILLUMINACLIP:/n/huttenhower_lab/tools/kneaddata/lib_v0.7.0_devel/kneaddata/adapters/NexteraPE-PE.fa:2:30:10:8:TRUE SLIDINGWINDOW:4:20 MINLEN:50
02/14/2018 10:41:42 PM - kneaddata.utilities - DEBUG: TrimmomaticPE: Started with arguments: -threads 6 -phred33 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/fastq/CSM79HIV_R2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.1.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.2.fastq /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.2.fastq ILLUMINACLIP:/n/huttenhower_lab/tools/kneaddata/lib_v0.7.0_devel/kneaddata/adapters/NexteraPE-PE.fa:2:30:10:8:TRUE SLIDINGWINDOW:4:20 MINLEN:50
Using PrefixPair: 'AGATGTGTATAAGAGACAG' and 'AGATGTGTATAAGAGACAG'
Using Long Clipping Sequence: 'GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG'
Using Long Clipping Sequence: 'TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG'
Using Long Clipping Sequence: 'CTGTCTCTTATACACATCTGACGCTGCCGACGA'
Using Long Clipping Sequence: 'CTGTCTCTTATACACATCTCCGAGCCCACGAGAC'
ILLUMINACLIP: Using 1 prefix pairs, 4 forward/reverse sequences, 0 forward only sequences, 0 reverse only sequences
Input Read Pairs: 14606116 Both Surviving: 11766071 (80.56%) Forward Only Surviving: 512509 (3.51%) Reverse Only Surviving: 436136 (2.99%) Dropped: 1891400 (12.95%)
TrimmomaticPE: Completed successfully

02/14/2018 10:41:42 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.1.fastq
02/14/2018 10:41:42 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.1.fastq
02/14/2018 10:41:42 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.2.fastq
02/14/2018 10:41:42 PM - kneaddata.utilities - DEBUG: Checking output file from Trimmomatic : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.2.fastq
02/14/2018 10:41:47 PM - kneaddata.utilities - INFO: READ COUNT: trimmed pair1 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.1.fastq ): 11766071
02/14/2018 10:41:52 PM - kneaddata.utilities - INFO: READ COUNT: trimmed pair2 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.2.fastq ): 11766071
02/14/2018 10:41:53 PM - kneaddata.utilities - INFO: READ COUNT: trimmed orphan1 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.1.fastq ): 512509
02/14/2018 10:41:53 PM - kneaddata.utilities - INFO: READ COUNT: trimmed orphan2 : Total reads after trimming ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.2.fastq ): 436136
02/14/2018 10:41:53 PM - kneaddata.run - INFO: Decontaminating ...
02/14/2018 10:41:53 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.1.fastq
02/14/2018 10:41:53 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.2.fastq
02/14/2018 10:41:53 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.1.fastq
02/14/2018 10:41:53 PM - kneaddata.utilities - DEBUG: Checking input file to bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.2.fastq
02/14/2018 10:41:53 PM - kneaddata.utilities - INFO: Running bowtie2 ... 
02/14/2018 10:41:53 PM - kneaddata.utilities - INFO: Execute command: kneaddata_bowtie2_discordant_pairs --bowtie2 /n/sw/centos6/bowtie2-2.2.1/bowtie2 --threads 6 -x /n/huttenhower_lab/data/kneaddata_databases/Homo_sapiens --bowtie2-options "--very-sensitive --phred33" -1 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.1.fastq -2 /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.2.fastq --un-pair /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_clean_%.fastq --al-pair /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_contam_%.fastq -U /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.1.fastq,/n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.trimmed.single.2.fastq --un-single /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_%_clean.fastq --al-single /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_%_contam.fastq -S /dev/null
02/14/2018 10:54:49 PM - kneaddata.utilities - DEBUG: 24480787 reads; of these:
  24480787 (100.00%) were unpaired; of these:
    22014065 (89.92%) aligned 0 times
    1905779 (7.78%) aligned exactly 1 time
    560943 (2.29%) aligned >1 times
10.08% overall alignment rate
pair1_aligned : 1174751
pair2_aligned : 1174751
orphan1_unaligned : 464646
orphan2_unaligned : 391157
orphan2_aligned : 57168
pair2_unaligned : 10579131
pair1_unaligned : 10579131
orphan1_aligned : 60052

02/14/2018 10:54:49 PM - kneaddata.utilities - DEBUG: Checking output file from bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_clean_1.fastq
02/14/2018 10:54:49 PM - kneaddata.utilities - DEBUG: Checking output file from bowtie2 : /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_clean_2.fastq
02/14/2018 10:54:49 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_contam_1.fastq ) : 1174751
02/14/2018 10:54:50 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_contam_2.fastq ) : 1174751
02/14/2018 10:54:50 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_1_contam.fastq ) : 60052
02/14/2018 10:54:50 PM - kneaddata.run - INFO: Total contaminate sequences in file ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_2_contam.fastq ) : 57168
02/14/2018 10:54:56 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens pair1 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_clean_1.fastq ): 10579131
02/14/2018 10:55:01 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens pair2 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_clean_2.fastq ): 10579131
02/14/2018 10:55:06 PM - kneaddata.utilities - INFO: READ COUNT: final pair1 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_paired_1.fastq ): 10579131
02/14/2018 10:55:10 PM - kneaddata.utilities - INFO: READ COUNT: final pair2 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_paired_2.fastq ): 10579131
02/14/2018 10:55:10 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_clean_1.fastq
02/14/2018 10:55:10 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_paired_clean_2.fastq
02/14/2018 10:55:10 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens orphan1 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_1_clean.fastq ): 464646
02/14/2018 10:55:11 PM - kneaddata.utilities - INFO: READ COUNT: final orphan1 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_unmatched_1.fastq ): 464646
02/14/2018 10:55:11 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_1_clean.fastq
02/14/2018 10:55:11 PM - kneaddata.utilities - INFO: READ COUNT: decontaminated Homo_sapiens orphan2 : Total reads after removing those found in reference database ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_2_clean.fastq ): 391157
02/14/2018 10:55:11 PM - kneaddata.utilities - INFO: READ COUNT: final orphan2 : Total reads after merging results from multiple databases ( /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_unmatched_2.fastq ): 391157
02/14/2018 10:55:11 PM - kneaddata.utilities - WARNING: Unable to remove file: /n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV_Homo_sapiens_bowtie2_unmatched_2_clean.fastq
02/14/2018 10:55:23 PM - kneaddata.knead_data - INFO: 
Final output file created: 
/n/regal/huttenhower_lab/carze/data/hmp2/workflow/processing/hmp2/2018-02-14/WGS/kneaddata/main/CSM79HIV.fastq